|
Gradient-based Planning with World Models
Jyothir S V, Siddhartha Jalagam, Yann LeCun, Vlad Sobal
under review, 2023
paper /
code /
Most model predictive control (MPC) algorithms designed for visual world models have traditionally explored gradient-free population-based optimization methods, such as Cross Entropy and Model Predictive Path Integral (MPPI) for planning. We present an exploration of a gradient-based alternative that fully leverages the differentiability of the world model.
|
|
A Massively Multi-System MultiReference Data Set for Dialog Metric Evaluation
Huda Khayrallah, Zuhaib Akhtar, Edward Cohen, Jyothir S V , João Sedoc
under review, 2023
paper /
Automatic metrics for dialogue evaluation should be robust proxies for human judgments; however, the verification of robustness is currently far from satisfactory. To quantify the robustness correlation and understand what is necessary in a test set, we create and release an 8-reference dialog datase. We then train 1750 systems and evaluate them and publicly available large models on our novel test set and the DailyDialog dataset.
|
|
Joint Embedding Predictive Architectures Focus on Slow Features
Vlad Sobal, Jyothir S V, Siddhartha Jalagam, Nicholas Carion, Kyunghyun Cho, Yann Lecun
arXiv, 2022
arxiv /
code /
In this work, we analyze performance of JEPA trained with VICReg and SimCLR objectives in the fully offline setting without access to rewards, and compare the results to the performance of the generative architecture.
|
|