Meta's AI research division, developer of the Segment Anything Model (SAM/SAM-2).
“LeCun's approach constructs a new latent space... the problem is: after predicting a latent state, if you show it to a language model, the language model can't read it. Show it to a video model, the video model also can't read it.”
Source→“Authors of V-JEPA 2-AC (assran2025vjepa2), referenced as enabling 'planning from image goals in latent space after large-scale pre-training on video data'.”
Source→AI-extracted from podcast / newsletter / paper summaries. May contain errors.