Netflix is quietly stepping into the AI race with a new video model that could fundamentally change how movies are made. Called VOID, short for Video Object and Interaction Deletion, the model can remove objects from a video and then intelligently rebuild the scene as if those objects never existed.
Unlike traditional editing tools that simply erase objects, VOID understands how objects interact in a scene and then reimagines how everything else should behave once something is removed.
For instance, if a person jumps into a pool and creates a splash, VOID can remove the person and regenerate the water to look perfectly undisturbed.
The model was developed by researchers affiliated with Netflix and academia, and is based on a vision-language model (VLM) architecture. It analyses both visual cues and contextual relationships to generate what researchers describe as “physically plausible” outcomes after edits.
Existing tools like Runway, ProPainter and DiffuEraser already allow creators to make changes to video. But Netflix’s team claims VOID significantly outperforms these alternatives. In human evaluation study involving 25 participants, VOID-generated outputs were preferred nearly 65% of the time, far ahead of competitors.
This sounds like a good news for studios, as it could mean fewer costly reshoots, faster production cycles and greater creative flexibility. For creators outside Hollywood, the barrier to high-quality visual storytelling could drop dramatically.
