Goku is a new large-scale dataset containing 2 million high-quality, instruction-aligned video editing pairs. It expands beyond basic appearance editing to include multi-task and structural manipulations, such as precise control of subject movement. This benchmark aims to address the complex creative demands of real-world instruction-based video editing.
Read original
huggingface/daily-papers