Web Analytics

ATI

⭐ 243 stars Simplified Chinese by bytedance

ATI: 可控视频生成的任意轨迹指令

arXiv  project page 

ATI: 可控视频生成的任意轨迹指令
Angtian Wang, Haibin Huang, Jacob Zhiyuan Fang, Yiding Yang, Chongyang Ma

字节跳动智能创作团队

观看视频

这是Wan2.1 ATI(可控视频生成的任意轨迹指令)的代码库,一个基于轨迹的运动控制框架,统一了视频生成中的物体、局部和摄像机运动。该代码库基于Wan2.1官方实现

相比原始Wan2.1,我们新增了以下文件:

我们修改了以下文件:

社区作品

ComfyUI

感谢Kijai开发了ATI的ComfyUI节点: https://github.com/kijai/ComfyUI-WanVideoWrapper

FP8量化Huggingface模型:https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan2_1-I2V-ATI-14B_fp8_e4m3fn.safetensors

指南

Benji的使用指南:https://www.youtube.com/watch?v=UM35z2L1XbI

安装

ATI需要与官方Wan 2.1相同的环境。请按照INSTALL.md(Wan2.1)的说明进行安装。

git clone https://github.com/bytedance/ATI.git
cd ATI
安装软件包

pip install .
首先,您需要下载Wan2.1的14B原始模型。

huggingface-cli download Wan-AI/Wan2.1-I2V-14B-480P --local-dir ./Wan2.1-I2V-14B-480P
然后从我们的 huggingface 仓库下载 ATI-Wan 模型。

huggingface-cli download bytedance-research/ATI --local-dir ./Wan2.1-ATI-14B-480P
最后,将VAE、T5及其他杂项检查点从原始Wan2.1文件夹复制到ATI检查点位置

cp ./Wan2.1-I2V-14B-480P/Wan2.1_VAE.pth ./Wan2.1-ATI-14B-480P/
cp ./Wan2.1-I2V-14B-480P/models_t5_umt5-xxl-enc-bf16.pth ./Wan2.1-ATI-14B-480P/
cp ./Wan2.1-I2V-14B-480P/models_clip_open-clip-xlm-roberta-large-vit-huge-14.pth ./Wan2.1-ATI-14B-480P/
cp -r ./Wan2.1-I2V-14B-480P/xlm-roberta-large ./Wan2.1-ATI-14B-480P/
cp -r ./Wan2.1-I2V-14B-480P/google ./Wan2.1-ATI-14B-480P/

Run

We provide a demo sript to run ATI.

bash run_example.sh -p examples/test.yaml -c ./Wan2.1-ATI-14B-480P -o samples
where -p 是配置文件的路径,-c 是检查点文件的路径,-o 是输出目录的路径,-g 定义使用的 GPU 数量(如果未指定,将使用所有可用 GPU;如果给定 1,则以单进程模式运行)。

完成后,您将看到以下内容:

预期结果:

输入图像及轨迹 生成的视频(叠加轨迹)
Image 0 Image 0
Image 1 Image 1
Image 2 Image 2
Image 3 Image 3
Image 4 Image 4
Image 5 Image 5

创建您自己的轨迹

我们提供了一个交互式工具,允许用户在图像上绘制和编辑轨迹。

cd tools/trajectory_editor
python3 app.py
then open this url localhost:5000 in the browser. Note if you run the editor on the server, you need to replace localhost with the server's IP address.

Interface Screenshot

Trajectory Functions

a. Free Trajectory: Click and then drag with the mouse directly on the image. b. Circular (Camera Control):

c. Static Point: A point that remains stationary over time.

Note: Pay attention to the progress bar in the box to control motion speed. Progress Control

Trajectory Editing

Camera Pan Control

Store Tracks

Citation

Please cite our paper if you find our work useful:
@article{wang2025ati,
  title={{ATI}: Any Trajectory Instruction for Controllable Video Generation},
  author={Wang, Angtian and Huang, Haibin and Fang, Zhiyuan and Yang, Yiding, and Ma, Chongyang}
  journal={arXiv preprint},
  volume={arXiv:2505.22944},
  year={2025}
}

--- Tranlated By Open Ai Tx | Last indexed: 2025-07-19 ---