Web Analytics

On-device-Sora

⭐ 131 stars English by eai-lab

[arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices

Paper: https://arxiv.org/abs/2502.04363

Code implementation of [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices. Base code referred -> Open-Sora : Democratizing Efficient Video Production for All

Introduction

This repository provides code for On-device Sora, which is an open-sourced implementation of paper named On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices.

On-Device Sora

On-device Sora applies Linear Proportional Leap (LPL), Temporal Dimension Token Merging (TDTM), and Concurrent Inference with Dynamic Loading (CI-DL) to enable efficient video generation on the iPhone 15 Pro.

On-Device_Sora

Open-Sora

Open-Sora is a baseline model of On-Device Sora, an open-source project for video generation, and a T2V Diffusion model that can produce videos based on text input.

How to convert each model to MLPackage for On-device Sora

Package Dependencies

Dependency

cd Device_conversion

conda create -n convert python=3.10

conda activate convert

pip install -r requirements/requirements-convert.txt

pip install -v .

Converting

T5 Converting

cd t5
python3 export-t5.py

STDiT Converting

cd stdit3
python3 export-stdit3.py

VAE Converting

When you run export-vae-spatial.py, There are some error that is Fatal Python error: PyEval_SaveThread. To address this error, you should only run one code block for each VAE part. Comment out the rest.

cd vae

for vae's temporal part

python3 export-vae-temporal.py

for vae's spatial part

python3 export-vae-spatial.py

How to Use

Required

Download converted MLPackage (if you don't want to convert each model to MLPackage)

You can download and use the converted models from the following link. [Download]

Run the app

Example artifacts

On-device-Sora-Example1

--- Tranlated By Open Ai Tx | Last indexed: 2026-02-18 ---