UniSH：在一次前向传递中统一场景与人体重建

李孟飞¹，李鹏¹，张铮²，卢家豪¹，赵成峰¹，薛伟¹，
刘启峰¹，彭思达³，张文晓¹，罗文翰¹，刘远^1†，郭奕柯^1†

¹香港科技大学，²北京邮电大学，³浙江大学

TL;DR

给定单目视频作为输入，我们的UniSH能够在一次前向传递中联合重建场景和人体，实现对场景几何、相机参数和SMPL参数的有效估计。

🛠️ 安装

我们提供了一种无需sudo权限的安装方法，适用于大多数Linux服务器（包括无头服务器）。

第一步：克隆仓库

git clone https://github.com/murphylmf/UniSH.git
cd UniSH

步骤 2：创建 Conda 环境

这将安装 Python、系统编译器和 OpenGL 驱动程序。

conda env create -f environment.yml
conda activate unish

第3步：编译依赖项

此脚本使用第2步中安装的编译器从源代码编译PyTorch3D、MMCV和SAM2。

该环境已在CUDA 12.1和CUDA 11.8上进行测试。您可以通过将CUDA版本作为参数传递给安装脚本来指定CUDA版本。

# Default (Auto-detect or 12.1)
bash install.sh
For CUDA 11.8
bash install.sh 11.8
For CUDA 12.1
bash install.sh 12.1

第4步：下载SMPL模型

请下载SMPL模型并将其放置在body_models文件夹中。目录结构应按以下方式组织：

UniSH/
├── body_models/
│   └── smpl/
│       └── smpl/
│           ├── SMPL_FEMALE.pkl
│           ├── SMPL_MALE.pkl
│           └── SMPL_NEUTRAL.pkl

🚀 快速开始（推理）

运行推理

运行以下命令从视频中重建场景和人物：

python inference.py --output_dir inference_results/example --video_path examples/example_video.mp4

请参阅 inference.py 以获取有关附加参数的更多信息。

📝 引用

如果您发现此代码对您的研究有帮助，请考虑引用我们的论文：

@misc{li2026unishunifyingscenehuman,
      title={UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass}, 
      author={Mengfei Li and Peng Li and Zheng Zhang and Jiahao Lu and Chengfeng Zhao and Wei Xue and Qifeng Liu and Sida Peng and Wenxiao Zhang and Wenhan Luo and Yuan Liu and Yike Guo},
      year={2026},
      eprint={2601.01222},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2601.01222}, 
}

🙏 致谢

我们感谢以下项目的卓越贡献：

📄 许可协议

本项目采用 Apache 2.0 许可协议。详情请参见 LICENSE。

--- Tranlated By Open Ai Tx | Last indexed: 2026-03-24 ---