聯絡方式 📫
本倉庫的主要貢獻者是一位將於 2026 年畢業的碩士生,歡迎聯絡合作或交流機會。>
本倉庫的主要貢獻者是一名 2026 屆碩士畢業生,歡迎聯絡合作或交流機會。
最新消息 📅
- [2026/03]:我們現已支持 CLI,並已在 Hugging Face 🤗 發佈我們的微調模型!
- [2026/01]:我們支持自由格式與模板生成,並支援 PPTX 匯出與離線模式!新增上下文管理功能以避免上下文溢出。
- [2025/12]:🔥 發佈 V2,重大改進 - 深度研究整合、自由形式視覺設計、自主資產創建、文字轉圖片生成,並有沙盒環境及 20+ 工具。
- [2025/09]:🛠️ 增加 MCP 伺服器支援 - 設定詳情請參見 MCP Server
- [2025/09]:🚀 發佈 v2,重大改進 - 詳情請參見 release notes
- [2025/08]:🎉 論文已被 EMNLP 2025 接收!
- [2025/05]:✨ 發佈 v1,核心功能與 🌟 突破:GitHub 點贊數達 1,000!詳情請參見 release notes
- [2025/01]:🔓 原始碼開源,實驗性代碼已存檔於 experiment release
使用方法 📖
[!重要]
不支援 Windows。如在 Windows 上,請使用 WSL。>
強烈建議先從 CLI 與最小任務開始,以確認依賴項與環境配置正確。
設定
若使用 CLI,pptagent onboard 可協助互動式創建與更新這些設定。若使用 Docker Compose 或源碼建構,則需手動準備:
cp deeppresenter/config.yaml.example deeppresenter/config.yaml
cp deeppresenter/mcp.json.example deeppresenter/mcp.json#### 提升品質的可選服務
以下服務可以顯著提升生成品質,尤其是在研究深度、PDF解析及視覺素材創建方面:
- Tavily:提升網頁搜尋品質。請至 tavily.com 申請 API 金鑰,然後於
deeppresenter/mcp.json設定TAVILY_API_KEY。 - MinerU:提升 PDF 解析品質。你可以至 mineru.net 申請 API 金鑰並於
deeppresenter/mcp.json設定MINERU_API_KEY,或是在本地部署 MinerU 並改設MINERU_API_URL。 - 文字轉圖像模型:提升圖像生成品質。在
deeppresenter/config.yaml設定t2i_model。
deeppresenter/config.yaml 設定 offline_mode: true,以避免載入如網頁搜尋等需網路的工具。更多可調整的變數可在 constants.py 找到。
1. 個人使用/OpenClaw 整合:CLI
[!NOTE]
在 macOS 上,CLI 可能會自動安裝多項本地依賴,包括 Homebrew、Node.js、Docker、poppler、Playwright 與 llama.cpp。>
在 Linux 上,你需自行準備環境。
若你想要最快速的本地安裝,或希望將 DeepPresenter 透過 CLI 插入 OpenClaw,請使用此模式。
# Install uv
curl -LsSf https://astral.sh/uv/install.sh | shFirst-time interactive setup
uvx pptagent onboardGenerate a presentation
uvx pptagent generate "Single Page with Title: Hello World" -o hello.pptxGenerate with attachments
uvx pptagent generate "Q4 Report" \
-f data.xlsx \
-f charts.pdf \
-p "10-12" \
-o report.pptx| 指令 | 說明 |
| ------------------- | ------------------------------------------------- |
| pptagent onboard | 互動式設定精靈 |
| pptagent generate | 產生簡報 |
| pptagent config | 查看目前設定 |
| pptagent reset | 重設設定 |
| pptagent serve | 啟動 CLI 使用的本地推論服務 |
2. 最小化設定/開發:從原始碼建置
如果你想在開發過程中擁有最少的抽象層並完全掌控相依性,請使用此模式。
uv pip install -e .
playwright install-deps
playwright install chromium
npm install --prefix deeppresenter/html2pptx
modelscope download forceless/fasttext-language-iddocker pull forceless/deeppresenter-sandbox
docker pull forceless/deeppresenter-host
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
or build from dockerfile
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .
啟動應用程式:python webui.py3. 伺服器部署:Docker Compose
使用此模式可建立具有明確依賴關係的穩定伺服器環境。
# Pull the public images to avoid build from source
docker pull forceless/deeppresenter-sandbox
docker tag forceless/deeppresenter-sandbox deeppresenter-sandboxOr build from source
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .Start the host service
docker compose up -dThe service exposes the web UI on http://localhost:7861.
Case Study 💡
- #### Prompt: Please present the given document to me.










- #### Prompt: 请介绍小米 SU7 的外观和价格






- #### Prompt: 请制作一份高中课堂展示课件,主题为“解码立法过程:理解其对国际关系的影响”















貢獻者 🌟
|
Force1ess |
Puelloc |
hongyan |
Dnoob |
Sadahlu |
|
KurisuMakiseSame |
Angelen |
BrandonHu |
Eliot White |
EvolvedGhost |
|
ISCAS-zwl |
James Brown |
JunZhang |
Open AI Tx |
Sense_wang |
|
SuYao |
Zakir Jiwani |
Zhenyu |
lnennnn |
引用 🙏
如果您覺得本專案對您有幫助,請使用以下內容進行引用:
@inproceedings{zheng-etal-2025-pptagent,
title = "{PPTA}gent: Generating and Evaluating Presentations Beyond Text-to-Slides",
author = "Zheng, Hao and
Guan, Xinyan and
Kong, Hao and
Zhang, Wenkai and
Zheng, Jia and
Zhou, Weixiang and
Lin, Hongyu and
Lu, Yaojie and
Han, Xianpei and
Sun, Le",
editor = "Christodoulopoulos, Christos and
Chakraborty, Tanmoy and
Rose, Carolyn and
Peng, Violet",
booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing",
month = nov,
year = "2025",
address = "Suzhou, China",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.emnlp-main.728/",
doi = "10.18653/v1/2025.emnlp-main.728",
pages = "14413--14429",
ISBN = "979-8-89176-332-6",
abstract = "Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to extract slide-level functional types and content schemas, then drafts an outline and iteratively generates editing actions based on selected reference slides to create new slides. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Results demonstrate that PPTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions."
}@misc{zheng2026deeppresenterenvironmentgroundedreflectionagentic,
title={DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation},
author={Hao Zheng and Guozhao Mo and Xinru Yan and Qianhao Yuan and Wenkai Zhang and Xuanang Chen and Yaojie Lu and Hongyu Lin and Xianpei Han and Le Sun},
year={2026},
eprint={2602.22839},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2602.22839},
}
--- Tranlated By Open Ai Tx | Last indexed: 2026-04-09 ---