🌐 語言

English | 簡體中文 | 繁體中文 | 日本語 | 韓國語 | 印度語 | 泰語 | 法語 | 德語 | 西班牙語 | 義大利語 | 俄語 | 葡萄牙語 | 荷蘭語 | 波蘭語 | 阿拉伯語 | 波斯語 | 土耳其語 | 越南語 | 印尼語 | 阿薩姆語

聯絡方式 📫

本倉庫的主要貢獻者是一位將於 2026 年畢業的碩士生，歡迎聯絡合作或交流機會。

本倉庫的主要貢獻者是一名 2026 屆碩士畢業生，歡迎聯絡合作或交流機會。

使用方法 📖

[!重要]

不支援 Windows。如在 Windows 上，請使用 WSL。

強烈建議先從 CLI 與最小任務開始，以確認依賴項與環境配置正確。

設定

若使用 CLI，pptagent onboard 可協助互動式創建與更新這些設定。若使用 Docker Compose 或源碼建構，則需手動準備：

cp deeppresenter/config.yaml.example deeppresenter/config.yaml
cp deeppresenter/mcp.json.example deeppresenter/mcp.json

#### 提升品質的可選服務

以下服務可以顯著提升生成品質，尤其是在研究深度、PDF解析及視覺素材創建方面：

Tavily：提升網頁搜尋品質。請至 tavily.com 申請 API 金鑰，然後於 deeppresenter/mcp.json 設定 TAVILY_API_KEY。
MinerU：提升 PDF 解析品質。你可以至 mineru.net 申請 API 金鑰並於 deeppresenter/mcp.json 設定 MINERU_API_KEY，或是在本地部署 MinerU 並改設 MINERU_API_URL。
文字轉圖像模型：提升圖像生成品質。在 deeppresenter/config.yaml 設定 t2i_model。

若你希望完全離線運作，請在本地部署 MinerU 並於 deeppresenter/config.yaml 設定 offline_mode: true，以避免載入如網頁搜尋等需網路的工具。

更多可調整的變數可在 constants.py 找到。

1. 個人使用／OpenClaw 整合：CLI

[!NOTE]

在 macOS 上，CLI 可能會自動安裝多項本地依賴，包括 Homebrew、Node.js、Docker、poppler、Playwright 與 llama.cpp。

在 Linux 上，你需自行準備環境。

若你想要最快速的本地安裝，或希望將 DeepPresenter 透過 CLI 插入 OpenClaw，請使用此模式。

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
First-time interactive setup
uvx pptagent onboard
Generate a presentation
uvx pptagent generate "Single Page with Title: Hello World" -o hello.pptx
Generate with attachments
uvx pptagent generate "Q4 Report" \
  -f data.xlsx \
  -f charts.pdf \
  -p "10-12" \
  -o report.pptx

| 指令 | 說明 | | ------------------- | ------------------------------------------------- | | pptagent onboard | 互動式設定精靈 | | pptagent generate | 產生簡報 | | pptagent config | 查看目前設定 | | pptagent reset | 重設設定 | | pptagent serve | 啟動 CLI 使用的本地推論服務 |

2. 最小化設定／開發：從原始碼建置

如果你想在開發過程中擁有最少的抽象層並完全掌控相依性，請使用此模式。

uv pip install -e .
playwright install-deps
playwright install chromium
npm install --prefix deeppresenter/html2pptx
modelscope download forceless/fasttext-language-id
docker pull forceless/deeppresenter-sandbox
docker pull forceless/deeppresenter-host
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
or build from dockerfile
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .

啟動應用程式：

python webui.py

3. 伺服器部署：Docker Compose

使用此模式可建立具有明確依賴關係的穩定伺服器環境。

# Pull the public images to avoid build from source
docker pull forceless/deeppresenter-sandbox
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
Or build from source
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .
Start the host service
docker compose up -d

The service exposes the web UI on http://localhost:7861.

Case Study 💡

#### Prompt: Please present the given document to me.

#### Prompt: 请介绍小米 SU7 的外观和价格

#### Prompt: 请制作一份高中课堂展示课件，主题为“解码立法过程：理解其对国际关系的影响”

貢獻者 🌟

_Force1ess	_Puelloc	_hongyan	_Dnoob	_Sadahlu
_{KurisuMakiseSame}	_Angelen	_BrandonHu	_{Eliot White}	_EvolvedGhost
_ISCAS-zwl	_{James Brown}	_JunZhang	_{Open AI Tx}	_{Sense_wang}
_SuYao	_{Zakir Jiwani}	_Zhenyu	_lnennnn

引用 🙏

如果您覺得本專案對您有幫助，請使用以下內容進行引用：

@inproceedings{zheng-etal-2025-pptagent, title = "{PPTA}gent: Generating and Evaluating Presentations Beyond Text-to-Slides", author = "Zheng, Hao and Guan, Xinyan and Kong, Hao and Zhang, Wenkai and Zheng, Jia and Zhou, Weixiang and Lin, Hongyu and Lu, Yaojie and Han, Xianpei and Sun, Le", editor = "Christodoulopoulos, Christos and Chakraborty, Tanmoy and Rose, Carolyn and Peng, Violet", booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2025", address = "Suzhou, China", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2025.emnlp-main.728/", doi = "10.18653/v1/2025.emnlp-main.728", pages = "14413--14429", ISBN = "979-8-89176-332-6", abstract = "Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to extract slide-level functional types and content schemas, then drafts an outline and iteratively generates editing actions based on selected reference slides to create new slides. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Results demonstrate that PPTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions." }

@misc{zheng2026deeppresenterenvironmentgroundedreflectionagentic, title={DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation}, author={Hao Zheng and Guozhao Mo and Xinru Yan and Qianhao Yuan and Wenkai Zhang and Xuanang Chen and Yaojie Lu and Hongyu Lin and Xianpei Han and Le Sun}, year={2026}, eprint={2602.22839}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2602.22839}, }

--- Tranlated By Open Ai Tx | Last indexed: 2026-04-09 ---

PPTAgent

聯絡方式 📫

最新消息 📅

使用方法 📖

設定

1. 個人使用／OpenClaw 整合：CLI

First-time interactive setup

Generate a presentation

Generate with attachments

2. 最小化設定／開發：從原始碼建置

or build from dockerfile

3. 伺服器部署：Docker Compose

Or build from source

Start the host service

Case Study 💡

貢獻者 🌟

引用 🙏