Web Analytics

PPTAgent

⭐ 3987 stars Traditional Chinese by icip-cas

🌐 語言

https://github.com/icip-cas/PPTAgent

聯絡方式 📫

本倉庫的主要貢獻者是一位將於 2026 年畢業的碩士生,歡迎聯絡合作或交流機會。
>
本倉庫的主要貢獻者是一名 2026 屆碩士畢業生,歡迎聯絡合作或交流機會。

最新消息 📅

使用方法 📖

[!重要]
不支援 Windows。如在 Windows 上,請使用 WSL。
>
強烈建議先從 CLI 與最小任務開始,以確認依賴項與環境配置正確。

設定

若使用 CLI,pptagent onboard 可協助互動式創建與更新這些設定。若使用 Docker Compose 或源碼建構,則需手動準備:

cp deeppresenter/config.yaml.example deeppresenter/config.yaml
cp deeppresenter/mcp.json.example deeppresenter/mcp.json

#### 提升品質的可選服務

以下服務可以顯著提升生成品質,尤其是在研究深度、PDF解析及視覺素材創建方面:

若你希望完全離線運作,請在本地部署 MinerU 並於 deeppresenter/config.yaml 設定 offline_mode: true,以避免載入如網頁搜尋等需網路的工具。

更多可調整的變數可在 constants.py 找到。

1. 個人使用/OpenClaw 整合:CLI

[!NOTE]
在 macOS 上,CLI 可能會自動安裝多項本地依賴,包括 Homebrew、Node.js、Docker、poppler、Playwright 與 llama.cpp。
>
在 Linux 上,你需自行準備環境。

若你想要最快速的本地安裝,或希望將 DeepPresenter 透過 CLI 插入 OpenClaw,請使用此模式。

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

First-time interactive setup

uvx pptagent onboard

Generate a presentation

uvx pptagent generate "Single Page with Title: Hello World" -o hello.pptx

Generate with attachments

uvx pptagent generate "Q4 Report" \ -f data.xlsx \ -f charts.pdf \ -p "10-12" \ -o report.pptx

| 指令 | 說明 | | ------------------- | ------------------------------------------------- | | pptagent onboard | 互動式設定精靈 | | pptagent generate | 產生簡報 | | pptagent config | 查看目前設定 | | pptagent reset | 重設設定 | | pptagent serve | 啟動 CLI 使用的本地推論服務 |

2. 最小化設定/開發:從原始碼建置

如果你想在開發過程中擁有最少的抽象層並完全掌控相依性,請使用此模式。

uv pip install -e .
playwright install-deps
playwright install chromium
npm install --prefix deeppresenter/html2pptx
modelscope download forceless/fasttext-language-id

docker pull forceless/deeppresenter-sandbox docker pull forceless/deeppresenter-host docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox

or build from dockerfile

docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .
啟動應用程式:

python webui.py

3. 伺服器部署:Docker Compose

使用此模式可建立具有明確依賴關係的穩定伺服器環境。

# Pull the public images to avoid build from source
docker pull forceless/deeppresenter-sandbox
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox

Or build from source

docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .

Start the host service

docker compose up -d

The service exposes the web UI on http://localhost:7861.

Case Study 💡

图片1

图片2

图片3

图片4

图片5

图片6

图片7

图片8

图片9

图片10

图片1

图片2

图片3

图片4

图片5

图片6

图片1

图片2

图片3

图片4

图片5

图片6

图片7

图片8

图片9

图片10

图片11

图片12

图片13

图片14

圖片15


貢獻者 🌟

Force1ess/
Force1ess
Puelloc/
Puelloc
hongyan/
hongyan
Dnoob/
Dnoob
Sadahlu/
Sadahlu
KurisuMakiseSame/
KurisuMakiseSame
Angelen/
Angelen
BrandonHu/
BrandonHu
Eliot
Eliot White
EvolvedGhost/
EvolvedGhost
ISCAS-zwl/
ISCAS-zwl
James
James Brown
JunZhang/
JunZhang
Open
Open AI Tx
Sense_wang/
Sense_wang
SuYao/
SuYao
Zakir
Zakir Jiwani
Zhenyu/
Zhenyu
lnennnn/
lnennnn

Star History Chart

引用 🙏

如果您覺得本專案對您有幫助,請使用以下內容進行引用:

@inproceedings{zheng-etal-2025-pptagent,
    title = "{PPTA}gent: Generating and Evaluating Presentations Beyond Text-to-Slides",
    author = "Zheng, Hao  and
      Guan, Xinyan  and
      Kong, Hao  and
      Zhang, Wenkai  and
      Zheng, Jia  and
      Zhou, Weixiang  and
      Lin, Hongyu  and
      Lu, Yaojie  and
      Han, Xianpei  and
      Sun, Le",
    editor = "Christodoulopoulos, Christos  and
      Chakraborty, Tanmoy  and
      Rose, Carolyn  and
      Peng, Violet",
    booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2025",
    address = "Suzhou, China",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.emnlp-main.728/",
    doi = "10.18653/v1/2025.emnlp-main.728",
    pages = "14413--14429",
    ISBN = "979-8-89176-332-6",
    abstract = "Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to extract slide-level functional types and content schemas, then drafts an outline and iteratively generates editing actions based on selected reference slides to create new slides. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Results demonstrate that PPTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions."
}

@misc{zheng2026deeppresenterenvironmentgroundedreflectionagentic, title={DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation}, author={Hao Zheng and Guozhao Mo and Xinru Yan and Qianhao Yuan and Wenkai Zhang and Xuanang Chen and Yaojie Lu and Hongyu Lin and Xianpei Han and Le Sun}, year={2026}, eprint={2602.22839}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2602.22839}, }

--- Tranlated By Open Ai Tx | Last indexed: 2026-04-09 ---