🌐 言語

お問い合わせ 📫

このリポジトリの主な貢献者は2026年卒業予定の修士課程の学生です。コラボレーションや機会について、どうぞお気軽にご連絡ください。

本仓库的主要贡献者是一名 2026 届硕士毕业生，欢迎联系合作或交流机会。

ニュース 📅

[2026/03]: CLIをサポートし、ファインチューニング済みモデルをHugging Face 🤗で公開しました！
[2026/01]: フリーフォームおよびテンプレート生成がPPTXエクスポートとオフラインモードに対応しました！コンテキスト管理も追加され、コンテキストオーバーフローを防止します。
[2025/12]: 🔥 V2をリリースし大幅な改善を実現 - ディープリサーチ統合、フリーフォームビジュアルデザイン、自律的アセット生成、テキストから画像生成、サンドボックス＆20以上のツールを持つエージェント環境を搭載。
[2025/09]: 🛠️ MCPサーバーサポートを追加 - 設定詳細はMCP Serverをご覧ください
[2025/09]: 🚀 v2を大幅な改善とともにリリース - 詳細はリリースノートをご覧ください
[2025/08]: 🎉 論文がEMNLP 2025に採択されました！
[2025/05]: ✨ v1をリリースし、コア機能と🌟ブレークスルー：GitHubで1,000スターを達成！詳細はリリースノートをご覧ください
[2025/01]: 🔓 コードベースをオープンソース化し、実験的なコードはexperiment releaseにアーカイブ

使い方 📖

[!IMPORTANT]

Windowsはサポートされていません。Windowsをお使いの場合はWSLをご利用ください。

まずCLIと最小タスクから始めて、依存関係や環境構成が正しいことを確認することを強く推奨します。

設定

CLIを使用する場合は、pptagent onboardコマンドでこれらの設定を対話的に作成・更新できます。Docker Composeやソースからビルドする場合は、手動で準備する必要があります。

cp deeppresenter/config.yaml.example deeppresenter/config.yaml
cp deeppresenter/mcp.json.example deeppresenter/mcp.json

#### 品質向上のためのオプションサービス

以下のサービスは生成品質を顕著に向上させます。特にリサーチの深度、PDF解析、ビジュアル資産の作成で有効です。

Tavily：ウェブ検索の品質を向上させます。tavily.comでAPIキーを申請し、deeppresenter/mcp.json の TAVILY_API_KEY に設定してください。
MinerU：PDF解析の品質を向上させます。mineru.netでAPIキーを申請し、deeppresenter/mcp.json の MINERU_API_KEY に設定するか、MinerU をローカルにデプロイして MINERU_API_URL を設定してください。
テキストから画像モデル：画像生成の品質を向上させます。deeppresenter/config.yaml の t2i_model を設定してください。

完全にオフラインでの利用を希望する場合は、MinerU をローカルにデプロイし、deeppresenter/config.yaml の offline_mode: true を設定してウェブ検索などネットワーク依存のツールの読み込みを回避してください。

より多くの設定可能な変数は constants.py に記載されています。

1. 個人利用 / OpenClaw 連携：CLI

[!NOTE]

macOS では、CLI 実行時に Homebrew、Node.js、Docker、poppler、Playwright、llama.cpp などのローカル依存パッケージが自動でインストールされる場合があります。

Linux では、環境を自身で準備する必要があります。

ローカルで最速のセットアップを希望する場合や、CLIを通じてDeepPresenterをOpenClawに接続したい場合はこのモードを使用してください。

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
First-time interactive setup
uvx pptagent onboard
Generate a presentation
uvx pptagent generate "Single Page with Title: Hello World" -o hello.pptx
Generate with attachments
uvx pptagent generate "Q4 Report" \
  -f data.xlsx \
  -f charts.pdf \
  -p "10-12" \
  -o report.pptx

| コマンド | 説明 | | ---------------------- | -------------------------------------------------- | | pptagent onboard | 対話型設定ウィザード | | pptagent generate | プレゼンテーションの生成 | | pptagent config | 現在の設定の表示 | | pptagent reset | 設定のリセット | | pptagent serve | CLIで使用されるローカル推論サービスの起動 |

2. ミニマルセットアップ / 開発: ソースからのビルド

開発中に最小限の抽象化レイヤーと依存関係の完全な管理を希望する場合、このモードを使用してください。

uv pip install -e .
playwright install-deps
playwright install chromium
npm install --prefix deeppresenter/html2pptx
modelscope download forceless/fasttext-language-id
docker pull forceless/deeppresenter-sandbox
docker pull forceless/deeppresenter-host
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
or build from dockerfile
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .

アプリを起動します：

python webui.py

3. サーバー展開：Docker Compose

このモードは、明確な依存関係を持つ安定したサーバー環境のために使用します。

# Pull the public images to avoid build from source
docker pull forceless/deeppresenter-sandbox
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
Or build from source
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .
Start the host service
docker compose up -d

The service exposes the web UI on http://localhost:7861.

Case Study 💡

#### Prompt: Please present the given document to me.

#### Prompt: 请介绍小米 SU7 的外观和价格

#### Prompt: 请制作一份高中课堂展示课件，主题为“解码立法过程：理解其对国际关系的影响”

貢献者 🌟

_Force1ess	_Puelloc	_hongyan	_Dnoob	_Sadahlu
_{KurisuMakiseSame}	_Angelen	_BrandonHu	_{Eliot White}	_EvolvedGhost
_ISCAS-zwl	_{James Brown}	_JunZhang	_{Open AI Tx}	_{Sense_wang}
_SuYao	_{Zakir Jiwani}	_Zhenyu	_lnennnn

引用 🙏

このプロジェクトが役立つ場合は、以下の方法で引用してください。

@inproceedings{zheng-etal-2025-pptagent, title = "{PPTA}gent: Generating and Evaluating Presentations Beyond Text-to-Slides", author = "Zheng, Hao and Guan, Xinyan and Kong, Hao and Zhang, Wenkai and Zheng, Jia and Zhou, Weixiang and Lin, Hongyu and Lu, Yaojie and Han, Xianpei and Sun, Le", editor = "Christodoulopoulos, Christos and Chakraborty, Tanmoy and Rose, Carolyn and Peng, Violet", booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2025", address = "Suzhou, China", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2025.emnlp-main.728/", doi = "10.18653/v1/2025.emnlp-main.728", pages = "14413--14429", ISBN = "979-8-89176-332-6", abstract = "Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to extract slide-level functional types and content schemas, then drafts an outline and iteratively generates editing actions based on selected reference slides to create new slides. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Results demonstrate that PPTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions." }

@misc{zheng2026deeppresenterenvironmentgroundedreflectionagentic, title={DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation}, author={Hao Zheng and Guozhao Mo and Xinru Yan and Qianhao Yuan and Wenkai Zhang and Xuanang Chen and Yaojie Lu and Hongyu Lin and Xianpei Han and Le Sun}, year={2026}, eprint={2602.22839}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2602.22839}, }

--- Tranlated By Open Ai Tx | Last indexed: 2026-04-09 ---