🌐 Taal

Contact 📫

De belangrijkste bijdrager aan deze repository is een masterstudent die afstudeert in 2026, neem gerust contact op voor samenwerking of kansen.

本仓库的主要贡献者是一名 2026 届硕士毕业生，欢迎联系合作或交流机会。

Nieuws 📅

[2026/03]: We ondersteunen nu CLI en hebben onze getrainde modellen uitgebracht op Hugging Face 🤗!
[2026/01]: We ondersteunen vrije vorm en sjabloongeneratie, PPTX-export en nu ook offline modus! Contextbeheer toegevoegd om context-overflow te voorkomen.
[2025/12]: 🔥 V2 uitgebracht met grote verbeteringen - Diepe Onderzoeksintegratie, Vrije Vorm Visueel Ontwerp, Autonome Asset Creatie, Tekst-naar-beeld Generatie en Agent-omgeving met sandbox & 20+ tools.
[2025/09]: 🛠️ MCP-serverondersteuning toegevoegd - zie MCP Server voor configuratiedetails
[2025/09]: 🚀 V2 uitgebracht met grote verbeteringen - zie release notes voor details
[2025/08]: 🎉 Artikel geaccepteerd voor EMNLP 2025!
[2025/05]: ✨ V1 uitgebracht met kernfunctionaliteit en 🌟 doorbraak: 1.000 sterren op GitHub bereikt! - zie release notes voor details
[2025/01]: 🔓 Codebase open-source gemaakt, met experimentele code gearchiveerd op experiment release

Gebruik 📖

[!IMPORTANT]

Windows wordt niet ondersteund. Gebruik WSL als u op Windows werkt.

We raden sterk aan te beginnen met de CLI en een minimale taak om te controleren of afhankelijkheden en de omgeving correct zijn geconfigureerd.

Configuratie

Als u de CLI gebruikt, kan pptagent onboard helpen deze configuraties interactief aan te maken en bij te werken. Als u Docker Compose gebruikt of vanaf de bron bouwt, moet u deze handmatig voorbereiden:

cp deeppresenter/config.yaml.example deeppresenter/config.yaml
cp deeppresenter/mcp.json.example deeppresenter/mcp.json

#### Optionele Diensten Die De Kwaliteit Verbeteren

De volgende diensten kunnen de generatiekwaliteit merkbaar verbeteren, vooral voor onderzoeksdiepte, PDF-parsing en het maken van visuele middelen:

Tavily: verbetert de kwaliteit van webzoekopdrachten. Vraag een API-sleutel aan op tavily.com, stel vervolgens TAVILY_API_KEY in in deeppresenter/mcp.json.
MinerU: verbetert de PDF-parseringskwaliteit. Vraag een API-sleutel aan op mineru.net en stel MINERU_API_KEY in in deeppresenter/mcp.json, of implementeer MinerU lokaal en stel in plaats daarvan MINERU_API_URL in.
Text-to-image model: verbetert de kwaliteit van afbeeldingsgeneratie. Configureer t2i_model in deeppresenter/config.yaml.

Als je een volledig offline installatie wilt, implementeer MinerU dan lokaal en stel offline_mode: true in in deeppresenter/config.yaml om te voorkomen dat netwerkafhankelijke tools zoals websearch worden geladen.

Meer configureerbare variabelen zijn te vinden in constants.py.

1. Persoonlijk Gebruik / OpenClaw-integratie: CLI

[!OPMERKING]

Op macOS kan de CLI automatisch verschillende lokale afhankelijkheden installeren, waaronder Homebrew, Node.js, Docker, poppler, Playwright en llama.cpp.

Op Linux dien je de omgeving zelf voor te bereiden.

Gebruik deze modus als je de snelste lokale installatie wilt of DeepPresenter via de CLI in OpenClaw wilt integreren.

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
First-time interactive setup
uvx pptagent onboard
Generate a presentation
uvx pptagent generate "Single Page with Title: Hello World" -o hello.pptx
Generate with attachments
uvx pptagent generate "Q4 Report" \
  -f data.xlsx \
  -f charts.pdf \
  -p "10-12" \
  -o report.pptx

| Commando | Beschrijving | | ----------------------- | ---------------------------------------------------- | | pptagent onboard | Interactieve configuratiewizard | | pptagent generate | Genereer presentaties | | pptagent config | Bekijk huidige configuratie | | pptagent reset | Reset configuratie | | pptagent serve | Start de lokale inference-service gebruikt door de CLI |

2. Minimale Setup / Ontwikkeling: Bouwen vanaf broncode

Gebruik deze modus als u de kleinste abstractielaag wilt en volledige controle over afhankelijkheden tijdens ontwikkeling.

uv pip install -e .
playwright install-deps
playwright install chromium
npm install --prefix deeppresenter/html2pptx
modelscope download forceless/fasttext-language-id
docker pull forceless/deeppresenter-sandbox
docker pull forceless/deeppresenter-host
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
or build from dockerfile
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .

Start de app:

python webui.py

3. Serverimplementatie: Docker Compose

Gebruik deze modus voor een stabiele serveromgeving met expliciete afhankelijkheden.

# Pull the public images to avoid build from source
docker pull forceless/deeppresenter-sandbox
docker tag forceless/deeppresenter-sandbox deeppresenter-sandbox
Or build from source
docker build -t deeppresenter-sandbox -f deeppresenter/docker/SandBox.Dockerfile .
Start the host service
docker compose up -d

The service exposes the web UI on http://localhost:7861.

Case Study 💡

#### Prompt: Please present the given document to me.

#### Prompt: 请介绍小米 SU7 的外观和价格

#### Prompt: 请制作一份高中课堂展示课件，主题为“解码立法过程：理解其对国际关系的影响”

Afbeelding15

Bijdragers 🌟

_Force1ess	_Puelloc	_hongyan	_Dnoob	_Sadahlu
_{KurisuMakiseSame}	_Angelen	_BrandonHu	_{Eliot White}	_EvolvedGhost
_ISCAS-zwl	_{James Brown}	_JunZhang	_{Open AI Tx}	_{Sense_wang}
_SuYao	_{Zakir Jiwani}	_Zhenyu	_lnennnn

Referentie 🙏

Als je dit project nuttig vindt, gebruik dan het volgende om het te citeren:

@inproceedings{zheng-etal-2025-pptagent, title = "{PPTA}gent: Generating and Evaluating Presentations Beyond Text-to-Slides", author = "Zheng, Hao and Guan, Xinyan and Kong, Hao and Zhang, Wenkai and Zheng, Jia and Zhou, Weixiang and Lin, Hongyu and Lu, Yaojie and Han, Xianpei and Sun, Le", editor = "Christodoulopoulos, Christos and Chakraborty, Tanmoy and Rose, Carolyn and Peng, Violet", booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2025", address = "Suzhou, China", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2025.emnlp-main.728/", doi = "10.18653/v1/2025.emnlp-main.728", pages = "14413--14429", ISBN = "979-8-89176-332-6", abstract = "Automatically generating presentations from documents is a challenging task that requires accommodating content quality, visual appeal, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, overlooking visual appeal and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to extract slide-level functional types and content schemas, then drafts an outline and iteratively generates editing actions based on selected reference slides to create new slides. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Results demonstrate that PPTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions." }

@misc{zheng2026deeppresenterenvironmentgroundedreflectionagentic, title={DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation}, author={Hao Zheng and Guozhao Mo and Xinru Yan and Qianhao Yuan and Wenkai Zhang and Xuanang Chen and Yaojie Lu and Hongyu Lin and Xianpei Han and Le Sun}, year={2026}, eprint={2602.22839}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2602.22839}, }

--- Tranlated By Open Ai Tx | Last indexed: 2026-04-09 ---