Meet II-Agent V1
Your Complete Open-Source Intelligent Assistant
#5 on Terminal Bench 2 • 100% Open Source • Apache-2.0 License.
II-Agent V1 Release Video
Core Workflows
Full-Stack Development
Complete web app scaffolding and iterative development. From initial setup to deployment, II-Agent handles the entire development lifecycle with intelligent code generation and optimization.
Slide Creation
Transform short briefs into polished presentations. Create professional slides and decks with intelligent content structuring, design suggestions, and automated formatting.
Deep/ Fast Research
Comprehensive research capabilities through tight integration with II-Researcher. Conduct thorough investigations, analyze data, and generate detailed reports with our specialized research agent.
Image Generation
High-fidelity image generation with Nano Banana Pro, GPT Image 1.5, and Imagen 4.0. Visualize ideas, create assets, and iterate on designs without leaving your workflow.
Video Generation
Generate videos from text with Veo 3.1. Control start/end frames, resolution, aspect ratio, and audio integration.
Storybook Creation
Generate beautiful, fully illustrated picture books from a single prompt. Cohesive visual style, consistent characters, and structured narratives ready to export and share.
Key Features
Bring Your Own Key (BYOK)
Use your favorite LLMs with your own API keys. Full control over your AI models and costs.
Multi-Model Chat Support
Work across Gemini 3, Claude Sonnet/Opus 4.5, and GPT-5.2 in a single conversation thread with full context.
Plan Mode
Visualize project plans before code is written. Review roadmap, modify requirements, and hit Build to execute precisely.
Universal Connectors
Connect GitHub, Slack, Gmail, Google Workspace, Notion, Discord, Dropbox, Canva, and more for seamless context-aware assistance.
Custom Skills & MCP
Plug in any Model Context Protocols your workflow requires. Connect GitHub repos to run your exact processes as one-click actions.
Live Editing
Design-first editor for refining outputs visually. Change fonts, adjust borders, tweak layout and spacing in real time.
Fast & Deep Research
Fast Research for quick answers in seconds. Deep Research for multi-step investigations with source triangulation and structured reports.
Document Skills
PDF extraction & creation, Excel formulas & charts, Word editing, PowerPoint manipulation, and form filling.
Browser Automation
Playwright-powered automation, form filling, screenshot capture, element interaction, and E2E testing.
Audio & Speech
New audio transcription feature for easier prompting on the go.
Data Analysis (REPL)
Execute code to analyze files, perform calculations, process data, and generate insights from datasets.
20+ Style Presets
Cyber Punk, Low-poly, Clay, Pixel Art and more. Remove backgrounds, restore photos, enhance quality, and replace objects.
