Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.enconvo.ai/llms.txt

Use this file to discover all available pages before exploring further.

Overview

EnConvo ships with 80+ built-in extensions organized into functional categories. Each extension provides specialized capabilities that can be used standalone, combined in workflows, or called by AI agents as tools.
All built-in extensions are open source and available on GitHub. You can customize or fork any extension to fit your needs.

AI & Chat

Core AI interaction extensions that power EnConvo’s conversational interface.
ExtensionDescriptionKey Features
AI (chat_with_ai)Multi-modal AI chat agent with tool use and knowledge base retrievalAgent mode with planning, multi-LLM support, dynamic context awareness, customizable system prompts, post-actions
Agent (agent)AI agent management with the Mavis personal assistantCreate, list, and message agents; configurable prompts, tools, and session handling; built-in Mavis assistant
Custom Bot (custom_bot)Registry for user-created custom AI bots and agentsDynamic command creation, personalized bot configurations
Writing Toolkit (writing_package)AI-powered writing commandsFix spelling/grammar, improve writing, summarize, explain, rewrite, tone adjustment, proofreading, dictation polishing
Web Search (internet_browsing)Real-time web search with AI synthesisConfigurable search providers, LLM-synthesized cited answers
The AI extension is the primary chat interface. Use @ in SmartBar to select tools, and # to add context sources like screenshots, browser tabs, or clipboard content.

Language Models

Extensions for managing and connecting to AI model providers.
ExtensionDescriptionKey Features
AI Model Providers (llm)Unified LLM provider integrations for 20+ servicesOpenAI, Anthropic, Google Gemini, Ollama, Groq, Mistral, DeepSeek, X.AI, Azure OpenAI, LM Studio, and more; streaming chat completion; model management APIs
Embedding Providers (embeddings_providers)Text embedding for semantic search and knowledge base indexingOpenAI, Voyage AI, Ollama, SiliconFlow, Enconvo Cloud; configurable concurrency and model selection
Reranker Providers (reranker_providers)Document reranking for improved search relevanceVoyage AI, SiliconFlow, Enconvo Cloud reranking models
MLX (mlx_manage)Apple Silicon local AI model managementLocal LLM chat, vision-language models, TTS, STT via MLX; Hugging Face model downloads
Use the Enconvo Cloud Plan provider for zero-configuration access to multiple models. For privacy-sensitive work, configure Ollama or MLX for fully local inference.

Voice & Audio

Text-to-speech, speech recognition, transcription, and audio utilities.
ExtensionDescriptionKey Features
TTS (tts)Text-to-speech commandsRead aloud, text-to-audio file conversion, sound effects, SRT-to-audio, Gemini single/multi-speaker TTS
TTS Providers (tts_providers)11+ TTS provider backendsOpenAI, Microsoft Azure, ElevenLabs, MiniMax, Speechify, Google Cloud, xAI, macOS system, Edge TTS; voice listing and speed configuration
Dictation Model Providers (speech_recognize_providers)15+ real-time speech-to-text providersGroq Whisper, OpenAI, Deepgram, AssemblyAI, ElevenLabs, Google Gemini, Microsoft Azure, NVIDIA Parakeet, Qwen ASR, local Whisper
Transcription Providers (transcription_providers)File-based transcription for long-form audioSame provider backends as dictation; optimized for longer recordings
Audio Utils (audio_utils)Audio file compressionffmpeg-based compression with configurable quality; batch processing of MP3, WAV, M4A, AAC, OGG, FLAC
My Recordings (my_recordings)Recording managementOrganize and access voice recordings
For the best free dictation experience, use Microsoft (Enconvo Cloud Plan) Free which requires no API key.

Content & Knowledge

Extensions for managing knowledge, searching, reading web content, and crawling websites.
ExtensionDescriptionKey Features
Knowledge Base (knowledge_base)Document management and semantic searchCreate knowledge bases, ingest files/text/webpages/audio/video, vector similarity search via LanceDB, personal memory system with automatic knowledge extraction
Search (search)Unified search engineSearch commands, files, applications, skills, and conversations; fuzzy matching, context-aware recommendations, real-time index updates
Web Fetch (link_reader)Fetch and extract content from URLsConvert webpages to markdown/text, summarize pages using LLM
Link Reader Providers (link_reader_providers)Web scraping backendsLocal Fetch, Crawl4AI, Jina AI, Firecrawl, Enconvo Cloud
Website Crawler (website_crawler)Multi-page website crawlingConfigurable depth and page limits, output as text or saved to file
Website Crawler Providers (website_crawler_providers)Crawler backendsCrawl4AI (local, free), Firecrawl (cloud API)
Web Search Providers (browsing_providers)Web search backends for AI agentsTavily, Bing, Google Custom Search, Exa, Gemini grounding
Document Loader Providers (document_loader_providers)Document text extractionMistral OCR for structured text and markdown from documents and images
Combine Knowledge Base with Embedding Providers and Reranker Providers for a powerful RAG (Retrieval-Augmented Generation) pipeline. Ingest your documents, then ask questions in chat with knowledge base context enabled.

Image & Video

Image generation, video creation, compression, OCR, and media utilities.
ExtensionDescriptionKey Features
Image Create (image_generation)Generate and edit images from text promptsBackground removal, multiple output formats, batch generation
Image Generate Providers (image_generation_providers)7+ image generation backendsOpenAI DALL-E, Google Gemini, Fal.ai, Azure, Together AI, Straico; text-to-image and image editing
Video Generation (video_generation)AI video creationText-to-video, image-to-video using Sora, Kling, Hailuo, Veo, Wan
Video Generate Providers (video_generation_providers)Video generation backendsEnconvo Cloud (fal.ai), OpenAI Sora, Fal.ai direct
Image Compress (compress_image)Image compressionCaesium engine, adjustable quality, batch compression, overwrite or output to folder
Video Utils (video_utils)Video utilitiesCompress video via ffmpeg, extract audio to MP3, download videos from YouTube/TikTok/Instagram/Reddit/Twitter via yt-dlp
OCR (ocr_action)Text extraction from imagesScreenshot OCR, silent clipboard copy, interactive SmartBar display
OCR Providers (ocr_providers)OCR backendsApple native Vision OCR, AI model-based OCR via LLM vision, Mistral OCR

Productivity

Workflow automation, scheduling, prompt management, and utility extensions.
ExtensionDescriptionKey Features
Workflow (workflow)Visual workflow automation engineNode-based execution, manual/scheduled/file-change triggers, conditional branching, loop iteration, AI parameter generation, Jinja2 templating, run logging
Cron (cron)Scheduled task managementCreate, list, update, delete, toggle cron jobs; execute commands on recurring schedules; execution logs
Prompt (prompt)Reusable prompt template systemCreate, manage, execute prompt templates with variable substitution, diff highlighting, post-actions
Variables (variables)System variable providersActive app info, screenshots, clipboard, selected text, Finder paths, browser tabs, date/time for use in workflows and templates
Quick Links (quick_links)Parameterized URL launcherOpen URLs in configurable browser; presets for Google, YouTube, Twitter/X search
Skills Manager (skills_manager)Skill installation and updatesInstall from Enconvo Store, GitHub, or ZIP; version checking; auto-install built-in skills
Enconvo (enconvo)Core platform APIs30+ internal endpoints for extension management, command configuration, credential loading, cache operations, user profile

Communication

Email, messaging, and channel integrations.
ExtensionDescriptionKey Features
Gmail (gmail)Gmail API integrationSend, draft, read, search, delete emails; manage labels; batch operations
IM Channels (im_channels)Messaging platform integrationSlack, Telegram, Discord, Feishu/Lark; send and read messages; channel listener service
Message Manager (message_manager)Chat message persistenceSQLite + Drizzle ORM; store, query, paginate conversation messages across agents and sessions
Set up IM Channels to receive AI agent notifications directly in Slack or Telegram. Combine with Cron to schedule automated reports sent to your team channels.

Apple Integration

Native macOS application integrations using AppleScript and system APIs.
ExtensionDescriptionKey Features
Apple Mail (apple_mail)Apple Mail integrationCompose, read, manage emails; archive, trash, delete, move between mailboxes; toggle read status
Apple Reminders (apple_reminders)Apple Reminders integrationCreate, update, delete, list reminders; due dates, priorities, recurrence rules, location-based triggers
Shortcuts (apple_shortcuts)Apple Shortcuts integrationDynamically populated commands based on shortcuts available on the system
Apple Calendar (calender)Apple Calendar integrationFull CRUD for events and calendars; recurrence rules, alarms, all-day events
These extensions work through native macOS APIs, so they require appropriate permissions. Grant access in System Settings when prompted.

System & Control

System utilities, file operations, window management, and computer control.
ExtensionDescriptionKey Features
Mac Applications (application)AI-powered app assistantsContext-aware prompts for 12+ apps: Finder, Calendar, Mail, Notes, Obsidian, Microsoft Office
File System (file_system)File system tools for AI agentsRead, write, edit files; search with ripgrep (grep); find files by glob patterns
Window (window)Window managementPopBar configuration, Mini SmartBar setup, display mode options
System Tools (system)macOS system utilitiesToggle dark mode, hidden files, mute, full screen; web search; URL opening
Computer Control (computer_control)Accessibility API controlNavigate UI trees, click elements, type text, screenshots, scroll; works with any accessible macOS app

Development & Automation

Code execution, browser automation, and protocol integrations.
ExtensionDescriptionKey Features
Code Runner (code_runner)Shell command executionBash/zsh commands with auto Python venv activation, configurable working directory, timeout handling
MCP (mcp)Model Context Protocol clientConnection pooling, stdio/HTTP/SSE transports, OAuth authentication for MCP server tools
Browser Control (browser_control)Browser automationNavigate, click, type, screenshots, read page content, manage tabs; works with Chrome, Edge, Brave, Arc, Vivaldi, Opera, Firefox via Companion extension
Browser Control uses your existing browser sessions — all your login cookies and extensions work automatically. Install the Enconvo Companion browser extension for the best experience.

Translation

Multi-language translation with multiple provider backends.
ExtensionDescriptionKey Features
Translate (translate)Text and word translationAI-powered LLM, DeepL, Google Translate, Youdao, Caiyun; auto language detection, dictionary mode, screenshot OCR translation, TTS playback
Translate Providers (translate_providers)Translation backendsLLM-based, DeepL, Google Translate, Youdao, Caiyun, Enconvo Cloud

Other Utilities

Additional tools for export, credentials, YouTube, and more.
ExtensionDescriptionKey Features
Exporter (exporter)Export conversations and documentsSave as Markdown, PDF, or audio (MP3) via TTS
Credentials (credentials)Centralized API key management40+ AI service providers; key validation; secure storage in macOS Keychain
YouTube (youtube)YouTube toolsExtract video transcripts with timestamps and language selection; download videos/audio via yt-dlp
EnConvo WebApp (enconvo_webapp)Management UIExtension store, agent creation, workflow builder (React Flow), knowledge base management, MCP server installation

Extension Architecture

How Extensions Work

Every extension follows a consistent pattern:
  1. Package Definition: package.json declares commands, preferences, and metadata
  2. Command Handlers: TypeScript files in src/ export entry functions
  3. API Routes: Files in src/api/ are auto-discovered as HTTP-like endpoints
  4. Provider Pattern: Extensions with commandType: "provider" register capabilities that consumer extensions select

Command Key Format

All extensions use the format extensionName|commandName for targeting. For example:
  • chat_with_ai|chat — the main chat command
  • llm|open_ai — OpenAI LLM provider
  • gmail|send_email — Gmail send tool

Provider Selection

Many extensions use a provider pattern where you choose which backend to use:
chat_with_ai (consumer) → llm|open_ai (provider)
tts (consumer) → tts_providers|openai_tts_provider (provider)
translate (consumer) → translate_providers|deepl_translate_provider (provider)
Configure default providers in each extension’s settings, or let AI agents choose automatically.

Installing Additional Extensions

Beyond built-in extensions, you can install community extensions:
1

Open Extension Store

Settings → Extensions → Browse, or use the / command in SmartBar
2

Search or browse

Find extensions by name or category
3

Install

Click Install — the extension is downloaded and available immediately
4

Configure

Some extensions require API keys or preferences. Configure in Settings → Extensions → [Extension Name]

Next Steps

Develop Extensions

Create your own custom extensions

Extension API Reference

Complete API reference for extension development

Workflows

Combine extensions into automated workflows

Skills

Extend AI capabilities with task-specific skills