Trendshift - Ask AI

base on Agent Builder and Runtime by Docker Engineering # 🤖 `cagent` 🤖 > A powerful, easy-to-use, customizable multi-agent runtime that orchestrates AI > agents with specialized capabilities and tools, and the interactions between > agents. ![cagent in action](docs/demo.gif) ## ✨ What is `cagent`? ✨ `cagent` lets you create and run intelligent AI agents, where each agent has specialized knowledge, tools and capabilities. Think of it as allowing you to quickly build, share and run a team of virtual experts that collaborate to solve complex problems for you. And it's dead easy to use! ⚠️ Note: `cagent` is in active development, **breaking changes are to be expected** ⚠️ ### Your First Agent Example [basic_agent.yaml](/examples/basic_agent.yaml): Creating agents with cagent is straightforward. They are described in a short .yaml file, like this one: ```yaml agents: root: model: openai/gpt-5-mini description: A helpful AI assistant instruction: | You are a knowledgeable assistant that helps users with various tasks. Be helpful, accurate, and concise in your responses. ``` Run it in a terminal with `cagent run basic_agent.yaml`. Many more examples can be found [here](/examples/README.md)! ### Improving an agent with MCP tools `cagent` supports MCP servers, enabling agents to use a wide variety of external tools and services. It supports three transport types: `stdio`, `http` and `sse`. Giving an agent access to tools via MCP is a quick way to greatly improve its capabilities, the quality of its results and its general usefulness. Get started quickly with the [Docker MCP Toolkit](https://docs.docker.com/ai/mcp-catalog-and-toolkit/toolkit/) and [catalog](https://docs.docker.com/ai/mcp-catalog-and-toolkit/catalog/) Here, we're giving the same basic agent from the example above access to a **containerized** `duckduckgo` mcp server and its tools by using Docker's MCP Gateway: ```yaml agents: root: model: openai/gpt-5-mini description: A helpful AI assistant instruction: | You are a knowledgeable assistant that helps users with various tasks. Be helpful, accurate, and concise in your responses. toolsets: - type: mcp ref: docker:duckduckgo # stdio transport ``` When using a containerized server via the Docker MCP gateway, you can configure any required settings/secrets/authentication using the [Docker MCP Toolkit](https://docs.docker.com/ai/mcp-catalog-and-toolkit/toolkit/#example-use-the-github-official-mcp-server) in Docker Desktop. Aside from the containerized MCP servers the Docker MCP Gateway provides, any standard MCP server can be used with cagent! Here's an example similar to the above but adding `read_file` and `write_file` tools from the `rust-mcp-filesystem` MCP server: ```yaml agents: root: model: openai/gpt-5-mini description: A helpful AI assistant instruction: | You are a knowledgeable assistant that helps users with various tasks. Be helpful, accurate, and concise in your responses. Write your search results to disk. toolsets: - type: mcp ref: docker:duckduckgo - type: mcp command: rust-mcp-filesystem # installed with `cargo install rust-mcp-filesystem` args: ["--allow-write", "."] tools: ["read_file", "write_file"] # Optional: specific tools only env: - "RUST_LOG=debug" ``` See [the USAGE docs](./docs/USAGE.md#tool-configuration) for more detailed information and examples ### Exposing agents as MCP tools `cagent` can expose agents as MCP tools via the `cagent mcp` command, allowing other MCP clients to use your agents. Each agent in your configuration becomes an MCP tool with its description. ```bash # Start MCP server with local file cagent mcp ./examples/dev-team.yaml # Or use an OCI artifact cagent mcp agentcatalog/pirate ``` This exposes each agent as a tool (e.g., `root`, `designer`, `awesome_engineer`) that MCP clients can call: ```json { "method": "tools/call", "params": { "name": "designer", "arguments": { "message": "Design a login page" } } } ``` See [MCP Mode documentation](./docs/MCP-MODE.md) for detailed instructions on exposing your agents through MCP with Claude Desktop, Claude Code, and other MCP clients. ### 🎯 Key Features - **🏗️ Multi-agent architecture** - Create specialized agents for different domains. - **🔧 Rich tool ecosystem** - Agents can use external tools and APIs via the MCP protocol. - **🔄 Smart delegation** - Agents can automatically route tasks to the most suitable specialist. - **📝 YAML configuration** - Declarative model and agent configuration. - **💭 Advanced reasoning** - Built-in "think", "todo" and "memory" tools for complex problem-solving. - **🔍 RAG (Retrieval-Augmented Generation)** - Pluggable retrieval strategies (BM25, chunked-embeddings, semantic-embeddings) with hybrid retrieval, result fusion and reranking support. - **🌐 Multiple AI providers** - Support for OpenAI, Anthropic, Gemini, xAI, Mistral, Nebius and [Docker Model Runner](https://docs.docker.com/ai/model-runner/). ## 🚀 Quick Start 🚀 ### Installation #### Using Homebrew Install `cagent` with a single command using [homebrew](https://brew.sh/)! ```sh $ brew install cagent ``` #### Using binary releases [Prebuilt binaries](https://github.com/docker/cagent/releases) for Windows, macOS and Linux can be found on the release page of the [project's GitHub repository](https://github.com/docker/cagent/releases) Once you've downloaded the appropriate binary for your platform, you may need to give it executable permissions. On macOS and Linux, this is done with the following command: ```sh # linux amd64 build example chmod +x /path/to/downloads/cagent-linux-amd64 ``` You can then rename the binary to `cagent` and configure your `PATH` to be able to find it (configuration varies by platform). ### **Set your API keys** Based on the models you configure your agents to use, you will need to set the corresponding provider API key accordingly, all these keys are optional, you will likely need at least one of these, though: ```bash # For OpenAI models export OPENAI_API_KEY=your_api_key_here # For Anthropic models export ANTHROPIC_API_KEY=your_api_key_here # For Gemini models export GOOGLE_API_KEY=your_api_key_here # For xAI models export XAI_API_KEY=your_api_key_here # For Nebius models export NEBIUS_API_KEY=your_api_key_here # For Mistral models export MISTRAL_API_KEY=your_api_key_here ``` ### Run Agents! ```bash # Run an agent! cagent run ./examples/pirate.yaml # or specify a different starting agent from the config, useful for agent teams cagent run ./examples/pirate.yaml -a root # or run directly from an image reference here I'm pulling the pirate agent from the creek repository cagent run creek/pirate ``` ### Multi-agent team example ```yaml agents: root: model: claude description: "Main coordinator agent that delegates tasks and manages workflow" instruction: | You are the root coordinator agent. Your job is to: 1. Understand user requests and break them down into manageable tasks 2. Delegate appropriate tasks to your helper agent 3. Coordinate responses and ensure tasks are completed properly 4. Provide final responses to the user When you receive a request, analyze what needs to be done and decide whether to: - Handle it yourself if it's simple - Delegate to the helper agent if it requires specific assistance - Break complex requests into multiple sub-tasks sub_agents: ["helper"] helper: model: claude description: "Assistant agent that helps with various tasks as directed by the root agent" instruction: | You are a helpful assistant agent. Your role is to: 1. Complete specific tasks assigned by the root agent 2. Provide detailed and accurate responses 3. Ask for clarification if tasks are unclear 4. Report back to the root agent with your results Focus on being thorough and helpful in whatever task you're given. models: claude: provider: anthropic model: claude-sonnet-4-0 max_tokens: 64000 ``` You'll find a curated list of agents examples, spread into 3 categories, [Basic](https://github.com/docker/cagent/tree/main/examples#basic-configurations), [Advanced](https://github.com/docker/cagent/tree/main/examples#advanced-configurations) and [multi-agents](https://github.com/docker/cagent/tree/main/examples#multi-agent-configurations) in the `/examples/` directory. ### DMR (Docker Model Runner) provider options When using the `dmr` provider, you can use the `provider_opts` key for DMR runtime-specific (e.g. llama.cpp/vllm) options and speculative decoding: ```yaml models: local-qwen: provider: dmr model: ai/qwen3 max_tokens: 8192 provider_opts: # general flags passed to the underlying model runtime runtime_flags: ["--ngl=33", "--repeat-penalty=1.2", ...] # or comma/space-separated string # speculative decoding for faster inference speculative_draft_model: ai/qwen3:1B speculative_num_tokens: 5 speculative_acceptance_rate: 0.8 ``` The default base_url `cagent` will use for DMR providers is `http://localhost:12434/engines/llama.cpp/v1`. DMR itself might need to be enabled via [Docker Desktop's settings](https://docs.docker.com/ai/model-runner/get-started/#enable-dmr-in-docker-desktop) on macOS and Windows, and via the command-line on [Docker CE on Linux](https://docs.docker.com/ai/model-runner/get-started/#enable-dmr-in-docker-engine). See the [DMR Provider documentation](docs/USAGE.md#dmr-docker-model-runner-provider-usage) for more details on runtime flags and speculative decoding options. ## RAG (Retrieval-Augmented Generation) Give your agents access to your documents with cagent's modular RAG system. It supports multiple retrieval strategies that can be used individually or combined for hybrid search. ### Quick RAG Example ```yaml models: embedder: provider: openai model: text-embedding-3-small rag: my_knowledge_base: docs: [./documents, ./pdfs] strategies: - type: chunked-embeddings model: embedder threshold: 0.5 chunking: size: 1000 overlap: 100 results: limit: 5 agents: root: model: openai/gpt-4o instruction: | You are an assistant with access to an internal knowledge base. Use the knowledge base to gather context before answering user questions rag: [my_knowledge_base] ``` ### Hybrid Retrieval (Chunked-Embeddings + BM25) Combine semantic search (chunked-embeddings) with keyword search (BM25) for best results: ```yaml rag: hybrid_search: docs: [./shared_docs] strategies: - type: chunked-embeddings model: embedder threshold: 0.5 limit: 20 chunking: size: 1000 overlap: 100 - type: bm25 k1: 1.5 b: 0.75 threshold: 0.3 limit: 15 chunking: size: 1000 overlap: 100 results: fusion: strategy: rrf # Reciprocal Rank Fusion k: 60 deduplicate: true limit: 5 agents: root: model: openai/gpt-4o rag: [hybrid_search] ``` **Features:** - **Multiple strategies**: Vector embeddings, semantic embeddings, BM25 (keyword), or combinations - **Parallel execution**: Strategies run concurrently for fast results - **Pluggable fusion**: RRF, weighted, or max score combining - **Result reranking**: Re-score results with specialized models for improved relevance - **Per-strategy configuration**: Different thresholds, limits, and documents - **Auto file watching**: Reindex automatically on file changes ### Result Reranking Improve search quality by re-scoring retrieved results with a reranking model: ```yaml rag: knowledge_base: docs: [./documents] strategies: - type: chunked-embeddings model: openai/text-embedding-3-small limit: 20 # Retrieve more candidates for reranking results: reranking: model: openai/gpt-4.1-mini # Any chat model or DMR reranker top_k: 10 # Only rerank top 10 (optional) threshold: 0.3 # Filter low-scoring results (optional) criteria: | # Domain-specific relevance guidance (optional, not used with DMR reranking specific models) Prioritize recent documentation and practical examples. Documents from official sources are more relevant. limit: 5 # Final top results after reranking ``` **Supported providers:** DMR (native `/rerank` endpoint), OpenAI, Anthropic, Gemini (via structured outputs) **Note:** Temperature defaults to 0.0 for more deterministic scoring when not explicitly set. See the [RAG documentation in USAGE.md](docs/USAGE.md#rag-configuration) for complete details, examples, and debugging guides. ## Quickly generate agents and agent teams with `cagent new` Using the command `cagent new` you can quickly generate agents or multi-agent teams using a single prompt! `cagent` has a built-in agent dedicated to this task. To use the feature, you must have an Anthropic, OpenAI or Google API key available in your environment or specify a local model to run with DMR (Docker Model Runner). You can choose what provider and model gets used by passing the `--model provider/modelname` flag to `cagent new` If `--model` is unspecified, `cagent new` will automatically choose between these three providers in order based on the first api key it finds in your environment. ```sh export ANTHROPIC_API_KEY=your_api_key_here # first choice. default model claude-sonnet-4-0 export OPENAI_API_KEY=your_api_key_here # if anthropic key not set. default model gpt-5-mini export GOOGLE_API_KEY=your_api_key_here # if anthropic and openai keys are not set. default model gemini-2.5-flash ``` `--max-tokens` can be specified to override the context limit used. When using DMR, the default is 16k to limit memory usage. With all other providers the default is 64k `--max-iterations` can be specified to override how many times the agent is allowed to loop when doing tool calling etc. When using DMR, the default is set to 20 (small local models have the highest chance of getting confused and looping endlessly). For all other providers, the default is 0 (unlimited). Example of provider, model, context size and max iterations overriding: ```sh # Use GPT-5 via OpenAI cagent new --model openai/gpt-5 # Use a local model (ai/gemma3-qat:12B) via DMR cagent new --model dmr/ai/gemma3-qat:12B # Override the max_tokens used during generation, default is 64k, 16k when using the dmr provider cagent new --model openai/gpt-5-mini --max-tokens 32000 # Override max_iterations to limit how much the model can loop autonomously when tool calling cagent new --model dmr/ai/gemma3n:2B-F16 --max-iterations 15 ``` --- ``` $ cagent new ------- Welcome to cagent! ------- (Ctrl+C to stop the agent and exit) What should your agent/agent team do? (describe its purpose): > I need an agent team that connects to <some-service> and does... ``` ## Pushing and pulling agents from Docker Hub ### `cagent push` Agent configurations can be packaged and shared to Docker Hub using the `cagent push` command ```sh cagent push ./<agent-file>.yaml namespace/reponame ``` `cagent` will automatically build an OCI image and push it to the desired repository using your Docker credentials ### `cagent pull` Pulling agents from Docker Hub is also just one `cagent pull` command away. ```sh cagent pull creek/pirate ``` `cagent` will pull the image, extract the .yaml file and place it in your working directory for ease of use. `cagent run creek.yaml` will run your newly pulled agent ## Usage More details on the usage and configuration of `cagent` can be found in [USAGE.md](/docs/USAGE.md) ## Telemetry We track anonymous usage data to improve the tool. See [TELEMETRY.md](/docs/TELEMETRY.md) for details. ## Contributing Want to hack on `cagent`, or help us fix bugs and build out some features? 🔧 Read the information on how to build from source and contribute to the project in [CONTRIBUTING.md](/docs/CONTRIBUTING.md) ## DogFooding: using `cagent` to code on `cagent` A smart way to improve `cagent`'s codebase and feature set is to do it with the help of a `cagent` agent! We have one that we use and that you should use too: ```sh cd cagent cagent run ./golang_developer.yaml ``` This agent is an _expert Golang developer specializing in the cagent multi-agent AI system architecture_. Ask it anything about `cagent`. It can be questions about the current code or about improvements to the code. It can also fix issues and implement new features! ## Share your feedback We’d love to hear your thoughts on this project. You can find us on [Slack](https://dockercommunity.slack.com/archives/C09DASHHRU4) ", Assign "at most 3 tags" to the expected json: {"id":"15468","tags":[]} "only from the tags list I provide: [{"id":77,"name":"3d"},{"id":89,"name":"agent"},{"id":17,"name":"ai"},{"id":54,"name":"algorithm"},{"id":24,"name":"api"},{"id":44,"name":"authentication"},{"id":3,"name":"aws"},{"id":27,"name":"backend"},{"id":60,"name":"benchmark"},{"id":72,"name":"best-practices"},{"id":39,"name":"bitcoin"},{"id":37,"name":"blockchain"},{"id":1,"name":"blog"},{"id":45,"name":"bundler"},{"id":58,"name":"cache"},{"id":21,"name":"chat"},{"id":49,"name":"cicd"},{"id":4,"name":"cli"},{"id":64,"name":"cloud-native"},{"id":48,"name":"cms"},{"id":61,"name":"compiler"},{"id":68,"name":"containerization"},{"id":92,"name":"crm"},{"id":34,"name":"data"},{"id":47,"name":"database"},{"id":8,"name":"declarative-gui "},{"id":9,"name":"deploy-tool"},{"id":53,"name":"desktop-app"},{"id":6,"name":"dev-exp-lib"},{"id":59,"name":"dev-tool"},{"id":13,"name":"ecommerce"},{"id":26,"name":"editor"},{"id":66,"name":"emulator"},{"id":62,"name":"filesystem"},{"id":80,"name":"finance"},{"id":15,"name":"firmware"},{"id":73,"name":"for-fun"},{"id":2,"name":"framework"},{"id":11,"name":"frontend"},{"id":22,"name":"game"},{"id":81,"name":"game-engine "},{"id":23,"name":"graphql"},{"id":84,"name":"gui"},{"id":91,"name":"http"},{"id":5,"name":"http-client"},{"id":51,"name":"iac"},{"id":30,"name":"ide"},{"id":78,"name":"iot"},{"id":40,"name":"json"},{"id":83,"name":"julian"},{"id":38,"name":"k8s"},{"id":31,"name":"language"},{"id":10,"name":"learning-resource"},{"id":33,"name":"lib"},{"id":41,"name":"linter"},{"id":28,"name":"lms"},{"id":16,"name":"logging"},{"id":76,"name":"low-code"},{"id":90,"name":"message-queue"},{"id":42,"name":"mobile-app"},{"id":18,"name":"monitoring"},{"id":36,"name":"networking"},{"id":7,"name":"node-version"},{"id":55,"name":"nosql"},{"id":57,"name":"observability"},{"id":46,"name":"orm"},{"id":52,"name":"os"},{"id":14,"name":"parser"},{"id":74,"name":"react"},{"id":82,"name":"real-time"},{"id":56,"name":"robot"},{"id":65,"name":"runtime"},{"id":32,"name":"sdk"},{"id":71,"name":"search"},{"id":63,"name":"secrets"},{"id":25,"name":"security"},{"id":85,"name":"server"},{"id":86,"name":"serverless"},{"id":70,"name":"storage"},{"id":75,"name":"system-design"},{"id":79,"name":"terminal"},{"id":29,"name":"testing"},{"id":12,"name":"ui"},{"id":50,"name":"ux"},{"id":88,"name":"video"},{"id":20,"name":"web-app"},{"id":35,"name":"web-server"},{"id":43,"name":"webassembly"},{"id":69,"name":"workflow"},{"id":87,"name":"yaml"}]" returns me the "expected json"

AI prompts