Native AI Agent (dfm agent)¶

DV Flow Manager ships a fully-embedded, native AI agent that runs entirely within the dfm process — no external CLI tools required. It is the default when no subprocess assistant (GitHub Copilot CLI, Codex CLI) is detected in PATH, and can also be selected explicitly with -a native.

The native agent is powered by openai-agents + LiteLLM, which means it supports every model provider that LiteLLM supports, including GitHub Copilot, OpenAI, Anthropic, Google, Azure, Ollama, and any OpenAI-compatible endpoint.

Quick Start¶

# Auto-detect: uses native agent when no subprocess CLI is found
dfm agent

# Explicitly use the native agent
dfm agent -a native

# Specify a model
dfm agent -a native -m openai/gpt-4o

# With project context
dfm agent -a native MySkill MyPersona

Installing the Agent Dependencies¶

The agent dependencies are an optional extra that must be installed:

pip install dv-flow-mgr[agent]

Or, in a project managed by ivpm:

# Add to ivpm.yaml agent dep-set, then:
ivpm update

Provider Configuration¶

The native agent uses LiteLLM to talk to the underlying model provider. The model is selected in this priority order:

-m / --model CLI flag
model: key in config file (~/.dfm/agent.yaml or .dfm/agent.yaml)
DFM_MODEL environment variable
DFM_PROVIDER environment variable (uses <provider>/gpt-4.1)
Auto-detected from well-known API-key environment variables (see table below)
Built-in default: github_copilot/gpt-4.1

In most cases you only need to export the right key and the agent will figure out the provider automatically:

Environment variable	Auto-selected model
`GITHUB_TOKEN`	`github_copilot/gpt-4.1`
`OPENAI_API_KEY`	`openai/gpt-4.1`
`ANTHROPIC_API_KEY`	`anthropic/claude-3-5-sonnet-20241022`
`GEMINI_API_KEY`	`gemini/gemini-2.0-flash`
`AZURE_API_KEY` + `AZURE_API_BASE`	`azure/gpt-4o`
`OLLAMA_HOST`	`ollama/llama3.2`

Model names follow the LiteLLM convention <provider>/<model-name>.

GitHub Copilot¶

GitHub Copilot is the default provider. It uses your existing Copilot subscription — no separate API key is required.

Authentication is handled via the GitHub CLI token or a GITHUB_TOKEN environment variable. The first time you use a Copilot model, LiteLLM may trigger an OAuth device-flow prompt in the terminal.

# Export a GitHub personal access token (recommended for headless use)
export GITHUB_TOKEN=ghp_...

# Default Copilot model
dfm agent -a native

# Specific Copilot model
dfm agent -a native -m github_copilot/gpt-4.1
dfm agent -a native -m github_copilot/claude-3.7-sonnet
dfm agent -a native -m github_copilot/o3-mini

Available Copilot model names depend on your subscription; gpt-4.1, gpt-4o, claude-3.7-sonnet, and o3-mini are common options.

Config file shorthand:

# ~/.dfm/agent.yaml
model: github_copilot/gpt-4.1

OpenAI¶

export OPENAI_API_KEY=sk-...

# GPT-4o (recommended)
dfm agent -a native -m openai/gpt-4o

# GPT-4o-mini (faster, cheaper)
dfm agent -a native -m openai/gpt-4o-mini

# o1 reasoning model
dfm agent -a native -m openai/o1

Config file:

# ~/.dfm/agent.yaml
model: openai/gpt-4o

Anthropic Claude¶

export ANTHROPIC_API_KEY=sk-ant-...

dfm agent -a native -m anthropic/claude-3-5-sonnet-20241022
dfm agent -a native -m anthropic/claude-3-opus-20240229

Config file:

model: anthropic/claude-3-5-sonnet-20241022

Azure OpenAI¶

export AZURE_API_KEY=...
export AZURE_API_BASE=https://your-resource.openai.azure.com/
export AZURE_API_VERSION=2024-02-01

dfm agent -a native -m azure/your-deployment-name

Config file:

model: azure/your-deployment-name

Custom HTTP Headers and API Gateway Authentication¶

Some organisations route model requests through a proxy or API gateway that requires an auth token or subscription key in a custom HTTP header. Use model_settings.headers in the config file:

# .dfm/agent.yaml
model: openai/gpt-4o
model_settings:
  api_base:    https://llm-proxy.example.com
  api_key:     "${{ env.LLM_API_KEY }}"
  api_version: "2024-06-01"
  ssl_verify:  false
  headers:
    X-Auth-Token: "${{ env.LLM_AUTH_TOKEN }}"

The ${{ env.VAR }} syntax expands the named environment variable at config-load time, so the key never has to be stored in plain text:

export LLM_AUTH_TOKEN=my-secret-token
dfm agent -a native

All entries under model_settings are passed directly to the underlying LiteLLM acompletion() call:

Key	Description
`api_base`	Override the endpoint URL
`api_key`	Override the API key (can use `${{ env.VAR }}`)
`api_version`	API version string (required for Azure)
`ssl_verify`	Set to `false` to disable TLS certificate verification
`headers`	Dict of custom HTTP headers added to every request

Google Gemini¶

export GEMINI_API_KEY=...

dfm agent -a native -m gemini/gemini-1.5-pro
dfm agent -a native -m gemini/gemini-1.5-flash

OpenAI-Compatible Endpoints (vLLM, LM Studio, etc.)¶

Any server that implements the OpenAI chat-completions API can be used. Set OPENAI_API_BASE (or OPENAI_BASE_URL) to point at your server:

export OPENAI_API_BASE=http://my-server:8000/v1
export OPENAI_API_KEY=dummy          # required by LiteLLM even if unused

dfm agent -a native -m openai/my-deployed-model

Ollama (Local Models)¶

Ollama runs open-weight models locally. Install Ollama and pull a model, then point LiteLLM at it:

# Start Ollama (it runs on http://localhost:11434 by default)
ollama pull llama3.2
ollama pull qwen2.5-coder:7b

# Run via LiteLLM's ollama provider
dfm agent -a native -m ollama/llama3.2
dfm agent -a native -m ollama/qwen2.5-coder:7b

# Or set DFM_MODEL to avoid typing it every time
export DFM_MODEL=ollama/qwen2.5-coder:7b
dfm agent

Note

Smaller models (7 B parameters and below) may struggle with complex multi-tool workflows. qwen2.5-coder:14b, llama3.1:8b, or mistral-nemo are reasonable minimum sizes for productive sessions.

Ollama on a remote host:

export OLLAMA_API_BASE=http://gpu-server:11434
dfm agent -a native -m ollama/llama3.2

Environment Variables Summary¶

Variable	Description
`DFM_MODEL`	Full LiteLLM model name, e.g. `openai/gpt-4o`
`DFM_PROVIDER`	Provider prefix only; model defaults to `<provider>/gpt-4.1`
`GITHUB_TOKEN`	GitHub personal-access-token; auto-selects `github_copilot/gpt-4.1`
`OPENAI_API_KEY`	OpenAI (or OpenAI-compatible) API key; auto-selects `openai/gpt-4.1`
`OPENAI_API_BASE`	Override base URL for OpenAI-compatible servers
`ANTHROPIC_API_KEY`	Anthropic API key; auto-selects `anthropic/claude-3-5-sonnet-20241022`
`AZURE_API_KEY`	Azure OpenAI key; combined with `AZURE_API_BASE` auto-selects `azure/gpt-4o`
`AZURE_API_BASE`	Azure OpenAI endpoint URL
`AZURE_API_VERSION`	Azure OpenAI API version string
`GEMINI_API_KEY`	Google Gemini API key; auto-selects `gemini/gemini-2.0-flash`
`OLLAMA_HOST`	Ollama server URL; auto-selects `ollama/llama3.2`
`OLLAMA_API_BASE`	Alternative Ollama server URL (used by LiteLLM directly)

Config File¶

Create ~/.dfm/agent.yaml (user-wide) and/or .dfm/agent.yaml (project-local, takes precedence) to set persistent defaults:

# ~/.dfm/agent.yaml  or  .dfm/agent.yaml
model: github_copilot/gpt-4.1

# Tool approval mode: never | auto | write
#   never  – run all tools automatically (default)
#   auto   – prompt before shell/write tools
#   write  – alias for auto
approval_mode: never

# Enable agent tracing (writes JSONL to trace_dir)
trace: false
trace_dir: ~/.dfm/traces/

# Extra text appended to the generated system prompt
system_prompt_extra: |
    Always prefer minimal, targeted code changes.

# LiteLLM model-level settings (all optional)
model_settings:
  api_base:    https://llm-proxy.example.com  # override endpoint
  api_key:     "${{ env.LLM_API_KEY }}"       # or plain string
  api_version: "2024-06-01"                   # e.g. Azure API version
  ssl_verify:  false                           # disable TLS verification
  headers:                                    # custom HTTP request headers
    X-Auth-Token: "${{ env.LLM_AUTH_TOKEN }}"
    X-Custom-Header: some-value

# Additional MCP servers to start (advanced)
mcp_servers:
  - name: my-tool
    command: uvx
    args: [mcp-my-tool]

CLI flags always override config-file values.

Environment Variable References¶

Any string value in the config file can reference an environment variable using ${{ env.VAR_NAME }} syntax. This is evaluated at load time:

model_settings:
  api_key: "${{ env.OPENAI_API_KEY }}"
  headers:
    X-Auth-Token: "${{ env.LLM_AUTH_TOKEN }}"
    Authorization: "Bearer ${{ env.MY_BEARER_TOKEN }}"

If the referenced variable is not set, the value expands to an empty string and a warning is logged.

TUI Interaction¶

The native agent presents a Rich + prompt_toolkit terminal UI with streaming Markdown output and colour-coded tool-call panels.

Slash Commands¶

Command	Description
`/help`	Show all slash commands
`/model`	Display the active model name
`/tools`	List all registered tools
`/skills`	List skills and personas defined in the project
`/personas`	Alias for `/skills`
`/skill add <Name>`	Hot-load a skill into the current session
`/persona add <Name>`	Hot-load a persona into the current session
`/cost`	Show cumulative token usage for the session
`/approval [mode]`	Show or set tool approval mode (`never` / `auto` / `write`)
`/clear`	Clear conversation history
`/exit`, `/quit`	Exit the agent

Keyboard Shortcuts¶

Ctrl+D — exit immediately (same as /exit)
Ctrl+C — cancel the current response; press again within 1 second to exit
Up / Down — navigate input history

Approval Mode¶

By default (never), all tool calls execute automatically. Set approval_mode: auto (or use --approval-mode auto) to be prompted before any shell_exec, write_file, or apply_patch call:

dfm agent --approval-mode auto

You can also change the mode mid-session:

> /approval auto
Approval mode set to: auto

> /approval never
Approval mode set to: never

Tracing¶

Enable detailed span tracing for debugging or auditing:

dfm agent --trace

Traces are written as JSONL to ~/.dfm/traces/trace_<timestamp>.jsonl.

Available Tools¶

The native agent has access to two sets of built-in tools.

DFM Tools (blue panels)¶

These tools give the agent direct access to DV Flow Manager:

Tool	Description
`dfm_show_tasks`	List all tasks in the loaded project
`dfm_show_task`	Get detailed information about a specific task
`dfm_show_packages`	List imported packages
`dfm_show_types`	List available task types
`dfm_show_skills`	List skills and personas
`dfm_context`	Return complete project context as JSON
`dfm_validate`	Validate the current flow definition
`dfm_run_tasks`	Execute one or more tasks

Coding Tools (yellow / green panels)¶

These general-purpose tools let the agent read and modify files:

Tool	Description
`shell_exec`	Run a shell command (yellow — requires approval in auto mode)
`write_file`	Write content to a file (yellow — requires approval in auto mode)
`apply_patch`	Apply a unified diff (yellow — requires approval in auto mode)
`read_file`	Read a file (green)
`list_directory`	List directory contents (green)
`grep_search`	Search file contents with a regex (green)

Subprocess Agents (Legacy)¶

The original subprocess-based agents (GitHub Copilot CLI, OpenAI Codex CLI) are still supported via --assistant:

# GitHub Copilot CLI (must be installed separately)
dfm agent -a copilot

# OpenAI Codex CLI (must be installed separately)
dfm agent -a codex

These agents communicate with DFM through a JSON result-file protocol and do not have the streaming TUI or direct tool access. They remain available for environments where the native agent dependencies cannot be installed.

Troubleshooting¶

`No module named 'agents'`¶

The agent optional dependencies are not installed.

pip install dv-flow-mgr[agent]

Authentication errors / `401 Unauthorized`¶

Verify your API key is exported in the current shell:

echo $OPENAI_API_KEY      # should print your key
echo $GITHUB_TOKEN        # for Copilot

For GitHub Copilot, re-run gh auth login if the token has expired.

`Rate limit reached`¶

The agent will display a user-friendly message and the run_once path automatically retries with exponential backoff. In the TUI, wait a moment and re-submit your message.

Copilot OAuth prompt in headless environments¶

Set GITHUB_TOKEN explicitly to avoid the interactive OAuth device flow:

export GITHUB_TOKEN=$(gh auth token)
dfm agent -a native

Ollama model not responding¶

Ensure the Ollama server is running:

ollama serve &
ollama list           # confirm the model is pulled

Small models may time out on complex prompts. Try a larger model or simplify the query.

Native AI Agent (dfm agent)¶

Quick Start¶

Installing the Agent Dependencies¶

Provider Configuration¶

GitHub Copilot¶

OpenAI¶

Anthropic Claude¶

Azure OpenAI¶

Custom HTTP Headers and API Gateway Authentication¶

Google Gemini¶

OpenAI-Compatible Endpoints (vLLM, LM Studio, etc.)¶

Ollama (Local Models)¶

Environment Variables Summary¶

Config File¶

Environment Variable References¶

TUI Interaction¶

Slash Commands¶

Keyboard Shortcuts¶

Approval Mode¶

Tracing¶

Available Tools¶

DFM Tools (blue panels)¶

Coding Tools (yellow / green panels)¶

Subprocess Agents (Legacy)¶

Troubleshooting¶

`No module named 'agents'`¶

Authentication errors / `401 Unauthorized`¶

`Rate limit reached`¶

Copilot OAuth prompt in headless environments¶

Ollama model not responding¶

See Also¶

DV Flow Manager

Navigation

Related Topics

Native AI Agent (dfm agent)¶

Quick Start¶

Installing the Agent Dependencies¶

Provider Configuration¶

GitHub Copilot¶

OpenAI¶

Anthropic Claude¶

Azure OpenAI¶

Custom HTTP Headers and API Gateway Authentication¶

Google Gemini¶

OpenAI-Compatible Endpoints (vLLM, LM Studio, etc.)¶

Ollama (Local Models)¶

Environment Variables Summary¶

Config File¶

Environment Variable References¶

TUI Interaction¶

Slash Commands¶

Keyboard Shortcuts¶

Approval Mode¶

Tracing¶

Available Tools¶

DFM Tools (blue panels)¶

Coding Tools (yellow / green panels)¶

Subprocess Agents (Legacy)¶

Troubleshooting¶

No module named 'agents'¶

Authentication errors / 401 Unauthorized¶

Rate limit reached¶

Copilot OAuth prompt in headless environments¶

Ollama model not responding¶

See Also¶

`No module named 'agents'`¶

Authentication errors / `401 Unauthorized`¶

`Rate limit reached`¶