MCP Integration

archmax exposes your semantic models to AI agents through the Model Context Protocol (MCP). Agents discover available models, browse datasets and fields, and run scoped SQL queries.

While semantic models are stored internally as OSI YAML files, the MCP tools never return raw YAML. Instead, models are converted on-the-fly into a compressed markdown digest that uses 3–5× fewer tokens than the equivalent YAML while preserving all semantically relevant context. See Semantic Models — How Agents See Your Models for details on the digest format.

Endpoints

Each project has two MCP endpoints:

Endpoint	Reads from	Use case
`POST /mcp/<project-slug>/mcp`	Published build (`build/`)	Production: give this to external AI agents
`POST /mcp/<project-slug>/test/mcp`	Source files (`src/`), assembled on-the-fly	Development: test changes immediately without publishing

The production endpoint serves the last published version of your semantic models. Changes you make in the editor are not visible to production agents until you click Publish.

The test endpoint assembles the model from source files on every request, so it always reflects your latest saved changes. The built-in Playground and batch test runner use this endpoint. You can also point an external agent at the test endpoint during development to iterate quickly.

Both endpoints accept JSON-RPC requests with tools/list and tools/call methods, and both require the same Bearer token authentication.

Authentication

All MCP requests require a Bearer token:

Authorization: Bearer <your-mcp-token>

Tokens are created in the admin UI under MCP Access. Each token has:

Scopes: which semantic models the token can access
Expiry: optional expiration date

The MCP Access table also surfaces per-token activity:

Last Used: relative time of the most recent call (hover for the absolute timestamp)
Events (30d): number of MCP calls the token made in the last 30 days

Monitoring Calls

The MCP Log page at /<projectId>/monitoring shows every tools/call made through the project’s MCP endpoint. A filter bar above the table lets you narrow the view by:

Tool — pick one of the tools that have actually been called (e.g. execute_query)
Status — All, Success only, or Errors only
Token — restrict to calls from a specific token
Date range — pick a start and end date; both ends are inclusive

Click any row to open a detail panel with the full input arguments and tool output. For execute_query errors and other tool calls that reference a semantic model, a Refine action opens the agent chat with a pre-filled prompt to improve the model.

Available Tools

Tool	Description
`list_semantic_models`	List semantic models the token has access to
`get_semantic_model`	Get an overview of a model with datasets, relationships, and metrics
`get_datasets`	Get fields for one or more datasets with types, examples, enums, and instructions
`execute_query`	Run a read-only SQL query scoped to a semantic model’s VIEWs. Returns a `storedQueryId` by default.
`execute_stored_query`	Re-execute a previously stored query by ID, optionally with different parameters
`request_improvement`	Submit an improvement request for a semantic model

Query Execution

The execute_query tool lets agents run SQL against your data through scoped per-model VIEWs. Instead of accessing raw tables, agents write SQL with bare dataset names — the correct scoped schema is resolved automatically via DuckDB’s search_path:

SELECT o.total_amount, c.name
FROM "orders" o
JOIN "customers" c
  ON o.customer_id = c.customer_id
WHERE o.created_at > '2024-01-01'
LIMIT 100

Agents should not add schema or catalog prefixes. The search_path ensures that "orders" resolves to the correct scoped VIEW for the requested model.

The VIEW that backs each dataset comes from the dataset’s view_query custom extension — authored by the modeller, not auto-generated. The platform re-materialises the VIEWs on every execute_query call (CREATE OR REPLACE VIEW), so changes to view_query take effect on the very next call. If a dataset has no view_query, execute_query returns a clear error identifying the offending datasets and the model is unqueryable until the modeller adds one.

Security

Queries are validated to only allow SELECT, WITH, EXPLAIN, and DESCRIBE statements
Raw catalog references (e.g., shopify.public.orders) are rejected — use bare dataset names
Explicit _scope_ prefixes are rejected — names resolve automatically
Each query runs with DuckDB security hardening: external access disabled, resource limits, SET statements blocked
Results are capped at 1,000 rows with a 30-second timeout

The view layer is a logical access control built on top of the federated DuckDB instance — it is not a process or kernel sandbox. See Limits of the View Layer for what it does and does not protect against, and the operator checklist for how to combine it with upstream database privileges.

Client Configuration Examples

Claude Desktop

Add to your Claude Desktop MCP config:

{
  "mcpServers": {
    "archmax": {
      "url": "https://your-server/mcp/your-project/mcp",
      "headers": {
        "Authorization": "Bearer sk-your-token"
      }
    }
  }
}

Cursor

Add to your .cursor/mcp.json:

{
  "mcpServers": {
    "archmax": {
      "url": "https://your-server/mcp/your-project/mcp",
      "headers": {
        "Authorization": "Bearer sk-your-token"
      }
    }
  }
}

Rate Limiting

MCP requests are rate-limited per client IP. The default is 120 requests per 60-second window, configurable via MCP_RATE_LIMIT_MAX.