Configuration Reference

All settings are read from environment variables, or a .env file in the working directory (copy .env.example to .env to start).

Variable	Required	Default	Description
`ANTHROPIC_API_KEY`	One of these two	—	Anthropic API key
`OPENAI_API_KEY`	One of these two	—	OpenAI API key
`LLM_MODEL`	No	`claude-sonnet-4-5`	LLM model identifier
`LLM_PROVIDER`	No	Auto-detected	`anthropic`, `openai`, or any LiteLLM provider. Setting `OPENAI_API_KEY` alone does not switch the provider — also set `LLM_PROVIDER=litellm` and `LLM_MODEL=openai/<model>` to use OpenAI.
`DATABASE_URL`	No	—	Connection string for the database being queried, e.g. `postgresql+psycopg2://user:password@localhost:5432/mydb`
`SSL_MODE`	No	`prefer`	SSL mode for the source database connection
`SSL_CA_CERT`	No	—	Path to an SSL CA certificate bundle (e.g. for AWS RDS/Aurora with `verify-full`)
`QDRANT_URL`	No	`http://localhost:6333`	Qdrant URL. Required for `--embed`, the semantic cache, and document connectors.
`QDRANT_API_KEY`	No	—	Required if using Qdrant Cloud
`QDRANT_COLLECTION`	No	`nlqueries`	Qdrant collection name
`KB_PATH`	No	`~/.nlqueries/knowledge_base`	Local path for exported knowledge base files
`KB_REFRESH_INTERVAL`	No	`3600`	Seconds between auto-refresh of the KB (`0` disables)
`CONNECTORS_FILE`	No	`~/.nlqueries/connectors.yaml`	Path to the connector registry
`CAPSULES_DIR`	No	`~/.nlqueries/capsules`	Path to saved query capsules
`FEEDBACK_DIR`	No	`~/.nlqueries/feedback`	Path to feedback JSONL files
`NOTION_API_TOKEN`	Only for Notion sync	—	Notion integration token
`CONFLUENCE_URL` / `CONFLUENCE_USER` / `CONFLUENCE_API_TOKEN`	Only for Confluence sync	—	Confluence connection details
`HF_TOKEN`	No	—	Hugging Face token — avoids rate limits on the one-time embedding model download. See troubleshooting.md.
`EMBED_SERVER_PORT`	No	`8765`	Port the embedding daemon listens on
`LOG_LEVEL`	No	`INFO`	`DEBUG`, `INFO`, `WARNING`, `ERROR`
`OTEL_EXPORTER_OTLP_ENDPOINT`	No	—	OTLP endpoint for traces (disabled if unset)
`LLM_MODEL_FAST`	No	`claude-haiku-4-5-20251001` (Anthropic) / `openai/gpt-4o-mini` (OpenAI)	Smaller/cheaper model used for short-output auxiliary calls (intent classifier, follow-up resolver). Set to any LiteLLM-supported model string.
`QUERY_HISTORY_LIMIT`	No	`500`	Maximum number of useful queries returned by `process-history` after filtering. Override per-run with `--max-queries`.
`EMBED_BACKEND`	No	`torch`	Embedding backend for the embed-server daemon. `torch` uses sentence-transformers/PyTorch (no extra deps); `onnx` uses ONNX Runtime via `optimum[onnxruntime]` (faster cold start, no PyTorch).
`NLQ_SCHEMA_FORMAT`	No	`compact`	Schema format injected into the system prompt. `compact` uses M-Schema (`【Table】 …`) — fewer tokens; `verbose` uses full Markdown (`### Table: …`) — backward compatible.
`NLQ_SELF_CONSISTENCY`	No	`off`	Self-consistency mode. `off` — disabled; `hard` — run N parallel SQL candidates only for queries classified as hard; `all` — always run N candidates and pick the majority answer.
`NLQ_CACHE_ANSWER_THRESHOLD`	No	`0.97`	Cosine similarity threshold for the Tier 1 answer cache. Questions within this distance of a cached question get the cached answer directly.
`NLQ_CACHE_TEMPLATE_THRESHOLD`	No	`0.90`	Cosine similarity threshold for the Tier 2 template cache.
`NLQ_EXPLAIN_VALIDATION`	No	`false`	When `true`, runs `EXPLAIN` on the final generated SQL via the connector to validate query plans before returning an answer.

Windows note: ~ in default paths resolves to C:\Users\<YourUsername> in PowerShell. To set a variable for the current session use $env:VAR = "value"; to persist it, use System Properties → Environment Variables or add it to your PowerShell profile.

Docker Compose services and volumes

Running docker compose up from core/ starts:

Service	Port	Purpose
`qdrant`	6333 (REST), 6334 (gRPC)	Vector store for embeddings, semantic cache, document search
`nlqueries-core`	8080	MCP server + CLI engine

Volume	Persists
`qdrant-data`	Qdrant collections across restarts
`nlqueries-data`	Knowledge bases, connector config, capsules, and feedback (mounted at `/data/nlqueries` in the container)

Run CLI commands inside the container with docker exec -it nlqueries-core nlqueries <command> — when connecting to a database on your hos