Per-repo issue queues + per-LLM-backend connection pools
# repos.yaml
repos:
- repo: owner/repo-A
parallel_issues: 2 # process 2 issues at once
- repo: owner/repo-B
parallel_issues: 1 # sequential per repo
- repo: owner/repo-C
parallel_issues: false # also sequential
# config.yaml
llm:
pools:
ollama: 1 # only 1 call at a time
openai: 10 # plenty of capacity
anthropic: 5
opencode: 3
You could configure your feature pipeline to use OpenAI for PM/Architect (fast) and Ollama for Engineer (free, local). The Ollama pool ensures only one local call happens at a time even if 5 pipelines are running simultaneously.
agents:
product_manager: { backend: openai, model: gpt-4.1 }
architect: { backend: openai, model: gpt-4.1 }
engineer: { backend: ollama, model: qwen2.5-coder } ← joins Ollama pool
code_reviewer: { backend: openai, model: gpt-4.1-mini }
parallel_issues: N per repo entry
llm.pools.<backend>: N