Question 1

How was the benchmark conducted?

Accepted Answer

Codna ran head-to-head against OpenAI Codex CLI and Google Gemini CLI across 8 real bug-fix scenarios. Every fix was verified by the project's own tests. No synthetic tasks, no self-reported numbers.

Question 2

What does "verified" mean in these results?

Accepted Answer

A fix counts only when the test suite passes. Codna went 8 for 8. A fix that compiles but breaks tests is not counted.

Question 3

How many tokens does Codna actually use per fix?

Accepted Answer

Codna sends the AI agent an evidence bundle measured at roughly 600 tokens — 162x less context than reading the full repository. That translates directly to lower API costs, typically pennies per fix.

Question 4

Why is Codna faster than other AI coding agents?

Accepted Answer

The deterministic engine maps the dependency and blast-radius graph in about 60 ms using zero LLM tokens. The agent receives only the relevant evidence, so there is far less to process before a fix is produced.

Question 5

Does Codna scale beyond a single repository?

Accepted Answer

Yes. In measured testing, Codna mapped 130 repositories in 9.2 seconds, using zero tokens for the mapping step.

Question 6

What models did the competing tools use in the benchmark?

Accepted Answer

The benchmark compared Codna against the default configurations of OpenAI Codex CLI and Google Gemini CLI as available at time of testing. Codna is bring-your-own-key, so you choose the underlying model.

Metric	Codna	Cursor	Cline
Fixes verified	87/87	87/87	64/87
Accuracy	100%	100%	73.6%
Avg wall time	13.4s	22.6s	72.9s
Avg tokens / fix	16.2K	81.0K	64.8K
Avg cost / fix	$0.021	n/a	$0.139
Repo understanding	0 tokens	LLM context	LLM context

Più veloce. Più economico. Verificato.

Bug reali. Repo reali. Test superati.

Scenari di bug-fix

Repo scansionati

Meno contesto

Per verified fix

Frequently asked

Verifica nel tuo repo.