Language Support

Chronos' debugging performance across different programming languages.

Chronos-1 maintains consistently strong performance across multiple programming languages, each evaluated on real-world bugs.

Language-wise Accuracy

  • Python: 68.7% ± 2.1%
  • JavaScript: 64.2% ± 2.3%
  • Java: 63.9% ± 2.2%
  • Go: 66.8% ± 2.4%
  • C++: 61.2% ± 2.6%
  • Rust: 59.8% ± 2.7%

General-purpose models achieve 3.7–15.8% on the same datasets.

Chronos’ debugging-first training corpus enables deep understanding of:

  • language-specific idioms
  • framework patterns
  • runtime behavior differences
  • ecosystem-level failure modes