Component Impact Analysis

Ablation study showing how each Chronos-1 subsystem contributes to overall debugging performance.

A rigorous ablation study quantifies how each Chronos-1 subsystem affects debugging accuracy.

Full System Performance

  • Debug Success: 67.3%
  • Precision: 92%
  • Recall: 85%

Without Key Components

Without AGR (flat retrieval)

  • Debug Success: 28.7%
  • Precision: 42%
  • Recall: 31%

➡️ Major loss of multi-hop reasoning.

Without Persistent Debug Memory

  • Debug Success: 40.1%
  • Precision: 67%
  • Recall: 58%

➡️ Cannot learn from past bugs.

Without Orchestration Loop

  • Debug Success: 42.5%
  • Precision: 71%
  • Recall: 62%

➡️ No iterative fix refinement.

Without Multi-Code Association

  • Debug Success: 35.8%
  • Precision: 54%
  • Recall: 47%

➡️ Cannot connect related files across modules.

Without Execution Sandbox

  • Debug Success: 48.2%
  • Precision: 78%
  • Recall: 69%

➡️ No test-driven validation or regression defense.

Every subsystem plays a critical role, with AGR and multi-code association being the most essential for maintaining high debugging accuracy.