Component Impact Analysis
Ablation study showing how each Chronos-1 subsystem contributes to overall debugging performance.
A rigorous ablation study quantifies how each Chronos-1 subsystem affects debugging accuracy.
Full System Performance
- Debug Success: 67.3%
- Precision: 92%
- Recall: 85%
Without Key Components
Without AGR (flat retrieval)
- Debug Success: 28.7%
- Precision: 42%
- Recall: 31%
➡️ Major loss of multi-hop reasoning.
Without Persistent Debug Memory
- Debug Success: 40.1%
- Precision: 67%
- Recall: 58%
➡️ Cannot learn from past bugs.
Without Orchestration Loop
- Debug Success: 42.5%
- Precision: 71%
- Recall: 62%
➡️ No iterative fix refinement.
Without Multi-Code Association
- Debug Success: 35.8%
- Precision: 54%
- Recall: 47%
➡️ Cannot connect related files across modules.
Without Execution Sandbox
- Debug Success: 48.2%
- Precision: 78%
- Recall: 69%
➡️ No test-driven validation or regression defense.
Every subsystem plays a critical role, with AGR and multi-code association being the most essential for maintaining high debugging accuracy.