Benchmarks
Realistic benchmarks for financial AI.
We evaluate AI models on the tasks that matter most to financial institutions—using real data, realistic scenarios, and the metrics that are most relevant for the domain.
FinSpread-Bench
Updated yesterdayThe first public benchmark for agentic financial spreading. Evaluates how well AI systems extract, calculate, and reason across financial documents—like bank statements, tax returns, payslips, and financial spreads—in real-world decision scenarios.
Task types
- Extraction
- Cross-document reasoning
- Calculation
- Structured output
Data source
Anonymized data from Taktile co-development partners
Evaluation method
Automated metrics and expert human evaluation
Last updated
2026-03-04