Parsing
RealDoc-Bench
Measures whether parsers deliver accurate layouts, preserve reading order, and enable agents to correctly answer objective questions against real-world documents.
Q&A accuracy
Extend Parse 2.0
95.7%
Layout F1
Adjusted F1
0.847
Q&A set
Prompts across 581 docs
1,359
