TaskQA Results — Exact-Match Accuracy (%)
Average over HTML, LaTeX, and Markdown representations · 700 questions
VLM-Image (avg)
VLM-Text (avg)
LLM-Text (avg)
Model
Image avg
Text avg
Bar width proportional to accuracy · max = 70%