Table 8

Bentham dataset results

ModelCERWERWER BoW
The Text Titan I7.07%12.41%8.54%
Gpt-4o-mini-2024-07-189.48%15.09%13.16%
Gpt-4o-2024-08-0616.62%20.73%18.89%
Claude-3-5-sonnet-2024062010.97%14.46%12.24%
MinicpmV-2 611.76%17.24%13.91%
Qwen2-VL-7B8.01%12.94%11%
Pixtral-12B28.08%38.25%30.32%
InternVL2-8B76.81%95.92%81.67%
Phi-3-mini-128k-instruct32.03%41.85%38.73%
Source(s): Authors’ own work

or Create an Account

Close Modal
Close Modal