Table 4

Wilcoxon T Tests on CLQA accuracy of 7 GPLLMs with and without CLKR in PCEQEs

NoGPLLMCLKRAverage accuracyAccuracy enhancementz-statisticp-value
1Llama-2-70bwithout0.28328.3%4.1970.000***
with0.363
2text-davinci-003without0.32944.9%4.2860.000***
with0.476
3GPT-3.5 Turbowithout0.34936.3%4.2870.000***
with0.476
4GPT-4without0.52825.4%4.1710.000***
with0.663
5ChatGLM2-6Bwithout0.43011.1%3.7290.000***
with0.478
6ERNIE-Bot-turbowithout0.41910.2%3.4290.002***
with0.462
7ERNIE-Bot 4.0without0.7559.9%4.0290.000***
with0.830
Average accuracy of 7 GPLLMswithout0.44221.1%NANA
with0.535

Note(s): *** denote confidence levels above 99%

Source(s): Authors’ own work

or Create an Account

Close Modal
Close Modal