Table 3 Performance of conventional and...

Table 3

Performance of conventional and proposed SER models on the evaluation part of IEMOCAP and the preprocessed BC2013 dataset (only the annotated part) with three emotions (angry, neutral, sad). The conventional model utilized Text, Text and PSD (prosodic factors), while the proposed SER model utilized Text, PSD, and PRM (prominence) as input.

Dataset	Input	Precision	Recall	F1
IEMOCAP	Text	0.551	0.562	0.554
	Text + PSD [29]	0.621	0.618	0.619
	Text+PSD+PRM	0.642	0.623	0.632
BC2013	Text	0.535	0.480	0.486
	Text + PSD [29]	0.552	0.518	0.523
	Text+PSD+PRM	0.562	0.536	0.543

Dataset	Input	Precision	Recall	F1
IEMOCAP	Text	0.551	0.562	0.554
	Text + PSD [29]	0.621	0.618	0.619
	Text+PSD+PRM	0.642	0.623	0.632
BC2013	Text	0.535	0.480	0.486
	Text + PSD [29]	0.552	0.518	0.523
	Text+PSD+PRM	0.562	0.536	0.543

[ViewLarge]