Performance of conventional and proposed SER models on the evaluation part of IEMOCAP and the preprocessed BC2013 dataset (only the annotated part) with three emotions (angry, neutral, sad). The conventional model utilized Text, Text and PSD (prosodic factors), while the proposed SER model utilized Text, PSD, and PRM (prominence) as input.