Validating NAYALex emotion lexicon: identifying diverse range of emotions for comprehensive personality analysis in social media posts

Atlı, Yakup; İlhan, Nagehan

doi:10.1108/IDD-04-2024-0063

Article navigation

Research Article| June 27 2025

Validating NAYALex emotion lexicon: identifying diverse range of emotions for comprehensive personality analysis in social media posts

Yakup Atlı;

Yakup Atlı

Republic of Turkey Ministry of National Education

, Ankara,

Turkey

Search for other works by this author on:

This Site

PubMed

Google Scholar

Nagehan İlhan

Department of Computer Engineering,

Harran University

, Sanliurfa,

Turkey

Nagehan İlhan can be contacted at: nagehanilhan@harran.edu.tr

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Nagehan İlhan can be contacted at: nagehanilhan@harran.edu.tr

Publisher: Emerald Publishing

Received: April 30 2024

Revision Received: September 15 2024

Revision Received: December 23 2024

Revision Received: March 28 2025

Revision Received: May 10 2025

Accepted: May 14 2025

Online ISSN: 2398-6255

Print ISSN: 2398-6247

2025

Emerald Publishing Limited

Licensed re-use rights only

Information Discovery and Delivery (2025)

https://doi.org/10.1108/IDD-04-2024-0063

Purpose

This study aims to validate the NAYALex emotion lexicon, a comprehensive lexicon containing 245,822 emotion−word relationships across 6,469 English words, each mapped to at least one of 38 distinct emotions. It addresses the critical gap in existing emotion lexicons like National Research Council (NRC), which are limited in capturing emotions reflecting personality traits.

Design/methodology/approach

This study uses a quantitative approach, conducting experiments on two data sets: one with 11,880 Instagram posts used as a test set, and another with 26,600 sentence emotion pairs evaluated by human judges as the validation set. The analysis incorporates machine learning algorithms, including Naive Bayes, support vector machines (SVM) and K-nearest neighbors, to assess the lexicon’s performance.

Findings

The results demonstrate that NAYALex achieves an average validation rate of 77% and outperforms existing lexicons by extracting approximately four times more emotions, with a 24.7% coverage rate compared to NRC’s 6.5%. Among the tested algorithms, SVM achieved the highest classification accuracy of 93% on the validation data set, confirming the lexicon’s applicability for personality analysis.

Originality/value

This research offers a novel contribution by introducing the most comprehensive emotion lexicon to date, significantly enhancing the capacity for emotion and personality trait analysis from text. The findings pave the way for advanced applications in computational personality profiling, social media analytics and future emotion-based research.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Validating NAYALex emotion lexicon: identifying diverse range of emotions for comprehensive personality analysis in social media posts

Email Alerts

Cited By

Validating NAYALex emotion lexicon: identifying diverse range of emotions for comprehensive personality analysis in social media posts

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable