Topic features for machine learning-based sentiment analysis in Indonesian tweets

Murfi, Hendri; Siagian, Furida Lusi; Satria, Yudi

doi:10.1108/IJICC-04-2018-0057

Article navigation

Research Article| January 09 2019

Topic features for machine learning-based sentiment analysis in Indonesian tweets

Hendri Murfi;

Hendri Murfi

Department of Mathematics,

Universitas Indonesia

, Depok,

Indonesia

Hendri Murfi is the corresponding author and can be contacted at: hendri@ui.ac.id

Search for other works by this author on:

This Site

PubMed

Google Scholar

Furida Lusi Siagian;

Furida Lusi Siagian

Department of Mathematics,

Universitas Indonesia

, Depok,

Indonesia

Search for other works by this author on:

This Site

PubMed

Google Scholar

Yudi Satria

Department of Mathematics,

Universitas Indonesia

, Depok,

Indonesia

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Hendri Murfi is the corresponding author and can be contacted at: hendri@ui.ac.id

Publisher: Emerald Publishing

Received: April 30 2018

Revision Received: July 02 2018

Accepted: July 10 2018

Online ISSN: 1756-3798

Print ISSN: 1756-378X

2019

Emerald Publishing Limited

Licensed re-use rights only

International Journal of Intelligent Computing and Cybernetics (2019) 12 (1): 70–81.

https://doi.org/10.1108/IJICC-04-2018-0057

Purpose

The purpose of this paper is to analyze topics as alternative features for sentiment analysis in Indonesian tweets.

Design/methodology/approach

Given Indonesian tweets, the processes of sentiment analysis start by extracting features from the tweets. The features are words or topics. The authors use non-negative matrix factorization to extract the topics and apply a support vector machine to classify the tweets into its sentiment class.

Findings

The authors analyze the accuracy using the two-class and three-class sentiment analysis data sets. Both data sets are about sentiments of candidates for Indonesian presidential election. The experiments show that the standard word features give better accuracies than the topics features for the two-class sentiment analysis. Moreover, the topic features can slightly improve the accuracy of the standard word features. The topic features can also improve the accuracy of the standard word features for the three-class sentiment analysis.

Originality/value

The standard textual data representation for sentiment analysis using machine learning is bag of word and its extensions mainly created by natural language processing. This paper applies topics as novel features for the machine learning-based sentiment analysis in Indonesian tweets.

2019

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Topic features for machine learning-based sentiment analysis in Indonesian tweets

Email Alerts

Cited By

Topic features for machine learning-based sentiment analysis in Indonesian tweets Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable

Topic features for machine learning-based sentiment analysis in Indonesian tweets