Summary of related works on SA
| Study | Languagea | Datab | L | S | P | Algorithmc | Nr classes | Data size |
|---|---|---|---|---|---|---|---|---|
| Pang et al. | ENG | IMDb | – | – | X | N-gram+NB, N-gram+SVM, N-gram+ME | 3 | 16K rev |
| Blitzer et al. | ENG | Am | – | – | – | SCL | 2 | 8K rev |
| Bolleaga et al. | ENG | Am | X | – | X | FE+L1LR | 2 | 76K |
| Ortigiosa et al. | SP | FB | – | – | X | L, NB, J48, SVM | 2 | 3K |
| Dos Santos and Gatti | ENG | IMDb, TW | – | – | – | WE+CNN, CE+CNN | 2 | 12K, 80 K |
| Kumar et al. | – | Am | – | – | – | NB,LR,SW | 2 | – |
| Tripathy et al. | ENG | IMDb | – | – | – | N-gram+NB, N-gram+SVM, N-gram+ME, N-gram SGD | 2 | 50K |
| Conneau et al. | ENG, CH | various* | – | – | – | Ce+VDCNN | 2–14 | 11M |
| Zola et al. | ENG, IT | Am, FB, TR, TW | – | X | X | We+NB, We+SVM, We+MLP, We+CNN | 2, 3 | 1.3M |
| Study | Languagea | Data | L | S | P | Algorithm | Nr classes | Data size |
|---|---|---|---|---|---|---|---|---|
| Pang | ENG | IMDb | – | – | X | N-gram+NB, N-gram+SVM, N-gram+ME | 3 | 16K rev |
| Blitzer | ENG | Am | – | – | – | SCL | 2 | 8K rev |
| Bolleaga | ENG | Am | X | – | X | FE+L1LR | 2 | 76K |
| Ortigiosa | SP | FB | – | – | X | L, NB, J48, SVM | 2 | 3K |
| Dos Santos and Gatti | ENG | IMDb, TW | – | – | – | WE+CNN, CE+CNN | 2 | 12K, 80 K |
| Kumar | – | Am | – | – | – | NB,LR,SW | 2 | – |
| Tripathy | ENG | IMDb | – | – | – | N-gram+NB, N-gram+SVM, N-gram+ME, N-gram SGD | 2 | 50K |
| Conneau | ENG, CH | various | – | – | – | Ce+VDCNN | 2–14 | 11M |
| Zola | ENG, IT | Am, FB, TR, TW | – | X | X | We+NB, We+SVM, We+MLP, We+CNN | 2, 3 | 1.3M |
Notes:
aLanguage – ENG: English; CH: Chinese; IT: Italian; and SP: Spanish
Data source type – Am: Amazon reviews; FB: Facebook; IMDb: movies reviews; TR: Tripadvisor reviews; and TW: Twitter
SA method – Ce: character embedding; CNN: convolutional neural network; FE: feature extraction; J48: decision tree; L: lexicon information; L1LR: L1 regularized logistic regression; LR: logistic regression; ME: maximum-entropy; MI: mutual information; NB: naive Bayes; SGD: stochastic gradient descent; SVM: support vector machine; SCL: structural correspondence learning; SW: SentiWordNet (Baccianella et al., 2010); VDCNN: very deep convolutional neural network; W2V: word to vec; and We: word embedding
The authors used different sources: news data, DBPedia, Yelp reviews, Yahoo Answer and Amazon reviews
Sharing content requires targeting cookies to be enabled. Please update your cookie preferences to use this feature.