Performance comparison (× 100) of six-word embedding baseline models against 13-word similarity datasets
| Word similarity datasets | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WS | WS-SIM | WS-REL | MC | RG | RW | MEN | Mturk287 | Mturk771 | YP | SimLex | Verb | SimVerb | |
| SGNS | 71.6 | 78.7 | 62.8 | 81.1 | 79.3 | 46.6 | 76.1 | 67.3 | 67.8 | 53.6 | 39.8 | 45.6 | 28.9 |
| CBOW | 64.3 | 74.0 | 53.4 | 74.7 | 81.3 | 43.3 | 72.4 | 67.4 | 63.6 | 41.6 | 37.2 | 40.9 | 24.5 |
| GloVe | 59.7 | 66.8 | 55.9 | 74.2 | 75.1 | 32.5 | 68.5 | 61.9 | 63.0 | 53.4 | 32.4 | 36.7 | 17.2 |
| FastText | 64.8 | 72.1 | 56.4 | 76.3 | 77.3 | 46.6 | 73.0 | 63.0 | 63.0 | 49.0 | 35.2 | 35.0 | 21.9 |
| ngram2vec | 74.2 | 81.5 | 67.8 | 85.7 | 79.5 | 45.0 | 75.1 | 66.5 | 66.5 | 56.4 | 42.5 | 47.8 | 32.1 |
| Dict2vec | 69.4 | 72.8 | 57.3 | 80.5 | 85.7 | 499 | 73.3 | 60.0 | 65.5 | 59.6 | 41.7 | 18.9 | 41.7 |
| Word similarity datasets | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WS | WS-SIM | WS-REL | MC | RG | RW | MEN | Mturk287 | Mturk771 | YP | SimLex | Verb | SimVerb | |
| SGNS | 71.6 | 78.7 | 62.8 | 81.1 | 79.3 | 46.6 | 67.3 | 53.6 | 39.8 | 45.6 | 28.9 | ||
| CBOW | 64.3 | 74.0 | 53.4 | 74.7 | 81.3 | 43.3 | 72.4 | 63.6 | 41.6 | 37.2 | 40.9 | 24.5 | |
| GloVe | 59.7 | 66.8 | 55.9 | 74.2 | 75.1 | 32.5 | 68.5 | 61.9 | 63.0 | 53.4 | 32.4 | 36.7 | 17.2 |
| FastText | 64.8 | 72.1 | 56.4 | 76.3 | 77.3 | 46.6 | 73.0 | 63.0 | 63.0 | 49.0 | 35.2 | 35.0 | 21.9 |
| ngram2vec | 79.5 | 45.0 | 75.1 | 66.5 | 66.5 | 56.4 | |||||||
| Dict2vec | 69.4 | 72.8 | 57.3 | 80.5 | 73.3 | 60.0 | 65.5 | 41.7 | 18.9 | 41.7 | |||
Sharing content requires targeting cookies to be enabled. Please update your cookie preferences to use this feature.