Base algorithm configurations
| Algorithm | Input data | Stemming | Other settings for 3-digit DDC | Other settings for full DDC |
|---|---|---|---|---|
| Lexical | Terms from DDC | Snowball | – | – |
| SVC | Libris train set | – | ngram = 2 | ngram = 2 min_df = 2 |
| fastText | Libris train set | – | wordNgrams = 2 minn = 5 maxn = 5 loss = softmax dim = 150 epoch = 50 lr = 0.4234 minCount = 4 | wordNgrams = 2 minn = 5 maxn = 5 loss = softmax dim = 150 epoch = 45 lr = 0.9740 minCount = 3 |
| Omikuji | Libris train set | Snowball | ngram = 2 cluster_balanced = False cluster_k = 100 max_depth = 3 | ngram = 2 cluster_balanced = False cluster_k = 100 max_depth = 3 |
| Algorithm | Input data | Stemming | Other settings for 3-digit DDC | Other settings for full DDC |
|---|---|---|---|---|
| Lexical | Terms from DDC | Snowball | – | – |
| SVC | Libris train set | – | ngram = 2 | ngram = 2 |
| fastText | Libris train set | – | wordNgrams = 2 | wordNgrams = 2 |
| Omikuji | Libris train set | Snowball | ngram = 2 | ngram = 2 |
Source(s): Authors’ own creation