Correlation Between the Number of Identified Skills Subcategories and the Number of Words Available in the Text Descriptions, 2010–2020.
| Applicant Empl. Spells | Vacancy Data | |||
|---|---|---|---|---|
| Number of Words (Mean) | Words per Skills Category | Number of Words (Mean) | Words per Skills Category | |
| Number of Assigned Skills Subcategories | (1) | (2) | (3) | (4) |
| 0 | 5.5 | 20.2 | ||
| 1 | 9.8 | 9.8 | 30.0 | 30.0 |
| 2 | 18.3 | 9.1 | 41.3 | 20.7 |
| 3 | 30.7 | 10.2 | 54.8 | 18.3 |
| 4 | 46.3 | 11.6 | 67.1 | 16.8 |
| 5 | 65.8 | 13.2 | 79.8 | 16.0 |
| 6 | 90.4 | 15.1 | 95.8 | 16.0 |
| 7 | 120.4 | 17.2 | 114.5 | 16.4 |
| 8 | 157.5 | 19.7 | 150.3 | 18.8 |
| 9 | 217.9 | 24.2 | 187.3 | 20.8 |
| 10 | 467.0 | 46.7 | 311.6 | 31.2 |
| 11 | 333.0 | 30.3 | 143.0 | 13.0 |
| 12 | – | – | 524.0 | 43.7 |
| Correlation coefficient (Number of words and assigned categories) | ||||
| 0.632 | 0.632 | |||
| Applicant Empl. Spells | Vacancy Data | |||
|---|---|---|---|---|
| Number of Words (Mean) | Words per Skills Category | Number of Words (Mean) | Words per Skills Category | |
| Number of Assigned Skills Subcategories | (1) | (2) | (3) | (4) |
| 0 | 5.5 | 20.2 | ||
| 1 | 9.8 | 9.8 | 30.0 | 30.0 |
| 2 | 18.3 | 9.1 | 41.3 | 20.7 |
| 3 | 30.7 | 10.2 | 54.8 | 18.3 |
| 4 | 46.3 | 11.6 | 67.1 | 16.8 |
| 5 | 65.8 | 13.2 | 79.8 | 16.0 |
| 6 | 90.4 | 15.1 | 95.8 | 16.0 |
| 7 | 120.4 | 17.2 | 114.5 | 16.4 |
| 8 | 157.5 | 19.7 | 150.3 | 18.8 |
| 9 | 217.9 | 24.2 | 187.3 | 20.8 |
| 10 | 467.0 | 46.7 | 311.6 | 31.2 |
| 11 | 333.0 | 30.3 | 143.0 | 13.0 |
| 12 | – | – | 524.0 | 43.7 |
| Correlation coefficient (Number of words and assigned categories) | ||||
| 0.632 | 0.632 | |||
Notes: Authors' calculations based on BuscoJobs database. Columns (1) and (3) refer to the mean number of words needed to identify the given number of skills subcategories for the applicants' job spells and vacancies, respectively. See again Section 2 above for the definition of the skills subcategories. To facilitate interpretation, columns (2) and (4) divide the respective mean number of words by the number of assigned skills subcategories. The results are based on the initial keywords and expressions, while neglecting the synonyms. The conclusions do not change when including also the synonyms. Note also that the text length refers to preprocessed text, where stop words, etc., have already been removed, such that the text length is shorter than in the original version.
Sharing content requires targeting cookies to be enabled. Please update your cookie preferences to use this feature.