Table 3.

Correlation Between the Number of Identified Skills Subcategories and the Number of Words Available in the Text Descriptions, 2010–2020.

Applicant Empl. SpellsVacancy Data
Number of Words (Mean)Words per Skills CategoryNumber of Words (Mean)Words per Skills Category
Number of Assigned Skills Subcategories(1)(2)(3)(4)
05.5 20.2 
19.89.830.030.0
218.39.141.320.7
330.710.254.818.3
446.311.667.116.8
565.813.279.816.0
690.415.195.816.0
7120.417.2114.516.4
8157.519.7150.318.8
9217.924.2187.320.8
10467.046.7311.631.2
11333.030.3143.013.0
12524.043.7
Correlation coefficient (Number of words and assigned categories)
 0.6320.632

Notes: Authors' calculations based on BuscoJobs database. Columns (1) and (3) refer to the mean number of words needed to identify the given number of skills subcategories for the applicants' job spells and vacancies, respectively. See again Section 2 above for the definition of the skills subcategories. To facilitate interpretation, columns (2) and (4) divide the respective mean number of words by the number of assigned skills subcategories. The results are based on the initial keywords and expressions, while neglecting the synonyms. The conclusions do not change when including also the synonyms. Note also that the text length refers to preprocessed text, where stop words, etc., have already been removed, such that the text length is shorter than in the original version.

or Create an Account

Close Modal
Close Modal