The European Commission is tackling the digital skills gap and promoting projects and strategies to improve the level of digital skills in Europe. In line with the European efforts, ESCO is now releasing the digital label, which distinguishes the digital skills and knowledge competences that are part of the ESCO classification.
In the ESCO classification, digital skills and knowledge concepts are identified, described, and connected to occupations - all available in 28 languages. During 2022, a total of 1,201 digital concepts were identified, and starting from ESCO v1.1.1 they can be downloaded together with the ESCO classification, and accessed via the ESCO API. This blog post describes the methodology developed to label skills and knowledge concepts as digital, with a particular focus on the data science work.
What is an ESCO digital skill?
The ESCO classification stores information on almost 14,000 knowledge concepts, considered the outcome of a learning process, and skills, which are the ability to apply such knowledge. The idea of labelling skills and knowledge concepts was first applied with the release of ESCO v1.1.0, where the green label was introduced. Using labels to distinguish sub-groups of skills allowed users to benefit from the richness of ESCO, while avoiding getting lost in the large size of the dataset.
The work on digital skills in ESCO is based on a longer journey, as it started with the integration of the European Digital Competence Framework for Citizens (DigComp) in the skills pillar in ESCO v1.0.0. The framework represents a complete summary of the skills needed in the digital environment, but it does not have the granularity of ESCO skills - which are important in a digital setting, but too specific to be listed in line with the DigComp standard. For this reason, the DigComp definition of digital skills is now adopted to distinguish all the digital ESCO concepts, including those skills that are more granular, or specific to only one or few occupations.
Digital competence involves the confident, critical and responsible use of, and engagement with, digital technologies for learning, at work, and for participation in society. It includes information and data literacy, communication and collaboration, media literacy, digital content creation (including programming), safety (including digital well-being and competences related to cybersecurity), intellectual property related questions, problem solving and critical thinking. (Council Recommendation on Key Competences for Lifelong Learning, 22 May 2018, ST 9009 2018 INIT).
Following this definition, 1,201 ESCO skills and knowledge concepts are now labelled as digital. If you are curious about these skills, please look at the three options below. One is a digital skill, while the other two are non-digital ones. Pick the digital and check if you selected the correct one! You can also refresh to get more examples.
Labelling Digital Skills: A Collaborative Approach
One of the main challenges when labelling concepts is to ensure homogeneity across the labelled data. The standard process is to adopt one definition and ask validators to manually make judgements on a set of data. Such method may lead to inconsistencies in the validated dataset, mainly due to possible bias of validators, unspoken disagreements among validators, and simple human errors - which are not to be excluded when considering a large dataset such as the ESCO classification. Similarly, a fully machine learning-driven labelling would have pitfalls, mainly related to limitations in the training dataset. As a solution, the methodology adopted to label ESCO digital skills and knowledge concepts involved a combination of the independent work of human validators and Machine Learning (ML) algorithms.
1. Starting with the manual labelling activity, each concept is analysed based on its preferred term, non-preferred terms, and description. The labelling consists in verifying whether an ESCO concept should or should not be considered as digital, following the DigComp definition. Validators assign a score equal to 1 to digital concepts, 0 to non-digital ones. The result of the manual labelling is a list of concepts classified as digital by the experts.
2. The ML classifier work-strand can be further divided into three parts: the creation of the training and validation datasets, the training of ML models, and the labelling activity by the best-performing ML model. The ML algorithm then assigns a probability score (between 0 and 1) to estimate the likelihood of a concept to be considered digital. The result of the algorithmic work is a list of concepts labelled as digital by the ML classifier.
3. Finally, the two lists of ESCO digital skills are compared, where disagreements between the two are further analysed and accepted or rejected by experts, following the same labelling rules set for the manual labelling.
The graph below plots the distribution of the skills and knowledge concepts labelled as digital as a result of the manual labelling (yellow), ML labelling (red), and final revision (blue) activities. These are distributed following the probability scores as assigned by the ML model. Differences between the red and yellow bars are an approximate indicator of the quality improvements gained by combining the two labelling methods. For example, the first set of bars from the right side shows that the number of blue (final) digital skills is higher than the number of skills labelled manually, but is lower than the number of skills detected by the model.
The combination of the two methods allowed to identify skills that represented more doubtful cases, and hence needed additional discussion and manual revision. It also helped filling gaps due to human mistakes or imperfections in the training dataset.
The Digital Skills and Knowledge Concepts: Labelling the ESCO classification report provides detailed information concerning the methodology, results, and provides an overview of the possible use-cases of the digital concepts by ESCO implementers. Digital skills and knowledge concepts can be downloaded here, while the documentation for the ESCO API is available here.
If you are an ESCO implementer and want to share your feedback, please get in touch via email at EMPL-ESCO-SECRETARIAT@ec.europa.eu or use our hashtag #ESCO_EU.
Learn more:
Digital Skills and Knowledge Concepts: Labelling the ESCO classification
The Technical Report provides more information on this topic.