Who really is a data scientist? Analysis of requirements for data centred roles job market and their future

Academic entrepreneurship in theory and practice(2022)

引用 0|浏览3
暂无评分
摘要
Data analysis and processing skills are currently required by a multitude of job offers and cover a wide variety of applications. Although mostly shaped by the development of new technologies, programming languages and libraries, they are a necessity in the world of digital economy and entrepreneurship. A multitude of reports by large consulting companies such as Deloitte predict a sharp increase in demand for data science and AI roles in the future of not only the IT sector, but also the entire economy. The following questions arise: “What skillset do these innovators that use artificial intelligence and advanced analytical skills have?” and “What skills and requirements truly make a data scientist and are they are any different to that of data analysts, data engineers or software developers and programmers?”, moreover, “What is the demand for these specialists and are the university programs educating future specialists in this field or are the skills too new and need to be taught solely by business practice?” . To answer these questions, this article applies Natural Language Processing (NLP) techniques of machine learning to characterize and extract from the offers key skills important for data centred roles. The research was carried out on a preprocessed sample of 72 thousand job offers from the IT sector posted in 2019. A SVM linear classifier was applied to extract the most distinguishing technical skills and characterize the possibility of the automated classification of job postings, which resulted in about 85% precision and recall values for classifying data analyst, data scientist and data engineer roles and about 90% for classifying python developer roles.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要