AVATAR-Automated Feature Wrangling for Machine Learning

ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021(2021)

引用 0|浏览7
暂无评分
摘要
A large part of the time invested in data science is spent on manual preparation of data. Transforming wrongly formatted columns into useful features takes up a significant part of this time. We present the avatar algorithm for automatically learning programs that perform this type of feature wrangling. Instead of relying on users to guide the wrangling process, avatar directly uses the predictive performance of machine learning models to measure its progress during wrangling. We use datasets from Kaggle to show that avatar improves raw data for prediction, and square it off against human data scientists.
更多
查看译文
关键词
Data wrangling, Program synthesis, Machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要