Extended pre-processing pipeline for text classification: On the role of meta-feature representations, sparsification and selective sampling

Information Processing & Management(2020)

引用 30|浏览37
暂无评分
摘要
•We propose and orchestrate new pre-processing steps for text classification pipelines.•We explore meta-feature representations, sparsification and selective sampling.•We provide thorough evaluations of the trade-offs between costs and effectiveness.•Our final representations are more effective than word embeddings (up to 46%).•Our processes induce large reductions in computational costs and memory consumption.
更多
查看译文
关键词
Text classification pipelines,Pre-processing,Meta-features,Selective sampling,Sparsification,Experimental evaluation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要