Using Language Models on Low-end Hardware

Fabian Ziegner,Janos Borst,Andreas Niekler,Martin Potthast

CoRR（2023）

引用 0|浏览19

暂无评分

摘要

This paper evaluates the viability of using fixed language models for training text classification networks on low-end hardware. We combine language models with a CNN architecture and put together a comprehensive benchmark with 8 datasets covering single-label and multi-label classification of topic, sentiment, and genre. Our observations are distilled into a list of trade-offs, concluding that there are scenarios, where not fine-tuning a language model yields competitive effectiveness at faster training, requiring only a quarter of the memory compared to fine-tuning.

查看译文

关键词

language models,hardware,low-end

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要