AILS-NTUA at SemEval-2024 Task 9: Cracking Brain Teasers: Transformer Models for Lateral Thinking Puzzles
arxiv(2024)
摘要
In this paper, we outline our submission for the SemEval-2024 Task 9
competition: 'BRAINTEASER: A Novel Task Defying Common Sense'. We engage in
both sub-tasks: Sub-task A-Sentence Puzzle and Sub-task B-Word Puzzle. We
evaluate a plethora of pre-trained transformer-based language models of
different sizes through fine-tuning. Subsequently, we undertake an analysis of
their scores and responses to aid future researchers in understanding and
utilizing these models effectively. Our top-performing approaches secured
competitive positions on the competition leaderboard across both sub-tasks. In
the evaluation phase, our best submission attained an average accuracy score of
81.7
outperforming the best neural baseline (ChatGPT) by more than 20
respectively.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要