Rephrasing Natural Text Data with Different Languages and Quality Levels for Large Language Model Pre-TrainingMichael Pieler,Marco Bellagente,Hannah Teufel,Duy Phung,Nathan Cooper,Jonathan Tow, Paulo Rocha, Reshinth Adithyan,Zaid Alyafeai,Nikhil Pinnaparaju,Maksym Zhuravinskyi,Carlos RiquelmeCoRR(2024)引用 0|浏览2AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要