Evaluating The Lexico-Grammatical Differences In The Writing Of Native And Non-Native Speakers Of English In Peer-Reviewed Medical Journals In The Field Of Pediatric Oncology: Creation Of The Genuine Index Scoring System

PLOS ONE(2017)

引用 2|浏览5
暂无评分
摘要
IntroductionThe predominance of English in scientific research has created hurdles for "non-native speakers" of English. Here we present a novel application of native language identification ( NLI) for the assessment of medical-scientific writing. For this purpose, we created a novel classification system whereby scoring would be based solely on text features found to be distinctive among native English speakers ( NS) within a given context. We dubbed this the "Genuine Index" ( GI).MethodologyThis methodology was validated using a small set of journals in the field of pediatric oncology. Our dataset consisted of 5,907 abstracts, representing work from 77 countries. A support vector machine ( SVM) was used to generate our model and for scoring.ResultsAccuracy, precision, and recall of the classification model were 93.3%, 93.7%, and 99.4%, respectively. Class specific F-scores were 96.5% for NS and 39.8% for our benchmark class, Japan. Overall kappa was calculated to be 37.2%. We found significant differences between countries with respect to the GI score. Significant correlation was found between GI scores and two validated objective measures of writing proficiency and readability. Two sets of key terms and phrases differentiating NS and non-native writing were identified.ConclusionsOur GI model was able to detect, with a high degree of reliability, subtle differences between the terms and phrasing used by native and non-native speakers in peer reviewed journals, in the field of pediatric oncology. In addition, L1 language transfer was found to be very likely to survive revision, especially in non-Western countries such as Japan. These findings show that even when the language used is technically correct, there may still be some phrasing or usage that impact quality.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要