Data cleansing of the fire & rescue text corpus. The case study of correction of the misspellings and segmentation into sentences

FedCSIS(2014)

引用 0|浏览1
暂无评分
摘要
The article presents a case study of applying data cleansing methods and segmentation procedures in order to correct and enhance the structure of the domain corpus of fire service. During the study we present our approach and the results in the task of correcting the misspellings, as well as the method of segmenting the corpus into sentences.
更多
查看译文
关键词
fire service,text corpus,sentence segmentation procedure,misspelling correction,fires,data cleansing method,emergency services,data cleansing,segmentation,text analysis,misspellings,fire & rescue text corpus,dictionaries,databases,data analysis,semantics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要