Joint Learning for Non-standard Chinese Building Address Standardization*

Xue-feng Xi, Lei Wang, Encen Zou,Cheng Zeng,Baochuan Fu

2018 IEEE International Smart Cities Conference (ISC2)(2018)

引用 2|浏览25
暂无评分
摘要
Since there is no uniform specification for building address name in China, the same building address maybe has many different representations in Chinese natural language. The goal of the non-standard Chinese building address standardization task is to uniformly convert the non-standard building addresses from different social institutions to the standard building address defined by the public security organ, so that the spatial location information corresponding to the standard building address can be obtained. This plays an important role in the analysis and processing of big data in smart cities. Due to the large number of non-standard building addresses and the semantic ambiguity of addresses expressed in Chinese natural language, traditional methods based on string matching are difficult to meet the task requirements. To address these above problems, we propose an innovative joint learning approach based on hash map principle and word frequency theory for standardizing Chinese non-standard building addresses. Experimental results on the dataset constructed via crowdsourced technology show that approach has outstanding accuracy and adaptability to data from different sources.
更多
查看译文
关键词
Standards,Buildings,Dictionaries,Security,Task analysis,Urban areas
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要