Natural Language Processing and Big Data - An Ontology-Based Approach for Cross-Lingual Information Retrieval

Social Computing(2013)

引用 6|浏览0
暂无评分
摘要
Extracting relevant information in multilingual context from massive amounts of unstructured, structured and semi-structured data is a challenging task. Various theories have been developed and applied to ease the access to multicultural and multilingual resources. This papers describes a methodology for the development of an ontology-based Cross-Language Information Retrieval (CLIR) application and shows how it is possible to achieve the translation of Natural Language (NL) queries in any language by means of a knowledge-driven approach which allows to semi-automatically map natural language to formal language, simplifying and improving in this way the human-computer interaction and communication. The outlined research activities are based on Lexicon-Grammar (LG), a method devised for natural language formalization, automatic textual analysis and parsing. Thanks to its main characteristics, LG is independent from factors which are critical for other approaches, i.e. interaction type (voice or keyboard-based), length of sentences and propositions, type of vocabulary used and restrictions due to users' idiolects. The feasibility of our knowledge-based methodological framework, which allows mapping both data and metadata, will be tested for CLIR by implementing a domain-specific early prototype system.
更多
查看译文
关键词
domain-specific early prototype system,challenging task,big data,semi-structured data,multilingual context,cross-lingual information retrieval,formal language,ontology-based approach,natural language,natural language formalization,automatic textual analysis,multilingual resource,natural language processing,human-computer interaction,grammars,human computer interaction,formal languages,meta data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要