基本信息
浏览量:127
职业迁徙
个人简介
I'm generally interested in computer facilitated human problem solving. My current research interests are in information retrieval, in using computational modeling and techniques to facilitate users in their search, information processing and decision making process. I drive my research with a deep understanding of every aspect of the search process, drawing insights from retrieval theory, as well as areas such as user behavior and natural language understanding. I verify the insights with data analyses, sometimes in a large scale, and carry them out as statistical inference or structured retrieval models.
My thesis research is the first to quantitatively study the vocabulary mismatch problem in retrieval, which leads to effective ways of predicting whether a query term is likely to mismatch relevant documents, and a number of principled interventions that significantly improve retrieval using the mismatch predictions.
Another topic of interest is structured retrieval enabled by advanced query languages and diverse document structure. We develop and apply the structured retrieval capabilities of the Indri search engine of the Lemur project to problems ranging from Ad-hoc retrieval, relevance feedback, pseudo relevance feedback, to applications such as question answering, intelligent tutoring, XML retrieval, and information extraction. I also work on legal search and bio/medical/chemical patent search, which are structure heavy in their own ways.
During my time at CMU, I also worked on other cool research projects. I identified areas of the Read The Web knowledge base that need improved coverage and did focused crawls of the Web to fix such areas. I worked on crawl seeding, PageRank prioritization and language identification for the Hadoop based ClueWeb09 billion page crawl. And almost every summer, I would do a TREC retrieval evaluation task.
研究兴趣
论文共 33 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Zhongguo yi xue ke xue yuan xue bao. Acta Academiae Medicinae Sinicaeno. 5 (2019): 581-588
SIGIR '12: The 35th International ACM SIGIR conference on research and development in Information Retrieval
Portland
Oregon
USA
August, 2012pp.515-524, (2012)
ACM SIGIR Forumno. 2 (2012): 117-118
EMNLP '11: Proceedings of the Conference on Empirical Methods in Natural Language Processingpp.1291-1300, (2011)
引用9浏览0EI引用
9
0
mag(2011)
引用23浏览0引用
23
0
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn