The TEXTURE Benchmark: Measuring Performance of Text Queries on a Relational DBMS.

VLDB '05: Proceedings of the 31st international conference on Very large data bases(2005)

引用 29|浏览20
暂无评分
摘要
We introduce a benchmark called TEXTURE (TEXT Under RElations) to measure the relative strengths and weaknesses of combining text processing with a relational workload in an RDBMS. While the well-known TREC benchmarks focus on quality, we focus on efficiency. TEXTURE is a micro-benchmark for query workloads, and considers two central text support issues that previous benchmarks did not: (1) queries with relevance ranking, rather than those that just compute all answers, and (2) a richer mix of text and relational processing, reflecting the trend toward seamless integration. In developing this benchmark, we had to address the problem of generating large text collections that reflected the (performance) characteristics of a given "seed" collection; this is essential for a controlled study of specific data characteristics and their effects on performance. In addition to presenting the benchmark, with performance numbers for three commercial DBMSs, we present and validate a synthetic generator for populating text fields.
更多
查看译文
关键词
central text support issue,large text collection,text field,text processing,performance number,previous benchmarks,relational processing,relational workload,well-known TREC benchmarks,commercial DBMSs,TEXTURE benchmark,relational DBMS,text query
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要