Electrical Engineering
      and Computer Sciences

Electrical Engineering and Computer Sciences

COLLEGE OF ENGINEERING

UC Berkeley

Computing Query Previews in the Flamenco System

Kevin Chen

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-04-1303
January 2004

http://www.eecs.berkeley.edu/Pubs/TechRpts/2004/CSD-04-1303.pdf

The Flamenco system is a web search interface that allows users to browse through large data sets using predefined hierarchical faceted metadata. It is built on top of a conventional relational database and currently scales to collections of several tens of thousands of items. In the current implementation, the system translates each user query into multiple SQL group-by commands in order to obtain query preview information for possible future queries. These group-by's take up a significant fraction of the query processing time. In this note, we describe an optimization that allows us to speed up the group-by computations dramatically. Our ideas have some similarity to the work of Beyer and Ramakrishnan on computing iceberg data cubes.


BibTeX citation:

@techreport{Chen:CSD-04-1303,
    Author = {Chen, Kevin},
    Title = {Computing Query Previews in the Flamenco System},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {2004},
    Month = {Jan},
    URL = {http://www.eecs.berkeley.edu/Pubs/TechRpts/2004/5369.html},
    Number = {UCB/CSD-04-1303},
    Abstract = {The Flamenco system is a web search interface that allows users to browse through large data sets using predefined hierarchical faceted metadata. It is built on top of a conventional relational database and currently scales to collections of several tens of thousands of items. In the current implementation, the system translates each user query into multiple SQL group-by commands in order to obtain query preview information for possible future queries. These group-by's take up a significant fraction of the query processing time. In this note, we describe an optimization that allows us to speed up the group-by computations dramatically. Our ideas have some similarity to the work of Beyer and Ramakrishnan on computing iceberg data cubes.}
}

EndNote citation:

%0 Report
%A Chen, Kevin
%T Computing Query Previews in the Flamenco System
%I EECS Department, University of California, Berkeley
%D 2004
%@ UCB/CSD-04-1303
%U http://www.eecs.berkeley.edu/Pubs/TechRpts/2004/5369.html
%F Chen:CSD-04-1303