Search
 New @ Now
Products
 FnTs in Business  FnTs in Technology
For Authors
 Review Updates
 Authors Advantages
 Download Style Files
 Submit an article
 

Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches



Author(s):

Source:
    Journal:Foundations and Trends® in Databases
    ISSN Print:1931-7883,  ISSN Online:1931-7891
    Publisher:Now Publishers
    Volume 4 Number 1-3,
Pages: 294(1-294)
DOI: 10.1561/1900000004

Abstract:

Methods for Approximate Query Processing (AQP) are essential for dealing with massive data. They are often the only means of providing interactive response times when exploring massive datasets, and are also needed to handle high speed data streams. These methods proceed by computing a lossy, compact synopsis of the data, and then executing the query of interest against the synopsis rather than the entire dataset. We describe basic principles and recent developments in AQP. We focus on four key synopses: random samples, histograms, wavelets, and sketches. We consider issues such as accuracy, space and time efficiency, optimality, practicality, range of applicability, error bounds on query answers, and incremental maintenance. We also discuss the trade-offs between the different synopsis types.