A Pipelined Framework for Online Cleaning of Sensor Data Streams

Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong and Jennifer Widom

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-05-1413
September 2005

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2005/CSD-05-1413.pdf

Data captured from the physical world through receptor devices such as wireless sensor networks and RFID readers tend to be unreliable and noisy. The data cleaning process for such data is not easily handled by standard data warehouse-oriented techniques, which do not take into account the strong temporal and spatial components of receptor data. Here we present Extensible receptor Stream Processing (ESP), an extensible framework for cleaning the data streams produced by physical receptor devices. ESP is a declarative query processing tool with a pipelined design that is easy to setup and configure for each receptor deployment. We validate the ESP platform through three real-world deployments using ESP to clean receptor data streams.


BibTeX citation:

@techreport{Jeffery:CSD-05-1413,
    Author = {Jeffery, Shawn R. and Alonso, Gustavo and Franklin, Michael J. and Hong, Wei and Widom, Jennifer},
    Title = {A Pipelined Framework for Online Cleaning of Sensor Data Streams},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {2005},
    Month = {Sep},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2005/6474.html},
    Number = {UCB/CSD-05-1413},
    Abstract = {Data captured from the physical world through receptor devices such as wireless sensor networks and RFID readers tend to be unreliable and noisy. The data cleaning process for such data is not easily handled by standard data warehouse-oriented techniques, which do not take into account the strong temporal and spatial components of receptor data. Here we present Extensible receptor Stream Processing (ESP), an extensible framework for cleaning the data streams produced by physical receptor devices. ESP is a declarative query processing tool with a pipelined design that is easy to setup and configure for each receptor deployment. We validate the ESP platform through three real-world deployments using ESP to clean receptor data streams.}
}

EndNote citation:

%0 Report
%A Jeffery, Shawn R.
%A Alonso, Gustavo
%A Franklin, Michael J.
%A Hong, Wei
%A Widom, Jennifer
%T A Pipelined Framework for Online Cleaning of Sensor Data Streams
%I EECS Department, University of California, Berkeley
%D 2005
%@ UCB/CSD-05-1413
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2005/6474.html
%F Jeffery:CSD-05-1413