- Daniel J. Abadi, Don Carney, Ugur Cetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik. Aurora: a new model and architecture for data stream management.. VLDB Journal 12(2): 120-139, August 2003. [PDF]. Read Sections 1-6. Note that this IS NOT the Aurora paper in the Red Book.
- "A Storm is coming: more details and plans for release", Twitter. [HTML]
Aurora is a "stream management system" for processing continuous queries over "streams" -- sequences of stock quotes, traces of network traffic, or runs of sensor data. Twitter Storm is a "stream processing engine"; unlike Aurora it doesn't predefine the set of operators but provides a more flexible way for users to write their own operators over streams.
As you read the papers, consider the following questions:
- What language constructs does Aurora introduce that are not in the relational model? How are those new language feature specially tailored to work with data streams?
- Do you think the Aurora idea of writing queries via "boxes and arrows" is a good one? What are the advantages / disadvantages of a graphical interface?
- How are Aurora and Twitter Storm different? How are they alike?