6.830/6.814: Database Systems
Spring 2021
Papers:

Aurora is a "stream management system" for processing continuous queries over "streams" -- sequences of stock quotes, traces of network traffic, or runs of sensor data. Twitter Storm is a "stream processing engine"; unlike Aurora it doesn't predefine the set of operators but provides a more flexible way for users to write their own operators over streams.

As you read the papers, consider the following questions:

  1. What language constructs does Aurora introduce that are not in the relational model? How are those new language feature specially tailored to work with data streams?
  2. Do you think the Aurora idea of writing queries via "boxes and arrows" is a good one? What are the advantages / disadvantages of a graphical interface?
  3. How are Aurora and Twitter Storm different? How are they alike?