[Announcements | Schedule and Readings | Assignments and Quizzes | Syllabus/Policies ]
Reading Questions, Lecture 9 (10/6)
Papers: Data Mining Overview,
Data warehouses are an important class of database that is widely used to
store and query information about business processes (e.g., sales,
customers, etc.) Data warehouse workloads are "OLAP"-style aggregate analysis
Readings for class consist of a thorough overview as well as
one of the most
influential papers that presents a clever way to simultaneously
compute many aggregates
for warehouse-style OLAP workloads.
Questions to consider:
- What types of schemas are typically used in data warehouses?
Why? What advantages do those schemas have?
- For what types of
data is a bitmap index an efficient index? When are bitmap indices a
- How does the data cube operator improve upon the GROUP BY expression? What limitations
of group by does it address?
- What mechanisms does the data cube use to improve the performance of grouped aggregates
over multiple separate GROUP BY expressions?
Samuel Madden (madden at csail dot mit dot edu)
Last modified: Wed Oct 6 09:42:48 EDT 2004