- Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified Data Processing on Large Clusters In OSDI, 2004. [PDF]
This paper talks about a specific implementation of a simple query system (called Map-Reduce) on top of the Google cluster.
As you read the paper, consider the following question:
- What kinds of failures can a search engine (or the Map-Reduce system) tolerate? What consistency guarantees are provided in the face of failures?