Information Retrieval: Implementing and Evaluating Search Engines        

Authors & Publisher

Stefan Büttcher, Google Inc.
Charles L. A. Clarke, Univ. of Waterloo
Gordon V. Cormack, Univ. of Waterloo
 
MIT Press, 2010
(publisher's website)

Getting the Book


About the Book

Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus, a multi-user open-source information retrieval system developed by one of the authors and available online, provides model implementations and a basis for student work.
The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems implementation perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. Additionally, professionals in computer science, computer engineering, and software engineering will find Information Retrieval a valuable reference.
After an introduction to the basics of information retrieval, the text covers three major topic areas — indexing, retrieval, and evaluation — in self-contained parts. The final part of the book draws on and extends the general material in the earlier parts, treating specific application areas, including parallel search engines, link analysis, crawling, and information retrieval over collections of XML documents. End-of-chapter references point to further reading; end-of-chapter exercises range from pencil and paper problems to substantial programming projects.

Table of Contents (Incl. Sample Chapters)

—   I  Foundations   —
—   II  Indexing   —
—   III  Retrieval and Ranking   —
—   IV  Evaluation   —
—   V  Applications and Extensions   —
—   VI  Appendix   —

Additional Resources