Modern Information Retrieval
- Ricardo Baeza-Yates
Depto. de Ciencias de la Computación
Universidad de Chile
- Berthier Ribeiro-Neto
- Depto. de Ciencia da Computação
Universidad Federal de Minas Gerais
Belo Horizonte, Brasil
With the collaboration of
Information retrieval (IR)
has changed considerably in the last years with the
expansion of the Web (World Wide Web) and the advent of modern and
inexpensive graphical user interfaces and mass storage devices.
As a result, traditional IR textbooks have become quite out-of-date
which has led to the introduction of new IR books recently.
Nevertheless, we believe that there is still great need of a book that
approaches the field in a
rigorous and complete way from a computer-science perspective (in opposition
to a user-centered perspective).
This book is an effort to partially fulfill this gap and
useful for a first course
on information retrieval as well as for a graduate course on the topic.
These WWW pages are not a digital version of the book, nor the complete
contents of it. Here you will find the preface, table of contents, glossary
and two chapters available for reading on-line. The printed
version can be ordered
directly from Addison-Wesley-Longman.
© Addison Wesley Longman Publishing
You can order this book on-line with
a secure form, or search other titles from A/W about
The book is composed of two portions which complement and balance each
other. The core portion includes 9 chapters authored or coauthored by the
designers of the book. The second portion, which is fully integrated
with the first, is formed by 6 state-of-the-art
chapters written by
leading researchers in their fields.
A same notation and glossary
are employed in all the chapters.
Thus, despite the fact that
several people contributed to the text,
this book is really much more a textbook than an edited
collection of chapters written by
separate authors. Further, contrary to a collection of chapters, the
contents and organization of this book have been carefully
designed by the main authors
to present a cohesive view of all the
important aspects of modern information
Full text of Chapters 1
(Introduction) and 10
(User Interfaces and Visualization) are
available on-line, as well as a table of contents, exercises and resources
for other chapters.
- Retrieval Evaluation
- Query Languages (with G. Navarro)
- Query Operations
- Text and Multimedia Languages and Properties
- Text Operations (with N. Ziviani)
- Indexing and Searching (with G. Navarro)
- Parallel and Distributed IR (by E. Brown)
User Interfaces and Visualization (by M. Hearst)
- Multimedia IR: Models and Languages (by E. Bertino, B. Catania and
- Multimedia IR: Indexing and Searching (by C. Faloutsos)
- Searching the Web
- Libraries and Bibliographical Systems
(by E. Rasmussen)
- Digital Libraries
(by E. Fox and O. Sornil)
- A practical collection of documents on cystic fibrosis, queries, and
relevant answers is available in text ASCII and XML formats here.
Perfection does not exist!
Page vii: should read "David Harrison".
Page 136: The correct meaning of MIDF is maximum inverse document frequency.
Thanks to Shoujie He, USA.
In chapter 5, Rocchio was spelled consistently wrong. However, the correct
spelling is in the appropriate references.
Page 195, Table 8.1, "right" and "left" should be switched. Thanks to Joanna
Page 210: j-next[j]+1 should be j-next[j]-1 (this typo was noted by
Shoujie He, USA).
Page 212: Note that the examples in the text and in figure 8.16 are not
the same. Thanks to Shoujie He, USA.
Page 213: All 0's and 1's in Figure 8.17 should be reversed (we originally
had also the Shift And description of the algorithm). Thanks to Satoru
Page 373: Change "That is, 9.3 Kb is the cut point between both
distributions" by "The cut point between both distributions is then larger
than 9.3 Kb (such that the whole probability adds to 1)".
Page 381: In the last formula, the first term of PR(a) should
be modified from q to q/P where P is the total number of pages.
Page 474, Reference 352: should read "H.S. Heaps".
© Addison Wesley Longman Publishing Co. Inc.
© This Web site was designed by Ricardo Baeza-Yates and Carlos Castillo
and is still under construction. We thank the comments of Berthier
Ribeiro-Neto and Marti Hearst.