Use Cover yellow

Understanding Search Engines
Second Edition
by Michael W. Berry and Murray Browne

Latest Update: September 7, 2004

The following is a chapter by chapter summary of what changes have been made in the 2nd edition of Understanding Search Engines. The first edition was published in 1999. The biggest difference in the new edition is changes in Chapter 7 and Chapter 8. In the previous edition, Chapter 7 was User Interface and Chapter 8 was Course Project.  In this new edition, Chapter 7 is a chapter on link structure-based algorithms such as PageRank and HITS. PageRank is the algorithm that is used by Google (which hardly existed when our first book was written). We moved and rewrote User Interface to Chapter 8 and took out the Course Project chapter completely.

Cover

The cover has basically the same design, except the background gray will be changed to yellow to distinguish the new edition.
The artwork on the terminal screen has been altered.  The dedication has changed slightly.

Preface



The Preface has been rewritten to reflect changes from the first edition. Another book, Baerza-Yates and Ribeiro-Neto's Modern Information Retrieval, has been added to a list of  IR literature.  The new chapter on link-structure algorithms is mentioned in a couple of sentences and the Acknowledgments reflects the people who helped with the second edition.
 Chapter 1: Introduction
Section 1.7 Search by Link Structure is a new and Section 1.8 User Interface was rewritten.
Chapter 2:
Document File Preparation
Section 2.1 Document Purification and Analysis was updated to reflect changes in how search engines handle  HTML documents.  Section 2.1.2 Validation was updated slightly to include HTML 4.0 syntax. Section 2.2 Manual Indexing was updated and rewritten to include information companies that were still involved with manual indexing. The two sidebars on the Major Commercial Search Engines were updated and rewritten. The URL for the Porter Stemmer was updated.


Chapter 3: Vector Space Models
The figure for number of web pages was updated to 4 billion pages. A couple minor errors in the matrix were also corrected.
Chapter 4: Matrix Decompositions
Minor text changes and footnotes added.
Chapter 5: Query Management
The text in Section 5.22 Natural Language Queries was slightly changed.
Chapter 6: Ranking and Relevance Feedback
No changes were made.
Chapter 7: Searching by Link Structure
A completely new chapter covering HITS and PageRank algorithms.
Chapter 8: User Interface
A complete rewrite of the User Interface chapter of the 1st edition.  Included is a look at Shneiderman and Plaisants new book on user interface. Also there was  a restructuring of the material.
Chapter 9: Further Reading
With the exception of a few paragraphs this chapter was rewritten also. Special mention was made of Langville and Meyer's upcoming book on link-structure algorithms. Table 9.1 was checked and is current. Table 9.2 was almost entirely redone.
Bibliography and Index
Several sources were dropped and approximately 25 new ones were added. In the previous edition, if a work was "in press" it was updated. There are more web sites than in the previous edition--a refection of the times.
Index was streamlined and updated.