Building Search Applications: Lucene, LingPipe, and Gate

Front Cover
Mustru Publishing, 2008 - Computers - 430 pages
0 Reviews
Lucene, LingPipe, and Gate are popular open source tools to build powerful search applications. Building Search Applications describes functions from Lucene that include indexing, searching, ranking, and spelling correction to build search engines. With this book you will learn to: Extract tokens from text using custom tokenizers and analyzers from Lucene, LingPipe, and Gate. Construct a search engine index with an optional backend database to manage large document collections. Explore the wide range of Lucene queries to search an index, understand the ranking algorithm for a query, and suggest spelling corrections. Find the names of people, places, and other entities in text using LingPipe and Gate. Categorize documents by topic using classifiers and build groups of self-organized documents using clustering algorithms from LingPipe. Create a Web crawler to scan the Web, Intranet, or desktop using Nutch. Track the sentiment of articles published on the Web with LingPipe.

What people are saying - Write a review

We haven't found any reviews in the usual places.

About the author (2008)

Manu Konchady (Oakton, VA) is a consultant working on open source text mining software. Previously, he worked at Mitre Corp. where he designed and developed software to mine the Internet. He received his Ph.D. in Information Technology from George Mason University and his articles have appeared in Dr. DobbAs Journal and Linux Journal.

Bibliographic information