Category: Book
Used starting at $48.21
New starting at $36.98
Buy it Lucene, LingPipe, and Gate are popular open source tools to build powerful search applications. Building Search Applications describes functions from Lucene that include indexing, searching, ranking, and spelling correction to build search engines. Use LingPipe and Gate to find the meaning of text to make search applications more useful.
With this book you will learn to:
- Extract tokens from text using custom tokenizers and analyzers from Lucene, LingPipe, and Gate.
- Construct a search engine index with an optional backend database to manage large document collections.
- Explore the wide range of Lucene queries to search an index, understand the ranking algorithm for a query, and suggest spelling corrections.
- Find the names of people, places, and other entities in text using LingPipe and Gate.
- Categorize documents by topic using classifiers and build groups of self-organized documents using clustering algorithms from LingPipe.
- Create a Web crawler to scan the Web, Intranet, or desktop using Nutch.
- Track the sentiment of articles published on the Web with LingPipe
- Detect plagiarism of documents using a registered document collection.
| Publisher | Mustru Publishing |
| ISBN | 0615204252 |
| Format | Paperback |
| Author | Manu Konchady |
| EAN | 9780615204253 |
| Label | Mustru Publishing |
| Edition | First |
| Dewey Decimal Number | 004 |
| Studio | Mustru Publishing |
| Number Of Pages | 448 |
| Title | Building Search Applications: Lucene, LingPipe, and Gate |
| Publication Date | 2008-06 |
| Manufacturer | Mustru Publishing |
Review by Gulli Antonino, 2009-08-09
Building Search Applications: Lucene, LingPipe, and Gate is a pretty good introduction to Information Retrieval with a lot of pragmatic examples. Based on Lucene, Gate and LingPipe. I recomend to add it to your library if you like Lucene and Nutch or if you need to maintain or create a medium scale search application.
Review by Songkran Thongsawang, 2009-05-14
This is a good book to create search application. However, it is not easy enough for newbies. You need to have some experience and familiar enough with Lucene and related packages.
Review by Fayyazuddin A. Syed, 2009-04-19
Unfortunately, there are not too many books written on the subject of Information Retrieval as it relates to Java programming, and thankfully, Mr.Konchady's contribution is an excellent resource. It provides a nice balance between the discussion of the theory of Information Retrieval, and providing concrete examples in Java, using Lucene, LingPipe, and Gate (API's for Information Retrieval used in Java). I personally had only heard of Lucene before coming across this book, and was very thankful to learn of the other two (LingPipe, and Gate) afterwards. The book shows the user how to use the above API's together when building an application, which is a great learning opportunity for the reader, because most tutorials available for Lucene, LingPipe, or Gate that you'll find online show you how to use that particular API only, instead of showing you how to use it in conjunction with others to truly harness the power of Information Retrieval with Natural Language Processing, together. The other very nice thing about this book is that the author also introduces the reader to other tools (such as Nutch, WordNet, etc.) that allows the user to provide advanced functionality, without "re-inventing the wheel". This book is a must read for anyone who is serious about learning to develop applications involving Information Retrieval.
Lucene in Action, Second Edition: Covers Apache Lucene 3.0
Used starting at $24.75
New starting at $28.95