Package net.nutch.indexer

Maintain Lucene full-text indexes.

See:
          Description

Interface Summary
IndexingFilter Extension point for indexing.
 

Class Summary
DeleteDuplicates Deletes duplicate documents in a set of Lucene indexes.
DeleteDuplicates.IndexedDoc The key used in sorting for duplicates.
DeleteDuplicates.IndexedDoc.ByHashDoc Order equal hashes by decreasing index and document.
DeleteDuplicates.IndexedDoc.ByHashScore Order equal hashes by decreasing score and increasing urlLen.
HighFreqTerms Lists the most frequent terms in an index.
IndexingFilters Creates and caches IndexingFilter implementing plugins.
IndexMerger Creates an index for the output corresponding to a single fetcher run.
IndexOptimizer  
IndexSegment Creates an index for the output corresponding to a single fetcher run.
NutchSimilarity Similarity implementatation used by Nutch indexing and search.
 

Exception Summary
IndexingException  
 

Package net.nutch.indexer Description

Maintain Lucene full-text indexes.



Copyright © 2004 The Nutch Organization.