Class Summary |
CrawlTool |
|
DistributedAnalysisTool |
DistributedAnalysisTool performs link-analysis by reading
exclusively from a IWebDBReader, and writing to
an IWebDBWriter. |
DumpSegment |
Dump FetcherOutput, ParseData and ParseText for every record in one segment. |
FetchListTool |
This class takes an IWebDBReader, computes a relevant subset,
and then emits the subset. |
FetchListTool.SortableScore |
SortableScore is just a WritableComparable Float! |
LinkAnalysisTool |
LinkAnalysisTool performs link-analysis by using the
DistributedAnalysisTool. |
SegmentMergeTool |
This class cleans up accumulated segments data, and merges them
into a single segment, with no duplicates in it. |
UpdateDatabaseTool |
This class takes the output of the fetcher and updates the page and link
DBs accordingly. |
WebDBAdminTool |
The WebDBAdminTool is for Nutch administrators
who need special access to the webdb. |