|
Class Summary |
| CrawlTool |
|
| DistributedAnalysisTool |
DistributedAnalysisTool performs link-analysis by reading
exclusively from a IWebDBReader, and writing to
an IWebDBWriter. |
| DumpSegment |
Dump FetcherOutput, ParseData and ParseText for every record in one segment. |
| FetchListTool |
This class takes an IWebDBReader, computes a relevant subset,
and then emits the subset. |
| FetchListTool.SortableScore |
SortableScore is just a WritableComparable Float! |
| LinkAnalysisTool |
LinkAnalysisTool performs link-analysis by using the
DistributedAnalysisTool. |
| SegmentMergeTool |
This class cleans up accumulated segments data, and merges them
into a single segment, with no duplicates in it. |
| UpdateDatabaseTool |
This class takes the output of the fetcher and updates the page and link
DBs accordingly. |
| WebDBAdminTool |
The WebDBAdminTool is for Nutch administrators
who need special access to the webdb. |