|
|||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||
java.lang.Objectnet.nutch.tools.FetchListTool
This class takes an IWebDBReader, computes a relevant subset, and then emits the subset.
| Nested Class Summary | |
static class |
FetchListTool.SortableScore
SortableScore is just a WritableComparable Float! |
| Field Summary | |
static Logger |
LOG
|
| Constructor Summary | |
FetchListTool(File dbDir,
boolean refetchOnly,
boolean anchorOptimize,
float cutoffScore,
int seed)
FetchListTool takes a page db, and emits a RECNO-based subset of it. |
|
| Method Summary | |
void |
emitFetchList(File segmentDir,
long topN,
long curTime)
Spit out the fetchlist, to a BDB at the indicated filename. |
void |
emitMultipleLists(File dir,
int numLists,
long topN,
long curTime)
Spit out several fetchlists, so that we can fetch across several machines. |
static void |
main(String[] argv)
Generate a fetchlist from the pagedb and linkdb |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
public static final Logger LOG
| Constructor Detail |
public FetchListTool(File dbDir,
boolean refetchOnly,
boolean anchorOptimize,
float cutoffScore,
int seed)
throws IOException,
FileNotFoundException
| Method Detail |
public void emitMultipleLists(File dir,
int numLists,
long topN,
long curTime)
throws IOException
IOException
public void emitFetchList(File segmentDir,
long topN,
long curTime)
throws IOException
IOException
public static void main(String[] argv)
throws IOException,
FileNotFoundException
IOException
FileNotFoundException
|
|||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||