net.nutch.indexer
Class DeleteDuplicates.IndexedDoc.ByHashScore

java.lang.Object
  extended bynet.nutch.io.WritableComparator
      extended bynet.nutch.indexer.DeleteDuplicates.IndexedDoc.ByHashScore
Enclosing class:
DeleteDuplicates.IndexedDoc

public static class DeleteDuplicates.IndexedDoc.ByHashScore
extends WritableComparator

Order equal hashes by decreasing score and increasing urlLen.


Constructor Summary
DeleteDuplicates.IndexedDoc.ByHashScore()
           
 
Method Summary
 int compare(byte[] b1, int s1, int l1, byte[] b2, int s2, int l2)
          Optimization hook.
 
Methods inherited from class net.nutch.io.WritableComparator
compare, compareBytes, getKeyClass, newKey, readFloat, readInt, readLong, readUnsignedShort
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DeleteDuplicates.IndexedDoc.ByHashScore

public DeleteDuplicates.IndexedDoc.ByHashScore()
Method Detail

compare

public int compare(byte[] b1,
                   int s1,
                   int l1,
                   byte[] b2,
                   int s2,
                   int l2)
Description copied from class: WritableComparator
Optimization hook. Override this to make SequenceFile.Sorter's scream.

The default implementation reads the data into two WritableComparables (using Writable.readFields(DataInput), then calls WritableComparator.compare(WritableComparable,WritableComparable).

Overrides:
compare in class WritableComparator


Copyright © 2004 The Nutch Organization.