net.nutch.parse.html
Class RobotsMetaProcessor
java.lang.Object
net.nutch.parse.html.RobotsMetaProcessor
- public class RobotsMetaProcessor
- extends Object
Class for parsing META Directives from DOM trees. This class
currently handles Robots META directives (all, none, nofollow,
noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache
instructions.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
RobotsMetaProcessor
public RobotsMetaProcessor()
getRobotsMetaDirectives
public static final void getRobotsMetaDirectives(RobotsMetaProcessor.RobotsMetaIndicator robotsMeta,
Node node,
URL currURL)
- Sets the indicators in
robotsMeta
to appropriate
values, based on any META tags found under the given
node
.
Copyright © 2004 The Nutch Organization.