Uses of Class
net.nutch.protocol.Content

Packages that use Content
net.nutch.analysis.lang Text document language identifier. 
net.nutch.parse   
net.nutch.parse.html An HTML document parsing plugin. 
net.nutch.parse.msword A Word document parsing plugin. 
net.nutch.parse.pdf A pdf parsing plugin. 
net.nutch.parse.text A plain text parsing plugin. 
net.nutch.protocol   
net.nutch.protocol.file Protocol plugin which supports retrieving local file resources. 
net.nutch.protocol.ftp Protocol plugin which supports retrieving documents via the ftp protocol. 
net.nutch.protocol.http Protocol plugin which supports retrieving documents via the http protocol. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of Content in net.nutch.analysis.lang
 

Methods in net.nutch.analysis.lang with parameters of type Content
 Parse HTMLLanguageParser.filter(Content content, Parse parse, DocumentFragment doc)
          Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.
 

Uses of Content in net.nutch.parse
 

Methods in net.nutch.parse with parameters of type Content
 Parse Parser.getParse(Content c)
          Creates the parse for some content.
static Parse HtmlParseFilters.filter(Content content, Parse parse, DocumentFragment doc)
          Run all defined filters.
 Parse HtmlParseFilter.filter(Content content, Parse parse, DocumentFragment doc)
          Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
 

Uses of Content in net.nutch.parse.html
 

Methods in net.nutch.parse.html with parameters of type Content
 Parse HtmlParser.getParse(Content content)
           
 

Uses of Content in net.nutch.parse.msword
 

Methods in net.nutch.parse.msword with parameters of type Content
 Parse MSWordParser.getParse(Content content)
           
 

Uses of Content in net.nutch.parse.pdf
 

Methods in net.nutch.parse.pdf with parameters of type Content
 Parse PdfParser.getParse(Content content)
           
 

Uses of Content in net.nutch.parse.text
 

Methods in net.nutch.parse.text with parameters of type Content
 Parse TextParser.getParse(Content content)
           
 

Uses of Content in net.nutch.protocol
 

Methods in net.nutch.protocol that return Content
 Content Protocol.getContent(String url)
          Returns the Content for a url.
static Content Content.read(DataInput in)
           
 

Uses of Content in net.nutch.protocol.file
 

Methods in net.nutch.protocol.file that return Content
 Content FileResponse.toContent()
           
 Content File.getContent(String urlString)
           
 

Uses of Content in net.nutch.protocol.ftp
 

Methods in net.nutch.protocol.ftp that return Content
 Content FtpResponse.toContent()
           
 Content Ftp.getContent(String urlString)
           
 

Uses of Content in net.nutch.protocol.http
 

Methods in net.nutch.protocol.http that return Content
 Content HttpResponse.toContent()
           
 Content Http.getContent(String urlString)
           
 

Uses of Content in org.creativecommons.nutch
 

Methods in org.creativecommons.nutch with parameters of type Content
 Parse CCParseFilter.filter(Content content, Parse parse, DocumentFragment doc)
          Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.
 



Copyright © 2004 The Nutch Organization.