Nutch Wiki TWiki > Main > Nutch > TaskList TWiki webs:
Main | TWiki | Know | Sandbox
Main . { Changes | Index? | Search | Go }
-- LukeBaker - 23 Jan 2005

Here's a document listing any sort of development tasks or suggestions.

Fetching

HTTP Improvements

These two could come from the Jakarta HTTPClient work started by AndyHedges?. I've implemented NTLM authentication, but it won't be hard to add basic support as well -- I just have to figure out the best way to store credentials in nutch's XML config files.

I've modified Hedges' code to use a single HTTPClient object with multiple connection objects, so cookies should work fine. I'll check whether last-modified can be checked as well from the client, but wouldn't it need changes to the fetcher as well?

-- KenMeltsner - 04 Feb 2005

Searching

Result Serving

SourceForge.net Logo

Topic TaskList . { Edit | Attach | Ref-By | Printable | Diffs | r1.3 | > | r1.2 | > | r1.1 | More }
Revision r1.3 - 11 Feb 2005 - 12:03 GMT - AndyHedges?
Parents: Nutch
Copyright © 1999-2003 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback.