HTML Parser Home Page

Package org.htmlparser

The basic API classes which will be used by most users when working with the html parser (the Parser class is the most important one in this).

See:
          Description

Interface Summary
Node  
NodeFilter Implement this interface to select particular nodes.
 

Class Summary
AbstractNode AbstractNode, which implements the Node interface, is the base class for all types of nodes, including tags, string elements, etc
Parser This is the class that the user will use, either to get an iterator into the html page or to directly parse the page and print the results
Typical usage of the parser is as follows :
[1] Create a parser object - passing the URL and a feedback object to the parser
[2] Enumerate through the elements from the parser object
It is important to note that the parsing occurs when you enumerate, ON DEMAND.
PrototypicalNodeFactory A node factory based on the prototype pattern.
RemarkNode The remark tag is identified and represented by this class.
StringNode Normal text in the html document is identified and represented by this class.
StringNodeFactory  
 

Package org.htmlparser Description

The basic API classes which will be used by most users when working with the html parser (the Parser class is the most important one in this).

Related Documentation

For overviews, tutorials, examples, guides, and tool documentation, please see:


© 2004 Somik Raha
Mar 14, 2004

HTML Parser is an open source library released under LGPL.
SourceForge.net