HTML Parser Home Page

Uses of Class
org.htmlparser.util.ParserException

Packages that use ParserException
org.htmlparser The basic API classes which will be used by most users when working with the html parser (the Parser class is the most important one in this). 
org.htmlparser.beans The beans package contains Java Beans that can integrate within IDEs. 
org.htmlparser.lexer The lexer package is the base level I/O subsystem. 
org.htmlparser.lexer.nodes The nodes package are the lexemes returned by the base level I/O subsystem. 
org.htmlparser.lexerapplications.thumbelina Extract the images behind thumbnail images. 
org.htmlparser.nodeDecorators The nodeDecorators package contains classes that use the Decorator pattern. 
org.htmlparser.parserapplications Developers and users alike should try out the applications in this package. 
org.htmlparser.scanners The scanners package contains classes responsible for the tertiary identification of tags. 
org.htmlparser.tags The tags package contains tag types that are created mostly by the scanners. 
org.htmlparser.util Code which can be reused by many classes, is located in this package. 
 

Uses of ParserException in org.htmlparser
 

Methods in org.htmlparser that throw ParserException
 Node PrototypicalNodeFactory.createTagNode(Page page, int start, int end, Vector attributes)
          Create a new tag node.
 void Parser.setConnection(URLConnection connection)
          Set the connection for this parser.
 void Parser.setURL(String url)
          Set the URL for this parser.
 void Parser.setEncoding(String encoding)
          Set the encoding for the page this parser is reading from.
 NodeIterator Parser.elements()
          Returns an iterator (enumeration) to the html nodes.
 void Parser.parse(NodeFilter filter)
          Parse the given resource, using the filter provided.
static URLConnection Parser.openConnection(URL url, ParserFeedback feedback)
          Opens a connection using the given url.
static URLConnection Parser.openConnection(String string, ParserFeedback feedback)
          Opens a connection based on a given string.
 void Parser.visitAllNodesWith(NodeVisitor visitor)
           
 void Parser.setInputHTML(String inputHTML)
          Initializes the parser with the given input HTML String.
 NodeList Parser.extractAllNodesThatMatch(NodeFilter filter)
          Extract all nodes matching the given filter.
 Node[] Parser.extractAllNodesThatAre(Class nodeType)
          Convenience method to extract all nodes of a given class type.
 void Node.doSemanticAction()
          Perform the meaning of this tag.
 void AbstractNode.doSemanticAction()
          Perform the meaning of this tag.
 

Constructors in org.htmlparser that throw ParserException
Parser(URLConnection connection, ParserFeedback fb)
          Constructor for custom HTTP access.
Parser(String resourceLocn, ParserFeedback feedback)
          Creates a Parser object with the location of the resource (URL or file) You would typically create a DefaultHTMLParserFeedback object and pass it in.
Parser(String resourceLocn)
          Creates a Parser object with the location of the resource (URL or file).
Parser(URLConnection connection)
          Constructor for non-standard access.
 

Uses of ParserException in org.htmlparser.beans
 

Methods in org.htmlparser.beans that throw ParserException
protected  String StringBean.extractStrings()
          Extract the text from a page.
protected  URL[] LinkBean.extractLinks(String url)
           
 

Uses of ParserException in org.htmlparser.lexer
 

Methods in org.htmlparser.lexer that throw ParserException
 Node Lexer.nextNode()
          Get the next node from the source.
 Node Lexer.nextNode(boolean quotesmart)
          Get the next node from the source.
protected  void Lexer.scanJIS(Cursor cursor)
          Advance the cursor through a JIS escape sequence.
protected  Node Lexer.parseString(Cursor cursor, boolean quotesmart)
          Parse a string node.
protected  Node Lexer.makeString(Cursor cursor)
          Create a string node based on the current cursor and the one provided.
protected  Node Lexer.parseTag(Cursor cursor)
          Parse a tag.
protected  Node Lexer.makeTag(Cursor cursor, Vector attributes)
          Create a tag node based on the current cursor and the one provided.
protected  Node Lexer.parseRemark(Cursor cursor, boolean quotesmart)
          Parse a comment.
protected  Node Lexer.makeRemark(Cursor cursor)
          Create a remark node based on the current cursor and the one provided.
protected  Node Lexer.parseJsp(Cursor cursor)
          Parse a java server page node.
static void Lexer.main(String[] args)
          Mainline for command line operation
 void Page.setConnection(URLConnection connection)
          Set the URLConnection to be used by this page.
 char Page.getCharacter(Cursor cursor)
          Read the character at the cursor position.
 void Page.setEncoding(String character_set)
          Begins reading from the source with the given character set.
 

Constructors in org.htmlparser.lexer that throw ParserException
Lexer(URLConnection connection)
          Creates a new instance of a Lexer.
Page(URLConnection connection)
          Construct a page reading from a URL connection.
 

Uses of ParserException in org.htmlparser.lexer.nodes
 

Methods in org.htmlparser.lexer.nodes that throw ParserException
 Node NodeFactory.createStringNode(Page page, int start, int end)
          Create a new string node.
 Node NodeFactory.createRemarkNode(Page page, int start, int end)
          Create a new remark node.
 Node NodeFactory.createTagNode(Page page, int start, int end, Vector attributes)
          Create a new tag node.
 

Uses of ParserException in org.htmlparser.lexerapplications.thumbelina
 

Methods in org.htmlparser.lexerapplications.thumbelina that throw ParserException
protected  URL[][] Thumbelina.extractImageLinks(Lexer lexer, URL docbase)
          Get the links of an element of a document.
 

Uses of ParserException in org.htmlparser.nodeDecorators
 

Methods in org.htmlparser.nodeDecorators that throw ParserException
 void AbstractNodeDecorator.doSemanticAction()
           
 

Uses of ParserException in org.htmlparser.parserapplications
 

Methods in org.htmlparser.parserapplications that throw ParserException
 String StringExtractor.extractStrings(boolean links)
          Extract the text from a page.
protected  boolean SiteCapturer.isHtml(String link)
          Returns true if the link contains text/html content.
protected  void SiteCapturer.process(NodeFilter filter)
          Process a single page.
 

Uses of ParserException in org.htmlparser.scanners
 

Methods in org.htmlparser.scanners that throw ParserException
 Tag StyleScanner.scan(Tag tag, Lexer lexer, NodeList stack)
          Scan for style definitions.
 Tag ScriptScanner.scan(Tag tag, Lexer lexer, NodeList stack)
          Scan for script.
static String ScriptDecoder.Decode(Page page, Cursor cursor)
          Decode script encoded by the Microsoft obfuscator.
 Tag CompositeTagScanner.scan(Tag tag, Lexer lexer, NodeList stack)
          Collect the children.
protected  void CompositeTagScanner.finishTag(CompositeTag tag, Lexer lexer)
          Finish off a tag.
 Tag Scanner.scan(Tag tag, Lexer lexer, NodeList stack)
          Scan the tag.
 Tag TagScanner.scan(Tag tag, Lexer lexer, NodeList stack)
          Scan the tag.
 

Uses of ParserException in org.htmlparser.tags
 

Methods in org.htmlparser.tags that throw ParserException
 void MetaTag.doSemanticAction()
          Check for a charset directive, and if found, set the charset for the page.
 void BaseHrefTag.doSemanticAction()
          Perform the meaning of this tag.
 

Uses of ParserException in org.htmlparser.util
 

Subclasses of ParserException in org.htmlparser.util
 class EncodingChangeException
          The encoding is changed invalidating already scanned characters.
 

Methods in org.htmlparser.util with parameters of type ParserException
static void FeedbackManager.error(String message, ParserException e)
           
 void DefaultParserFeedback.error(String message, ParserException exception)
          Print an error message.
 void ParserFeedback.error(String message, ParserException e)
           
 

Methods in org.htmlparser.util that throw ParserException
 Node PeekingIteratorImpl.peek()
          Deprecated.  
 boolean PeekingIteratorImpl.hasMoreNodes()
          Deprecated. Check if more nodes are available.
 Node PeekingIteratorImpl.nextNode()
          Deprecated. Get the next node.
 Node PeekingIterator.peek()
          Deprecated. Fetch a node without consuming it.
 boolean IteratorImpl.hasMoreNodes()
          Check if more nodes are available.
 Node IteratorImpl.nextNode()
          Get the next node.
 boolean NodeIterator.hasMoreNodes()
          Check if more nodes are available.
 Node NodeIterator.nextNode()
          Get the next node.
 


© 2004 Somik Raha
Mar 14, 2004

HTML Parser is an open source library released under LGPL.
SourceForge.net