|
HTML Parser Home Page | ||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||
| Packages that use ParserException | |
| org.htmlparser | The basic API classes which will be used by most users when working with the html parser (the Parser class is the most important one in this). |
| org.htmlparser.beans | The beans package contains Java Beans that can integrate within IDEs. |
| org.htmlparser.lexer | The lexer package is the base level I/O subsystem. |
| org.htmlparser.lexer.nodes | The nodes package are the lexemes returned by the base level I/O subsystem. |
| org.htmlparser.lexerapplications.thumbelina | Extract the images behind thumbnail images. |
| org.htmlparser.nodeDecorators | The nodeDecorators package contains classes that use the Decorator pattern. |
| org.htmlparser.parserapplications | Developers and users alike should try out the applications in this package. |
| org.htmlparser.scanners | The scanners package contains classes responsible for the tertiary identification of tags. |
| org.htmlparser.tags | The tags package contains tag types that are created mostly by the scanners. |
| org.htmlparser.util | Code which can be reused by many classes, is located in this package. |
| Uses of ParserException in org.htmlparser |
| Methods in org.htmlparser that throw ParserException | |
Node |
PrototypicalNodeFactory.createTagNode(Page page,
int start,
int end,
Vector attributes)
Create a new tag node. |
void |
Parser.setConnection(URLConnection connection)
Set the connection for this parser. |
void |
Parser.setURL(String url)
Set the URL for this parser. |
void |
Parser.setEncoding(String encoding)
Set the encoding for the page this parser is reading from. |
NodeIterator |
Parser.elements()
Returns an iterator (enumeration) to the html nodes. |
void |
Parser.parse(NodeFilter filter)
Parse the given resource, using the filter provided. |
static URLConnection |
Parser.openConnection(URL url,
ParserFeedback feedback)
Opens a connection using the given url. |
static URLConnection |
Parser.openConnection(String string,
ParserFeedback feedback)
Opens a connection based on a given string. |
void |
Parser.visitAllNodesWith(NodeVisitor visitor)
|
void |
Parser.setInputHTML(String inputHTML)
Initializes the parser with the given input HTML String. |
NodeList |
Parser.extractAllNodesThatMatch(NodeFilter filter)
Extract all nodes matching the given filter. |
Node[] |
Parser.extractAllNodesThatAre(Class nodeType)
Convenience method to extract all nodes of a given class type. |
void |
Node.doSemanticAction()
Perform the meaning of this tag. |
void |
AbstractNode.doSemanticAction()
Perform the meaning of this tag. |
| Constructors in org.htmlparser that throw ParserException | |
Parser(URLConnection connection,
ParserFeedback fb)
Constructor for custom HTTP access. |
|
Parser(String resourceLocn,
ParserFeedback feedback)
Creates a Parser object with the location of the resource (URL or file) You would typically create a DefaultHTMLParserFeedback object and pass it in. |
|
Parser(String resourceLocn)
Creates a Parser object with the location of the resource (URL or file). |
|
Parser(URLConnection connection)
Constructor for non-standard access. |
|
| Uses of ParserException in org.htmlparser.beans |
| Methods in org.htmlparser.beans that throw ParserException | |
protected String |
StringBean.extractStrings()
Extract the text from a page. |
protected URL[] |
LinkBean.extractLinks(String url)
|
| Uses of ParserException in org.htmlparser.lexer |
| Methods in org.htmlparser.lexer that throw ParserException | |
Node |
Lexer.nextNode()
Get the next node from the source. |
Node |
Lexer.nextNode(boolean quotesmart)
Get the next node from the source. |
protected void |
Lexer.scanJIS(Cursor cursor)
Advance the cursor through a JIS escape sequence. |
protected Node |
Lexer.parseString(Cursor cursor,
boolean quotesmart)
Parse a string node. |
protected Node |
Lexer.makeString(Cursor cursor)
Create a string node based on the current cursor and the one provided. |
protected Node |
Lexer.parseTag(Cursor cursor)
Parse a tag. |
protected Node |
Lexer.makeTag(Cursor cursor,
Vector attributes)
Create a tag node based on the current cursor and the one provided. |
protected Node |
Lexer.parseRemark(Cursor cursor,
boolean quotesmart)
Parse a comment. |
protected Node |
Lexer.makeRemark(Cursor cursor)
Create a remark node based on the current cursor and the one provided. |
protected Node |
Lexer.parseJsp(Cursor cursor)
Parse a java server page node. |
static void |
Lexer.main(String[] args)
Mainline for command line operation |
void |
Page.setConnection(URLConnection connection)
Set the URLConnection to be used by this page. |
char |
Page.getCharacter(Cursor cursor)
Read the character at the cursor position. |
void |
Page.setEncoding(String character_set)
Begins reading from the source with the given character set. |
| Constructors in org.htmlparser.lexer that throw ParserException | |
Lexer(URLConnection connection)
Creates a new instance of a Lexer. |
|
Page(URLConnection connection)
Construct a page reading from a URL connection. |
|
| Uses of ParserException in org.htmlparser.lexer.nodes |
| Methods in org.htmlparser.lexer.nodes that throw ParserException | |
Node |
NodeFactory.createStringNode(Page page,
int start,
int end)
Create a new string node. |
Node |
NodeFactory.createRemarkNode(Page page,
int start,
int end)
Create a new remark node. |
Node |
NodeFactory.createTagNode(Page page,
int start,
int end,
Vector attributes)
Create a new tag node. |
| Uses of ParserException in org.htmlparser.lexerapplications.thumbelina |
| Methods in org.htmlparser.lexerapplications.thumbelina that throw ParserException | |
protected URL[][] |
Thumbelina.extractImageLinks(Lexer lexer,
URL docbase)
Get the links of an element of a document. |
| Uses of ParserException in org.htmlparser.nodeDecorators |
| Methods in org.htmlparser.nodeDecorators that throw ParserException | |
void |
AbstractNodeDecorator.doSemanticAction()
|
| Uses of ParserException in org.htmlparser.parserapplications |
| Methods in org.htmlparser.parserapplications that throw ParserException | |
String |
StringExtractor.extractStrings(boolean links)
Extract the text from a page. |
protected boolean |
SiteCapturer.isHtml(String link)
Returns true if the link contains text/html content. |
protected void |
SiteCapturer.process(NodeFilter filter)
Process a single page. |
| Uses of ParserException in org.htmlparser.scanners |
| Methods in org.htmlparser.scanners that throw ParserException | |
Tag |
StyleScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for style definitions. |
Tag |
ScriptScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for script. |
static String |
ScriptDecoder.Decode(Page page,
Cursor cursor)
Decode script encoded by the Microsoft obfuscator. |
Tag |
CompositeTagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Collect the children. |
protected void |
CompositeTagScanner.finishTag(CompositeTag tag,
Lexer lexer)
Finish off a tag. |
Tag |
Scanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag. |
Tag |
TagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag. |
| Uses of ParserException in org.htmlparser.tags |
| Methods in org.htmlparser.tags that throw ParserException | |
void |
MetaTag.doSemanticAction()
Check for a charset directive, and if found, set the charset for the page. |
void |
BaseHrefTag.doSemanticAction()
Perform the meaning of this tag. |
| Uses of ParserException in org.htmlparser.util |
| Subclasses of ParserException in org.htmlparser.util | |
class |
EncodingChangeException
The encoding is changed invalidating already scanned characters. |
| Methods in org.htmlparser.util with parameters of type ParserException | |
static void |
FeedbackManager.error(String message,
ParserException e)
|
void |
DefaultParserFeedback.error(String message,
ParserException exception)
Print an error message. |
void |
ParserFeedback.error(String message,
ParserException e)
|
| Methods in org.htmlparser.util that throw ParserException | |
Node |
PeekingIteratorImpl.peek()
Deprecated. |
boolean |
PeekingIteratorImpl.hasMoreNodes()
Deprecated. Check if more nodes are available. |
Node |
PeekingIteratorImpl.nextNode()
Deprecated. Get the next node. |
Node |
PeekingIterator.peek()
Deprecated. Fetch a node without consuming it. |
boolean |
IteratorImpl.hasMoreNodes()
Check if more nodes are available. |
Node |
IteratorImpl.nextNode()
Get the next node. |
boolean |
NodeIterator.hasMoreNodes()
Check if more nodes are available. |
Node |
NodeIterator.nextNode()
Get the next node. |
|
© 2004 Somik Raha Mar 14, 2004
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||