HTML Parser Home Page

org.htmlparser.visitors
Class TextExtractingVisitor

java.lang.Object
  extended byorg.htmlparser.visitors.NodeVisitor
      extended byorg.htmlparser.visitors.TextExtractingVisitor

public class TextExtractingVisitor
extends NodeVisitor

Extracts text from a web page. Usage: Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();


Constructor Summary
TextExtractingVisitor()
           
 
Method Summary
 String getExtractedText()
           
 void visitEndTag(Tag tag)
           
 void visitStringNode(StringNode stringNode)
           
 void visitTag(Tag tag)
           
 
Methods inherited from class org.htmlparser.visitors.NodeVisitor
beginParsing, finishedParsing, shouldRecurseChildren, shouldRecurseSelf, visitImageTag, visitLinkTag, visitRemarkNode, visitTitleTag
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

TextExtractingVisitor

public TextExtractingVisitor()
Method Detail

getExtractedText

public String getExtractedText()

visitStringNode

public void visitStringNode(StringNode stringNode)
Overrides:
visitStringNode in class NodeVisitor

visitTag

public void visitTag(Tag tag)
Overrides:
visitTag in class NodeVisitor

visitEndTag

public void visitEndTag(Tag tag)
Overrides:
visitEndTag in class NodeVisitor

© 2004 Somik Raha
Mar 14, 2004

HTML Parser is an open source library released under LGPL.
SourceForge.net