Package | Description |
---|---|
org.htmlparser |
The basic API classes which will be used by most developers when working with
the HTML Parser.
|
org.htmlparser.lexerapplications.thumbelina |
Extract the images behind thumbnail images.
|
org.htmlparser.scanners |
The scanners package contains classes responsible for the tertiary
identification of tags.
|
org.htmlparser.util |
Code which can be reused by many classes, is located in this package.
|
Modifier and Type | Field and Description |
---|---|
protected Lexer |
Parser.mLexer
The html lexer associated with this parser.
|
Modifier and Type | Method and Description |
---|---|
Lexer |
Parser.getLexer()
Returns the lexer associated with the parser
|
Modifier and Type | Method and Description |
---|---|
void |
Parser.setLexer(Lexer lexer)
Set the lexer for this parser.
|
Constructor and Description |
---|
Parser(Lexer lexer)
Construct a parser using the provided lexer.
|
Parser(Lexer lexer,
ParserFeedback fb)
Construct a parser using the provided lexer and feedback object.
|
Modifier and Type | Method and Description |
---|---|
protected java.net.URL[][] |
Thumbelina.extractImageLinks(Lexer lexer,
java.net.URL docbase)
Get the links of an element of a document.
|
Modifier and Type | Method and Description |
---|---|
protected Tag |
CompositeTagScanner.createVirtualEndTag(Tag tag,
Lexer lexer,
Page page,
int position)
Creates an end tag with the same name as the given tag.
|
protected void |
CompositeTagScanner.finishTag(Tag tag,
Lexer lexer)
Finish off a tag.
|
Tag |
ScriptScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for script.
|
Tag |
Scanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag.
|
Tag |
CompositeTagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Collect the children.
|
Tag |
StyleScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for style definitions.
|
Tag |
TagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag.
|
Constructor and Description |
---|
IteratorImpl(Lexer lexer,
ParserFeedback fb) |
HTML Parser is an open source library released under LGPL.