Package org.basex.build.html
Class HtmlParser
java.lang.Object
org.basex.core.jobs.Job
org.basex.build.Parser
org.basex.build.SingleParser
org.basex.build.xml.XMLParser
org.basex.build.html.HtmlParser
This class uses TagSoup to convert HTML input to well-formed XML.
If TagSoup is not found in the classpath, the original document is passed on.
TagSoup was written by John Cowan and is based on the Apache 2.0 License:
http://home.ccil.org/~cowan/XML/tagsoup/.- Author:
- BaseX Team 2005-21, BSD License, Christian Gruen
-
Field Summary
Fields inherited from class org.basex.build.SingleParser
builder -
Constructor Summary
ConstructorsConstructorDescriptionHtmlParser(IO source, MainOptions opts) Constructor.HtmlParser(IO source, MainOptions opts, HtmlOptions hopts) Constructor. -
Method Summary
Methods inherited from class org.basex.build.xml.XMLParser
close, detailedInfo, parse, progressInfoMethods inherited from class org.basex.build.SingleParser
parseMethods inherited from class org.basex.build.Parser
emptyParser, info, singleParser, target, xmlParser
-
Constructor Details
-
HtmlParser
Constructor.- Parameters:
source- document sourceopts- database options- Throws:
IOException- I/O exception
-
HtmlParser
Constructor.- Parameters:
source- document sourceopts- database optionshopts- html options- Throws:
IOException- I/O exception
-
-
Method Details
-
available
public static boolean available()Checks if a CatalogResolver is available.- Returns:
- result of check
-
parser
Returns the name of the parser, or an empty string.- Returns:
- name of parser
-