Package org.htmlcleaner

Interface Summary
BaseToken Base token interface.
ITagInfoProvider Provides set of TagInfo instances.
 

Class Summary
CommandLine Command line usage class.
CommentToken HTML comment token.
CompactXmlSerializer Compact XML serializer - creates resulting XML by stripping whitespaces.
ContentToken HTML text token.
DoctypeToken HTML doctype token.
EndTagToken HTML tag end token.
HtmlCleaner Main HtmlCleaner class.
HtmlTagProvider Default HTML tag info provider.
HtmlTokenizer Main HTML tokenizer.
PrettyXmlSerializer Pretty XML serializer - creates resulting XML with indenting lines.
SimpleXmlSerializer Simple XML serializer - creates resulting XML without indenting lines.
SpecialEntities This class contains map with special entities used in HTML and their unicodes.
TagInfo Class contains information about single HTML tag.
TagNode XML node node tag - it is produced during cleaning process when all start and end tokens are removed and replaced by instances of TagNode.
TagToken HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
Utils Common utilities.
XmlSerializer Abstract XML serializer - contains common logic for descendants.