Whats new in version 3.3:
Bug Fixes:
• CharacterReference.decode() does not decode entities containing digits ½ ¼ ¾ ¹ ² ³ ∴
• SourceCompactor does not respect TEXTAREA
• Renderer output incorrect when constructed with an Element object.
• Renderer output of font decoration on block boundaries incorrect.
• Segment.getAllStartTags(name) and Segment.getFirstElement(name) do not work if the argument contains upper case characters.
• The end delimiter of a common server tag inside an escaped server tag is falsely recognised as the end delimiter of the escaped tag.
• Segment.getStyleURISegments() now includes style element content as well as style attribute values.
• Segment.getURIAttributes() now includes the archive attributes of object and applet elements.
• Comments no longer recognised inside script elements during full sequential parse. Previously they were recognised for compatibility with major browsers but modern browser behaviour has changed.
• Changed the log level of all parsing e...
Publisher review:Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Operating system:Mac OS X