Hi,
On Jul 4, 2012, at 12:41 PM, Arioch wrote:
which HTML parsers do u have onboard ?
Jericho, JTidy, whatever ?
Please don't cross post. And don't use JIRA as a place to get answers! It's
there to report issues…
We use our own parse based on SAX in the Rendering module.
We also use htmlcleaner to clean up HTML.
Thanks
-Vincent