[xwiki-users] Xwiki.com API stability and Class/Object model
Pablo Oliveira
pablo.oliveira at enst.fr
Thu Apr 12 16:07:02 CEST 2007
On Apr 06, THOMAS, BRIAN M (ATTSI) wrote :
> From: Sergiu Dumitriu [mailto:sergiu.dumitriu at gmail.com]
> Sent: Thursday, April 05, 2007 4:16 PM
> To: xwiki-users at objectweb.org
> Subject: Re: [xwiki-users] Xwiki.com API stability and
> Class/Object model
>
> On 4/4/07, THOMAS, BRIAN M (ATTSI) <bt0008 at att.com> wrote:
>
>
> The only reason I haven't already made a start of it is
> that I haven't
> found an HTML DOM parser. Is there one in the myriad of
> libraries that
> come with XWiki?
>
>
>
>
> What do you mean by "HTML DOM parser"? You can use any DOM
> parser as long as it's well formed XML, and it should be.
>
>
> --
> http://purl.org/net/sergiu
>
>
> Unfortunately, it isn't:
>
> Nested exception: org.xml.sax.SAXParseException: The declaration for the
> entity "HTML.Version" must end with '>'.
>
>
>
> This exception is thrown regardless of which of the javadoc pages I
> use...
Just my two cents:
you might have a look at TagSoup (http://home.ccil.org/~cowan/XML/tagsoup/) or
JTidy (http://jtidy.sourceforge.net/) which I think is distributed
already as part of XWiki, those should help you when dealing with non
xml-valid HTML.
Pablo
More information about the users
mailing list