[xwiki-users] Xwiki.com API stability and Class/Object model

Pablo Oliveira pablo.oliveira at enst.fr
Thu Apr 12 16:07:02 CEST 2007


On Apr 06, THOMAS, BRIAN M (ATTSI) wrote :
 
> 	From: Sergiu Dumitriu [mailto:sergiu.dumitriu at gmail.com] 
> 	Sent: Thursday, April 05, 2007 4:16 PM
> 	To: xwiki-users at objectweb.org
> 	Subject: Re: [xwiki-users] Xwiki.com API stability and
> Class/Object model
> 
> 	On 4/4/07, THOMAS, BRIAN M (ATTSI) <bt0008 at att.com> wrote: 
> 
> 
> 		The only reason I haven't already made a start of it is
> that I haven't
> 		found an HTML DOM parser.  Is there one in the myriad of
> libraries that
> 		come with XWiki?
> 		
> 		
> 
> 
> 	What do you mean by "HTML DOM parser"? You can use any DOM
> parser as long as it's well formed XML, and it should be. 
> 	
> 	
> 	-- 
> 	http://purl.org/net/sergiu  
> 	 
> 
>  Unfortunately, it isn't:
>  
> Nested exception: org.xml.sax.SAXParseException: The declaration for the
> entity "HTML.Version" must end with '>'.
> 
>  
> 
> This exception is thrown regardless of which of the javadoc pages I
> use...

Just my two cents:
you might have a look at TagSoup (http://home.ccil.org/~cowan/XML/tagsoup/) or
JTidy (http://jtidy.sourceforge.net/) which I think is distributed
already as part of XWiki, those should help you when dealing with non
xml-valid HTML.

Pablo




More information about the users mailing list