Hi,
I have build a web app which can provide documents conversion service, both
through web page or RESTful web service. All the documents are converted to
html format.
Now, I want to clean the result html. As discussed before, I go to JTidy[1].
Hense, I find that the last release of JTidy is published at August 1, 2001.
I wonder if there is some more active project for html code cleaning, or
JTidy is just sufficient?
After clean the html, I will write some parser to convert html to xwiki
syntax.
All I have done is a web app which can run standalone. It accept input file,
convert it to format file or purge xwiki syntax code, and output the
result. I'm confused about how to develop it as a xwiki plugin. I know how
to develop a xwiki plugin. As office import need upload file, output the
result, I just don't know how to integrate it in to xwiki. Any suggestion
will be very appreciated.
Thanks.
[
1]http://jtidy.sourceforge.net/index.html
--
Sincerely,
Wang Ning