On Aug 19, 2008, at 6:47 PM, Wang Ning wrote:
Hi Vincent,
AFAIK, the conversion from xhtml to xwiki syntax is hard because many
things about how to render the html elements in xwiki syntax are still
not sure, like table, img, {style} macro. Furthermore, the xhtmlparser
is buggy, like <img> <link> list. I will try my best to make the
conversion work, but to make a option, I want develop a feature that
just convert office doument to xhtml. So the user can just use the
convert2html to get the xhtml and fill xhtml into xwiki page with
xwiki syntax 1.0. After we finish the new rendering and the
xhtmlparser, the use can change to convert2xwikisyntax.
The convert2html is easy but some thing to handle
* clean the html (htmlcleaner)
* remove the <style> in head (htmlcleaner)
* remove empty link (htmlcleaner)
* remove <p> in <li> (jdom)
* replace <img> with {img} (jdom)
* replace link <a> to [] (jdom)
* handle the ppt and odp to the single page
All these step is easy to implement. The convert2html may be not
perfect, but it can work for most conversion requirement for now. As I
am runnig for deadline, I think a workable feature is useful. However,
I will work on the finial gold "convert to xwiki syntax2.0"
continually.
WDYT?
I agree. Both conversion to XHTML and to wiki syntax should be pursued.
Thanks
-Vincent