Currently this plugin can convert the attachment file which is a office
document to html code.
This project use jodconverter[1] as the conversion framework which leverage
the openoffice power.
Please read the README with the code to test.
This officeimporter plugin is simple and do a simply tidy html work. don't
concern picture handler, xwiki syntax conversion. However, I'll add those
feature in the future.
Now I have to write my thesis in the day time, so I only can work on that
project after 6 pm and the weekend. Hense sorry for the delay.
Todo List
1. clean html with htmlclean[2]. Furthermore, remove the office document
special html code. remove <html> <body> <script>. html also have some
content that conflict with xwiki syntax like [notlink].
2. handle pictures in the office file. Maybe upload the images to xwiki as
the attachments. Hense I need to go over how xwiki address attachment.
Replace <src> with {image:}
3. convert html to xwiki. As Vincent said, I should read the html parser and
wikimodel. I think this can be a option for user.
4. provide the interface for inputstream and outputstream. Hense this plugin
can be used in the xwiki code.
5. intergrate it to wysiwyg editor. product a feature to convert local
office file to xwiki page. This should use the xwiki upload feature.
Now I'm a little confused that I afraid I can't finish all the task above
before the middle exam. What should I do? Which task to pick as first
priority? Any suggestion will be very appreciated.
Thanks.
[1]
http://www.artofsolving.com/opensource/jodconverter
[2]
http://htmlcleaner.sourceforge.net/
--
Sincerely,
Wang Ning