Page 1 of 1

Can Oxygen batch convert DOCX to HTML or XHTML?

Posted: Fri Nov 08, 2013 11:31 pm
by jjoensuu
Hi all

I wanted to ask following questions but it is not necessary to give me detailed instructions. At the moment I am looking for enough information to put me on the right direction.

1. Can Oxygen be used to batch convert docx documents to some html or xhtml format?
2. Does this require using Eclipse with Oxygen?
3. Is the process capable of extracting any files embedded in docx files?

Any information on this would be appreciated!

Thanks

Re: Can Oxygen batch convert DOCX to HTML or XHTML?

Posted: Mon Nov 11, 2013 11:55 am
by Radu
Hi,

Please see some answers below:
1. Can Oxygen be used to batch convert docx documents to some html or xhtml format?
We do not have any out of the box solution for this.

Oxygen comes bundled with a plugin which could be used to convert DOCX to DITA (which can then be published as XHTML):

http://www.oxygenxml.com/forum/post2818 ... ita#p28188

Potentially this plugin could be used to batch process a bunch of DOCX documents to DITA.
You may not obtain exactly the same styling you had in the Word document.

Another approach would be to copy the content in the Word document, then create in Oxygen an XHTML document, open it in the Author visual editing mode and paste in it, Oxygen will attempt to convert the content to valid XHTML.
2. Does this require using Eclipse with Oxygen?
Oxygen comes either as a standalone editor or as an Eclipse plugin, they both have similar functionality.
3. Is the process capable of extracting any files embedded in docx files?
DOCX files are ZIP-like archives. If you open a DOCX document in Oxygen you can see it in the Archive Browser and see the files which are present in it.

Regards,
Radu

Re: Can Oxygen batch convert DOCX to HTML or XHTML?

Posted: Mon Nov 11, 2013 4:36 pm
by jjoensuu
Thank you Radu!