[oXygen-user] Customizing Batch Document conversion from Word to DITA

Jirka Kosek jirka at kosek.cz
Mon Feb 12 08:14:59 CST 2024


Hi,

we are using Batch Document converting Add-on on one project to convert 
Word files into DITA content. I'm wondering if there is any way how to 
customize conversion process in a more complex way than just by mapping 
Word styles into HTML elements that are later mapped into DITA (as 
described at 
https://www.oxygenxml.com/doc/versions/26.0/ug-editor/topics/batch-converter-addon.html).

Examples of things we would like to customize:

* Grouping of generated DITA elements. For example Word figure with 
caption is by default converted to two paragraphs -- one with image and 
the second with caption. We can map Caption style to "figure > 
figcaption" but this will generate DITA figure only with title, image 
itself will be in the previous paragraph. If we could run simple XSLT on 
the result it should be possible to automatically fix such output to 
create valid and more semantically rich DITA.

* We need to create DITA Bookmap not plain DITA Map from one Word file. 
So having ability to run custom XSLT that woudl transform map into 
bookmap would help us a lot.

Of course we can implement this as an additional post-processing step 
but if there is some existing integration point I've missed it would be 
much easier for users just to invoke conversion from the menu.

Many thanks in advance,

				Jirka

-- 
------------------------------------------------------------------
   Jirka Kosek      e-mail: jirka at kosek.cz      http://xmlguru.cz
------------------------------------------------------------------
      Professional XML and Web consulting and training services
DocBook/DITA customization, custom XSLT/XSL-FO document processing
------------------------------------------------------------------
     Bringing you XML Prague conference    http://xmlprague.cz
------------------------------------------------------------------
-------------- next part --------------
A non-text attachment was scrubbed...
Name: OpenPGP_signature.asc
Type: application/pgp-signature
Size: 203 bytes
Desc: OpenPGP digital signature
URL: <http://www.oxygenxml.com/pipermail/oxygen-user/attachments/20240212/fb2c8b85/attachment.sig>


More information about the oXygen-user mailing list