Edit online

Migrating Various Document Formats to and from DITA

When organizations decide to use DITA for structuring, developing, managing, or publishing content, they usually already have content written in other formats and need to convert it to DITA. There are a variety of possibilities for a conversion to DITA, depending on the original format of the content.

Migration from other formats to DITA is rarely perfect and manual changes may need to be made to the converted content, but the methods described below should help you find the best approach for your particular case.

Migrating Microsoft Office and Other Similar Types of Documents to DITA

There are various possibilities for migrating content from Microsoft Office® (and other Office-type formats) to DITA. For details, see Migrating MS Office Documents to DITA.

Migrating Google Docs to DITA

There are several possibilities to convert Google Docs to DITA:
  • Copy the content from Google Docs and paste it in an open DITA topic in Author mode. The Smart Paste functionality will attempt to convert the content to DITA.
  • Save the Google document as OpenDocumentFormat (ODF), then open it in the free LibreOffice application and save it as DocBook. Next, open the DocBook document in Oxygen XML Author Eclipse plugin and run the built-in transformation scenario called DocBook to DITA.

In all cases, you may need to make some manual adjustments in the resulting documents for elements that couldn't be mapped.

Migrating HTML Content to DITA

There are several possibilities to convert HTML content to DITA:

  • Copy the HTML content and paste it in an open DITA topic in Author mode. The Smart Paste functionality will attempt to convert the content to DITA.
  • Convert the HTML file to XHTML by selecting File > Import/Convert > HTML File to XHTML. Then, open the XHTML file and use one of the XHTML to DITA Transformation Scenarios to convert the content to DITA.

Migrating Unstructured FrameMaker to DITA

There is a blog post that details various possibilities for converting Unstructured FrameMaker content to DITA: Migrating Unstructured FrameMaker to DITA.

Migrating MadCap Content to DITA

This open-source project contains such a stylesheet that attempts to convert a Flare project to DITA XML along with instructions on how to use it. As an alternative, some recent MadCap versions seem to have facilities to export content directly to DITA.

Migrating Other Formats to DITA

You may find third-party applications (such as Pandoc) that can convert your content to HTML or to some kind of XML format like DocBook. Once you have HTML or DocBook content, you can convert them to DITA using one of the methods described above.

Migrate from DITA to Confluence and Other Formats

There are various possible methods available for converting DITA content to Confluence and other formats (such as Microsoft Word or HTML). For details and ideas for some of the possible methods, see the DITA to Confluence blog post.

Resources