Getting Started

Importing Microsoft Word

Importing from Microsoft Word is notoriously hard, because it is not really a structured format. So the results will depend a lot on the input quality. The most important consideration is usually that Word styles should have been applied consistently.

There are three options available for importing Microsoft Word into Paligo. One option is the direct import wizard. It can handle most well-structured Word documents. The second is to use the (optional) Oxygen XML editor with the Paligo plugin. The third option is to use a pre-conversion to XML. For large documents the latter method is usually preferable.

Method 1: Direct Import Wizard

Obviously the simplest method is to just import directly in the Import Wizard. Just zip the file (one at a time, not multiple files in one zip), and select Import content in the context menu of the Content Manager.

Method 2: Using Oxygen XML Editor

  1. In Oxygen, create a DocBook 5 Article, by selecting File > New, and then selecting the proper DocBook template.

  2. Remove the first "sect1" element.

  3. Insert a new section element by pressing enter on your keyboard and select Section in the element list. This will not really be used, and we'll remove it at the end. But it's needed because of a quirk in Oxygen.

  4. Place the cursor inside the section element.

  5. Save the document with any name you choose.

    It is however important to save it with a name, otherwise images will not be properly saved.

  6. Copy the text from your Word document. You should leave out any Table of Contents or similar, since it won't be needed.

  7. Paste the content into the section element in Oxygen.

    You will get a warning saying that it needs to place it inside the closest Article element. Go ahead and accept that.

  8. When you paste the content, it will be automatically converted to the proper XML elements.

  9. Remove the empty section tag. It will be at the bottom or at the top of the document.

  10. You are done! Save the document again. And finally:

  11. Import the resulting DocBook document using the Import Wizard.

Method 3: Importing by pre-converting to XML

The third method involves a pre-conversion to XML using a script package. It is slightly more complex, but with good results and especially useful for large documents since it is very fast and can be tweaked more to adapt to your content.


Depending on the complexity of your content and mapping the structure, there may be a charge to do this conversion.

Contact Paligo support to inquire about this if you would like to use this method instead of the above procedures.

Search results