Microsoft Word API

WorkplaceX provides an API to read Microsoft Word (*.docx) documents. Users do not have to learn new text processing tools. It’s not necessary to copy and paste text from word documents into a CMS. Content is rendered to front end in a highly structured way. It is always HTML5 valid. Unconventional text and image formatting is ignored and rendered as standard formatting. The parser detects:

  • Text
  • Image
  • Hyperlink

Following paragraph formatting style is detected:

  • Heading 1
  • Heading 2
  • Normal
  • List Paragraph
  • Caption, if followed immediately after an image.
  • Code, for source code documentation.

Style=”max-width:480px” Microsoft paragraph style formatting detected by WorkplaceX
Microsoft paragraph style formatting detected by WorkplaceX

All other formatting style gets diverted to “Normal”. Review comments can stay in the document and are not rendered to the application.

Image with caption

Microsoft word (*.docx) files can contain images with caption. These are detected by the parser and associated to each other.

Style=”max-width:480px;” Add a caption to an image.
Add a caption to an image.

Image with html style attribute

WorkplaceX offers the option to pass thru html style information directly. Some examples which can be included at the beginning of the caption are:

  • Style=”width:33%; float:right;” (Show little image on the right)
  • Style=”max-width:480px;” (Image will not zoom over 480 pixel)
  • Style=”width:;” (No zoom will be applied to image. For example for icons).

Source code documentation

Highlight source code in Microsoft word document and set style to “Code”. This section will be rendered colorful with Google Code Prettify and will look like this:

namespace ConsoleApplication
class Program
static void Main(string[] args)
Console.WriteLine("Hello world!");

Custom formatting

Do you use in your company Word documents with custom formatting like multi language? Feel free to contact us:

(C) 2015 by; Framework Version=a1.461