About |

Convert Word documents to HTML markup!
…of course, it works for any document, not just MS Word 😉
Paste the document you want to convert in the Word Editor, then switch to HTML view using the big tabs at the top of the page.
Switch to HTML view and test the cleaning options.
Clean the dirty markup with the big button or also apply the cleaning features one-by-one with the icons.
Cleaning Options:
– remove style attributes from each tag: <span style=”color: red;”>highlighted</span>
– delete empty tags which do not contain anything: <span>Hello<span></span> World</span>
– clear successive non-breaking spaces and leave only one instance: <p> </p>
– get rid of tag attributes, except the href of anchors and src of images: <img src=”/icon.png” alt=”icon” width=”52″ height=”52″ id=”pic52″ />
– strip classes and IDs: <img id=”pic52″ class=”right” src=”/icon.png” alt=”icon” width=”52″ />
– wipe off elements that contain only a character: <p> </p>
– dislodge HTML comments: <!– comment –>
– convert the document to plain text: <p><span class=”header”>text</span></p>

Undo, new page, indent markup, compress, activate encoding
|