Jan 11 2008

Getting Rid of XML from HTML Files Editted by MicroSoft Word

Published by Robert Fischer at 7:38 am under Uncategorized

So, at my current gig, we’re using Microsoft Word to edit FIT files. For an introduction to FIT, check out my post “FIT, AntFit, and FitNesse: Test-Based Communication Tools“. For a conversation on using version controlled files editted in MSWord vs. FitNesse, check out Todd and I talking over here.

Anyway, we ran into a bit of a problem. The newest version of MSWord, in a fit of helpfulness, will inject namespaced XML into your HTML. Some of this will cause awkwardness and ugliness in the source, some of it will inject arbitrary additional rows, and some of it will cause other random errors. After joking around that we could get it to stop by finding the “Turn the Suck Off” switch, my coworker (Steve Hupert) actually went in and found that switch.

Here’s a picture of the “Tools > Options” screen, and the “Embed Smart Tags” switch that needs to be off:
Smart Tags

Once that’s off, things went really well.

Popularity: 3% [?]

Trackback URI | Comments RSS

Leave a Reply

Green Web Hosting! This site hosted by DreamHost.