Serge Boucher
 
Philip Greenspun's Homepage : Community member
A member of the Philip Greenspun's Homepage community since December 30, 2002
If you were to log in, you'd be able to get more information on your fellow community member.
Static Page Comments
- December 30, 2002, on Converting from Microsoft Word to HTML: 
I have a vaguely db-backed personal web site on which I had to append some MSWord documents. The easiest solution I found is to open the Word file in OpenOffice Writer, (http://www.openoffice.org) save as HTML then edit the file in Emacs.
OpenOffice does generate a load of crap in the html file, but it's nowhere as bad as Word 2000. Most of my work is done by a rather ugly Emacs Macro (which should really be a Lisp procedure, but that will have to wait until I actually learn Lisp) to replace-regexp a couple of tags, namely : 
- delete SPAN ("?SPAN[^>]*>" -> "")
- remove attributes from p and h tags ("]*>" -> "")
The macro also add calls to my header and footer scripts and edits the header to use external CSS stylesheets.
Overall it works rather well and I can get .doc files up really fast. I still have to correct a few things by hand but with something more involved than my macro (by a better programmer) I think that wouldn't even be necessary.
By the way O... 
 
 
philg@mit.edu