Extract text from MS-Word files, trying to preserve as many special
printable characters as possible. Catdoc doesn't attempt to analyze
Word file formatting, it just extracts readable text. Known to
support up to Word-97 format.
http://freshmeat.net/projects/catdoc/
a)
wget --non-verbose \
http://tierra.dyndns.org:81/cygwin/catdoc/catdoc-0.93.3-1-src.tar.bz2 \
http://tierra.dyndns.org:81/cygwin/catdoc/catdoc-0.93.3-1.tar.bz2 \
http://tierra.dyndns.org:81/cygwin/catdoc/setup.hint
b) or use this
mkdir catdoc ; cd catdoc
wget -q -O - http://tierra.dyndns.org:81/cygwin/catdoc/get.sh | sh
Jari