ITP: pstotext -- Extract text from PostScript and PDF files
Jari Aalto
jari.aalto@cante.net
Tue Oct 9 15:29:00 GMT 2012
wget --recursive --no-host-directories --cut-dirs=3 \
http://cante.net/~jaalto/tmp/cygwin/pstotext/pstotext-1.9-1-src.tar.bz2 \
http://cante.net/~jaalto/tmp/cygwin/pstotext/pstotext-1.9-1.tar.bz2 \
http://cante.net/~jaalto/tmp/cygwin/pstotext/setup.hint
# To check packaging
cd pstotext
tar -xf *-src.tar.bz2
./*.sh --color --verbose all
Included in Debian:
http://packages.debian.org/pstotext
Notes:
Program uses ghostscript for processing.
Jari
[ setup.hint ]
sdesc: "Extract text from PostScript and PDF files"
ldesc: "Extract text (in the ISO 8859-1 character set) from a PostScript or
PDF (Portable Document Format) file. Thus, pstotext is similar to the
ps2ascii program that comes with ghostscript. The output of pstotext
is however better than that of ps2ascii, because pstotext deals better
with punctuation and ligatures."
category: Utils
requires: ghostscript
More information about the Cygwin-apps
mailing list