ITP: pstotext -- Extract text from PostScript and PDF files

Jari Aalto jari.aalto@cante.net
Tue Oct 9 15:29:00 GMT 2012


wget --recursive --no-host-directories --cut-dirs=3 \
    http://cante.net/~jaalto/tmp/cygwin/pstotext/pstotext-1.9-1-src.tar.bz2 \
    http://cante.net/~jaalto/tmp/cygwin/pstotext/pstotext-1.9-1.tar.bz2 \
    http://cante.net/~jaalto/tmp/cygwin/pstotext/setup.hint

    # To check packaging

    cd pstotext
    tar -xf *-src.tar.bz2
    ./*.sh --color --verbose all

Included in Debian:

    http://packages.debian.org/pstotext

Notes:

    Program uses ghostscript for processing.

Jari

[ setup.hint ]

sdesc: "Extract text from PostScript and PDF files"
ldesc: "Extract text (in the ISO 8859-1 character set) from a PostScript or
PDF (Portable Document Format) file. Thus, pstotext is similar to the
ps2ascii program that comes with ghostscript. The output of pstotext
is however better than that of ps2ascii, because pstotext deals better
with punctuation and ligatures."
category: Utils
requires: ghostscript



More information about the Cygwin-apps mailing list