- abiword
- poppler-utils
- http://faceted.wordpress.com/2010/07/11/how-to-extract-text-from-pdf-files-using-poppler-and-gocr-on-ubuntu/
- pdftotext foo.pdf
- pdfimages foo.pdf foo-page
- for i in *pbm; do echo gocr $i; gocr $i > $i.txt; done (OCR)
- tesseract-ocr
- Adobe Acrobat Pro
No comments:
Post a Comment