convert PDF to txt file in progress

Good morning everyone forum.
I need a help.
I have to develop a routine in progress, for converting a PDF file to a file in TXT format.
For later import into the system.
How do I Convert this? There is a FREE application that I can run on my progress application?
How do I do that? What parameters do I use?

thanks


Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
rrubio's picture

Batch mode - pdftotext

I did something like this sometime ago. Your best bet is to run something like www.pdftotext.net in batch mode via os-command. e.g os-command('c:\programs\pdftotext your_filename.pdf'). It does consume a bit memory, so test with one pdf file to start with.

I'll update this response if i manage to find my code. When you google make sure to use "pdf to text batch mode" there are a few options out there.


Thank you for feedback

Thank you for feedback


Do not expect exact transformation

Carlos,

PDF to text is possible depending on the characteristics of the PDF security model you use, images , etc. of the pdf file

These tools are as OCR, try to do the " best pissible".

The information obtained should be validated in some way before you upload.

I'd done this before and the results are quite unsatisfactory.

Take care.