On Thu, 02 Jun 2016 23:51:03 -0700, dharmeshbhalodiya18 wrote:
> Hi,
>
> i have numbers of pdfs for row data in tabular format inside each pdf.
> is there any solution for this for reading pdf and directly send data
> into database? because i already convert it into excel but formatting of
> data is not properly converted into it. and also want some automated
> solution for this because huge numbers of pdfs which can not manually
> insert into database. around 20K pdf files. is ther any proper way to do
> this?
>
> Thanks
Use pdftotext or such to extract the available text from the PDF to
see if it's in any kind of shape that can be used or reformatted
for import.