pdf extractor
-
I'm trying to extract a pdf file but give me this error
-
Try adding an writeline above the ReadPDF, and check you are actually giving it a pdf file ?
-
yes is finding the pdf file
-
Does it work with other pdf files ? or is all pdf files broken for you ?
-
Already found the problem it was the pdf
By chance there is a tutorial to extract pdf to excel?
-
No. Not sure that would make any sense either. When grabbing the text of out a pdf, it has no structure, and excel is about structed data.
Maybe you are trying to parse an invoice ? In that case you have 2 options. you can use string manipulation to find the different fields you are interested in ( is going to be hard and super annoying ) or you can look into some of the 3rd party Document Process solution out there. OpenRPA has activities to work with Rossum.AI, and using OpenFlow you can easily integrate with Abby, aws textract (not designed for invoice, but is still useable ), Google invoice Parser, LarcAI, etc. -