Navigation

    OpenIAP

    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups

    pdf extractor

    General Discussion
    pdf extract
    2
    6
    62
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • J
      João Jesus last edited by

      I'm trying to extract a pdf file but give me this error

      4011c9c5-499d-4489-9c09-5063f16d0a49-image.png

      Allan Zimmermann 1 Reply Last reply Reply Quote 0
      • Allan Zimmermann
        Allan Zimmermann @João Jesus last edited by

        Try adding an writeline above the ReadPDF, and check you are actually giving it a pdf file ?

        J 1 Reply Last reply Reply Quote 0
        • J
          João Jesus @Allan Zimmermann last edited by

          @allan-zimmermann

          yes is finding the pdf file

          Allan Zimmermann 1 Reply Last reply Reply Quote 0
          • Allan Zimmermann
            Allan Zimmermann @João Jesus last edited by

            Does it work with other pdf files ? or is all pdf files broken for you ?

            J 1 Reply Last reply Reply Quote 0
            • J
              João Jesus @Allan Zimmermann last edited by

              @allan-zimmermann

              Already found the problem it was the pdf

              By chance there is a tutorial to extract pdf to excel?

              Allan Zimmermann 1 Reply Last reply Reply Quote 0
              • Allan Zimmermann
                Allan Zimmermann @João Jesus last edited by

                No. Not sure that would make any sense either. When grabbing the text of out a pdf, it has no structure, and excel is about structed data.
                Maybe you are trying to parse an invoice ? In that case you have 2 options. you can use string manipulation to find the different fields you are interested in ( is going to be hard and super annoying ) or you can look into some of the 3rd party Document Process solution out there. OpenRPA has activities to work with Rossum.AI, and using OpenFlow you can easily integrate with Abby, aws textract (not designed for invoice, but is still useable ), Google invoice Parser, LarcAI, etc.

                1 Reply Last reply Reply Quote 0
                • Locked by  A az 
                • First post
                  Last post