Navigation

    OpenIAP

    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups

    Split PDF file on content

    General Discussion
    pdf extract
    3
    5
    59
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • P
      Philippe Amice last edited by

      Hello,

      I am looking for a solution capable of splitting a PDF into several pdf on words contained in the PDF (invoices on one or more pages).

      Ideally, I should also be able to rename the PDFs with variables contained in the invoice (Company, customer account ...) is this operation possible with OpenIAP or do I have to go to another solution?

      Thank you

      B Allan Zimmermann 2 Replies Last reply Reply Quote 0
      • B
        Bill Xiao @Philippe Amice last edited by

        @philippe-amice I think you can utilize Python to do this kind of PDF file processing. For example, PyPDF2 can easily split and merge PDF files as you wish.

        1 Reply Last reply Reply Quote 0
        • Allan Zimmermann
          Allan Zimmermann @Philippe Amice last edited by

          Try this
          New Workflow6.xaml

          P 1 Reply Last reply Reply Quote 0
          • P
            Philippe Amice @Allan Zimmermann last edited by

            @allan-zimmermann thanks for your example but my file is cut all pages and i have invoices on two pages how can i indicate to make the break with a word eg "total"? Do I have to specify coordinates? Finally for the naming of PDFs can I extract data included in them? Thanks again

            Allan Zimmermann 1 Reply Last reply Reply Quote 0
            • Allan Zimmermann
              Allan Zimmermann @Philippe Amice last edited by Allan Zimmermann

              ah, i miss read your post.
              Sorry, I don't know, maybe try asking the people behind iTextSharp or you could have a look at some of the python libs @Bill-Xiao suggested.
              While creating my original workflow i also had a look at using nodered. I managed to get it working inside nodered using pdf-lib but decided just creating a simple xaml file was easier, than to explain how to enabled modules and use requires in function inside nodered. But if you have very specific requirement that might be worth the hassle anyway.

              1 Reply Last reply Reply Quote 0
              • Locked by  A az 
              • First post
                Last post