Merge Pdf Files Using PDFBox

11,660

Call PDFMergerUtility.appendDocument(PDDocument destination, PDDocument source) instead of merge. To load from a byte array, call PDDocument.load(array).

Btw: I'd discourage to merge PDDocument objects that you created yourself, this fails if you have font subsets (see issue PDFBOX-3243). Better save them (e.g. in a ByteArrayOutputStream) and reload and then merge. To get a byte array from a ByteArrayOutputStream, use ByteArrayOutputStream.tobytearray().

Share:
11,660
Raushan
Author by

Raushan

Updated on June 14, 2022

Comments

  • Raushan
    Raushan almost 2 years

    I have to merge two pdf Files using PdfBox of Apache. I have taken physical pdf files to do so. Below is the code:

    PDFMergerUtility ut = new PDFMergerUtility();
    ut.addSource(path1);
    ut.addSource(path2);
    ut.setDestinationFileName(path3);
    ut.mergeDocuments(MemoryUsageSetting.setupMainMemoryOnly());
    

    Files are merged perfectly but then have some constraints as below:

    1. I am creating 1st file in the code itself,so it is PDDocument object.
    2. The file which I have to merge with 1st file is in byte array format.
    3. I don't need to save the merged files but need convert it as byte array.

    Please anyone help me achieve this requirement.

  • Trinimon
    Trinimon about 7 years
    do you have an example for the ByteArrayOutputStream approach? I tried to save two docs into two streams, combined the arrays and reloaded the combined bytes into a new document. However, it didn't seem to work.
  • Tilman Hausherr
    Tilman Hausherr about 7 years
    ByteArrayOutputStream baos = new ByteArrayOutputStream(); doc.save(baos); doc.close(); doc = PDDocument.load(baos.toByteArray());
  • Trinimon
    Trinimon about 7 years
    So you meant one should save and reload the self-made PDDocument? ... and your comment was not about merging documents by combining ByteArrayOutputStream's. Right?
  • Tilman Hausherr
    Tilman Hausherr about 7 years
    Yes, you should save and reload the self-made PDDocument before merging.