Download image from S3 bucket to Lambda temp folder (Node.js)

21,178

Solution 1

You can get the image using the aws s3 api, then write it to the tmp folder using fs.

var params = {   Bucket: "BUCKET_NAME",   Key: "OBJECT_KEY" };  

s3.getObject(params, function(err, data){   if (err) {
    console.error(err.code, "-", err.message);
    return callback(err);   }

  fs.writeFile('/tmp/filename', data.Body, function(err){
    if(err)
      console.log(err.code, "-", err.message);

    return callback(err);   
  }); 
});

Out of curiousity, why do you need to write the file in order to attach it? It seems kind of redundant to write the file to disk so that you can then read it from disk

Solution 2

If you're writing it straight to the filesystem you can also do it with streams. It may be a little faster/more memory friendly, especially in a memory-constrained environment like Lambda.

var fs = require('fs');
var path = require('path');

var params = {
    Bucket: "mybucket",
    Key: "image.png"
};

var tempFileName = path.join('/tmp', 'downloadedimage.png');
var tempFile = fs.createWriteStream(tempFileName);

s3.getObject(params).createReadStream().pipe(tempFile);

Solution 3

// Using NodeJS version 10.0 or later and promises

const fsPromise = require('fs').promises;

try {
    const params = {
        Bucket: 's3Bucket',
        Key: 'file.txt',
    };

    const data = await s3.getObject(params).promise();

    await fsPromise.writeFile('/tmp/file.txt', data.Body);

} catch(err) {
    console.log(err);
}
Share:
21,178
JBM
Author by

JBM

Updated on July 06, 2020

Comments

  • JBM
    JBM almost 4 years

    Good day guys.

    I have a simple question: How do I download an image from a S3 bucket to Lambda function temp folder for processing? Basically, I need to attach it to an email (this I can do when testing locally).

    I have tried:

    s3.download_file(bucket, key, '/tmp/image.png')
    

    as well as (not sure which parameters will help me get the job done):

    s3.getObject(params, (err, data) => {
        if (err) {
            console.log(err);
            const message = `Error getting object ${key} from bucket ${bucket}.`;
            console.log(message);
            callback(message);
        } else {
    
            console.log('CONTENT TYPE:', data.ContentType);
            callback(null, data.ContentType);
        }
    });
    

    Like I said, simple question, which for some reason I can't find a solution for.

    Thanks!

  • JBM
    JBM over 7 years
    I am using SendGrid, so it seems mandatory for the file to exist on the local disk to attach it. Are you saying that this is not necessarily required?
  • Jonathan Seed
    Jonathan Seed over 7 years
    I am not familiar at all with SendGrid so I am not sure, but my thoughts were that you would download it into memory, thought this could be an issue depending on file size.
  • V. Samma
    V. Samma over 7 years
    @JonathanSeed I am actually having this problem that I am reading a 150-200mb text file with getObject and this makes my Lambda function to reach its maximum memory limit. Is there a workaround or why is the memory limit so low?
  • Jonathan Seed
    Jonathan Seed over 7 years
    @V.Samma You can configure the memory for the lambda function under Advanced Settings in the console. I believe the default value is 128 mb.
  • V. Samma
    V. Samma over 7 years
    @JonathanSeed I may not have been clear enough. By reaching its maximum memory limit I meant that I have already set the maximum memory for my lambda function, which is 1536MB and this limit is reached when my lambda function tries to read in 2 files (one a few KB-s and one 150-200MB-s) and then concatenate them as string values and write the result back to S3.
  • Cinn
    Cinn about 5 years
    do you know if createReadStream() make aws-sdk firing multiple GET calls or it does only one and then stream the data? I am concerned about the cost of this solution
  • Vikas Satpute
    Vikas Satpute about 4 years
    what is path in this?
  • Seafish
    Seafish about 4 years
    @Cinn I believe it should fire just one GET call. It does the same thing as a regular getObject but just exposes the underlying stream
  • Seafish
    Seafish about 4 years
    @VikasSatpute I edited to add path, thanks for calling that out
  • mishsx
    mishsx almost 4 years
    Please add some details to your answer
  • alanwilter
    alanwilter almost 2 years
    This answer is wrong: SyntaxError: await is only valid in async functions and the top level bodies of modules