How to manage log in session through headless chrome?

43,143

Solution 1

In puppeter you have access to the session cookies through page.cookies().

So once you log in, you could get every cookie and save it in a json file:

const fs = require(fs);
const cookiesFilePath = 'cookies.json';
// Save Session Cookies
const cookiesObject = await page.cookies()
// Write cookies to temp file to be used in other profile pages
fs.writeFile(cookiesFilePath, JSON.stringify(cookiesObject),
 function(err) { 
  if (err) {
  console.log('The file could not be written.', err)
  }
  console.log('Session has been successfully saved')
})

Then, on your next iteration right before using page.goto() you can call page.setCookie() to load the cookies from the file one by one:

const previousSession = fs.existsSync(cookiesFilePath)
if (previousSession) {
  // If file exist load the cookies
  const cookiesString = fs.readFileSync(cookiesFilePath);
  const parsedCookies = JSON.parse(cookiesString);
  if (parsedCookies.length !== 0) {
    for (let cookie of parsedCookies) {
      await page.setCookie(cookie)
    }
    console.log('Session has been loaded in the browser')
  }
}

Checkout the docs:

Solution 2

There is an option to save user data using the userDataDir option when launching puppeteer. This stores the session and other things related to launching chrome.

puppeteer.launch({
  userDataDir: "./user_data"
});

It doesn't go into great detail but here's a link to the docs for it: https://pptr.dev/#?product=Puppeteer&version=v1.6.1&show=api-puppeteerlaunchoptions

Solution 3

For a version of the above solution that actually works and doesn't rely on jsonfile (instead using the more standard fs) check this out:

Setup:

const fs = require('fs');
const cookiesPath = "cookies.txt";

Reading the cookies (put this code first):

// If the cookies file exists, read the cookies.
const previousSession = fs.existsSync(cookiesPath)
if (previousSession) {
  const content = fs.readFileSync(cookiesPath);
  const cookiesArr = JSON.parse(content);
  if (cookiesArr.length !== 0) {
    for (let cookie of cookiesArr) {
      await page.setCookie(cookie)
    }
    console.log('Session has been loaded in the browser')
  }
}

Writing the cookies:

// Write Cookies
const cookiesObject = await page.cookies()
fs.writeFileSync(cookiesPath, JSON.stringify(cookiesObject));
console.log('Session has been saved to ' + cookiesPath);
Share:
43,143
Admin
Author by

Admin

Updated on July 05, 2022

Comments

  • Admin
    Admin almost 2 years

    I want to create a scraper that:

    1. opens a headless browser,
    2. goes to a url,
    3. logs in (there is steam oauth),
    4. fills some inputs,
    5. and clicks 2 buttons.

    My problem is that every new instance of headless browser clears my login session, and then I need to login again and again...

    How to save it through instances? (using puppeteer with headless chrome)

    Or how can I open already logged in chrome headless instance? (if I have already logged in in my main chrome window)

  • Zeeshan Chawdhary
    Zeeshan Chawdhary over 5 years
    jsonfile does not seem to work when headless: false, the documentation says "Note: this module cannot be used in the browser."
  • r1si
    r1si over 5 years
    fileExistSync is not a valid function... need to use : stackoverflow.com/questions/4482686/…
  • r1si
    r1si over 5 years
    This is a better solution, keep cookie and local storage
  • Rafael Mejía
    Rafael Mejía about 5 years
    This is the easiest way to persist the session, though you might end up storing more data than what you need. Just launching a browser with this configuration creates a folder containing ~3mb of data. If storage is a concern, you might want to consider @Ecovirtual solution. Otherwise, this is perfect.
  • Ghyath Darwish
    Ghyath Darwish almost 5 years
    Good answer but this take more disk space, can I specify the cookies only to save into this folder??
  • Sinosaurus
    Sinosaurus over 4 years
    I use it not't ok, what should i do?
  • jamis0n
    jamis0n over 4 years
    Does this approach store SESSION cookies? Semantically, the session ends when you close the browser, so I could see the argument for not persisting session cookies between puppeteer page instances this way.
  • Rick Gladwin
    Rick Gladwin almost 4 years
    Here's a version-agnostic link to the Puppeteer docs for launch options, since the version update has killed the old link: pptr.dev/…
  • EcoVirtual
    EcoVirtual over 3 years
    Just updated to use Node's "fs" instead of external dependency for writing and reading files.
  • Yuri Santos
    Yuri Santos almost 3 years
    this is the better way to preserve a session without know details like cookies, session storage, localstorage, web db, etc.