How do I find when an URL was first indexed by Google?

34,515

Solution 1

To know the age of an URL you can follow this link by replacing www.example.com by the URL you want:

https://www.google.com/search?tbs=cdr%3A1%2Ccd_min%3A1%2F1%2F2000&q=site%3Ahttp%3A%2F%2Fwww.example.com&safe=active&gws_rd=ssl

For example, here's the result from Google for the Meta site of Stack Overflow: enter image description here

Otherwise, the Wayback machine is also a good solution but less precise from my experience.

Solution 2

Zistoloen found a way to have Google display the date when it first indexed the content of the page. I'm adding it to my answer as well because I think I can explain it more clearly.

  1. Search Google for something that brings up the page you want as a result
  2. Use "Search Tools"
  3. Select "Custom Range..." from the "Any time" drop down
  4. Put in a large date range such as 1/1/1900 to 1/1/2020

Google will then show the date that it discovered the content that is on the page in the search result.

first indexed

If the page gets updated with new content, Google also updates this date. So it is more of a "first indexed this content" date rather than "first indexed this URL" date.


The Google cache for a page shows when the page was last indexed. You can see that the Stack Exchange home page was last indexed today:

enter image description here


Another option is using the Internet Archive's Wayback machine. That shows you what a page looked like in the past. You can figure out about when the pages were first published. Both Google and the Internet Archive crawl and use the page shortly after it is first published.

Solution 3

There may not be any way to find out when an arbitrary web page was first indexed by Google — certainly I don't know of any way to do so. It's possible that Google simply does not store that information, since there's no real reason why they'd need to. Besides, even if they do store this information, they really have no particular reason to make it freely available to third parties.

(If it's your own page, and you have access to your old webserver access logs, it's easy — just search the logs for the first visit from Googlebot to that page. But otherwise there may be no way to tell for sure.)


In any case, the method described by Zistoloen and Stephen Ostermiller in their answers does not generally reveal the date when a particular URL was first indexed by Google. Rather, it shows the date that Google thinks the content at the URL was published or last updated on, and is often based on Google's more or less reliable attempts to "sniff" dates from the page content itself.

In this video, Google's Matt Cutts touches briefly on how these dates are chosen. For convenience, I've transcribed the relevant piece of the video (approximately from 2:09 to 2:22) below:

"... often you'll see the date, as we infer it, or when we first saw it, whenever we crawled that page, or if we can find it somewhere on the page, and we can extract that date, you'll see that right at the very beginning of the snippet."

For pages like blog posts, wiki pages or Stack Exchange questions, where the software running site automatically reports an accurate creation / modification date on the page itself, the date reported by Google is likely to match it. For other types of pages, though, Google's date sniffer has to work harder, and it doesn't always get it right (whatever "right" may mean, in this context).

In particular, these dates are basically useless for determining how long ago a page was indexed, for two reasons:

  • If a page was modified recently, and the modification date is displayed prominently on the page, Google may pick it up as "the date" of the page, even if the modification was completely trivial.

    For example, this rather old wiki page (which archive.org first indexed in 2003) is currently datestamped by Google as being from November 10, 2014 — the date at which it was most recently edited, as shown at the bottom of the page. The change that happened on that date? Just removing a single link from the bottom of the page.

  • Conversely, Google seems to be happy to accept very old "publication dates" if it finds them on the page — even those that predate the launch of the World Wide Web.

    For example, this page on an old programming contest is dated by Google to September 15, 1986 — actually the date of the event described on the page. Similarly, this page documenting a student strike in 1970 is dated by Google to May 10, 1970 (the date of one of the scanned documents on the page), and, even more absurdly, this Linux manual page is dated by Google to November 4, 1989 (a random example date used on the page).

    You can find plenty more such examples by using the custom date range search described by Stephen and Zistoloen, but setting the upper end of the range to, say, August 6, 1991.

Share:
34,515

Related videos on Youtube

matcheek
Author by

matcheek

Updated on September 18, 2022

Comments

  • matcheek
    matcheek over 1 year

    How do I find out when a particular URL was first indexed by Google? I'd prefer a solution that works even for competitors' URLs that are not owned by me.

    • Admin
      Admin over 9 years
      It's quite possible that the answer is "no". Google may not even store this information (since there's no particular reason why they'd have to), or, even if they do, they may not expose it to third parties.
    • Admin
      Admin over 9 years
      Like others have mentioned, you can't get this information. If you have access to the server logs, you could see when it was first crawled, but even that doesn't mean that it was indexed then.
  • Stephen Ostermiller
    Stephen Ostermiller over 9 years
    Is that feature of Google available in English as well? You linked to the French version.
  • MDMoore313
    MDMoore313 over 9 years
    @StephenOstermiller Just change the tld from .fr to .com.
  • Sathiya Kumar V M
    Sathiya Kumar V M over 9 years
    @Zistoloen +1 for your answer but I have doubt in your query. As like you said I searched for www.stackexchange.com using your example query but it's not showing any result. I searched by removing www also at this time it's showing result for date Dec 1, 2014 which is very recent date that can't be the age of the URL. Am I searching incorrectly? or missing anything?
  • Stephen Ostermiller
    Stephen Ostermiller over 9 years
    I've "stolen" this answer and added it to my answer with a better explanation. This is a great trick, Zistoloen.
  • Sathiya Kumar V M
    Sathiya Kumar V M over 9 years
    Will your first option give exact result for all URLs? I searched for bing.com as like you explained but not getting first indexed date of bing.com. Sorry if I'm wrong?
  • Stephen Ostermiller
    Stephen Ostermiller over 9 years
    It appears to be accurate as long as the content on the page hasn't changed. Google may reset this date if the page gets an overhaul. My homepage is listed as Feb 1, 2002 even though an internal page is listed as Feb 1, 2001. The home page was redesigned around that time whereas the internal page hasn't changed substantially since 2001.
  • joosthoek
    joosthoek over 9 years
    @Stephen & Zistoloen: That is not generally the date when the page was first indexed by Google. Based on some testing, the date shown via this method appears to be either pulled from the content of the page (if Google thinks it sees something that looks like a "published on" or "last modified on" date) or, if no such date is found in the content, based on the date when Google last observed a (substantial?) change to the page. Of course, if the page was never changed after its first publication, this might happen to be the same as its publication date, but there's no guarantee of that.
  • Stephen Ostermiller
    Stephen Ostermiller over 9 years
    The page changes do appear to have to be "substantial" to get the date changed. I've made many minor changes to my homepage since 2002, but the date that Google has for it was the last time I did a major rewrite.
  • joosthoek
    joosthoek over 9 years
    @Stephen: That may not always be the case; see the examples in my new answer below.
  • Sathiya Kumar V M
    Sathiya Kumar V M over 9 years
    Well in your answer you mainly opposed the answers of Stephen and Zistoloen but I don't find any proper answer for OP's question "How do I find when an URL was first indexed by Google?" in your answer!!
  • joosthoek
    joosthoek over 9 years
    There may not be any answer, other that "you can't" or "nobody knows how". But you're right, let me edit that into my answer.
  • Thebluefish
    Thebluefish over 9 years
    @SathiyaKumar Bing.com gives the date Mar 19, 2014 for me using this method. Do note that anything not indexed by google (via robots.txt or otherwise) obviously won't be shown via this way.
  • HappyFace
    HappyFace almost 3 years
    Duplicate answer.