Skip to Main Content (press enter)

Archive for August, 2004

Case studies, blog postings, research papers, tools info and eBook notes

Google Cache adds Crawl Date

The Google Cache improves. A useful addition to the information stored in the Google Cache and subsequently published to the web for our consumption, is the precise date and time that the page being viewed was crawled by the Googlebot. Knowing when a site’s pages are crawled is very useful when trying to accelerate the indexing of new content, and to understand about the optimum times for content changes.

Many sites enjoy ‘daily indexing’ in view of the frequency of their content changes and this fact is more obviously displayed beneath their entry in the SERPs by a date attribute appearing after the page URL the file size, e.g. www.johnston.co.uk/ – 19k – 31 Jul 2004. For those pages that do not see the crawler every day, and this is often the case with pages that are targeted for improved performance in the SERPs, then understanding the pattern of crawling is very helpful.

To see examples of this, make a search, such as for a Google Consultant and scan down the list to find results that do not have a date attribute in the entry on the page. At the time of writing, the third entry down for Ring John qualifies. Click on the Cached link to view the Google Cache of this page. At the end of the very top line of the information Google presents is the following: ‘as retrieved on 13 Jun 2004 03:29:07 GMT’.

The Google Cache always was a useful tool, it has now just got significantly more so.

(Filed in Blog, August 2nd, 2004)