List of Googlebot IPs

Sep 11

List of Googlebot IPs

google.txt: I don’t know how well this is kept up-to-date, but it purports to be a list of all the IPs the Googlebot comes from.

This is of interest after the last post, because Google says you can detect the Googlebot User Agent and bypass login pages to index content behind subscription walls. But it also means that anyone can bypass your login page as well by changing their User Agent.

But, if this is an accurate list of Googlebot IPs, then you could detect both the User Agent and the IP. The trick, of course, is to make sure you have a an up-to-date list of IPs. I imagine it changes a lot, but of the 281 lines in this file, several are Class C subnets, which would encompass ~250 other IPs, so the actual number of potential IPs in this list is into the thousands.


Comments

by Sennafan,   September 12, 2008 10:49 AM  

The folks over at IP delivery (John Heard), they are probably the most aggressive in keeping track of all major search engine spider IP addresses. They use the IP addresses always save for their IP delivery software which is pretty potent and powerful stuff.

They have a subscription-based service where you can receive immediate alerts when new IP addresses from search engines (such as Google) are detected.



Add Comment