Tag Archive

Apple Confirms Their Web Crawler: Applebot

Published on 2015/05/06 By admin

Apple finally confirms they have a web crawler, primarily used for Siri and Spotlight Suggestions. The post Apple Confirms Their Web Crawler: Applebot appeared first on Search Engine Land.

How Search Engines Process Links

Published on 2015/04/13 By admin

Ever wondered how search engines crawl, analyze, index, and rank pages? Columnist Jenny Halasz has created a helpful primer on the link graph to answer these questions. The post How Search Engines Process Links appeared first on Search Engine Land

Google Now Supports Crawling & Indexing Locale-Adaptive Web Pages

Published on 2015/01/28 By admin

Google adds support for web pages that dynamically change their content based on IP origin or language settings. The post Google Now Supports Crawling & Indexing Locale-Adaptive Web Pages appeared first on Search Engine Land.

Google Mobile Search Descriptions Now Say If You’re Blocking Google

Published on 2015/01/27 By admin

Yet another reason not to block JavaScript, CSS, etc from Google.

Google Orders Terminator Robots Not To Kill Founders Brin & Page

Published on 2014/07/03 By admin

How do you stop the unstoppable killer Terminators if you’re not Sarah Connor? Google does it with a simple text file. People have noted today that Google has a special “robot.txt” file that pokes fun at stopping the Terminators.

Robots.txt Celebrates 20 Years Of Blocking Search Engines

Published on 2014/06/30 By admin

Today is the 20th anniversary of the robots.txt directive being available for webmasters to block search engines from crawling their pages. The robots.txt was created by Martijn Koster in 1994 while he was working at Nexor after having issues with crawlers hitting his sites too hard.

Google Fixing Reverse DNS GoogleBot Verification

Published on 2014/04/03 By admin

Google has confirmed that there are some GoogleBot useragent spiders not properly passing verification protocol. Savvy webmasters noticed that GoogleBot over the .249.70.0 /24 IP range was not returning the proper reverse DNS verification details. The response given was “no such host is..

Google’s Matt Cutts: NoFollow Attributes On Internal Links Don’t Hurt But Generally Don’t Do It

Published on 2013/09/30 By admin

In a video, Google’s head of search spam Matt Cutts published today an answer to the question, “Should I use rel=”nofollow” on internal links to a login page?” Matt Cutts basically said you shouldn’t, but said it won’t hurt you if you did. Matt said,… Please visit Search Engine Land for the full article.

Snowden Petition Blocked From Google? Like All Petitions, It Won’t Be When It Gets Enough Signatures

Published on 2013/06/28 By admin

Search for “edward snowden petition” to find the petition filed through the White House’s Petition site, and you’ll see something odd. The petition has no description, because the White House won’t let Google crawl the page. But it’s not a move against Snowden,..

Twitter Opens Up To More Crawling, But Do Search Engines Want Its Search Results In Theirs?

Published on 2012/09/27 By admin

Twitter recently updated its robots.txt file and, though the change opens up millions of pages to being crawled, there’s no guarantee that the main search engines want what Twitter is offering. The Sociable seems to have been the first to notice Twitter’s robots.txt changes, which now… Please visit Search Engine Land for the full article