Google Loves a Good Scrape

Look how old this is!
I post at SearchCommander.com now, and this post was published 13 years 8 months 13 days ago. This industry changes FAST, so blindly following the advice here *may not* be a good idea! If you're at all unsure, feel free to hit me up on Twitter and ask.

If you look at the Wikipedia definition of a scraper website, it says

“A scraper site is a spam website that copies all of its content from other websites”.

Well Google has a new project, that in my opinion, is basically just a well done Google scraper.

Over the past dozen years or so, the advent of RSS feeds and other automated content distribution technology has led to the development of hundreds of scripts, software programs, products and plugins, that all seek to locate, and regenerate content, in order to blog, post, or otherwise auto-magically display content that comes from other sources.

I’ve always maintained (granted, it’s often been only to myself while submitting a reinclusion request) that a well organized scraper site CAN have some actual value.

I’ve also long believed that augmenting the original content with “fed” or “scraped” content on the same page can have a positive impact on your search rankings.

Google on the other hand, has long made it their mission to put the words “scraped content” into the same category as “paid links” (i.e. Evil) , and they have always publicly discouraged the practice, claiming to be trying to get rid of scraped pages from the index even if they’ve rewarded it behind the scenes with rankings and big Adsense checks.

Last week I found out about a new Google project, and when I saw it I was stunned – I think it’s nothing but a scraper engine!

On the one hand, it is fun to play with and I do see the value in it. On the other hand, it seems sort of hypocritical. Why is it okay for them to do it, but if I want to do it on my site I’m a “bad guy”?

Here’s a quick video below, and although the site is now shut down, you can read about what was called Google WYDL here.