Content Scraping: What It Is And What To Do About It

As content marketing continues to become a force to be reckoned with, so does content scraping.

Content scraping is when a third party copies the content from your website or blog and reposts it without consent – and often without attribution. Surprisingly, there are benefits to content scraping, and opinion differs on whether it’s a good or bad thing.

On the pro side is Joan Muschamp, founder and CEO of LemonZest Marketing, who says, “The nature of the blog is that it is internet content, and designed for sharing. Blogs are a form of social media and by definition are meant to be shared to benefit social reach.”

On the other side is Bethany Gonzalez Moreno, founder of B.EcoChic.“Stealing content does not help build a business,” she says. “And it does not provide any real value to your reader or customer.”

So what are the pros and cons of getting your own content scraped?

Pro: scraping can broaden your reach: When a scraper posts your content, you or your company may gain visibility with a niche audience you would not have reached otherwise.  Of course, this only applies if the scraper gives you attribution and your internal links are intact.

Pro: it can help your SEO: Scraped content can increase your website or blog’s SEO. By linking internally to other posts on your blog, you create free backlinks, which will drive traffic to your site and increase your SEO. 

Con: it can also hurt your SEO: If the scraped article ranks higher than your original version, that’s bad for your Google ranking.

Con: decreased brand awareness: Scraping can cause Google to have difficulty deciphering which article is the original. For smaller companies this could be crippling.

“As a company that relies on organic search traffic, we are constantly battling content thieves. Scraped content can result in fewer visitors coming to our site and thus fewer purchases are made,” says Leslie Handmaker, senior marketing manager at Next Day Flyers,.

Con: lack of attribution: Scrapers often remove the author or company’s name from the content, negating thehard work you put into creating it and, in some cases, opening you up to damaging accusations of plagiarizing your own content.

Don’t want your content scraped? Here are six ways to catch it as it happens and protect yourself from it:

  1. Google Alerts: HARO’s Peter Shankman, suggests taking one line from every blog post and creating a Google alert. When the whole post is stolen, an alert will be triggered.
  2. Canonical Links: Add the rel=”canonical” tag to your content and your site will get credit for the post. Google will also see the tag and the scraper site could potentially get penalized.
  3. Copyscape: This service provides a free plagiarism checker to find copies of your content online.
  4. CAPTCHA: Most scraping is propagated by bots. CAPTCHA requires a human to type in a few jumbled letters and numbers. This helps to ensure that your content does not get scraped by a computer and will also reduce the amount of spam on your site.

    Image via: http://bit.ly/QqaGmq

  5. Pinging: Pinging alerts search engines that your content has been uploaded before including your post or article on an RSS feed. Various services offer this capability, such as Ping-O-Matic.
  6. Internal Links: Adding internal links to your posts will create trackbacks if a scraper tries to steal your content.

If you realize your content has already been stolen, first contact the website owner/scraper and ask them to take your content down. If the scraper ignores you or refuses to take down your stolen content, you can file a DMCA (Digital Millennium Copyright Act) with their host. Be warned: this is time-consuming. It’s easier to take preventative measures like points 2 and 4 above.

 

Image: Caseorganic (Creative Commons)

Did you like the post?

1 Star2 Stars3 Stars4 Stars5 Stars (No Ratings Yet)
Loading ... Loading ...
  • Joan Muschamp

    I must add that I also said that the reuse of content ALWAYS must include an attribution, and anything otherwise is not appropriate. While I do not use automated scrapers personally, nor do I recommend them, there can be benefits if the attribution and backlinks are present.

    I certainly do not wish to be associated with or viewed as participating or condoning what I would consider completely inappropriate.
    Joan Muschamp
    LemonZest Marketing

  • Brian D. Meeks

    I’ve had this happen to several guest posts I’ve written for Spin Sucks. The pros and cons didn’t really apply to me, because the piece wasn’t for my site, but it made me really mad. It ruined my entire day and most of the night as I plotted ways to right this horrible wrong. In the end, I ran out of steam and realized that the blogger, who was in Japan, had a crappy site and probably didn’t get much traffic anyway. Still, I did wish them ill.

    Stealing is wrong. It is wrong with pictures. (No, putting up a link to a picture and saying CC 2.0, doesn’t mean you didn’t steal it or won’t be sued) It is wrong with words. It makes people who work hard to write and take photographs think you are a terrible person who was never loved by their parents and probably have a bad odor about you.

    I say, don’t scrape.

  • Ponypress

    I think using content others have been creating without proper attribution is plain stealing. Content curation is the new, big thing in internet marketing, so that many wp plugins and software is being produced to copy content into someone’s blog easily. If it is used as a plus for your readers, enhanced by your own content and commentary and attributed correctly, it usually gives your visitors a better experience: you distill, they can consume faster. On the other side, if it is only pasted with a new headline and crappy affiliate links, it may be really harming the original writer’s reputation in a certain sense.
    So, curation yes, scraping no.

    • http://twitter.com/scott_benson Scott Benson

      I think you hit it on the head with asking;
      “does this content improve my visitor’s experience?”

      I’m a little surprised by the level of disgust for
      the scrapper sites – though maybe I’m desensitized due to the nature of my work
      (SEO) – and just to play devil’s advocate, is there any sense of flattery that
      your content was deemed scrape-worthy? J

  • Pingback: Content Scraping and What to Do About It | Digital Brand Marketing | Scoop.it

  • Pingback: Are Blog Stats Like the Skinny Model on Your Fridge? | Something Wagging This Way Comes

  • Pingback: Speeding is Against the Law |

  • Pingback: seo services