Skip to main content

What Is Different Between Copying And Scraping?

We see a few spam blogs, in Blogger Help Forum: Get Help with an Issue, that are classified for having scraped content.

Some nervous blog owners, who routinely copy content as part of their blog posts, want to know if their blogs are vulnerable to "scraped content" spam classifications.

I suspect that there are several issues to consider, regarding use of copied content in your blog.

To discuss "copying" vs "scraping", IMHO, you need to look at at least 3 details.

  1. Intent. Do you intend to do something useful with the copied content - or are you just looking to "bulk up" your blog?
  2. Permission. Do you have permission to copy (or is the content public)?
  3. Ratio. Does the original content in your blog vastly outweigh the copied content?

Intent.

You can start with copied content - but a blog with any future has to have some source of its own content - unique, plus informative and interesting.

If you are going to copy another blog or website, use the copied content to illustrate or to reinforce a key point in your blog post. Don't build your post around somebody else's content - start with your own content, and add relevant samples of other blogs and websites.

You can design a blog like a high school term paper, if you wish. Just remember, if you can Google it, and scrape into your blog, Google can find it - as can your teacher.

Permission.

If you copy content, make sure that you have permission of the owner. Be careful who you get permission from.

The blog or website that displays the content may not actually be the owner. Find out the legal owner of the content that interests you, and get permission from that person.

And if the owner is kind enough to give you permission to copy, and provides guidelines, observe the guidelines. Be polite - not presumptuous, when you interpret someone's permissions.

Ratio.

In 2013, Matt Cutts of Google estimated that somewhere between 25% to 30% of the content on the web is duplicative. That does not mean that your blog should use 25% copied content.

If you have a small blog, and are just starting out, a 10% (copied) to 90% (original) is a much better goal. In this blog, I would like to think I have more like 5% to 95%.

I, personally, don't mind if people copy my content - as long as they include the links in the posts.

You will get better search engine reputation with your own content. If you copy content from blogs and websites with better reputation, guess which blog / website has better chance of being linked? More original content == more content to index == better reputation == better chance of being listed, with duplicated content.

The more original content you publish, that gets indexed, the better chance you have of getting readers. The less original content in the blog, the greater the chance the blog will be classified, as a spam host. It's that simple.

Comments

Popular posts from this blog

Embedded Comments And Main Page View

The option to display comments, embedded below the post, was made a blog option relatively recently. This was a long requested feature - and many bloggers added it to their blogs, as soon as the option was presented to us. Some blog owners like this feature so much, that they request it to be visible when the blog is opened, in main page view. I would like all comments, and the comment form, to be shown underneath the relevant post, automatically, for everyone to read without clicking on the number of comments link. And this is not how embedded comments work.

What's The URL Of My Blog?

We see the plea for help, periodically I need the URL of my blog, so I can give it to my friends. Help! Who's buried in Grant's Tomb, after all? No Chuck, be polite. OK, OK. The title of this blog is "The Real Blogger Status", and the title of this post is "What's The URL Of My Blog?".

With Following, Anonymous Followers Can't Be Blocked

As people become used to Blogger Following as just another tool to connect people, they start to think about the implications . And we see questions like How do I block someone who's been following my blog secretly? I couldn't see her in my Followers list (hence I couldn't use the "Block this user" link), but I have looked at her profile and could see that she's Following my blog. Following, when you look at the bottom line, is no more than a feed subscription and an icon (possibly) displayed on your blog, and linking back to the profile of the Follower in question. If someone Follows your blog anonymously, all that they get is a subscription to the blog feed. If you publish a feed from your blog, and if the feed is open to anybody (which, right now, is the case ), then it's open to everybody. If someone wants to use Following to subscribe to the feed, you can't stop this. You can't block it before, or after, the fact. You can't Block w