tag:blogger.com,1999:blog-24069595.post2586904379446111157..comments2024-03-27T04:17:20.550-07:00Comments on The Real Blogger Status: Don't Backup Your Blogs, By Duplicating ThemNitecruzrhttp://www.blogger.com/profile/08069634565746003311noreply@blogger.comBlogger8125tag:blogger.com,1999:blog-24069595.post-65346566112978644022015-02-08T08:53:03.166-08:002015-02-08T08:53:03.166-08:00Gracey,
My suspicion is that Google maintains two...Gracey,<br /><br />My suspicion is that Google maintains two (at least) groups of bots. "GoogleBot" (I'll call it "GoogleContentBot") indexes all "indexable" websites, for Google Search. "GoogleSpamBot" indexes ALL Blogger blogs ("indexable" AND "non-indexable"), for Blogger Spam Classification.<br /><br />If we were to do a Venn diagram of the GoogleContentBot and GoogleSpamBot targets, we would see very slight overlap.<br /><br />We SHOULD be able to track both GoogleContentBot and GoogleSpamBot as they index our blogs. If there is a "safe" period for having duplicated content, it would be just after "GoogleSpamBot" makes its recent pass through each blog (but before the next pass).<br /><br />But this is simply my musings, of a rainy Sunday morn.Nitecruzrhttps://www.blogger.com/profile/08069634565746003311noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-25468087725115845442015-02-08T06:36:41.916-08:002015-02-08T06:36:41.916-08:00Gracey,
I don't want to mislead anybody - the...Gracey,<br /><br />I don't want to mislead anybody - there is no "non-backup blog" policy.<br /><br />You can have a backup blog, if you want. The issue is that both the original and backup blogs are vulnerable to <a href="http://blogging.nitecruzr.net/2009/11/attack-of-clones.html" rel="nofollow">spam classification, as clones</a>.<br /><br />If your blogs are classified, you will have to go through <a href="http://blogging.nitecruzr.net/2015/01/spam-review-requires-triage.html" rel="nofollow">the review process</a>, to get them restored.<br /><br />You won't enjoy the long term effects from <a href="http://blogging.nitecruzr.net/2013/06/hacking-malware-spam-classification-and.html" rel="nofollow">being in the review</a> - even if you get your blogs restored - so for everybody's sake, I start out by saying "Don't duplicate your blogs!".<br /><br /><a href="http://blogging.nitecruzr.net/2009/11/attack-of-clones.html" rel="nofollow">http://blogging.nitecruzr.net/2009/11/attack-of-clones.html</a><br /><br /><a href="http://blogging.nitecruzr.net/2015/01/spam-review-requires-triage.html" rel="nofollow">http://blogging.nitecruzr.net/2015/01/spam-review-requires-triage.html</a><br /><br /><a href="http://blogging.nitecruzr.net/2013/06/hacking-malware-spam-classification-and.html" rel="nofollow">http://blogging.nitecruzr.net/2013/06/hacking-malware-spam-classification-and.html</a>Nitecruzrhttps://www.blogger.com/profile/08069634565746003311noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-11628558653932819202015-02-08T05:22:03.760-08:002015-02-08T05:22:03.760-08:00Yes, I've read some of those which is why I wa...Yes, I've read some of those which is why I was wondering if there were some sort of "safe" time frame for doing such a thing.<br /><br />Right now, my work blog is empty save for a single post that's a draft - and it doesn't have any content published on the active blog, so the situation doesn't apply right now.<br /><br />I'm also a little confused - if your robots file is set not to allow the robots to crawl the blog, google's bots usually respect the robots file, so how would they know if the content is the same or not?<br /><br />Or, is it based on publishing a whole bunch of posts at one time, whether they're the same or not?<br /><br />That's what would happen if I backed up any of my blogs using the xml, and then deleted my blog. Opened a new blog with a new url and uploaded the xml file. Would I still have problems in doing that?<br /><br />If so, it almost makes no sense to use the backup xml file.<br />Nonihttps://www.blogger.com/profile/02197914508039719085noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-70649903024988114772015-02-07T15:52:15.447-08:002015-02-07T15:52:15.447-08:00Gracey,
All very reasonable, except for one detai...Gracey,<br /><br />All very reasonable, except for one detail.<br /><br />"Private, Non-Crawlable" may not apply to spam classification. I occasionally have blog owners in the forums, reporting a private blog - or one with "robots.txt" or meta tags supposedly blocking crawlers, yet <a href="http://blogging.nitecruzr.net/2014/11/static-blogs-and-spam-classification.html" rel="nofollow">deleted by Blogger for SPAM</a>.<br /><br /><a href="http://blogging.nitecruzr.net/2014/11/static-blogs-and-spam-classification.html" rel="nofollow">http://blogging.nitecruzr.net/2014/11/static-blogs-and-spam-classification.html</a>Nitecruzrhttps://www.blogger.com/profile/08069634565746003311noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-62712104140826667692015-02-07T06:27:25.792-08:002015-02-07T06:27:25.792-08:00While I understand the spam policy, the issue I ha...While I understand the spam policy, the issue I have with this "non-backup blog" policy is simple.<br /><br />When making a complete template or custom design change, the easiest way to to do so without affecting the current blog is to use a secondary, private, non-crawlable blog.<br /><br />Using that "work" blog, I uploaded the xml file to the private one so I could fiddle with the layout, colours and sizing I wanted to use or change to and this gave me a chance to see how it would affect certain contents already on my blog (and how much work would be involved in changing it), and make sure I had links set up correctly, and nothing was amiss, before actually making these changes to my active blog. It made the transition faster and simpler with less interruption for visitors.<br /><br />Initially, I kept that "work" blog, but after reading this, I deleted all the posts except one that is a draft.<br /><br />Not being able to do this sort of thing offline, means a work blog (not meant as a backup, but a place to fiddle with changes or try different coding or scripting) would have been an ideal way.<br /><br />I guess my question would be ... how long can you use a "work blog" with some duplicated contents (I never published all the posts, but deleted all of those anyways too) before it might be hit with a spam designation?<br /><br />It took me about 2 weeks to get everything right before initiating & completing the changes to the active blog.Nonihttps://www.blogger.com/profile/02197914508039719085noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-36166561554870035162015-01-14T16:11:53.295-08:002015-01-14T16:11:53.295-08:00This comment has been removed by the author.Angelina Lenahanhttps://www.blogger.com/profile/02969225046386812994noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-13424336860144354262015-01-14T08:25:39.477-08:002015-01-14T08:25:39.477-08:00Angel,
That is a good question. To discuss "...Angel,<br /><br />That is a good question. To discuss "copying" vs "scraping", IMHO, you need to look at at least 3 details.<br />1. Intent. Do you intend to do something useful with the copied content - or are you just looking to "bulk up" your blog?<br />2. Permission. Do you have permission to copy (or is the content public)?<br />3. Ratio. Does the original content in your blog vastly outweigh the copied content? I use a 90% original to 10% copied ratio, as a starting point. I would like to think that my blog is more like 95% to 5%.<br /><br />To use the example of "scientists", one famous scientist (Einstein?) said "Every successful scientist stands on the shoulders of every previous scientist.". Science, as well as Theology, blatantly copies content - they just use established rules.<br /><br />I bet if we do enough Googling, we can find a mutually respectable website which contains "etiquette" or "laws" which discuss this in a more formal manner. Maybe, we may even find references that are used by Google Legal, when they make the final verdict on an accused blog owner.<br /><br />A question to consider though. Who "owns" the Christian Bible, or the Muslim Koran? Not who has "copyrighted" the various editions - or profits from reprinting something that they, themselves did not write.Nitecruzrhttps://www.blogger.com/profile/08069634565746003311noreply@blogger.comtag:blogger.com,1999:blog-24069595.post-37115044740193170432015-01-14T06:43:27.950-08:002015-01-14T06:43:27.950-08:00This comment has been removed by the author.Angelina Lenahanhttps://www.blogger.com/profile/02969225046386812994noreply@blogger.com