Skip to main content

Blogs To Have Automatically Generated Sitemaps

Last week, Blogger gave us a feature that various blog owners have asked about, for many years.

The previous sitemap, based on the blog posts feed, has been replaced by an automatically generated, dedicated sitemap. You can see one, for this blog, as an example.

Accompanying the new sitemap will be an updated "robots.txt" file.

The new sitemap will be very simple.

http://blogging.nitecruzr.net/sitemap.xml

The sitemap will include 2 data elements / post.

  • Post URL
  • Last updated date / time (UTC).

The new sitemaps offer interesting diagnostic possibilities, for various blog problems.

By eliminating the posts newsfeed, sitemap access becomes much cleaner.

With these data elements now available without requiring searching through the post content in the newsfeed, any process which indexes or searches, using any of these data elements, will be much simpler - and be more stable, when run.

My suspicion is that several Blogger / Google features, no longer immediately requiring the blog feed in indexing, will be much more usable. Blogs which use dynamic templates, the Reading List, and search engine indexing, will eventually benefit.

Accompanying the new sitemap, which will index posts, will be a sitemap for static pages. You can see a pages sitemap, for this blog, as an example.

http://blogging.nitecruzr.net/sitemap-pages.xml

The pages sitemap appears to have 2 data elements / static page.

  • Page URL
  • Published date / time (UTC).

You will see the new sitemap specified in the "robots.txt" file.


Check the "robots.txt" file on your blog. When the sitemap is installed on your blog, you will see the change.

If you're unfamiliar with the concept, you may read my other posts in this blog - or the Webmaster Tools Help: Learn about sitemaps. Now, we can do other things with the blog feed, without impeding indexing. Possibly, even private blogs can now be indexed.

Large sitemaps will be broken into pages.

Any sitemap with over 150 entries (pages or posts) will be broken into pages - 150 entries / sitemap page, automatically.

Examine the posts sitemap, for this blog - as of 5 June, 2016.

http://blogging.nitecruzr.net/sitemap.xml

<?xml version='1.0' encoding='UTF-8'?><sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=1</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=2</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=3</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=4</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=5</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=6</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=7</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=8</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=9</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=10</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=11</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=12</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=13</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=14</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=15</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=16</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=17</loc></sitemap><sitemap><loc>http://blogging.nitecruzr.net/sitemap.xml?page=18</loc></sitemap></sitemapindex>

The most current posts will be listed on Page 1.


http://blogging.nitecruzr.net/sitemap.xml?page=1

The sitemap will have a limited size.

The sitemap will provide a maximum of 3,000 entries - 20 pages at 150 posts / page. As new posts are published to the blog, they will be added, automatically. Hopefully, not too many blogs will have 3,000 posts published, before the blog is indexed.

Since the announcement was made, I have added maybe a dozen posts to this blog. I just looked at Page 1 of the sitemap for this blog, and this post is now, there - 5 minutes after this post was published. You may, or may not, see the same update promptness on your blog.

The old sitemap is now not needed.

Both the old and new sitemaps index the same post complement - the old "sitemap" (posts feed) simply contains irrelevant content - the post material.

Let's compare the old sitemap, with the new, using either the content itself or an HTTP trace pair. Click on two of the links below, and compare the results.

The old sitemap:

The old sitemap URL:

http://blogging.nitecruzr.net/feeds/posts/default?redirect=false

The old sitemap HTTP trace:

http://www.rexswain.com/cgi-bin/httpview.cgi?url=http://blogging.nitecruzr.net/feeds/posts/default%3Fredirect%3Dfalse&uag=Mozilla/5.0+(X11%3B+CrOS+armv7l+7978.74.0)+AppleWebKit/537.36+(KHTML,+like+Gecko)+Chrome/50.0.2661.103+Safari/537.36&ref=http://www.rexswain.com/httpview.html&aen=&req=GET&ver=1.1&fmt=TXT

The new sitemap:

The new sitemap URL:

http://blogging.nitecruzr.net/sitemap.xml

The new sitemap HTTP trace:

http://www.rexswain.com/cgi-bin/httpview.cgi?url=http://blogging.nitecruzr.net/sitemap.xml&uag=Mozilla/5.0+(X11%3B+CrOS+armv7l+7978.74.0)+AppleWebKit/537.36+(KHTML,+like+Gecko)+Chrome/50.0.2661.103+Safari/537.36&ref=http://www.rexswain.com/httpview.html&aen=&req=GET&ver=1.1&fmt=TXT

The old and new sitemaps index the same content. The old sitemap simply includes all of the post content, as blog feed - and the new sitemap includes only search engine useful data. Some of the processes that read sitemaps will simply be able to digest the new sitemap easier - they simply walk the sitemap, to index the posts themselves.

Comments

Popular posts from this blog

Embedded Comments And Main Page View

The option to display comments, embedded below the post, was made a blog option relatively recently. This was a long requested feature - and many bloggers added it to their blogs, as soon as the option was presented to us. Some blog owners like this feature so much, that they request it to be visible when the blog is opened, in main page view. I would like all comments, and the comment form, to be shown underneath the relevant post, automatically, for everyone to read without clicking on the number of comments link. And this is not how embedded comments work.

What's The URL Of My Blog?

We see the plea for help, periodically I need the URL of my blog, so I can give it to my friends. Help! Who's buried in Grant's Tomb, after all? No Chuck, be polite. OK, OK. The title of this blog is "The Real Blogger Status", and the title of this post is "What's The URL Of My Blog?".

With Following, Anonymous Followers Can't Be Blocked

As people become used to Blogger Following as just another tool to connect people, they start to think about the implications . And we see questions like How do I block someone who's been following my blog secretly? I couldn't see her in my Followers list (hence I couldn't use the "Block this user" link), but I have looked at her profile and could see that she's Following my blog. Following, when you look at the bottom line, is no more than a feed subscription and an icon (possibly) displayed on your blog, and linking back to the profile of the Follower in question. If someone Follows your blog anonymously, all that they get is a subscription to the blog feed. If you publish a feed from your blog, and if the feed is open to anybody (which, right now, is the case ), then it's open to everybody. If someone wants to use Following to subscribe to the feed, you can't stop this. You can't block it before, or after, the fact. You can't Block w