<\/span>Another SEO myth – the page “spider” and the whole document render myth<\/span><\/h3>From the idea of spidering the web, reading how people characterize or talk about crawling, it seems that a large faction of new-to-SEOs as well as experienced ones is taht Google downloads a whole document and parses it in one go and reads it like a person, in a “bot-browser” and then “clicks around the page like a virtual person” – this is absolute nonsense and just a common human trait called “anthropomorphism.” Google doesnt. Google might explode parts of the site that require scripts to render it in order to grab text but it will start with whatever text and links it finds in a page and then update crawler lists.<\/p>
Some Googlebots are XML listeners that send a crawler on a new page being added – this is how Caffeine works (think CNN, news sites, etc). Pages with organic traffic are auto-updated<\/p>
<\/span>How Google Crawling works<\/span><\/h2>Google keeps lists of URLs to crawl – either live pages it serves up, from listening agents, or newly discovered URLs – that’s where most crawling comes from<\/p>
Google doesn’t sit there and go sitemap to sitemap and crawl all of the pages – for the most part, I’d say 90% of sitemaps don’t have a listener because they don’t get traffic.<\/p>
Instead, URLs are crawled in triage – the top 1% of the web is called every hour (source: Matt Cutts), then the middle of the web – the next 9% and then the base of the pyramid – 90% on a an increasing almost never basis. This 90% is page-level, not just domain-level.<\/p>
<\/span>Crawlers and Indexing<\/span><\/h2>Crawlers aren’t that separate from indexing because crawlers will also update where the link was found and what kind of link it is – 301, external or internal etc. Indexing and Crawling happen at the same time and together this is “search” – because Google doesnt search anything by the time the user gets to it – it just dumps an index which is a list of URLs it found and indexed and sorted WHEN it was found. the position is based on how much authority the page has for that index and this is updated with every incoming link found as the bots and indexers parse whole or parital documents.<\/p>
<\/span>How to test this in GSC<\/span><\/h2>Inside your Google Search Console (GSC), go to Pages and look at your page index history – you can see the oldest pages and how long it’s been since they were crawled.<\/p>
Then go to your Perofrmance Overview and look at he top 5-10 pages with the most clicks in the last week. Inspect each page and see when they were indexed. Pages with more than 10 clicks a week are likely to be crawled more frequently and automatically.<\/p>
Everything else is just ignored.<\/p>
<\/span>What XML Sitemaps really do<\/span><\/h3>Sitemaps are just a way for your CMS to issue you a receipt of what you published and for Google to let you know precisely which ones it indexed. It is not a control list – a Google list of URLs comes from:<\/p>
- Chrome URLs accessed (not from user behavior, just from users visiting new URLs)<\/li>
- Discovery via indexing other pages
- This is why if your blog is crawled from CNN, you will go to “page 1”<\/li><\/ol><\/li>
- Discovery in internal links<\/li>
- Via site refresh<\/li>
- Via a listener or repeat crawl (like Caffeine) if you are on Google News or discover<\/li>
- Via triage via XML<\/li><\/ol>
Sitemaps don’t actually form the major part of where Google spends its time crawling: which is new pages<\/p>
<\/span>Google Crawler Triage<\/span><\/h3>- New content from highly authoritative sites<\/li>
- Function Specific crawlers
- Caffeine
- News, Discover, Products<\/li><\/ol><\/li><\/ol><\/li>
- Refresh
- Hourly – Highly authoritative domains,
- LstMod trusted = true<\/li><\/ol><\/li>
- Daily – Tier 2<\/li>
- Monthly – Tier 3
- Daily and monthly mean every other day or month, not every dayXML<\/li><\/ol><\/li><\/ol><\/li><\/ol>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"
Sitemaps are just a document that tells you % of pages you THINK you published are indexed, that’s all. The XML sitemap does very little. This comes from building up Google more that what it is – as Gary Ylles keeps pointing out – its incredibly basic. SEO Myth One: Sitemaps “Tell” Google what to […]<\/p>\n","protected":false},"author":4,"featured_media":5970,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[278],"tags":[],"yoast_head":"\n
Fact vs Fiction: Sitemap Optimization and SEO<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Fact vs Fiction: Sitemap Optimization and SEO\" \/>\n<meta property=\"og:description\" content=\"Sitemaps are just a document that tells you % of pages you THINK you published are indexed, that’s all. The XML sitemap does very little. This comes from building up Google more that what it is – as Gary Ylles keeps pointing out – its incredibly basic. SEO Myth One: Sitemaps “Tell” Google what to […]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/\" \/>\n<meta property=\"og:site_name\" content=\"Primary Position SEO NYC\" \/>\n<meta property=\"article:published_time\" content=\"2024-08-31T14:42:58+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-12-07T22:02:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1536\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"David Quaid\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"David Quaid\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/\",\"url\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/\",\"name\":\"Fact vs Fiction: Sitemap Optimization and SEO\",\"isPartOf\":{\"@id\":\"https:\/\/primaryposition.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg\",\"datePublished\":\"2024-08-31T14:42:58+00:00\",\"dateModified\":\"2024-12-07T22:02:10+00:00\",\"author\":{\"@id\":\"https:\/\/primaryposition.com\/#\/schema\/person\/a8055f39ec47ce1130cbe034e06b9485\"},\"breadcrumb\":{\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage\",\"url\":\"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg\",\"contentUrl\":\"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg\",\"width\":2560,\"height\":1536,\"caption\":\"SEO Sitemap and Optimization\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/primaryposition.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Fact vs Fiction: Sitemap Optimization and SEO\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/primaryposition.com\/#website\",\"url\":\"https:\/\/primaryposition.com\/\",\"name\":\"Primary Position SEO\",\"description\":\"Primary Position SEO NYC\",\"alternateName\":\"Primary Position\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/primaryposition.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/primaryposition.com\/#\/schema\/person\/a8055f39ec47ce1130cbe034e06b9485\",\"name\":\"David Quaid\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/primaryposition.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/fe181e443ce16ca856846a66835a2296?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/fe181e443ce16ca856846a66835a2296?s=96&d=mm&r=g\",\"caption\":\"David Quaid\"},\"url\":\"https:\/\/primaryposition.com\/blog\/author\/david-quaid\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Fact vs Fiction: Sitemap Optimization and SEO","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/","og_locale":"en_US","og_type":"article","og_title":"Fact vs Fiction: Sitemap Optimization and SEO","og_description":"Sitemaps are just a document that tells you % of pages you THINK you published are indexed, that’s all. The XML sitemap does very little. This comes from building up Google more that what it is – as Gary Ylles keeps pointing out – its incredibly basic. SEO Myth One: Sitemaps “Tell” Google what to […]","og_url":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/","og_site_name":"Primary Position SEO NYC","article_published_time":"2024-08-31T14:42:58+00:00","article_modified_time":"2024-12-07T22:02:10+00:00","og_image":[{"width":2560,"height":1536,"url":"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg","type":"image\/jpeg"}],"author":"David Quaid","twitter_card":"summary_large_image","twitter_misc":{"Written by":"David Quaid","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/","url":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/","name":"Fact vs Fiction: Sitemap Optimization and SEO","isPartOf":{"@id":"https:\/\/primaryposition.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage"},"image":{"@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage"},"thumbnailUrl":"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg","datePublished":"2024-08-31T14:42:58+00:00","dateModified":"2024-12-07T22:02:10+00:00","author":{"@id":"https:\/\/primaryposition.com\/#\/schema\/person\/a8055f39ec47ce1130cbe034e06b9485"},"breadcrumb":{"@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#primaryimage","url":"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg","contentUrl":"https:\/\/primaryposition.com\/wp-content\/uploads\/2024\/08\/seo-sitemap-optimization-scaled.jpeg","width":2560,"height":1536,"caption":"SEO Sitemap and Optimization"},{"@type":"BreadcrumbList","@id":"https:\/\/primaryposition.com\/blog\/seo-sitemap-optimization\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/primaryposition.com\/"},{"@type":"ListItem","position":2,"name":"Fact vs Fiction: Sitemap Optimization and SEO"}]},{"@type":"WebSite","@id":"https:\/\/primaryposition.com\/#website","url":"https:\/\/primaryposition.com\/","name":"Primary Position SEO","description":"Primary Position SEO NYC","alternateName":"Primary Position","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/primaryposition.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/primaryposition.com\/#\/schema\/person\/a8055f39ec47ce1130cbe034e06b9485","name":"David Quaid","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/primaryposition.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/fe181e443ce16ca856846a66835a2296?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/fe181e443ce16ca856846a66835a2296?s=96&d=mm&r=g","caption":"David Quaid"},"url":"https:\/\/primaryposition.com\/blog\/author\/david-quaid\/"}]}},"_links":{"self":[{"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/posts\/5969"}],"collection":[{"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/comments?post=5969"}],"version-history":[{"count":0,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/posts\/5969\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/media\/5970"}],"wp:attachment":[{"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/media?parent=5969"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/categories?post=5969"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/primaryposition.com\/wp-json\/wp\/v2\/tags?post=5969"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}