An XML sitemap is a roadmap of your website handed directly to search engines. While Google discovers pages by following links, a sitemap ensures no important page is missed and provides metadata (last modified date, priority) to guide efficient crawling.
What to Include
Only canonical, indexable, high-quality URLs. Exclude: noindex pages, paginated pages (page 2+), filter/facet navigation URLs, duplicate content URLs, thin pages, and redirected URLs. A clean 500-URL sitemap outperforms a bloated 5,000-URL sitemap.
Sitemap Size Limits
Maximum 50,000 URLs or 50MB per file. For large sites: create multiple sitemaps by content type (pages, blog posts, products) and reference them in a sitemap index file at sitemap.xml.
Generating Sitemaps
WordPress: Yoast SEO or All in One SEO. Shopify: automatic at /sitemap.xml. Next.js: next-sitemap package generates automatically. Verify the auto-generation is working by checking your sitemap URL monthly.
Submitting to Google
Search Console → Sitemaps → Enter sitemap URL → Submit. Review the Sitemaps report weekly for: discovered vs indexed URL counts, and any errors preventing indexation.