Sitemap Analyzer
Analyze sitemap links, priorities, and crawl hints to improve technical SEO quality
Analyze Sitemap
Tip: Use URL mode for live fetch or XML mode for local/manual analysis.
No Analysis Yet
Fetch, parse, and analyze XML sitemaps to check URL structure, lastmod dates, priorities, and crawlability. Identify issues that could affect how search engines discover your content.
XML Sitemap Structure: Index Files, URL Sets, and Crawl Signals
An XML sitemap is a file that lists URLs on your website along with optional metadata to help search engine crawlers discover and prioritize your content. The Sitemap Protocol (sitemaps.org) defines two file types: URL set sitemaps (urlset) containing up to 50,000 individual page URLs, and sitemap index files (sitemapindex) that point to multiple sub-sitemaps for large sites.
Each URL entry can include three optional metadata fields: lastmod (ISO 8601 date indicating when the page was last modified — used by crawlers to prioritize recrawling updated content), changefreq (a hint about update frequency: always/hourly/daily/weekly/monthly/yearly/never), and priority (a 0.0-1.0 value indicating relative importance within your site — not a ranking signal). Googlebot treats changefreq and priority as hints that it largely ignores in favor of its own crawl scheduling, but lastmod is useful when accurately maintained.
Common sitemap issues include: including non-canonical URLs (pages with redirect targets instead of canonical versions), including noindex pages (which should be excluded since they won't be indexed), missing lastmod dates for dynamic content, exceeding the 50,000 URL or 50 MB uncompressed limit, and broken XML that fails to parse.
Audit sitemap for included noindex pages
Detect pages in your sitemap that have noindex directives — a contradiction that confuses crawlers about your intent.
Check lastmod date accuracy
Verify that lastmod dates in the sitemap reflect actual content change dates, not just sitemap generation timestamps.
Analyze URL count and sitemap structure
Count total URLs across all sub-sitemaps in an index and check for any individual files approaching the 50,000 URL limit.
Discover new sitemap for content indexing audit
Use sitemap analysis to identify which sections of a site are included for indexing and which are absent.
- 1
Enter your sitemap URL
Paste the full URL to your sitemap.xml or sitemap index file. The tool will fetch the XML directly and handle both sitemap index and URL set formats.
- 2
Review URL and metadata analysis
See total URL count, date range of lastmod values, priority distribution, and any validation warnings or errors detected in the sitemap structure.
- 3
Export URL list
Download the full list of sitemap URLs as CSV for use in crawl audits, content inventories, or further analysis.
Sitemap index and URL set support
Handles both top-level sitemap index files and individual URL set sitemaps, recursively fetching all sub-sitemaps.
Gzip compressed sitemap support
Transparently decompresses .xml.gz sitemap files, which are commonly used by large sites to reduce bandwidth.
Validation warnings
Flags common issues: malformed XML, URLs exceeding 2,048 characters, invalid lastmod date formats, and priority values outside 0.0-1.0.
CSV export
Export the complete URL list with all metadata fields for use in spreadsheets, crawl monitoring, or content audit workflows.
Found this tool useful?
Share your experience and help others discover it.