Home Features About Support Blog
On-Page SEO +
Technical SEO +
SERP & Content +
Get the Chrome Extension — Free
Free SEO Tool

Sitemap Analyzer

Validate XML sitemaps, check URL health status, detect duplicates, and get a comprehensive sitemap health score with actionable recommendations.

Last updated: April 2026
Max URLs:

How the Sitemap Analyzer Works

Enter a domain or sitemap URL to start. The tool automatically discovers your XML sitemap by checking robots.txt and common sitemap paths. It then parses the sitemap structure, validates XML syntax and tag values, and checks the HTTP status of every URL.

The health score (0-100) reflects XML validity, URL health, tag completeness, and duplicate detection. Each issue is classified as an error or warning with specific fix recommendations.

What Gets Checked

The analyzer validates XML namespace declarations, encoding, and schema compliance. It checks every <lastmod> for ISO 8601 format, every <changefreq> against allowed values, and every <priority> for the 0.0-1.0 range. It detects duplicate URLs across multiple sitemaps and verifies the 50,000 URL / 50MB size limits.

URL health checks reveal broken links, redirect chains, and server errors hiding in your sitemap. The tool groups results by status code so you can quickly find and fix problems.

More Tools

FAQ

What is an XML sitemap? +
An XML sitemap is a file that lists URLs on your website to help search engines discover and crawl your pages more efficiently. It can include metadata like the last modification date, change frequency, and relative priority of each URL.
How many URLs can a sitemap have? +
A single XML sitemap can contain up to 50,000 URLs and must not exceed 50MB (uncompressed). For larger sites, use a sitemap index file that references multiple individual sitemaps.
Does Google use lastmod and changefreq? +
Google primarily uses the lastmod tag when it accurately reflects real content changes. Google has stated that changefreq and priority are largely ignored. However, keeping lastmod accurate helps Google prioritize crawling recently updated pages.
Why are there non-200 URLs in my sitemap? +
Sitemaps should only contain URLs that return a 200 status code and are indexable. URLs that redirect (301/302), return errors (404/500), or are blocked by robots.txt waste crawl budget and should be removed from the sitemap.
What is a sitemap index? +
A sitemap index file is an XML file that references multiple individual sitemaps. It allows you to organize sitemaps by content type or section (e.g., posts, pages, products) and is required when your site has more than 50,000 URLs.
Sitemap monitoring on every page

Lumina shows indexation status, meta tags, and crawlability automatically in your browser.

Add Lumina to Chrome — Free