Skip to main content

Sitemap Check

Scan any URL to find and inspect its XML sitemap, discover all indexed pages, and check last-modified dates.

Why check your sitemap?

Find hidden sitemap files

Discover sitemap.xml, sitemap indexes, and entries declared in robots.txt — even when they are not linked from the homepage.

Audit indexed URLs

Browse every URL listed in the sitemap with last-modified dates, change frequency, and priority values in one place.

Catch setup issues early

Spot missing sitemaps, empty files, broken XML, or stale lastmod dates before search engines waste crawl budget on bad signals.

What is an XML sitemap and why does it matter?

An XML sitemap is a machine-readable list of URLs you want search engines to discover and crawl. It does not guarantee indexing, but it helps crawlers find pages — especially new content, large sites, and pages with few internal links.

Where do sitemaps live?

Most sites serve /sitemap.xml at the domain root. Larger sites use a sitemap index that points to multiple child sitemaps. Search engines also read the Sitemap: directive in robots.txt. If none of these exist, crawlers rely only on links they find while browsing.

How does the sitemap check work?

Enter any URL and we fetch robots.txt, probe common sitemap paths, parse XML sitemap and sitemap index files, and list every URL entry with optional lastmod, changefreq, and priority metadata.

Sitemap vs. robots.txt

robots.txt tells crawlers what they may fetch. A sitemap tells them which URLs exist and when they changed. You need both for a complete technical SEO baseline — robots.txt for crawl rules, sitemap for discovery.

Common sitemap mistakes

Including noindex URLs, listing redirects or 404 pages, stale lastmod dates on every URL, oversized single files (split into indexes above ~50k URLs), or forgetting to submit the sitemap in Google Search Console and Bing Webmaster Tools.

Validate Open Graph tags for individual pages with our Open Graph scanner.

Sitemap FAQ

Enter a URL above to inspect a live sitemap, or read the sections below for structure and best-practice tips.

sitemap.xml is an XML file listing URLs on your site. Search engines use it to discover pages they might otherwise miss. It can include lastmod, changefreq, and priority hints — though crawlers treat these as suggestions, not commands.