Site map

A sitemap is a list of pages of a web site within a domain.

There are three primary kinds of sitemap:

  • Sitemaps used during the planning of a website by its designers
  • Human-visible listings, typically hierarchical, of the pages on a site
  • Structured listings intended for web crawlers such as search engines

Types of sitemaps

A sitemap of what links from the English Wikipedia's Main Page
Sitemap of Google in 2006

Sitemaps may be addressed to users or to software.

Many sites have user-visible sitemaps which present a systematic view, typically hierarchical, of the site. These are intended to help visitors find specific pages, and can also be used by crawlers. They also act as a navigation aid[1] by providing an overview of a site's content at a single glance. Alphabetically organized sitemaps, sometimes called site indexes, are a different approach.

For use by search engines and other crawlers, there is a structured format, the XML Sitemap, which lists the pages in a site, their relative importance, and how often they are updated.[2] This is pointed to from the robots.txt file and is typically called sitemap.xml. The structured format is particularly important for websites which include pages that are not accessible through links from other pages, but only through the site's search tools or by dynamic construction of URLs in JavaScript.

XML sitemaps

Google introduced the Sitemaps protocol so web developers can publish lists of links from across their sites. The basic premise is that some sites have a large number of dynamic pages that are only available through the use of forms and user entries. The Sitemap files contain URLs to these pages so that web crawlers can find them. Bing, Google, Yahoo and Ask now jointly support the Sitemaps protocol.

Since the major search engines use the same protocol,[3] having a Sitemap lets them have the updated page information. Sitemaps do not guarantee all links will be crawled, and being crawled does not guarantee indexing.[4] Google Webmaster Tools allow a website owner to upload a sitemap that Google will crawl, or they can accomplish the same thing with the robots.txt file.[5]

XML Sitemaps have replaced the older method of "submitting to search engines" by filling out a form on the search engine's submission page. Now, web developers submit a Sitemap directly, or wait for search engines to find it.[6] Regularly, submitting an updated sitemap when new pages are published may allow search engines to find and index those pages more quickly than it would by finding the pages on its own.

Sample

Below is an example of a validated XML sitemap for a simple three-page website. Sitemaps are a useful tool for making sites searchable, particularly those written in non-HTML languages.

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>http://www.example.net/?id=who</loc>
    <lastmod>2009-09-22</lastmod>
    <changefreq>monthly</changefreq>
    <priority>0.8</priority>
  </url>
  <url>
    <loc>http://www.example.net/?id=what</loc>
    <lastmod>2009-09-22</lastmod>
    <changefreq>monthly</changefreq>
    <priority>0.5</priority>
  </url>
  <url>
    <loc>http://www.example.net/?id=how</loc>
    <lastmod>2009-09-22</lastmod>
    <changefreq>monthly</changefreq>
    <priority>0.5</priority>
  </url>
</urlset>

See also

References

  1. Sitemap Usability Jakob Nielsen's Alertbox, August 12, 2008
  2. Nadik, Tessa (2023-02-09). "What Is A Sitemap? Do I Need One?". Search Engine Journal. Retrieved 2023-09-16.
  3. "Google, Yahoo!, Microsoft Standardize Against Google Sitemap Protocol". Oreilly. Retrieved 2012-07-24.
  4. Joint announcement from Google, Yahoo, and Bing supporting Sitemaps
  5. "Submitting Sitemaps". Google Inc. Retrieved 2012-07-06.
  6. "How to Manage XML Sitemaps for eCommerce Websites | What You Need to Know". 2022-09-26. Retrieved 2022-11-07.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.