What It Is & How It Works

19 May

Canonicalization is the process that search engines use to determine the main version of a page. That is the page that will be indexed and shown to users. The chosen version is canonical, and ranking signals like links will consolidate to that page. This process is sometimes referred to as standardization or normalization.

According to Google Webmaster Trends Analyst Gary Illyes, ~60% of the internet is duplicate content.

Google’s crawling process is highly focused on removing duplication because 60% of the internet is duplicate 🤯 @methode #seodaydk pic.twitter.com/OJ9OkP74DU

— Lily Ray 😏 (@lilyraynyc) March 30, 2022

Canonicalization is complex and often misunderstood. I don’t think most of the duplicates are nefarious. It’s mostly going to be technical issues that cause them. We’ll look at this more in a bit. I’m going to talk about how the canonicalization process works, as well as the following:

A lot of different signals go into the canonicalization process. These include:

Duplicates
Canonical link elements
Sitemap URLs
Internal links
Redirects

Google looks at all the different signals and weighs them to determine what the canonical version should be. That’s the version of the page it will index and what it usually shows to users.

Weighing scale. "URL in Sitemap" and "Duplicate content" on lighter side; "Internal Links" and "Canonical URL" on heaver side

A potential scenario when Google decides on the canonical based on internal links and the canonical URL.

Duplicates

With duplicate content, Google will pick a canonical version to index. All the eligible pages form a cluster of pages, and the signals that go to the pages in that cluster will consolidate at the chosen canonical. That canonical may even change over time.

Flowchart showing process of duplication detection

Source: ahrefs.com, originally published on 2022-05-19 18:11:22

No Comments

Mobile App Development, SEO, Social Media, Web Development

Recent Posts

Recent Comments

Connect with B2 Web Studios

Get B2 news, tips and the latest trends on web, mobile and digital marketing

Main Menu

What We Do

Latest Blog Posts

Responsive Web Design

E-Commerce Solutions

Mobile App Development

SEO & SEM

Social Media Management

Support & Maintenance

Web Hosting

Mobile App Development, SEO, Social Media, Web Development

What It Is & How It Works

19 May

Duplicates

1. It prefers HTTPS pages over HTTP pages.

2. It prefers shorter URLs over longer URLs.

Canonical link element

Sitemap URLs

Internal links

Redirects

How to check the canonical

Mistake #1. Blocking the canonicalized URL via robots.txt

Mistake #2. Setting the canonicalized URL to “noindex”

Mistake #3. Setting a 4XX HTTP status code for the canonicalized URL

Mistake #4. Canonicalizing all paginated pages to the root page

Mistake #5. Using the URL removal tool in Google Search Console for canonicalization

Mistake #6. Not keeping canonicalization signals consistent

Mistake #7. Not using canonical tags with hreflang

Mistake #8. Having multiple rel=canonical tags

Mistake #9. Rel=canonical in the <body>

Final thoughts

Recent Posts

Here’s Why Brands Are

How to Successfully Use

Are Affiliate Links Bad

The Current State of Goo

Why Your Instagram Profi

Recent Comments

Connect with B2 Web Studios

Get B2 news, tips and the latest trends on web, mobile and digital marketing

Main Menu

What We Do

Latest Blog Posts

Here’s Why Brands Are Taking Instagram Influencers on Vacation

How to Successfully Use Social Media: A Small Business Guide for Beginners [Infographic]

Are Affiliate Links Bad for SEO?