jason@position1seo.com
0141 846 0114
Free SEO Audit
logo

Martin Splitt Explains How Google Selects Canonical Pages

who offers the best small business seo services in the uk

Martin Splitt recently told providers of SEO services how Google distinguishes duplicate content and web pages, as well as how they decide which canonical pages are to be included in the search engine results pages (SERPs). This information gave many small business SEO service providers important insights on how the Google algorithm works when it comes to canonicalisation.

In a podcast, Splitt explained that there 20 different signals which are weighted in order to detect the canonical page. He also went into detail about why machine learning is used to adjust the weights.

Splitt first stated how websites are crawled and how documents are indexed. Then, he goes into detail about how Google detects and identifies canonicalisation and page duplicates.

He said that they collect the signals first, then detect the duplicate pages by clustering them all together. Then, they will find a leader page for all these pages, and to do, so, they must reduce the content into a checksum or hash, and compare it with other checksums.

By making clusters of duplicate pages, it makes the task much faster and easier instead of checking thousands of words.

One reason why Google reduces content into a checksum is that they do not want to spend too much time and resources scanning the whole text. So, they calculate several kinds of checksums about the textual content of the page before comparing it with other checksums.

When it comes to exact duplicates and near-duplicates, Splitt says Google’s algorithms can catch both, such as those that are capable of detecting duplicates and then removing the boilerplate from pages. This means that their algorithms detect if the checksums are fairly similar or identical to each other before bringing them together in a duplicate cluster.

Once all the duplicates form one big cluster, Google selects only one document to display in the SERP.

Providers of SEO services may wonder why they avoid showing duplicate web pages in the SERP. This is so that Google can avoid showing the same content across many search results – which is one thing that users dislike. Moreover, doing so saves storage space in the index.

The hardest part is choosing the leader of the cluster, which is why they use more than twenty signals to select which web page to show as canonical from the group of duplicates.

These signals are like factors that help determine which page among the duplicates is the best one to show in the SERP. For instance, one signal is the webpage content. It could also be the PageRank – the higher the rank, the more chances the webpage will show.

Each signal has its own weight, and Google calculates and adjusts these weights. Google uses machine learning to adjust signal weights, making sure everything is accurate compared to doing things manually.

As for redirects, they are usually given a heavier weight compared to http/https URL signals. Splitt explains that any redirects must be higher in weight instead of http/https because the users will eventually see the redirect target. Because of this, Google does not include the redirect source in the SERP.

Canonical links are essential for businesses and small business SEO services because they specify which link is to be shown to users in the SERP. Moreover, search engines do not like duplicate content, and canonical tags help them identify which page should be ranked or shown to the users.

Here at Position1SEO, we make sure that your website is filled with high-quality content that is both authoritative and compelling. If you choose to work with us, you can be assured to get unique content that engages your users and effectively promotes your products and services.

Work with our SEO professionals today! Send us an email at office@position1seo.co.uk or call us on 0141 846 0114.

seo services 3
Author: Jason Ferry
Jason Ferry is an SEO specialist based in the United Kingdom. Taking pride in his many years of experience in the search engine optimisation industry, he has honed his skills in various digital marketing processes. From keyword marketing, website auditing, and link building campaigns, to social media monitoring, he is well-versed in all of them. Jason Ferry’s excellent skills in SEO, combined with his vast experience, makes him one of the best professionals to work with in the industry today. Not only does he guarantee outstanding output for everyone he works with, but also values a deep relationship with them as well.

Related  Posts

seo audit agency blog04
In the ever-evolving digital landscape, search engine optimisation (SEO) remains a critical component for enhanced online visibility and business growth. Conducting comprehensive SEO audits is essential to identify opportunities for improvement and to devise effective strategies. Choosing the right SEO audit agency is crucial in ensuring that your website is optimally set up for search […]
seo page audit blog
In the rapidly evolving digital landscape, businesses need to ensure that their websites remain relevant and competitive. One crucial way to achieve this is through regular SEO page audits. These audits help identify areas of improvement, enhance user experience, and maintain high search engine rankings. In this blog, we'll explore why regular SEO page audits […]
technical seo specialist blog02
In today's digital landscape, having a robust online presence is crucial for any business aiming to succeed. While many focus on engaging content and eye-catching design, the underlying technical structure of a website is often overlooked. This is where a Technical SEO Specialist comes into play, ensuring that your website not only appeals to visitors […]