What is duplicate content?
Duplicate content refers to blocks of content that appear on multiple URLs, either within the same domain or across different domains. Google does not want to serve the same content twice, so it attempts to identify the canonical version and may suppress duplicates in results.
How duplicate content hurts rankings
Duplicate content rarely results in a penalty unless it is clearly manipulative. More commonly, it causes ranking dilution — multiple pages competing for the same query split the signals that would otherwise concentrate on one page.
Internal duplication often occurs due to CMS settings (mobile versions, printer-friendly pages, session IDs in URLs, tag and category pages with overlapping content). External duplication occurs when content is syndicated or scraped.
Example
Example
An e-commerce site with 10 color variants of a product each on their own URL, all with identical product descriptions, has duplicate content. Canonicalizing to the main variant URL resolves this.
Frequently asked questions
Does Google penalize duplicate content?
Rarely. The practical damage is dilution: multiple URLs splitting signals that should concentrate on one page, and crawl budget wasted on near-copies. Penalties apply only to deliberately manipulative duplication at scale.
How do I fix duplicate content?
Pick the canonical version, then align every signal to it: canonical tags on variants, 301 redirects where variants should not exist, and internal links pointing only at the chosen URL.