Duplicate Content - Google Webmaster Office-hrs
I found +John Mueller
's duplicate content presentation yesterday to be exceptionally useful, especially as I am frequently requested to do SEO audits & pretty much without exception, find &/or have to resolve many of the issues described here. So I've put together these time stamped notes in a Google+ friendly format & believe these may be useful to you webmasters as well.1:00 Duplicate content affects pretty much all sites
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔Search recommended reading on this topic:
● Duplicate content - Search Console goo.gl/2saV8y
● Duplicate content & multi site issue goo.gl/uHRV6t
● Demystifying "dup. content penalty" goo.gl/9RQqu8These were useful articles found from said search1:45 What's duplicate content?
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Exact same page, or same piece of content● www / non-www / http / https / index.html / ?=...● Separate mobile URLs, printer-friendly URL's, etc.● Tag pg's, press releases, syndicated, descript., etc.● Every website can have these things!3:22 Not duplicate content
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Translations● Different pages with same title & description● Content in apps● Localized content... sometimes4:30 Websearch simplified
Schedulr ↓ ↓
↖URLs ←Parser → Indexing → Index →Search5:41 Handling duplicate content during crawling
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Duplicates waste resources, "crawl-budget"/time
-> New/changed content longer to be picked up.● The URL parameter handling tool helps!● Don't use robots.txt for this!● We have smart systems too● Not a penalty!6:40 Duplicate content during indexing
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Duplicates are a waste of storage & resources● If whole page is duplicate, we just keep one copy● Tricky: localization, same page for 2 countries?● Not a penalty!7:32 Duplicate content in search results
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Duplicates are confusing to users, we just show one● Often visible as "...we have omitted some entries..."● Not a penalty!8:23 Duplicate content problems
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Unnecessary crawling● Harder to track metrics● We might pick "un-preferred" URL to show10:00 What about penalties?!
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Scraper sites● Content spinning based on other sites
(auto translations, rewriting, etc.)● Doorway pages / sitesThese are spam. Follow Webmaster Guidelines.11:02 Recognizing duplicate content
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Are we showing the other URL in search?● Search Console /Duplicate titles & descriptions● Crawling much more than your site has content?12:18 Affiliate /shared or syndicated content
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Make pgs stand on their own; provide unique value● Noindex un-resolvable duplicate content● Some kind of duplication can't avoid, that's normal13:35 Bad practices for duplicate content
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Don't use robots.txt to block duplicate content.● Don't artificially rewrite content.● Don't use the URL removal tools.16:06 Fixing duplicate content
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔● Be consistent. "...is the mother of all good SEO"
Use a single URL (sitemap, canonical, hreflang, etc.)● Avoid unnecessary URL variations (in CMS, etc.)● Use 301 redirects where possible. Use canonical...● Use Search Console (preferred domain, URL para.)● Use geotargeting & hreflang, where relevant.19:00 Begins Q&A on duplicate content
Note: useful questions & discussion on this topic continues for the rest of the hour, that I recommend viewing for follow-up.#DuplicateContent #SEO #WebmasterCentral