How do i prevent Google and Moz from counting pages as duplicates?
-
I have 130,000 profiles on my site. When not Connected to them they have very few differences. So a bot - not logged in, etc, will see a login form and "Connect to Profilename"
MOZ and Google call the links the same, even though theyre unique such as
example.com/id/328/name-of-this-group
example.com/id/87323/name-of-a-different-group
So how do i separate them? Can I use Schema or something to help identify that these are profile pages, or that the content on them should be ignored as its help text, etc?
Take facebook - each facebook profile for a name renders simple results:
https://www.facebook.com/public/John-Smith
https://www.facebook.com/family/Smith/
Would that be duplicate data if facebook had a "Why to join" article on all of those pages?
-
What about this idea:
We can flesh out profiles with Data, demographics, and contact info. No one cares about it, so we leave it off.
We can also customize it by a list of names that are connected, for those that have registrants
So 2 options: throw the demo info up on each, giving some unique content.
and or
Throw up member first names last init of those registered in them, then only index them if they have members?
However, 80% of our traffic comes from these "duplicate" pages.
-
Yes - we need the directories to be found in google
These profile pages are places in an organization to register at. Our brand name contains 3 utterly generic words, so the only thing showing up on radar are these profile names
Of course, removing it is a "solution" but no one hands a fat person a butcher knife and says "just cut it off"
I need to shape the content to be unique. I think its our "pitch" text that has more characters than the profile.
-
Yes, adding noindex to all profile pages will solev any current or future issues you might have. There is no point in having those pages into the index if the "actual" content is anyway invisibile. There is no point keeping over 100k pages in the index with only boilerplate on them.
You should no index all profiles asap - there is no value from an organic point of view there and if you do lose some traffic from those - the risk - reward (risk - losing some traffic / reward - keeping your domain overall safe) is without question towards the reward side.
Cheers.
-
One solution would be to not index the directory that has the profiles. Do you get many visits from organic search to these pages?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Magento 1.9 SEO. I have product pages with identical On Page SEO score in the 90's. Some pull up Google page 1 some won't pull up at all. I am searching for the exact title on that page.
I have a website built on Magento 1.9. There are approximately 290,000 part numbers on the site. I am sampling Google SERP results. About 20% of the keywords show up on page 1 position 5 thru 10. 80% don't show up at all. When I do a MOZ page score I get high 80's to 90's. A page score of 89 on one part # may show up on page one, An identical page score on a different part # can't be found on Google. I am searching for the exact part # in the page title. Any thoughts on what may be going on? This seems to me like a Magento SEO issue.
Intermediate & Advanced SEO | | CTOPDS0 -
Google is indexing wrong page for search terms not on that page
I’m having a problem … the wrong page is indexing with Google, for search phrases “not on that page”. Explained … On a website I developed, I have four products. For example sake, we’ll say these four products are: Sneakers (search phrase: sneakers) Boots (search phrase: boots) Sandals (search phrase: sandals) High heels (search phrase: high heels) Error: What is going “wrong” is … When the search phrase “high heels” is indexed by Google, my “Sneakers” page is being indexed instead (and ranking very well, like #2). The page that SHOULD be indexing, is the “High heels” page (not the sneakers page – this is the wrong search phrase, and it’s not even on that product page – not in URL, not in H1 tags, not in title, not in page text – nowhere, except for in the top navigation link). Clue #1 … this same error is ALSO happening for my other search phrases, in exactly the same manner. i.e. … the search phrase “sandals” is ALSO resulting in my “Sneakers” page being indexed, by Google. Clue #2 … this error is NOT happening with Bing (the proper pages are correctly indexing with the proper search phrases, in Bing). Note 1: MOZ has given all my product pages an “A” ranking, for optimization. Note 2: This is a WordPress website. Note 3: I had recently migrated (3 months ago) most of this new website’s page content (but not the “Sneakers” page – this page is new) from an old, existing website (not mine), which had been indexing OK for these search phrases. Note 4: 301 redirects were used, for all of the OLD website pages, to the new website. I have tried everything I can think of to fix this, over a period of more than 30 days. Nothing has worked. I think the “clues” (it indexes properly in Bing) are useful, but I need help. Thoughts?
Intermediate & Advanced SEO | | MG_Lomb_SEO0 -
Google W.M.T Missing Meta Title on AJax Pages... Weird!!
Hey Mozers, I was looking through my Google Web Masters Tool under HTML Improvements. It looks like I have 2,200 pages missing Meta Titles and I was about to lose it thinking HOW COULD THIS HAPPEN! I came to realize that the pages were "Ajax Pages". This is specifically a checkprice pop up and I dont want this page crawled by google. It looks like to google I have over 2k pages missing Meta Titles and they are all "check price pop ups". How would you suggest I block this. I thought about going the easy route and removing the subfolder and putting it in the Robots.txt document and I'm scared of that because we use AJax for a bunch of calls. I'm also scared of putting in the head <metaname="robots" =="" noindex,nofollow"="">because it requires hard coding</metaname="robots"> I Know i'm not the first to come across this issue, Any Ideas??
Intermediate & Advanced SEO | | rpaiva0 -
Google serving wrong page...
Hi, When you Google: "Los Angeles divorce attorney", you will see this site on the 5th page of the SERPS: www.berenjifamilylaw.com/blog/. For some reason, Google is serving the BLOG page as opposed to the homepage. This has been going on now for several weeks. Any tips on how to fix this? Obviously, the Homepage is more relevant and has more links going to it, so not sure why it's happening. Would you just leave it alone? Would you use robots.txt to block Google from crawling the BLOG post page? Thanks.
Intermediate & Advanced SEO | | mrodriguez14400 -
Is it a problem that Google's index shows paginated page urls, even with canonical tags in place?
Since Google shows more pages indexed than makes sense, I used Google's API and some other means to get everything Google has in its index for a site I'm working on. The results bring up a couple of oddities. It shows a lot of urls to the same page, but with different tracking code.The url with tracking code always follows a question mark and could look like: http://www.MozExampleURL.com?tracking-example http://www.MozExampleURL.com?another-tracking-examle http://www.MozExampleURL.com?tracking-example-3 etc So, the only thing that distinguishes one url from the next is a tracking url. On these pages, canonical tags are in place as: <link rel="canonical<a class="attribute-value">l</a>" href="http://www.MozExampleURL.com" /> So, why does the index have urls that are only different in terms of tracking urls? I would think it would ignore everything, starting with the question mark. The index also shows paginated pages. I would think it should show the one canonical url and leave it at that. Is this a problem about which something should be done? Best... Darcy
Intermediate & Advanced SEO | | 945010 -
Joomla Duplicate Page content fix for mailto component?
Hi, I am currently working on my site and have the following duplicate page content issues: My Uni Essays http://www.myuniessays.co.uk/component/mailto/?tmpl=component&template=it_university&link=2631849e33 My Uni Essays http://www.myuniessays.co.uk/component/mailto/?tmpl=component&template=it_university&link=2edd30f8c6 This happens 15 times Any ideas on how to fix this please? Thank you
Intermediate & Advanced SEO | | grays01800 -
Does having a page that ends with ? cause duplicate content?
I am working on a site that has lots of dynamic parameters. So lets say we have www.example.com/page?parameter=1 When the page has no parameters you can still end up at www.example.com/page? Should I redirect this to www.example.com/page/ ? Im not sure if Google ignores this, or if these pages need to be dealt with. Thanks
Intermediate & Advanced SEO | | MarloSchneider0 -
SEOMoz mistaking image pages as duplicate content
I'm getting duplicate content errors, but it's for pages with high-res images on them. Each page has a different, high-res image on it. But SEOMoz keeps telling me it's duplicate content, even though the images are different (and named different). Is this something I can ignore or will Google see it the same way too?
Intermediate & Advanced SEO | | JHT0