Sudden Increase In Number of Pages Indexed By Google Webmaster When No New Pages Added
-
Greetings MOZ Community:
On June 14th Google Webmaster tools indicated an increase in the number of indexed pages, going from 676 to 851 pages. New pages had been added to the domain in the previous month. The number of pages blocked by robots increased at that time from 332 (June 1st) to 551 June 22nd), yet the number of indexed pages still increased to 851.
The following changes occurred between June 5th and June 15th:
-A new redesigned version of the site was launched on June 4th, with some links to social media and blog removed on some pages, but with no new URLs added. The design platform was and is Wordpress.
-Google GTM code was added to the site.
-An exception was made by our hosting company to ModSecurity on our server (for i-frames) to allow GTM to function.
In the last ten days my web traffic has decline about 15%, however the quality of traffic has declined enormously and the number of new inquiries we get is off by around 65%. Click through rates have declined from about 2.55 pages to about 2 pages.
Obviously this is not a good situation.
My SEO provider, a reputable firm endorsed by MOZ, believes the extra 175 pages indexed by Google, pages that do not offer much content, may be causing the ranking decline.
My developer is examining the issue. They think there may be some tie in with the installation of GTM. They are noticing an additional issue, the sites Contact Us form will not work if the GTM script is enabled. They find it curious that both issues occurred around the same time.
Our domain is www.nyc-officespace-leader. Does anyone have any idea why these extra pages are appearing and how they can be removed? Anyone have experience with GTM causing issues with this?
Thanks everyone!!!
Alan -
Yes, and I appreciate it!
Alan -
I did what I asked you to do.
-
-
-
- in my first post and repeated frequently.
-
-
-
-
Hi Egol:
How did you locate this duplicate or re-published content?
Obviously what you have pointed out is a major source of concern so I ran Copyscape search this afternoon for duplicate content and did not locate any the URLs you mention in the "this", "this" link above. It appears you entered the URL of the blog post in Google's search bar. Would that work? This method would be pretty slow going with 600 URLs.
Thanks,
Alan -
Those are the 448 URLs from your website that have been filtered.
You should find garbage in them like shown below.
Have you done what I have suggested three times above? Do that if you want to identify the problem pages.
-
www.nyc-officespace-leader.com/wp-content/plugins/...
A description for this result is not available because of this site's robots.txt – learn more.
-
www.nyc-officespace-leader.com/wp-content/plugins/...
A description for this result is not available because of this site's robots.txt – learn more.
-
www.nyc-officespace-leader.com/wp-content/plugins/...
A description for this result is not available because of this site's robots.txt – learn more.
-
-
Hi Egol:
Thanks for the suggestion.
When I click on _ repeat the search with the omitted results included _I get 448 results not the entire 859 results. Seems very strange. Some of these URLS have light content but I don't believe they are dups. I don't see any content outside our website when I click this.
Am I doing something wrong? I would think the total of 859 would appear not 447 URLs.
Thanks!!
Alan -
I don't know. You should ask someone who knows a lot about canonicalization.
Did you drill down through all of those indexed pages to see if you can identify all of them?
I've suggested it twice.
-
Hi Egol:
In the content of launching an upgraded site, could the canonicalization have implemented incorrectly? That could account for 175 pages sudden new content as the thin content has been there for some time.
I am particularly suspicious regarding canonicalization as there was an issue involving multi page URLs of property listings when the site was migrated from Drupal to Wordpress last Summer.
Thoughts?
Thanks, Alan
-
Apparently infitter24.rssing.com/chan-13023009/all is poaching my content, taking my original content and adding it to there site. I am not quiet sure what to do about that.
You can have an attorney demand that they stop, you can file DMCA complaints. Be careful
**However it does not explain the sudden appearance of the 175 pages on Googles index **
-
Do this query: site:www.nyc-officespace-leader.com
-
Start drilling down the SERPs. One page at a time. Look for content that you didn't make. Look for duplicates.
-
Get a spreadsheet that has all of your URLs. Drill down through the SERPs checking every one of them. Can you account for your pagination. You have a lot of it and that type of page is usually rubbish in the index. Combine, canonicalize, or get rid of them.
-
-
Hi Egol:
Thanks so much for taking the time for your thorough response!!
Apparently infitter24.rssing.com/chan-13023009/all is poaching my content, taking my original content and adding it to there site. I am not quiet sure what to do about that.
You have pointed out something very useful and I appreciate it and will act upon it. However it does not explain the sudden appearance of the 175 pages on Googles index that did not appear at the end of May and somehow coincided with uploading of the new version of our website in early June. Any ideas???
Thanks,
Alan -
-
Do this query: site:www.nyc-officespace-leader.com
-
Start drilling down the SERPs. One page at a time. Look for content that you didn't make. Look for duplicates.
-
When you drill down about 44 pages you will find this...
In order to show you the most relevant results, we have omitted some entries very similar to the 440 already displayed.
If you like, you can repeat the search with the omitted results included.The bad stuff is usually behind that link. Google doesn't want to show that stuff to people. It could be thin, it could be duplicate, it could be spammy, they just might not like it.
- Find out what is in there.
Possible problems that I see....
I see dupe content like this and this. Either your guys are grabbin' somebodyelse's content or they are grabbin' yours. Can get you in trouble with Panda. You need original and unique. Anything that is not original and unique should be deleted, noindexed or rewritten.
A lot of these pages are really skimpy. Think content can get you into trouble with Panda. Anything that is skimpy should be deleted, noindexed or beefed up.
I see multiple links to tags on lots of these posts. That can cause duplicate content problems.
The tag pages are paginated with just a few pages on each. These can generate extra pages that are low value, suck up your linkjuice or compound duplicate content problems.
You have archive pages, and category pages and more pagination problems.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How can I distinguish new visitors from existing (customers) in Google Analytics to attain an avg # of new visitor traffic per day/week?
i do marketing for a business software site where we have hundreds of clients and each account has on avg 100 users. I am having a very challenging time to attempting to figure out the real number of unique traffic that our site receives. **(what's creating the issue is that we have thousands of user accounts where our users log-in via our site to access our app/platform). Would love help with this! Christian
Reporting & Analytics | | Sundance_Kidd0 -
Google Analytics Organic Accuracy?
Hi, On a cool summers day, I was quenching my thirst for SEO Knowledge in the depths of Google analytics, When I stumbled upon a most troublesome sight. I was Browsing the data inside Acquisition->Keywords->Organic and comparing them to Acquisition->Search Engine Optimization->Landing Pages, to find the results quite questionable.. Since Keywords->Organic measure the visits from the organic results, and Search Engine Optimization->Landing Pages measures the clicks onto one of my landing pages from Google, I assumed they should be the same or very close, (besides Keywords->Organic having visits from other search engines) but they were not the same, as a matter of fact it was telling me that Keywords->Organic visits = 1,474, 27.19% of total(5,422) and Search Engine Optimization->Landing Pages, clicks = 1,548 154.80% of total (1000). Even worst the Search Engine Optimization -> Queries clicks = 1,123, 160.43% of total(700). I have been searching, and come up empty handed in the answer so does anyone know why Google is showing me different results and strange numbers that don't make sense? Any information is greatly appreciated?
Reporting & Analytics | | KBB_Digital0 -
Google Analytics Content Experiments
Has anyone else found that Google Analytics Content Experiments seems to quite quickly favor the best performing variant in an experiment, and then show that variant many times more often than other/s - not split the traffic evenly? What is Google's thinking behind 'optimizing' during an experiment? It seems odd to me.
Reporting & Analytics | | David_ODonnell0 -
Google WebMasters Tool - Preferred Domain
I just added Google Analytics to my wordpress site with Google Analytics by YOAST. I then added Google WebMaster tools through via verify through google analytics account. I then tried to set a preferred domain. I chose the non www. version; however, google wanted me to verify ownership of both versions in order to set a preferred domain. I then added the www. version of my domain. I was able to set the non-www. version to my preferred domain. Now, there are two example.com's in my webmaster tools. I have 10 sites. I intend to replicate this process on all of my sites. Do I have to leave the non-preferred version of my sites in the google webmaster? Can I delete it after I have set my preferred version? If I delete the non-preferred version will it delete my setting on the preferred version because it is now no longer verified (saved)?
Reporting & Analytics | | JML11791 -
Google Analytics Campaigns
I need the help of a smart Mozzer. In Google Analytics: Traffic Sources>Sources>Campaigns all the results shown are from RSS. Can anyone help me with why RSS results would be displayed in Campaigns?
Reporting & Analytics | | waynekolenchuk0 -
How to Track Google Local Places in Google Analytics?
I have read many articles on how to track google local places through google analytics. Each article I have read show a different way of setting up google analytics and using tags in google local places. Wondering if anyone as up to date information on this and what would be the best practice to track data from google local lisitngs in google analytics Thanks Arthur
Reporting & Analytics | | VivaArturo0 -
Google Product Feed help
Hello, We are working with a particular service that uses our Google product feed to pull information. We wanted to add product info such as dimensions and hex colors for each product to better service the third party we are using, but we not sure if this is something Google supports in their feed. Would it cause any problems if we added that info in? Thanks for any help! Also, my knowledge regarding a Google product feed is lacking so if the question is confusing or I didn't give enough info on our situation just let me know and I'll do what I can to better explain.
Reporting & Analytics | | ClaytonKendall0 -
How do I find out how well a page converts in Analytics
Hello All, I am looking to find out how well a page converts in Analytics. A simple request you would thing, but no! First off let me list what I don't want to know: I don't want to know the conversion rate of a product I don't want to know the conversion rate based on the landing page What I want to know is how many people click to add the product to their basket on a particular page (which I understand is not strictly the conversion rate, but whatever). So the ways I have tried unsuccessfully are: The Analytics overlay (the in page Analytics thing) - this doesn't work because the "Add to Basket" button is not a link, it is an input. The Navigation Summary - this doesn't work because most of the time the /shopping_cart.php URL doesn't come up in the short list, and if you search for the URL in the search box beneath the percentages get all skewed. The most obvious solution would be Event tracking but I can't get that implemented in the short term. So does anyone have the answer to this most curious of conundrums? Thanking you in advance, Rich
Reporting & Analytics | | tonyatfat0