Should I Remove This Subdirectory From Google?
-
On my site, I have a subdirectory.
It posts articles from a bunch of websites that my readers are interested in & links back to all of those sites. There is no original content in it.
There are over 1700 indexed pages in this subdirectory. The rest of my site has about 500 (all original content).
The search engine traffic for this subdirectory only accounts for 3.9% of my sites overall visits.
Should I consider removing this subdirectory? Could all the duplicate content be hurting the rankings of my legit pages?
What do you all think?
-
Upon further review, it looks like I'm getting about 150-200 search visits to the articles within that subd a week...I'm gonna keep it up.
It's easy enough to figure out which posts are getting the most search traffic, but it is going to be crazy removing only the ones which do not get that traffic...
If I see a huge panda like drop off in search traffic, then I'll pull the trigger. Until then, we will stick with it.
Thanks for the advice.
-
It's tough - I do think it boils down to the numbers. If you were talking about 1,700 pages and the rest of your site had 50K indexed pages, I'd probably say not to worry about it (unless, as @Dejan said, you experienced a traffic drop or other problems). When the rest of your site is 500 pages, though, I'd start to worry, especially with Panda updates hitting sites with too much copied content (even if legitimately syndicated).
The fact that your linking back does help (you're not trying to claim these are your articles), but if these pages, which are more than 75% of your index, only represent 4% of visits, I'd really start to question the usefulness and whether it's worth the potential SEO risk.
There might be a partial solution - you could NOINDEX a large chunk of the pages, but leave 50-100 of the articles, if those account for 90%+ of the traffic you're getting. Of course, that's going to take some analysis and is a bit trickier to implement, but it could let you keep most of that 3.9%.
I'd also see where that traffic is coming from - if it's 3.9% of total traffic, but only 0.9% of search traffic (mostly direct visits, bookmarks, etc.), then you've got even less to worry about if you de-index the whole subdirectory.
-
If it makes sense to have them there then leave it, however if you feel there has been a traffic drop then it may be worth investigating if it was these 1700 indexed articles that have caused it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google is showing erroneous results on SERPs page
Hello, All, In April, two months ago, we caught a hack on a client's website. It created about 40 pages in what looked to be a black hat link tactic. We removed the pages, resubmitted the sitemap.xml (it reprocessed) and ran it through screaming frog to confirm all the pages were gone, but the forty pages still show up in the search results for a site search. We have both the www. and non www. version of sites claimed and set a preference. Nothing is awry with the robots.text. We're not really sure what to do to resolve it. We asked Google to recrawl (fetch) the site. I'm not sure what's going on with it. The website's name is fortisitsolutions.com The site search bringing up the pages from the hack is below. site:www.fortisitsolutions.com Any ideas?
On-Page Optimization | | Cazarin-Interactive0 -
Google Console returning 0 pages as being indexed
HI there, I submitted my site notebuster.net to Search Console over a month ago and it is showing 0 pages as being indexed under the index status report. I know this isn't right as I can see that in google alone by typing in (site:notebusters.net) there are 113 pages indexed. Any idea why this might be? Thanks
On-Page Optimization | | CosiCrawley0 -
How does Google treat Dynamic Titles?
Let's say my website can be accessed in only 3 states Colorado, Arizona and Ohio. I want to display different information to each visitor based on where they are located. For this I would also like the title to change based on their location. Not quite sure how Google we treat the title and rank the site.... Any resources you can provide would be helpful. Thanks
On-Page Optimization | | Firestarter-SEO0 -
Why Google did not index exactly these 2 pages? Any ideas?
Dear Community, on 27th of July I relaunched my own website and submitted the sitemap as well I send the index-page to crawl it including all linked pages. Already the next day the new pages have been indexed. Today I checked them manually if they have been indexed. The result is that 2 of 13 pages have not been indexed, here marked in bold: http://inlinear.com/
On-Page Optimization | | inlinear
http://inlinear.com/suchmaschinenoptimierung-online-marketing.php
http://inlinear.com/design/
http://inlinear.com/design/printmedien-gestaltung.php
http://inlinear.com/design/corporate-design-und-corporate-identity.php
http://inlinear.com/design/corporate-raum-design.php
http://inlinear.com/webentwicklung/
http://inlinear.com/virtueller-rundgang-360grad-fotografie.php
http://inlinear.com/business-atlas-online-verzeichnis.php
http://inlinear.com/baudokumentation-bauueberwachung.php
http://inlinear.com/ueber-uns.php
http://inlinear.com/blog/
http://inlinear.com/kontakt/ The page "/design/" (which is the index.php of this folder should be the main-page because its about WEB DESIGN.
Should I create a copy and call it /design/web-design.php? May be Google prefers a meaningful URL than the index.php? So I put then a rel=canonical to web-design.php in my index.php? design/corporate-design-und-corporate-identity.php
The URL is a little long, but this should not be the reason? Or might be a reason that another page which is still in the index, but not online anymore (even redirecting to /design/) is still more dominant? Strange.... orshould I simply wait a little or try submitting these to sites manually to google? When checking Google Webmasters Tools Google tells me that just 3 pages have been indexed.
When I was checking which page is indexed or not I checked each URL with the site-search option:
site:inlinear.com/pageX.php ... when Google shows this page, it was a sign that it was indexed but why webmasters tools show up only 3 pages? (see screenshot) Do you have any ideas?
Thank You 🙂 indexed.png0 -
Google Index HTTPS
Hi,
On-Page Optimization | | JohnHuynh
I had a HTTP protocol file which indexed. Now I want to change this file to HTTPS protocol. I wonder that is there any effects?
I don't know HTTPS would be indexed by google or not? Thanks,0 -
My text does not show up in Google
Hi there. I've got an urgent question I hope someone can help me with. I've made a website (www.tonyharrismakingcents.com.au) with a few content pages. I don't get a lot of traffic. All my pages are scrawled and I don't see any errors. However, when I copy an entire paragraph and Google it, it does not show up in the search results. This makes me believe that the pages are not scrawled correctly. Only when I search for the exact paragraph by putting it between "", the website shows up on the results page. What can be the reason for this? Thanks for your help..It's much appreciated.
On-Page Optimization | | csrinpractice0 -
How to properly remove pages and a category from Google's index
I want to remove this category http://www.webdesign.org/web-design-news-all/ and all the pages in that category (e.g. http://www.webdesign.org/web-design-news-all/7386.html ) from Google's index. I used the following string in the "Reomval URS" section in Google Webmaster Tools: http://www.webdesign.org/web-design-news-all/* is that correct or I better use http://www.webdesign.org/web-design-news-all/ ? Thanks in advance.
On-Page Optimization | | VinceWicks0 -
Major update to site architecture (outline)-Is Google going to drop?
I'm working with a lawyer client who has a table-based, outdated site. Her nav links consist of a jumble of topics and static pages in one long sidebar list on the home page. I'm moving her site to Wordpress and I've recommended that she organize the site based on categories that roughly match the topics/keywords she wants to rank highest for in Google. The site will be much better organized and coded and the URLs for the new launch will be much stronger for SEO by being targeted and coded properly. So the site should rank better after, right? Right??? I know that when Google crawls the new architecture, it's not going to find the expected long sidebar list of internal nav links. It'll find better, more keyword targeted internal nav links. But will that keep the site from getting dropped off page 1? I'm speaking w/ the client tomorrow and if she's going to drop or get bounced around, I feel like I should prepare her and let her know roughly what might happen. I'm thinking based on my current understanding that I should tell her to expect to be bounced around for a few weeks, but in the end she should rank higher than before. What would you do/say?
On-Page Optimization | | bvrob0