Website pages missing from seomoz crawl
-
Hi!
I just added a website and the crawling result output has only 42 pages but my website has about 75 pages. What am i missing?
Thanks!
-
Is there any chance you could email me your sitemap as produced by wordpress? info[at]pathfindermedia[dot]co[dot]uk I'll take a closer look at whats being excluded.
-
I'll do that Keri. Thanks!
-
I don't think so:
User-agent: * Disallow: /cgi-bin Disallow: /wp-admin Disallow: /wp-includes Disallow: /wp-content/plugins Disallow: /wp-content/cache Disallow: /wp-content/themes Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /category/*/* Disallow: */trackback Disallow: */feed Disallow: */comments Disallow: /*?* Disallow: /*? Allow: /wp-content/uploads # Goole Bot User-agent: Googlebot Disallow: /*/feed/$ Disallow: /*/feed/rss/$ Disallow: /*/trackback/$ # Google Image User-agent: Googlebot-Image Disallow: Allow: /*
Regards.
-
Another possibility could be your robots.txt file, is it blocking some directories?
-
Hi! It's probably best to email help@seomoz.org about this. You can give them your full URL and they can help figure out why Roger isn't crawling everything. Thanks!
-
Anyone?
-
Hi!
One thing i figured out is that the crawling on both seomoz and xml-sitemaps.com returm the same 42 pages.Here's my website homepage URL - http://bit.ly/TGjpVx
And a couple of missing pages from 36 at total - http://bit.ly/WM3Rwe and http://bit.ly/VpHJ9H.
Regards,
OV -
Hi!
My website has a xml sitemap, generated by Google XML sitemap Wordpress plugin, with all the 75 pages.The crawler http://www.xml-sitemaps.com/ also outputs just 42 pages. I think it has someting to do with the blogs being archived (?).
I need to solve this and don't know how?!
Thanks for your help do.
Regards.
-
Actually, the SEOmoz crawler should be crawling all of the pages -- it's OSE that doesn't crawl everything, but the crawler from your campaign should show all that it could find. If you email help@seomoz.org they'd be happy to help you figure it out, or if you want to share your URL here along with some pages that are missing, the Q&A people could help diagnose things too.
-
Hi Ovieira,
This is not necessarily an indication that there are pages that are hidden from crawlers and the missing pages could simply be low priority for the moment. Or could have been created after the initial crawl had taken place.
The best way to check is to run a crawler like http://www.xml-sitemaps.com/ and that will give you a better idea. If the sitemap generates a complement of your pages then it's probably just a case of waiting until the next Moz crawl.
Mulith
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Category Page Content
Hey Mozzers, I've recently been doing a content audit on the category and sub-category pages on our site. The old pages had the following "profile" Above The Fold
On-Page Optimization | | ATP
Page Heading
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products
600 words+ of content duplicated from articles, sub categories and products My criticisms of the page were
1. No content (text) above the fold
2. Page content was mostly duplicated content
3. No keyword structure, many pages competed for the same keywords and often unwanted pages outranked the desired page for the keyword. I cleaned this up to the following structure Above The Fold
H1 Page Heading 80-200 Word of Content (Including a link to supporting article)
H2 Page Heading (Expansion or variance of the H1 making sure relevant) 80-200 150 Words of Content
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products The new pages are now all unique content, targeted towards 1-2 themed keywords. I have a few worries I was hoping you could address. 1. The new pages are only 180-300 words of text, simply because that is all that is needed to describe that category and provide some supporting information. the pages previously contained 600 words. Should I be looking to get more content on these pages?
2. If i do need more content, It wont fit "above the fold" without pushing the products and sub categories below the fold, which isn't ideal. Should I be putting it there anyway or should I insert additional text below the products and below the fold or would this just be a waste.
3. Keyword Structure. I have designed each page to target a selction of keywords, for example.
a) The main widget pages targets all general "widget" terms and provides supporting infromation
b) The sub-category blue widget page targets anything related and terms such as "Navy Widgets" because navy widgets are a type of blue widget etc"
Is this keyword structure over-optimised or exactly what I should be doing. I dont want to spread content to thin by being over selective in my categories Any other critisms or comment welcome0 -
Duplicated page titles
Dear friends, We have a problem which occurs on our page with duplicated page titles. Landlords are posting rooms on our page and most of them are giving the same name to the rooms and after that, we are getting more and more duplicated page titles. We are applying random whit this title tag: Accommodation for students in {city.name}: {name}. English title
On-Page Optimization | | Eurasmus.com
Certified student rooms in {city.name}: {name} English title
Erasmus room for students in {city.name} | {name} English title
Student room in {city.name}: {name} English title Also our title tag is sometimes to long but there is no possibility to make them shorter. I think. If anyone would have some idea be free to comment and help us. Kind regards Miško Macolić Tomičić0 -
Are internal and external footers on all pages of a large website detrimental to seo?
Hi, I am doing an seo clean up of our site - we have about 12 footers on the bottom of each page plus an internal site map including core keywords. Most footers go to internal sites and some go to external sites which are replicated on those external sites linking back to ours. (these sites are part of the same large company but are not related in any way) Are these bad from an seo ranking perspective? Do the look like spam? We get considerable traffic from one of the external footers that is replicated on their site so need to weigh up what the best option is whether to keep the traffic or to loose it as part of the seo clean up. thanks
On-Page Optimization | | Pday0 -
Jquery in top of page vs text on bottom page
Is it the best way to use jquery sliders on the top of your page to still get all your text above the fold and score in search engines? for example: http://www.wolf-howl.com/wp-conte... is much better to score high ranks in search engines than http://www.wolf-howl.com/wp-conte... ?? Thanks!
On-Page Optimization | | HMK-NL0 -
Page title changes based on results per page
I have a product listing that allows customers to set a results per page option. This ads a GET variable to the URL. I've added the page number to the title of this page if they go past the first page. If this results per page variable is added to the URL then google see it as a different page. Do I need to change the page title for this?
On-Page Optimization | | BedInABox.com0 -
Duplicate page titles
We have search results pages on our site. They share the same page title, there is no real differentiator between the result pages, other than page 1, page 2 etc. How do we de-dup the titles? just add page 1/2/3 etc to the end of them?
On-Page Optimization | | lilibooz0 -
Is On Page SEO Dead?
Hey Guys, Search Engine Roundtable has published a short post about this a few days ago, quoting senior member at WebmasterWorld forums who said: "The way I see it, on-page text today is for the "relevance" part of the total algorithm. The whole algorithm is, in broad strokes, "relevance + connectedness + quality". After you've clearly stated the relevance of the page, then the rest of your ranking power comes from elsewhere. I've added on-page bold tags with no effect. I've added or changed h1 elements with no effect. Not too long ago, those might well have done something, but that's not the game anymore. And moving from a table layout to a CSS-P layout today might get you nowhere, too. It all depends how deeply complicated the table layout was, I think." http://www.webmasterworld.com/google/4408395.htm Is it true? Is on-page SEO really dead? What do you think?
On-Page Optimization | | ShivaS0 -
What is important for page rank?
I have heard quality is the most important factor for page rank but after the 7 Nov 11 PR update I am no longer a believer. The PR on my home page dropped from 4 to 3 and the rest of my inside pages remained the same even though I have added a significant amount of content since the previous update and kept it fresh. Any thoughts on this most recent PR update?
On-Page Optimization | | casper4340