Unexplained Crawl Diagnostic Errors & Opencart
-
Hi,
I've been looking at the crawl diagnostics for my site and trying to fix the errors that are showing up but Seomoz is producing some strange results.
It's saying pages are duplicated upto 16 times but those pages dont exist. It's adding "page=3", "page=4" to the end of the product URL but I don't see how it's finding those pages, nothing on the site(as far as I can tell) is linking to them. There is no "page=3", just the one product page.
Again on the duplicate content it's saying under the "other URLs" there's URLs like "http:///product-a" but again I don't see where it's finding these URLs and obviously those URL's dont work. Those three slashes aren't a typo either.
So far I've reduced the amount of errors from 2,005 to 543 but the rest of them I can't make sense of.
Also, what does one do when you have two products, eg: "product-a-white" and "product-a-black" to prevent Seomoz from seeing duplicates? Canonical links wont work because there's no parent item, just those two. Google Webmaster tools doesn't seem to have a problem though.
Using Opencart 1.5, if it helps.
Cheers,
-
Ah, so it may well be opencart doing something funky then. It's carrying the page url over into the product listing by the looks of it. I'll have to look into that then, thanks for pointing that out!
Do you have any idea how it could be finding the "http://maggie" style links?
Cheers for the help,
-
Ok, here is a example
http://www.lustrelingerie.com/Gracya-Lingerie/safari-wild-bra-push-up?page=5linked from
http://www.lustrelingerie.com/Gracya-Lingerie?page=5
Seems like if the pages= is on the catalog page, it is on the product links
-
Hi Alan, thanks for the response.
Yea, sure there's additional pages for the categories, I'm talking about the individual products.
Take http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red for example. Seomoz's Diagnostics is saying there's a http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red?page=2. The latter works if you go there, I don't understand that and that's likely down to opencart, but what I don't get is how Seomoz is finding the link to it.
And it's the same with links such as "http://maggie" (real error), I don't see where Seomoz is finding the links to those. I've checked any stray canonical links but they seem fine to me.
Thanks,
-
Yes they do exist
this page http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie?page=1
is linked from this page
http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie
There are many examples
-
The URL is http://www.lustrelingerie.com/
-
If you can give us a url i will tell you for sure
-
Hi Ben, thanks for the response.
The thing is I don't think it's a CMS issue, it seems to me that seomoz is getting confused somewhere. my product pages are along the lines of "www.domain.com/range/product-a/". They have a canonical link pointing to "www.domain.com/product-a/" And all only have a single page to them. Which is why I can't figure out where Seomoz is picking up these duplicates.
With regards to your latter paragraph, yea I was thinking that. I thought it might confuse customers though, or I was hoping there would be a more elegant solution. Going back in and editing 500+ products isn't something I was looking forward to hehe.
Cheers,
-
I'll speak to the duplicates issue since the other appears to be a CMS issue and how it is displaying the products. Whenever I see the "page=1" in the URL I can usually fink a pagination script that isn't helping my SEO efforts. But I don't know for sure in your situation, especially since you said you don't see any links on the product page.
As far as the "duplicates" issue. Try to get them as distinct as possible. With our product pages (starting with the most sold items) I have begun changing up the product name. We have the difference of only the height on many of our products so I'm having to get a little creative and add some other aspect to the URL that stays within the products title. I only want one page from my site competing for that exact match product SERP anyway. It's not a good idea to have two pages on your site competing for the same SERP. It seems to always be treated with less authority by Google when that happened in the past.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to Configure & Tracking Images in Google Analytic?
Hello Experts, Can anyone tell me for my eCommerce site how to configure images tracking in google analytic? And where can I see the image traffic in google analytic? Not sure I am asking correct question but confuse about image traffic. Thanks! Wrights!
Reporting & Analytics | | wright3350 -
Huge Analytics discrepancy when I turned off Analytics Module & Turned on GA via GTM
Hello, I was running universal GA via a GA Drupal module. We setup Google Tag Manager and setup a GA tag to run through GTM. When we pushed the GTM container live our Real Time was at around 1900 and when we turned off the Drupal module the Real Time was reporting around 1100. After 24 hours the sessions for that day were about 100k lower than what they usually are. Any ideas Why the larger discrepancy? Any places that are obvious to look? *I didn't setup the original Drupal GA module, nor do I know who did. Thanks
Reporting & Analytics | | JJLWeber0 -
Are these Search Console crawl errors a major concern to new client site?
We recently (4/1) went live with a new site for a client of ours. The client site was originally Point2 before they made the switch to a template site with Real Estate Webmasters. Now when I look into the Search Console I am getting the following Crawl Errors: 111 Server Errors (photos) 104 Soft 404s (blogs, archives, tags) 6,229 Not Found (listings) I have a few questions. The server errors I know not a lot about so I generally ignore. My main concerns are the 404s and not found. The 404s are mostly tags and blog archives which I wonder if I should leave alone or do 301s for each to /blog. For not found, these are all the previous listings from the IDX. My assumption is these will naturally fall away after some time, as the new ones have already indexed. But I wonder what I should be doing here and which will be affecting me. When we launched the new site there was a large spike in clicks ( 250% increase) which has now tapered off to an average of ~85 clicks versus ~160 at time of launch. Not sure if the Crawl Errors have any effect, I'm guessing not so much right now. I'd appreciate your insights Mozzers!
Reporting & Analytics | | localwork0 -
Help Blocking Crawlers. Huge Spike in "Direct Visits" with 96% Bounce Rate & Low Pages/Visit.
Hello, I'm hoping one of you search geniuses can help me. We have a successful client who started seeing a HUGE spike in direct visits as reported by Google Analytics. This traffic now represents approximately 70% of all website traffic. These "direct visits" have a bounce rate of 96%+ and only 1-2 pages/visit. This is skewing our analytics in a big way and rendering them pretty much useless. I suspect this is some sort of crawler activity but we have no access to the server log files to verify this or identify the culprit. The client's site is on a GoDaddy Managed WordPress hosting account. The way I see it, there are a couple of possibilities.
Reporting & Analytics | | EricFish
1.) Our client's competitors are scraping the site on a regular basis to stay on top of site modifications, keyword emphasis, etc. It seems like whenever we make meaningful changes to the site, one of their competitors does a knock-off a few days later. Hmmm. 2.) Our client's competitors have this crawler hitting the site thousands of times a day to raise bounce rates and decrease the average time on site, which could like have an negative impact on SEO. Correct me if I'm wrong but I don't believe Google is going to reward sites with 90% bounce rates, 1-2 pages/visit and an 18 second average time on site. The bottom line is that we need to identify these bogus "direct visits" and find a way to block them. I've seen several WordPress plugins that claim to help with this but I certainly don't want to block valid crawlers, especially Google, from accessing the site. If someone out there could please weigh in on this and help us resolve the issue, I'd really appreciate it. Heck, I'll even name my third-born after you. Thanks for your help. Eric0 -
Clarification on Analytics Goals & Funnel Logic
I have a few questions about funnel flow and how my goals are tracked. 1. My goal is a user completing a contact form, form.html. Upon completion, user is redirected to thanks.html, ending the flow. Every page header contains a direct link to form.html. Does this mean every page on my site is considered a "step" leading the user to the contact form? If so, do I list each page containing the form link in the steps, or only form.html? 2. If the above isn't correct, how do I track how the user wound up completing my form. I'd love to know which pages are performing better and leading to more sign-ups (goal completions). Currently I think I'd only be setup to know when a user is on the form.html page and whether they complete it or leave, not how they got to form.html in the first place. 3. My .html pages are actually embedded via iFrame on WordPress pages. My form.html page is hosted on .com/quote and upon completion, form.html redirects to thanks.html within the iFrame (.com/quote remains the actual page). I'd assume this is going to throw off my stats since the physical .html pages don't have Analytics code embedded? 4. How would I track different CTAs on the same page which lead to the same goal? For example, header vs. footer CTA's to the same sign-up sheet. Let me know if you need anything clarified. Thanks a lot for the help.
Reporting & Analytics | | kirmeliux0 -
Suspect Links from Yeusaigon.net Causing Server Errors
Good morning, Webmaster Tools is reporting an increase in server errors on our site due to some very suspect links from Yeusaigon.net. After taking a quick look, it appears they are some form of search engine attempting to link to our images by using incomplete URLs. For example: http://yeusaigon.net/search/images.php?q=htc%20one%20max%20phone%20cases&page=1044 Is linking to: http://www.mobilemadhouse.co.uk/caseflex-htc-one-max-real-leather-flip... As this URL is incomplete, it's throwing up a server error. There are currently 139 instances of there errors from the same domain, and is increasing by around 5-10 per day. The domain, however, is linking to some of our pages/images correctly, but I fear Google may look at these as spammy links - they certainly look that way! So, what can we do? I can't find any contact details on Yeusaigon website so I have disavowed the entire domain. Is this the right thing to do? How do I stop the ever-increasing number of sever errors due to incorrect URLs? Cheers, Lewis
Reporting & Analytics | | PeaSoupDigital0 -
Not found and Not followed Errors on Web Master Tools
Just noticed in web master tools that we are showing 150 not found errors on our site, the majority appear to be from old blog posts that have been deleted, is this damaging from an SEO perspective on a scale from 1-10? Also we have over 100000 not followed errors throughout the same site, is this damaging from an SEO perspective on a scale from 1-10? Thanks in Advance Andy
Reporting & Analytics | | First-VehicleLeasing0 -
OSE shows URLs redirecting to our custom created error page, is this a problem?.
When I check the link metrics for my product pages in OSE, it shows a message saying that the page redirects to our custom error page. This page was recently created to display when there is an error with the website. Do I need to be concerned that OSE is seeing all product pages as redirecting to this error page? Will it affect page authority etc,? I have attached a screen shot of the message that OSE displays for reference. YWbpM.jpg
Reporting & Analytics | | pugh0