Why is OSE showing no data for this URL?
-
Hi all,
Does anyone have any ideas as to why OSE might not have any data for this URL:
http://www.ccisolutions.com/StoreFront/product/shure-slx24-sm58-wireless-microphone-system-j3
It is not a new page at all. It's been on the site for years.
Is OSE being quirky? Or is there an underlying problem with this page?
Thanks in advance for any light you can shed on this,
Dana
-
Hi Paul,
We discovered that the problem was being caused by a trailing "comma" at the end of the keyword string that we once used to populate the Meta keywords tag. Unfortunately, the keyword information in those fields is still being parsed. The parser did not know what to do when it encountered a comma followed by nothing.
We did run a query and found that this problem was affecting 128 of our product pages and had been for a long time. We haven't been populating the keywords for almost a year now, so the problem is at least that old.
The commas are now gone.
Thanks again to you and Andrew!
-
Glad I could help, Dana.
And yes, "borked" is a technical term. It's defined as existing in a badly broken state as a result of an inexperienced/inattentive user making unauthorised/incorrect changes to a website's code or content
Can also be used as a verb: "he borked the database so badly the whole site went 503".
Not that it's ever been applied to me or anything.
And yea - sometimes our tools can mislead us, even though the info they provided was "technically" correct.
Suggestion for a fast way to test the rest of the site for this kind of error: Use the paid version of Screaming Frog to program a search for a snippet of code that should be in the content area of every product page. Limit the crawl to the product pages category. (Or whatever sections of the site you're worried about.)
You could search for something as simple as class="productExtendedDescription" which would at least ensure the content container was there. Still wouldn't prove there was any content it it, but if you wanted to get fancy with regex, you could even do that too. You could also search for the tag, which would indicate that the rest of the pages' code likely exists.
Just an idea to speed up the testing process.
Paul
-
Thanks so much Paul,
Yes, when I ran a "Fetch as Googlebot" it returned a "Success" message, but when I looked at what Google is seeing there is no content on the page.
"borked" - great term...I am definitely going to have to file that one away for future use!
If the problem is isolated to this page, that's one thing. I am more concerned that this problem is effecting a larger number of pages.
Once I figure it out, I'll come back here and post what we found/fixed.
I really appreciate the comments from you and Andrew very much!
-
Dana, there's no content on that page.
The massive head section with all it's JavaScript is there, making it look like there's lots of code, but the actual body content has somehow been deleted.
This is all I see in the actual body of the page:
|
<form name="headerForm" action="IAFDispatcher" onsubmit="return submitQuery()" method="post">
That's it. There's no actual content, no footer, no closing or tag, which makes me think someone's actually deleted the content part of the code by accident.
Good luck figuring out who borked it
Paul
</form>
|
-
I just ran the source code for this page through the validator at: http://validator.w3.org/
There are a multitude of problems that need to be addressed. Thanks very much Andrew. I do have enough HTML knowledge to provide guidance to our IT manager on how to fix the problems. I don't have access to much of the source code, so it will certainly be a "project" to fix the issues.
I am sure these problems are everywhere all over the site, as many people with very little experience in coding and design have had their hands in the pot (so to speak) over the years.
At least this will allow me to prove to our CEO that our underlying code is indeed presenting a problem for indexing and crawling.
-
I did some comparisons with other pages and it doesn't seem that the drop-down frequency selector is the culprit. This page also has one: www.ccisolutions.com/StoreFront/product/shure-slx24-sm58-wireless-microphone-system-h5
but the cache in Google seems to be fine for this page and OSE displays data for it just fine.
-
Could the coding issue be related to the drop down box that's located just above the pricing on the right hand side? That is one thing that makes this product page different from others on our site.
Thoughts?
-
I also see what you mean that there is a problem with Google's cache. The cache date is really old (April 11) and there is no preview of the page.
Anyone who can point me in the right direction?
-
Thanks so much for responding Andrew. I have suspected problems with our code for a long time, but I am not a coder, sp it's been a challenge to attempt to identify the specific problem.
I believe this is not just a problem with this page, but could be a problem across many pages on our site.
Can you are any of my fellow Mozzers point to what you are seeing in the source code that leads you to believe it is corrupted?
Many thanks for any help. I truly appreciate it!
Dana
-
Hi Dana,
I think your page is corrupted, I have copied a link to the sourcecode I am seeing http://pastebin.com/BRfFT4RR
It looks like Google Cache is also having problems with this page. Perhaps OSE had trouble too and so skipped the page?
- Andrew
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Two PR 5 websites, but little to no link data in OSE. How??
As the title states - We've recently developed two sites for clients - within the last 4 months or so. With the Google PR update, both sites are sitting as PR 5 sites. I've tried to have a look in the OSE for the backlink profile of both websites, but I see nothing. Even Majestic SEO's fresh index doesn't provide much info. The DA of each site is 11-16. I would really love to see what's generating the link juice to these sites. Any ideas? The two sites are: https://bfore.co.za
Moz Pro | | Mark.RedGiant
http://ictjournalafrica.net0 -
OSE Releases
Does anyone know how often OSE index is updated? We've relaunched our website recently, and I'd really like to see how our redirects, PR efforts and new internal linking structure is working out for us. I think I have previously seen a schedule somewhere, but I can't find it at the moment. Any help is appreciated!
Moz Pro | | tomcraig860 -
Opensite explorer not showing facebook/twitter other sites
hello i have had my site live and indexing for 3 months however open site explorer doesn't show the facebook or twitter linkback from my corporate sites. any ideas why this would not show in opesite? it doesn't show in google WMT either. my site: http://cheap-airport-taxis.com/
Moz Pro | | smashseo0 -
OSE Backlink results - reported link not actually there?
Not a complaint, but a question to understand how the research tool operates: When I run backlink checks on websites, often the reported link is not only not on the page, but it's not found anywhere on the site. I use several tools to search for the link url as well as for the keyword. Why does the tool report a link is there, but I cannot find the links in some cases? Is there a lag in the information the tool is using, making it not quite up to date, or is it something else? Thanks much!
Moz Pro | | AdamThompson0 -
Dead links-urls
What is the quickest way to get Google to clean up dead
Moz Pro | | 1step2heaven
link? I have 74,000 dead links reported back, i have added a robot txt to
disallow and added on Google list remove from my webmaster tool 4 months ago.
The same dead links also show on the open site explores. Thanks0 -
A suggestion to help with linkscape crawling and data processing
Since you guys are understandably struggling with crawling and processing the sheer number of URLs and links, I came up with this idea: In a similar way to how SETI@Home (is that still a thing? Google says yes: http://setiathome.ssl.berkeley.edu/) works, could SEOmoz use distributed computing amongst SEO moz users to help with the data processing? Would people be happy to offer up their idle processor time and (optionally) internet connections to get more accurate, broader data? Are there enough users of the data to make distributed computing worthwhile? Perhaps those who crunched the most data each month could receive moz points or a free month of Pro. I have submitted this as a suggestion here:
Moz Pro | | seanmccauley
http://seomoz.zendesk.com/entries/20458998-crowd-source-linkscape-data-processing-and-crawling-in-a-similar-way-to-seti-home1 -
Metrics from Linkscape - DJ Passed, URL mozRank Passed and funny numbers
Hello, Hoping someone can help me understand the difference between the Domain Juice Passed and some interesting numbers found in the exported CSV file. I ran the Advanced Link Intelligence Report and focusing on the Links to Domain metrics. It looks like the report is sorted by mozRank passed but next to each link we are given the DJ Passed instead. Why is that? My confusion is compounded by the fact that when I export the CSV of this report it no longer includes the DJ Passed numbers but does show URL mozRank Passed instead. For Example, on the web version of the Advanced Link Intelligence Report the top link is: http://www.holdenouterwear.com/shop.php with mozRank: 5.56 mozTrust: 5.95 and DJ Passed: 4.49 In the CSV file we don't get the DJ passed but get the URL mozRank Passed of: 0.00051 Looking at the CSV file further some links have URL mozRank Passed of 4.00E-05 Anyone has a clear explanation of why DJ Passed is not in the CSV file and how the mozRank passed is calculated? And what the 4.00E-05 mean? Thank you.
Moz Pro | | miloszpekala0 -
Incorrect domain authority result on SEO tool bar and OSE
The SEO tool bar is returning what I believe to be an incorrect domain authority of 71 and showing 24,356,141 lins from 153,051 domains. The OSE is also returning 71 as domain authority. Anyone know what could be doing this? Thanks. Jason
Moz Pro | | jayderby0