Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Crawl Diagnostics saids a page is linking but I can't find the link on the page.
Hi I have just got my first Crawl Diagnostics report and I have a questions. It saids that this page: http://goo.gl/8py9wj links to http://goo.gl/Uc7qKq which is a 404. I can't recognize the URL on the page which is a 404 and when searching in the code I can't find the %7Blink%7D in the URL which gives the problems. I hope you can help me to understand what triggers it 🙂
Moz Pro | | SebastianThode0 -
Find Historical SERP Ranking for a Term?
Is there any way to find out what pages ranked for a given term historically? I.e. what were the top 10 search results for "Widgets" 6 months ago, 1 year ago, 2 years ago? If I had a campaign tracking that term, I'd be able to look back, but I do not. Does this data exist anywhere in a format that could be queried?
Moz Pro | | kpclaypool0 -
Competitors links increasing rapidly
Hi there, I have 3 main competitors who I keep my eye on within Moz and I am always comparing pages as they are all sell similar products in the same way but we're increasingly getting more things put in place to make ourselves different, which is why we rank number 1 in the UK for their competitive keywords (without blowing my own trumpet!). One thing I have noticed, a competitor seems to be doing things the old school way. He has around 15 competitive keywords in his pages meta tag (irrelevant) and then has a huge 400 - 500 word description on each page describing whats on the page, what products are in that category (if it's a category page) or localised information if it's a company page. What seems to be odd, is that before penguin/panda their website had around 1,200,000 total links. Then after panda, he went down to around 300,000 total links. In the past few weeks I've noticed that he has gradually increased in total links from 300,000(estimate) to now 961,811 total links in a matter of weeks. Is this an error/glitch within Moz or is he doing something that may be classed as "blackhat" or is it something I shouldn't really be worrying about? Any feedback appreciated 🙂 Tom
Moz Pro | | tomhall900 -
How can I find out why my Domain authority has gone down?
I need to find out why the domain authority has dropped 6 points in the last 6 months - any ideas where to start looking?
Moz Pro | | RedC0 -
Problem with advanced linking domains report in OSE
Hi everyone, I request an advanced linking domains report in Open site explorer, but my csv file have only three columns as a normal linking domains report : Root Domain, Domain Authority, Number of Linking Root Domains. I have missed some important data as: anchor text, origin, target url Etc I try to request the same report two time, but I have the same problem. Someone have experienced this issue and know the fix ? thanks
Moz Pro | | wwmind0 -
help with the inbound links side of seomoz
Hi Can somebody help with the inbound links tool of seomoz, can you point me in the direction of how it works and the best practices to get the most from it. I know i have a lot more inbound links to the site in my campaign but its shows for example 114 links but when i click on show more it only shows 3 . am i doing something wrong? Any help and advice from how people use it , what for and what is best practice thanks.
Moz Pro | | Bristolweb0 -
How do I find the corresponding duplicate content pages from my SEOmoz report?
Once I have run my report and the duplicate content pages come up, is there a way to find out which pages have the duplicate content on them? I have one URL but where can I find the duplicate content that corresponds to it? Thanks Barry
Moz Pro | | MrBarrytg0 -
Crawl test. Bot crawled only 200 or so links when it should have crawled thousands
Hi everyone, I just recieved my crawl test report and its only given me 200 or so URL's when my site has thousands, any thoughts?
Moz Pro | | Ev840