Sitemaps, 404s and URL structure
-
Hi All!
I recently acquired a client and noticed in Search Console over 1300 404s, all starting around late October this year.
What's strange is that I can access the pages that are 404ing by cutting and pasting the URLs and via inbound links from other sites.
I suspect the issue might have something to do with Sitemaps. The site has 5 Sitemaps, generated by the Yoast plugin. 2 Sitemaps seem to be working (pages being indexed), 3 Sitemaps seem to be not working (pages have warnings, errors and nothing shows up as indexed). The pages listed in the 3 broken sitemaps seem to be the same pages giving 404 errors.
I'm wondering if auto URL structure might be the culprit here. For example, one sitemap that works is called newsletter-sitemap.xml, all the URLs listed follow the structure: http://example.com/newsletter/post-title
Whereas, one sitemap that doesn't work is called culture-event-sitemap.xml. Here the URLs underneath follow the structure http://example.com/post-title.
Could it be that these URLs are not being crawled / found because they don't follow the structure http://example.com/culture-event/post-title? If not, any other ideas?
Thank you for reading this long post and helping out a relatively new SEO!
-
Hi Daniel! Thanks for your question.
It's kind of hard to know what's going on without seeing your site. Feel free to PM it to me.
There's definitely a chance that this is the case, but if it's happening with Yoast it is likely a configuration issue on your site not with Yoast's technology. You may need to adjust your tag permalinks within your WordPress admin so that the URLs are correct in your sitemaps.
John
-
I'll make my question shorter and hopefully more clear...
If my Permalink structure in Wordpress is set up for a given custom post type, lets call it "culture", as: example.com/postname,
Yet with Yoast, a sitemap is automatically generated for posts tagged with "culture" that looks like example.com/culture/postname
Could that explain why posts being tagged as "culture" are showing up as 404s in Search Console?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Which product URL to include in Sitemaps?
Hi Does the product URL's in Sitemaps affect the sub-categories authority too? For example, if I have a product with 2 URL's and which have a canonical tag: **/brands/michael-kors/bags/**jet-set-double-zip-wallet/ **/women/accessories/wallets/**jet-set-double-zip-wallet/ If I make the main URL "/women/accessories/wallets/jet-set-double-zip-wallet/" and set that as the Canonical URL & list that URL in the XML Sitemap, will it also mean the "/women/accessories/wallets/" category will get more authority and increase it's power to rank? Thanks Frankie
Technical SEO | | Frankie-BTDublin0 -
Old forum with 404s, what should I do?
Hello, So I'm helping out some friends with their SEO. I've just run a Screaming Frog crawl of their entire site (which took hours and hours I might add). They used to have a forum connected to the site, which is no longer active. Google is still indexing all of the old URLs, which unsurprisingly return 404 errors. What should they do to prevent Google from indexing these pages? That's assuming they need to do anything at all. They don't have access to these old forum posts and therefore won't be able to fix the URL or resource adding a 301 redirect pointing to the most relevant alternate page. I'm new to SEO but my instinct is that they need to have the page return a 410 ‘Gone’ response code to give search engines a clear signal that the page no longer exists and won’t be returning, and removing the internal links to that URL or resource. 1. Is this interpretation correct?
Technical SEO | | jordanayresaira
2. What is the impact of leaving these 404s? There are over a thousand, so there's a lot 3. What should I recommend?0 -
Canonical URL on frontpage
I have a site where the CMS system have added a canonical URL on my frontpage, pointing to a subpage on my site. Something like on my domain root.Google is still showing MyDomain.com as the result in the search engines which is good, but can't this approach hurt my ranking? I mean it's basically telling google that my frontpage content is located far down the hierarki, instead of my domain root, which of course have the most authority.
Technical SEO | | EdmondHong87
Something seems to indicate that this could very well be the case, as we lost several placements after moving to this new CMS system a few months ago.0 -
Site structure headache
Hello all, I'm struggling to get to grips with a websites site structure. I appreciate that quality content is key etc, and the more content the better, but then I have issues with regards to doorway pages. For example im now starting to develop a lot of ecommerce websites and want to promote this service. should we have pages that detail all of the ins and outs of ecommerce - or should we simplify it to a couple of pages. what is best practice? Also isn't a content hub similar to having doorway pages? let me know what you think! William
Technical SEO | | wseabrook0 -
Should I change or redirect this URL?
Happy Friday everyone! I just noticed that one of our Attorney Profile's url's is wrong. We used to have someone named "Dana Fortugno" as our Family Law attorney, but when he left, (over two years ago) we hired "Scott Finelli." The person who setup the site, just changed the information on the page not url. So instead of it saying "http://www.kempruge.com/scott-finelli-jd-llm/;" it says "http://www.kempruge.com/dana-fortugno-jd-llm/." I'm considering taking all the content on the page with the wrong url, copying it to a new page with the correct URL and 301 redirecting (what would now be a blank page) to the new page with the correct URL. Is this the best way to handle this? Also, I don't believe there are many SEO concerns regarding the pages specifically. The profile pages aren't what we rank for in any of our Family Law related keywords. I am worried about having a completely blank page that just 301 redirects as looking bad to google, but not sure if it would? As always, thank you for your time and any assistance you can provide. Ruben
Technical SEO | | KempRugeLawGroup0 -
Structured Data Authorship
Hi I've just successfully set up authorship for a client according to the rich snippet testing tool although bit perplexed since underneath the results theres a section called 'Extracted Structured Data'. The first section is marked hatom feed and under that it says under the field saying 'Author' it says in red: Warning: At least one field must be set for Hcard.Warning: Missing required field "name (fn)".And then under the URL field & the URL it says:Warning: Missing required field "entry-title".Any ideas what this means or even if its important ? I would have thought the tool wouldnt acknowledge authorship as being set up correctly if this was an issue but that does beg the question what is it doing there and what does it mean ?Theres another section after that called rdfa node which seems all fineAlso says page does not contain publisher mark up although i know publisher has been added to the home page, is it best to add publisher to head section in every page (as i have heard some people say) or just the home page ?Many ThanksDan
Technical SEO | | Dan-Lawrence0 -
Folder Hierarchy Structure Theory
Hi, I was wondering if search engines, in particular, Google, actually use folder hierarchy to determine how important a particular page on a website might be for ranking purposes, or is on-site page inter-linking only taken into consideration. I know that external and internal links help to support the authority or 'page rank' of a particular webpage on a website. In a typical Wordpress installation, for example, it is easy to create a page and assign child-pages to support it. These sub-pages would naturally link to their parent pages via menu and/or body links, so they would theoretically 'support' the authority of the parent folder/page. My question is... would search engines see the parent folder page as more authoritative than a child-page, even without a lot of on-site interlinking of child and parent pages, just because it is higher up in the folder structure? For example, I have a client who has a Wordpress website, but is using a plugin to make all pages have a .htm ending. The site is fairly 'flat', hierarchally speaking and does not use any /folders/, but the pages are inter-linked. In the following scenario, there are 4 testimonial pages... 1 main one and 3 supporting pages. The 3 supporting pages are linked to from the parent page and vice versa. /testimonials.htm /testimonials-quality.htm /testimonials-price.htm /testimonials-ease.htm I was wondering if it is worth suggesting to my client that we remove that plugin so that we can more easily employ the natural folder hierarchy functions of Wordpress, such as this scenario: /testimonials/ /testimonials/quality/ /testimonials/price/ /testimonials/ease/ Would the loss of 'link juice' due to redirects and the work that would be involved would be worth the possible ranking increases of potentially structuring the website better... or are we fine just relying on the existing page interlinking to show the search engines what are the important parent pages?
Technical SEO | | OrionGroup0 -
Structure of urls
**Hallo from Athens, Greece. We have to implement the following project and i need your help: ** We will build a company guide for the whole country and company local guides for each city for the same client. **Information of the country guide is the sum of information of local guides, so when a user is at the country guide he sees information from companies from all cities and when the user is at city guide he sees info only for the city. ** The problem is the structure of the url we should have. Should the page of presentation of each company should have structure as domain.gr/id/company? or city.domain.gr/id/company and the one to be canonical to the other? is this good for seo? Should both urls be included in the sitemap? Thank you
Technical SEO | | herculesopa0