Drupal Question
-
So on our site we have a plugin for our fan gallery. The issue is that I am getting a lot of duplication errors and it's saying the URL is too long and all the errors are coming from the Fan Gallery, which has over 8,000 errors. It seems to be pulling a long form query URL that has over 100 characters. You can't physically see it on the site, but the crawlers can.
Anyway I'm trying to figure out a fix for this. One method would be to just stop those pages from being crawled, but I would hate to do that as the fan gallery for us would be a great source of links and content.
So I'm wondering if anyone else has had an issue with these types of plugins before where the user can upload a photo or do a video embed and then it submits to the site.
If you have a better method please let me know. I usually work on E-comm platforms so my experience with drupal is limited.
-
Well, we're using drupal also and had the same problem, we fixed it by making a custom view with some custom paths that we're smaller for example:
previously: http://www.domain.com/news/typeofnews/paperback/issue-20-august-2011/itemtitle1/
(as you can imagine the titles could be large)Now: http://www.domain.com/news/20-09-2012/title
Maybe this is possible for u also ?
-
Hi Kate,
Looking at the URL string there I see both Pressflow and Pantheon variables being passed. It looks like the platform is in the way. I would suggest sending an email to the folks at Pantheon and/or Pressflow to get some help. I'm not sure what your technical expertise is, but Pressflow is a flavor of Drupal and Pantheon is a hosting service for Drupal. They appear to be adding variables to the URL, which probably isn't necessary.
Just my guess.
John
-
Thanks,
Subfolders. I have a few URL's with over a 100 characters because of what things are named, but then it pulls up this really long query string like this:
URL/welcome-new-raywjcom?PRESSFLOW_SETTINGS=%7B%22conf%22%3A%7B%22pressflow_smart_start%22%3Atrue%2C%22pantheon_binding%22%3A%22e92472919be14d0b93b8d8ccd2e6b8c1%22%2C%22pantheon_site_uuid%22%3A%22da9acf76-5d3a-4fab-8c70-bb1e73cbe931%22%2C%22pantheon_environment%22%
and that's only a portion of it it keeps going on and on after that. Which is why I was thinking just to block it for now.
-
Is it the actual query that is over 100 characters or is it a long URL with lot's of subfolders that is causing it to be so long? If the latter, then maybe you should try to start over again and put the gallery closest to the root.
If this can't be done I would look into other plugins, Drupal can be buggy as heck with plugins and the issue might be isolated to that plugin. I would try to fix that and wouldn't consider blocking those pages as more than a temporary option to protect the site while this is being looked at.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Subdomain Question
Having a difficult time on our site and looking for some advice. Our site pages are indexed perfectly, however, we have a subdomain where we have all of our images and PDF's. We only have the main domain set-up in Search Console with our sitemap. We can't seem to get any of our images indexed by Google that are in the subdomain however all the PDF's are indexed. My thought is to add the subdomain to SC and create a new sitemap that is just for the subdomain. Assuming we are not blocking any folders or files with our robots.txt can anyone think of any other reasons why the images wouldn't get indexed.
Technical SEO | | cbathd
Thanks0 -
Questions on switching domain name of my site
Hey guys, I purchased a domain name on May 1st and have been running my site on it since then. At the time when I added the domain name to the shopping cart, I felt that it was the best domain name a human could possibly come up with. But now, after two months, I am not so sure about it. And that means I am thinking of getting a new domain name and redirecting the old domain to the new one. Give that the domain is only two months old, there is not much to lose. However, my domain name does have some valuable backlinks, and has built some credibility in the eyes of search engines. According to MOZ Open Site Explorer, its Domain Authority is 10 and Page Authority (homepage) is 20. It's not much, but I fear that if I switch the domain name, I will reset the clock and the new domain name will have to earn the rankings from scratch That said, after searching this forum, I come across articles like this one from google and this 2012 article from MOZ that gave me some confidence. The MOZ article is from 2102, and there have been several changes since then. Is there an updated version of that article, or another legit article on the internet that you can point me to? As you can tell, I want to be more than sure that we don't take any hit during the migration process. Also want to mention that the site is on WordPress -- in case that is relevant. Thank you for your help.
Technical SEO | | deathbyseo0 -
Question on Google's Site: Search
A client currently has two domains with the same content on each. When I pull up a Cached version of the site, I noticed that it has a Cache of the correct page on it. However, when I do a site: in Google, I am seeing the domain that we don't want Google indexing. Is this a problem? There is no canonical tag and I'm not sure how Google knows to cache the correct website but it does. I'm assuming they have this set in webmaster tools? Any help is much appreciated! Thanks!
Technical SEO | | jeff_46mile0 -
Detailed ranking question for the pros
Hi Community, We've been struggling with the search engine ranking of our SEO optimised homepage for a number of months. I'm going to provide an overview of the page stats in hope that somebody might have a suggestion as to what the problem might be or where we should be focusing our efforts. I have also provided the stats of our main competitor as I have no idea why they are ranking so high based on the stats provided: URL in question: https://mysite.com On Page Grade for our targeted keyword: A Domain authority: 36 Page authority: 45 Root Doman Links: 57 Total Links: 634 SE Ranking: #17 Competitor URL in question: https://competitorsite.com On Page Grade for same targeted keyword: A Domain authority: 32 Page authority: 43 Root Doman Links: 28 Total Links: 919 SE Ranking: #1 Another strange this about our homepage is that a second tier page on our site is actually ranking higher in the search ranking for the Targeted Keyword (#9), even though this page has not been optimised and has an On Page Grade of F. Does anybody have any suggestions of what we might be overlooking or what the issue here might be? -JF
Technical SEO | | ERpro0 -
Easy Question: regarding no index meta tag vs robot.txt
This seems like a dumb question, but I'm not sure what the answer is. I have an ecommerce client who has a couple of subdirectories "gallery" and "blog". Neither directory gets a lot of traffic or really turns into much conversions, so I want to remove the pages so they don't drain my page rank from more important pages. Does this sound like a good idea? I was thinking of either disallowing the folders via robot.txt file or add a "no index" tag or 301redirect or delete them. Can you help me determine which is best. **DEINDEX: **As I understand it, the no index meta tag is going to allow the robots to still crawl the pages, but they won't be indexed. The supposed good news is that it still allows link juice to be passed through. This seems like a bad thing to me because I don't want to waste my link juice passing to these pages. The idea is to keep my page rank from being dilluted on these pages. Kind of similar question, if page rank is finite, does google still treat these pages as part of the site even if it's not indexing them? If I do deindex these pages, I think there are quite a few internal links to these pages. Even those these pages are deindexed, they still exist, so it's not as if the site would return a 404 right? ROBOTS.TXT As I understand it, this will keep the robots from crawling the page, so it won't be indexed and the link juice won't pass. I don't want to waste page rank which links to these pages, so is this a bad option? **301 redirect: **What if I just 301 redirect all these pages back to the homepage? Is this an easy answer? Part of the problem with this solution is that I'm not sure if it's permanent, but even more importantly is that currently 80% of the site is made up of blog and gallery pages and I think it would be strange to have the vast majority of the site 301 redirecting to the home page. What do you think? DELETE PAGES: Maybe I could just delete all the pages. This will keep the pages from taking link juice and will deindex, but I think there's quite a few internal links to these pages. How would you find all the internal links that point to these pages. There's hundreds of them.
Technical SEO | | Santaur0 -
One more redirect question
If there are two URLs like below: example.com/toys/batman-toys
Technical SEO | | IceIcebaby
example.com/birthday/batman-toys Both have the exact same everything, except URL key. The first example ranks for all KWs and search terms in the SEs. Does having the second page hurt my ranking potential for the first page? Should I redirect the 2nd page to the first or just leave it? As always, thanks for your help.0 -
Using Drupal to Author Websites across 2 Domains
I am new to Drupal and as an organization we are considering using it to author both our corporate site and our blog. In the future we'd like our blog to live at a separate domain and I understand Drupal is capable of publishing across 2 domains. Does anyone know of any SEO implications to this type of infrastructure set up? Are there specific things to be mindful of when setting up the Drupal CMS across 2 domains? thanks for the assistance!
Technical SEO | | Hershel.Miller0 -
301 Redirect "wildcard" question
I have been looking at the SEOmoz redirect guide for some advice but I can't seem to find the answer : http://www.seomoz.org/learn-seo/redirection I have lots of URLs from a previous version of a site that look like the following: sitename.com/-c-25.html?sort=2d&page=1 sitename.com/-c-25.html?sort=3a&page=1 etc etc. I want to write a redirect so whenever a URL with the terms "-c-25.html" is requested it redirects to a specified page, regardless of what comes after the question mark. These URLs were created by our previous ecommerce software. The 'c' is for category, and each page of the cateogry created a different URL. I want to do these so I can rediect all of these URLs to the appropraite new cateogry page in a single redirect. Thanks for any help.
Technical SEO | | craigycraig0