Session IDs and crawlers
-
Hello here.
When we setup our e-commerce website virtualsheetmusic.com to allow session IDs to be assigned to users back in 2001, we decided to not assign them if a bot called the page. We wanted to be sure that bots, which officially can't store cookies, wouldn't have found links containing every time different session IDs . Just to better clarify, the way session IDs are generated on our system, is the standard way: if users have cookies enabled, a cookie called PHPSESSID is created which stores the session cookie. If the cookies are not enabled, session IDs are added automatically by the system to any link URL included on the page which could potentially cause the bots to find every time different link URLs with the session ID appended to them.
Now, after 12 years, we are considering if this is still a valuable solution, or can it be detrimental or negative in some way? What are your thoughts about this issue?
Thank you in advance for any thoughts.
Fab.
-
Thank you Kurt, that's exactly what I thought but I wanted to have confirmation from the experts community.
Thank you again!
-
You're not giving the search engines different content, so it's not deceptive. I can't think of any way it would harm you.
-
Thank you guys for your replies and insights, I am more for keeping what we have lived with so far, which means leaving the system on our site disabling session IDs when bots request the pages, unless you tell me there is any downsides to do that... that's really what I am trying to find out here. Is there any downside to not serve pages with session IDs to search engines compared to users?
Thank you again.
-
I can't speak to the technical side of setting up session IDs, however, you can deal with the URL issue with canonical tags and setting up URL parameters in Google and Bing Webmaster Tools. That should prevent the search engines from indexing every URL with a different session id and keep all the page authority on the main URL.
Kurt Steinbrueck
OurChurch.Com -
Hi Fabrizo,
Nowadays I would say there are better solutions to fix this issue. But I'm really not sure if you could convince me for rebuilding this feature on the site as the impact for SEO would probably be not really big. I think the best way to not set any Session IDs in the URL at all so you have plain URLs. What you then could do is use these pages as the basis of your URL structure and SEO strategy. You could also then canonicalize the session ID'ed urls back to the plain ones.
Hope this helps! Makes sense?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Huge ranking flux that we cannot explain
Hello, SEO experts from around the world. We need your help; we have seen massive ranking flux across our website and others. We cannot explain what is causing this ranking flux. The content marketing is top quality, so we don’t know why we are moving from 25 to 50th on Google and sometimes even beyond that. Can any SEO experts explain why our agency is moving so much within Google’s rankings? We don’t know whether to make changes or possibly wait. Any help would be fantastic; thank you all.
Algorithm Updates | | sarahwalsh0 -
Do crawler reads ahrefs tag which is just a text written in html form ,not a hyperlink or blue text, ?
i recently posted a back link but it didnot turn into hyperlink but just a html ahref tag, does it give any link equity? does it behave as a link because its not clickable?
Algorithm Updates | | calvinkj0 -
International Homepage Advice
Hello, colleagues! We have a conundrum. A client website has a good subdirectory strategy for localized/translated content for its various international markets, but nothing currently "lives" at the root. In my mind, this presents a challenge to search engines (note that we have had some trouble getting proper visibility overall, which is why I'm asking this question). I'm looking for any links or just plain old good advice on why it's important to have a global homepage. Should that global homepage be in English? Most enterprise sites I've worked with do have a homepage that's in English, with the ability to select a country from a drop down in a nav across the site. Any advice, best practices, etc. about why a global homepage is important and what language it could/should be in would be really helpful. Hreflang tags would make sense, I guess, but each country has slightly different offerings so I'm not sure that it makes complete sense. In other words, one country's homepage may have completely different content than another's. Thank you!
Algorithm Updates | | SimpleSearch0 -
Google not showing the recent cache info: How to know the last cached version of a page?
Hi, We couldn't able to see the last Google cached version of our homepage after March 29th. Just wondering why this is happening with other websites too. When we make some changes to the website, we will wait to our website indexed and cached, so the changes will have some ranking impact. Now we couldn't able to check if the website got indexed with changes. Is there any other way to check the latest cached version or time of last index? Thanks
Algorithm Updates | | vtmoz0 -
Is using REACT SEO friendly?
Hi Guys Is REACT SEO friendly? Has anyone used REACT and what was the results? Or do you recommend something else that is better suited for SEO? Many thanks for your help in advance. Cheers Martin
Algorithm Updates | | martin19700 -
Meta robots at every page rather than using robots.txt for blocking crawlers? How they'll get indexed if we block crawlers?
Hi all, The suggestion to use meta robots tag rather than robots.txt file is to make sure the pages do not get indexed if their hyperlinks are available anywhere on the internet. I don't understand how the pages will be indexed if the entire site is blocked? Even though there are page links are available, will Google really index those pages? One of our site got blocked from robots file but internal links are available on internet for years which are not been indexed. So technically robots.txt file is quite enough right? Please clarify and guide me if I'm wrong. Thanks
Algorithm Updates | | vtmoz0 -
What does it mean to build a 'good' website.
Hi guys. I've heard a lot of SEO professionals, Google, (and Rand in a couple of whiteboard Friday's) say it's really important to build a 'good' website if you want to rank well. What does this mean in more practical terms? (Context... I've found some sites rank much better than they 'should' do based on the competition. However, when I built my own site (well-optimised (on-page) based on thorough keyword research) it was nowhere to be found (not even top 50 after I'd 'matched' the backlink profile of others on page 1). I can only put this down to there being 'good quality website' signals lacking in the latter example. I'm not a web developer so the website was the pretty basic WordPress site.)
Algorithm Updates | | isaac6630 -
Optimize for separate words or combined word.
I can't find good answers to this question so I'm asking here. Thanks for any help you can give. Most people, 4 out of 5, search for our product using two separate words, while the trademarked name of the product is one word. Think: CleanCar(tm) vs Clean Car. However our product is a leader in the industry so it would be like searching for perhaps "Play Station" vs "Playstation" if people were looking for a gaming console in general. Google separates them in the search volumes so I am assuming it does not see Clean Car in the same way it sees CleanCar. I (obviously) want to rank as highly as possible in both while keeping brand integrity in mind. Should I SEO for just the CleanCar or both? Perhaps using CleanCar in the title and Clean Car in the description? Does Google distinguish? Thanks! bnew
Algorithm Updates | | mcampanaro0