Re: [Pharo-users] real world pharo web application set ups

jtuc...@objektfabrik.de Thu, 15 Dec 2016 02:45:05 -0800

Victor,

Am 14.12.16 um 19:23 schrieb Vitor Medina Cruz:

    If I tell you that my current estimate is that a Smalltalk image
    with Seaside will not be able to handle more than 20 concurrent
users, in many cases even less.
Seriously? That is kinda a low number, I would expect more for eachimage. Certainly it depends much on many things, but it is certainlyvery low for a rough estimate, why you say that?


seriously, I think 20 is very optimistic for several reasons.

One, you want to be fast and responsive for every single user, so thereis absolutely no point in going too close to any limit. It's easy tolose users by providing bad experience.

Second, in a CRUD Application, you mostly work a lot with DB queries.And you connect to all kinds of stuff and do I/O. Some of these thingssimply block the VM. Even if that is only for 0.3 seconds, you postponeprocessing for each "unaffected" user by these 0.3 seconds, so this addsto significant delays in response time. And if you do some heavy dboperations, 0.3 seconds is not a terribly bad estimate. Add to that thematerialization and stuff within the Smalltalk image.

Seaside adapters usually start off green threads for each request. Butthere are things that need to be serialized (like in a critical Block).So in reality, users block each other way more often than you'd like.

So if you asked me to give a more realistic estimation, I'd correctmyself down to a number between 5 and probably a maximum of 10 users.Everything else means you must use all those fancy tricks and toolspeople mention in this thread.So what you absolutely need to do is start with an estimate of 5concurrent users per image and look for ways to distribute work amongservers/images so that these blocking situations are down to a minimum.If you find your software works much better, congratulate yourself andstack up new machines more slowly than initially estimated.

Before you turn around and say: Smalltalk is unsuitable for the web,let's take a brief look at what concurrent users really means.Concurrent users are users that request some processing from the serverat they very same time (maybe within an interval of 200-400msec). Thisis not the same as 5 people being currently logged on to the server andrequesting something sometimes. 5 concurrent users can be 20, 50, 100users who are logged in at the same time.

Then there is this sad "share all vs. share nothing" argument. InSeaside you keep all your objects alive (read from db and materialized)between web requests. IN share nothing, you read everything back fromdisc/db whenever a request comes in. This also takes time and ressources(and pssibly blocks the server for the blink of an eye or two). Youexchange RAM with CPU cycles and I/O. It is extremely hard to predictwhat works better, and I guess nobody ever made A/B tests. It's all justtheoretical bla bla and guesses of what definitely must be better inone's world.

Why do I come up with this share everything stuff? Because it usuallymeans that each user that is logged on holds onto a load of objects onthe server side (session storage), like their user account, shoppingcard, settings, last purchases, account information and whatnot. That'seasily a list of a few thousand objects (and be it only Proxies) thattake up space and want to be inspected by the garbage collector. So eachconnected user not only needs CPU cycles whenever they send a request tothe server, but also uses RAM. In our case, this can easily be 5-10 MBof objects per user. Add to that the shadow copies that your persistencemechanism needs for undo and stuff, and all the data Seaside needs forContinuations etc, and each logged on users needs 15, 20 or more MB ofobject space. Connect ten users and you have 150-200 MB. That is not aproblem per se, but also means there is some hard limit, especially in a32 bit world. You don't want your server to slow down because it cannotallocate new memory or can't find contiguous slots for stuff and GCs allthe time.

To sum up, I think the number of influencing factors is way too high toreally give a good estimate. Our experience (based on our mix ofcomputation and I/O) says that 5 concurrent users per image is doablewithout negative impact on other users. Some operations take so muchtime that you really need to move them out of the front-facing image anddistribute work to backend servers. More than 5 is probably possible butchances are that there are operations that will affect all users andwith every additional user there is a growing chance that you have 2 ormore requesting the yery same operation within a very short interval.This will make things worse and worse.

So I trust in you guys having lots of cool tools around and knowingloads of tricks to wrench out much more power of a single Smalltalkimage, but you also need to take a look at your productivity and speedin creating new features and fixing bugs. Sometimes throwing hardware ata problem like growth and starting with a clever architecture to scaleon multiple layers is just the perfect thing to do. To me, handling 7instead of 5 concurrent users is not such a big win as long as we arenot in a posotion where we have so many users that this really matters.For sites like Amazon, Google, Facebook etc. saving 40% in server costby optimizing the software (investing a few man years) is significant. Ihope we'll soon change our mind about this question ;-)

So load balancing and services outsourced to backend servers are key toscalability. This, btw, is not smalltalk specific (some people seem tothink you won't get these problems in Java or Ruby because they are madefor the web...).


Joachim

Re: [Pharo-users] real world pharo web application set ups

Reply via email to