On Sat, Apr 6, 2013 at 6:37 PM, janI <j...@apache.org> wrote:

> On 7 April 2013 00:16, Rob Weir <robw...@apache.org> wrote:
>
> > On Sat, Apr 6, 2013 at 5:19 PM, janI <j...@apache.org> wrote:
> >
> > > On 6 April 2013 22:21, Rob Weir <robw...@apache.org> wrote:
> > >
> > > > On Sat, Apr 6, 2013 at 12:37 PM, janI <j...@apache.org> wrote:
> > > >
> > > > > On 6 April 2013 13:37, Rob Weir <robw...@apache.org> wrote:
> > > > >
> > > > > > On Sat, Apr 6, 2013 at 5:51 AM, janI <j...@apache.org> wrote:
> > > > > >
> > > > > > > Hi.
> > > > > > >
> > > > > > > Now that our wiki (mwiki) runs smoothly and performs quite
> well,
> > it
> > > > is
> > > > > > > slowly time to take a look at our other services.
> > > > > > >
> > > > > > > In order to determine what should/can be done, I need some
> > figures.
> > > > > > >
> > > > > > > Can anybody help me with traffic figures for
> > > > > > >    wiki.o.o, forum.o.o (total for the vm not per language) and
> > > > www.o.o
> > > > > > >
> > > > > > > Does end-users download directly from www.o.o or from a
> secondary
> > > > site
> > > > > ?
> > > > > > >
> > > > > > > I need the following numbers (if possible):
> > > > > > >    - daily number of clicks (average)
> > > > > > >    - peak number of clicks within a short period like 1 hour
> > > > > > >
> > > > > > >
> > > > > >
> > > > > > From Google Analytics we have:
> > > > > >
> > > > > > For wiki:  around 33K page views/day
> > > > > >
> > > > > > For forums, I have no idea.  They are using a Google Analytics
> > > account
> > > > (
> > > > > > UA-1087265-8) that is not connected with the rest of the website.
> >  If
> > > > > that
> > > > > > could be updated to match the rest of the website we'd have
> typical
> > > > > numbers
> > > > > > in a week.
> > > > > >
> > > > >
> > > > > Thanks for your answers, but I am a bit scared you (as a PMC) tell
> me
> > > > > "they" use an account unknown to us. Who has the forum account ?
> > > > >
> > > > > I have the technical power to change it (sudo access) but if a PMC
> > tell
> > > > we
> > > > > do no know who owns it, it would not be right just to change it.
> Who
> > > but
> > > > > the "owner" can authorize a change ?
> > > > >
> > > > > I trust this is a misunderstanding somewhere, it cannot be that we
> > run
> > > a
> > > > > service, that we do not control, in the sense of change management.
> > > > >
> > > > >
> > > > I assume this is just a legacy account from Sun days.  The same GA
> code
> > > was
> > > > used for other OOo-associated web services as well.
> > > >
> > > > I raised this issue a year ago on this mailing list:
> > > >
> > > > http://markmail.org/thread/mtknhgrwjkajj2ux
> > > >
> > > > I'm assuming this is just a "lack of maintenance issue" and we can
> add
> > it
> > > > to the list of other things that need attention.
> > > >
> > >
> > > Ok, I just wanted to be sure, when I come around to it, I will find the
> > > header page in forum and change. Sadly enough the setup of the forum vm
> > is
> > > not "a piece of cake", it seems to have gone through quite a lot of
> > changes
> > > (but it works and that is important).
> > >
> > >
> > > >
> > > > > For the website as a whole (all subdomains tracked) it is around
> > > 730K/day
> > > > > > for a peak day.  If you subtract out the forum traffic and the
> > > > downloads
> > > > > > (which serviced by SourceForge) then the static website is
> probably
> > > > more
> > > > > > like 600K/day visits.
> > > > > >
> > > > >
> > > > > Why do I need to subtract forum, I thought they ran as a separate
> > dns ?
> > > > do
> > > > > we not count on basis of our template (that loads the GA account).
> > > > >
> > > > >
> > > >
> > > > Sorry, I meant "subtract out the wiki" traffic.  GA reports the total
> > > > traffic for all tracked domains, and then allows filtering by URL
> > > pattern.
> > > > So total traffic minus the /wiki/ traffic minus the download traffic
> > > gives
> > > > us the static website traffic.
> > > >
> > >
> > > ok, got it.
> > >
> > > >
> > > >
> > > > >
> > > > > >
> > > > > > So that indirectly answers your other question -- the big
> download
> > > > files
> > > > > > are from the SourceForge distribution network.  But we do have
> > > smaller
> > > > > > downloads, of documentation PDF's.  These are mainly from the
> wiki.
> > > > > >
> > > > > > In terms of peaks, our biggest day usually Wednesday, lowest on
> > > Sunday.
> > > > > > The variation there around 200K/visits.
> > > > > >
> > > > >
> > > > > Super just what I need.
> > > > >
> > > > >
> > > > > >
> > > > > > Hourly peak is around 15K visits for all tracked subdomains
> > together.
> > > > > >
> > > > >
> > > > > Do you know the division between mwiki, forum and www ?
> > > > >
> > > > >
> > > > I can't seem to filter on the GA screen that gives hourly detail, so
> I
> > > only
> > > > see the count cross-subdomain.
> > > >
> > > >
> > > > >
> > > > > > Note:  for all of these I'm talking about "pages", where a single
> > > page
> > > > > > could include several HTTP calls, for the HTML, the Javascript,
> > > images,
> > > > > > etc.  Since anyone with Javascript not enabled would not
> register,
> > > the
> > > > > > actual numbers are probably a little bit higher than this.
> > > > > >
> > > > >
> > > > > Good point, that is very important for a a traffic server, where
> e.g.
> > > > > javascripts would be cached.
> > > > >
> > > > >
> > > > We might be able to get the httpd logs from Infra, in which case we
> > could
> > > > do more traditional traffic stats
> > >
> > > I have the httpd logs for forum and mwiki, but I miss a tool to make
> > proper
> > > statistic. I have asked in the httpd channel, but no luck.
> > >
> > >
> > I use a commercial tool called WebLog Expert.  I'm happy to run a report
> on
> > the logs if they are accessible.
> >
>
> Thx, I will see if the log can come down to a size where I can transport
> them.
>
>
I'm happy to scp them as well, whatever works for you.


> Forum is now changed to our main GA account, and I changed all the language
> forums as well, can you please check that we get GA data (I assume you have
> to wait 24 hours or something).
>
>
Data is starting to come through, 52169 page views yesterday, across
languages.



> Lets see in a week or so, if my assumption is correct (that mwiki and forum
> have the same usage pattern).
>
>

Yes.



> As a side remark, I can really understand the complains there have been
> about the problems with the forum. While I did my part, apache ran constant
> with more work tasks, than configured mysql connections and the vm was real
> busy (which is good, since it means we have active users).
>
>
I know what you mean.  I used to do performance work for IBM Websphere
Portal products.  80% of it was tuning the various thread pools and work
queues.  10% of it was DB tuning.  The other 10% was black magic.


Regards,

-Rob



> Have a nice day/evening.
>
> rgds
> jan I.
>
> >
> > -Rob
> >
> >
> >
> > >
> > > > .
> > > >
> > > >
> > > >
> > > > > I am trying to convince Infra, that we should have a common
> > > trafficserver
> > > > > for wiki and forum (and maybe www). Currently we use eresbus as
> > proxy,
> > > > and
> > > > > all packets are forwarded to the ooo-wiki2-vm where that
> > trafficserver
> > > > > filters cached pages. Forum goes direct to httpd, so every packet
> > > causes
> > > > > mysql activity. If we had a trafficserver in eresbus, we would for
> > > wiki2
> > > > > save loads of network traffic and for forum network traffic as well
> > as
> > > > > mysql activity.
> > > > >
> > > > >
> > > > > >
> > > > > > In any case, one approach would be to enable GA for the Forums,
> > track
> > > > > that
> > > > > > for a week or two and then compare that to the Wiki numbers.  If
> > the
> > > > > stacks
> > > > > > are similar, then we could think of the traffic ratio between the
> > two
> > > > and
> > > > > > tune based on that.
> > > > > >
> > > > >
> > > > > I agree, but not being PMC, I do not feel I am in the position to
> > > "just"
> > > > > make that change. Who "owns" the forum, they should authorize such
> a
> > > > > change. I can make the change (provided you tell give me the GA
> line
> > to
> > > > > insert) when I am sure I am not doing a change that someone else
> > depend
> > > > on.
> > > > >
> > > > >
> > > > I've certainly raised the issue on the list previously and there were
> > no
> > > > objections.   If you have access let me know offline and I can give
> you
> > > the
> > > > codes.
> > > >
> > > > Note that this root page is not in the Apache CMS.  It doesn't look
> > > > phpBB-related so there might be a static page out there on that host
> > that
> > > > would need to be updated manually:  http://forum.openoffice.org/
> > > >
> > >
> > > thx for the info.
> > >
> > > rgds
> > > Jan I.
> > >
> > > >
> > > >
> > > > > >
> > > > > >
> > > > > > > I would like to tune at least forum before we release 4.0 to
> > > prepare
> > > > > for
> > > > > > a
> > > > > > > higher load. Tuning of www depend on a discussion with infra,
> > > because
> > > > > it
> > > > > > is
> > > > > > > a shared service.
> > > > > > >
> > > > > > >
> > > > > > If it is like the release of 3.4.0 and 3.4.1 we'll get a few hot
> > > pages:
> > > > > >
> > > > > > 1) The announcement blog post.  With both 3.4.0 and 3.4.1
> releases
> > > > these
> > > > > > were heavily linked to by the press.  Roller went down in the
> 3.4.1
> > > > case.
> > > > > > I don't know if that was due to the traffic or not.  But we might
> > > want
> > > > to
> > > > > > plan on having the announcement post be a static webpage just to
> be
> > > > sure.
> > > > > >
> > > > >
> > > > > The blog post is on the general apache server and not our www, is
> > that
> > > > > correct ?
> > > > >
> > > > >
> > > > Correct.
> > > >
> > > >
> > > > > >
> > > > > > 2) The homepage and http://www.openoffice.org/download page will
> > be
> > > > hot.
> > > > > > You can see from the download chart the boost we get when a new
> > > release
> > > > > > occurs:  http://www.openoffice.org/stats/downloads.html  (AOO
> > 3.4.0
> > > > was
> > > > > > May
> > > > > > 8th 2012 and AOO 3.4.1 was Aug 23rd).
> > > > > >
> > > > >
> > > > > That is a real obvious case for trafficserver, and until it is in
> > > place I
> > > > > can ask to have the httpd use memcached (which it might be
> already).
> > > > > Currently the www is on a general apache server, so I need strong
> > > > arguments
> > > > > to make changes (high load is a strong argument)
> > > > >
> > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > > Translate-vm also needs tunings, but that is in progress, and
> > infra
> > > > has
> > > > > > > been very kind and prepared an extra vm, so we/I install/tune
> > > without
> > > > > > > affecting the active system.
> > > > > > >
> > > > > > > thx in advance for figures.
> > > > > > > rgds
> > > > > > > jan I.
> > > > > >
> > > > >
> > > > > THANKS a lot for your answer, which help me a lot doing calc..
> Sorry
> > > for
> > > > > being frank about the forum, but it really make me nervous that we
> > > have a
> > > > > service where part it unknown.
> > > > >
> > > > > rgds
> > > > > Jan I
> > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to