On 23 April 2013 20:39, Rob Weir <robw...@apache.org> wrote:

> On Mon, Apr 8, 2013 at 2:04 PM, Rob Weir <robw...@apache.org> wrote:
> >
> >
> >
> > On Sat, Apr 6, 2013 at 6:37 PM, janI <j...@apache.org> wrote:
> >>
> >> On 7 April 2013 00:16, Rob Weir <robw...@apache.org> wrote:
> >>
> >> > On Sat, Apr 6, 2013 at 5:19 PM, janI <j...@apache.org> wrote:
> >> >
> >> > > On 6 April 2013 22:21, Rob Weir <robw...@apache.org> wrote:
> >> > >
> >> > > > On Sat, Apr 6, 2013 at 12:37 PM, janI <j...@apache.org> wrote:
> >> > > >
> >> > > > > On 6 April 2013 13:37, Rob Weir <robw...@apache.org> wrote:
> >> > > > >
> >> > > > > > On Sat, Apr 6, 2013 at 5:51 AM, janI <j...@apache.org> wrote:
> >> > > > > >
> >> > > > > > > Hi.
> >> > > > > > >
> >> > > > > > > Now that our wiki (mwiki) runs smoothly and performs quite
> >> > > > > > > well,
> >> > it
> >> > > > is
> >> > > > > > > slowly time to take a look at our other services.
> >> > > > > > >
> >> > > > > > > In order to determine what should/can be done, I need some
> >> > figures.
> >> > > > > > >
> >> > > > > > > Can anybody help me with traffic figures for
> >> > > > > > >    wiki.o.o, forum.o.o (total for the vm not per language)
> and
> >> > > > www.o.o
> >> > > > > > >
> >> > > > > > > Does end-users download directly from www.o.o or from a
> >> > > > > > > secondary
> >> > > > site
> >> > > > > ?
> >> > > > > > >
> >> > > > > > > I need the following numbers (if possible):
> >> > > > > > >    - daily number of clicks (average)
> >> > > > > > >    - peak number of clicks within a short period like 1 hour
> >> > > > > > >
> >> > > > > > >
> >> > > > > >
> >> > > > > > From Google Analytics we have:
> >> > > > > >
> >> > > > > > For wiki:  around 33K page views/day
> >> > > > > >
> >> > > > > > For forums, I have no idea.  They are using a Google Analytics
> >> > > account
> >> > > > (
> >> > > > > > UA-1087265-8) that is not connected with the rest of the
> >> > > > > > website.
> >> >  If
> >> > > > > that
> >> > > > > > could be updated to match the rest of the website we'd have
> >> > > > > > typical
> >> > > > > numbers
> >> > > > > > in a week.
> >> > > > > >
> >> > > > >
> >> > > > > Thanks for your answers, but I am a bit scared you (as a PMC)
> tell
> >> > > > > me
> >> > > > > "they" use an account unknown to us. Who has the forum account ?
> >> > > > >
> >> > > > > I have the technical power to change it (sudo access) but if a
> PMC
> >> > tell
> >> > > > we
> >> > > > > do no know who owns it, it would not be right just to change it.
> >> > > > > Who
> >> > > but
> >> > > > > the "owner" can authorize a change ?
> >> > > > >
> >> > > > > I trust this is a misunderstanding somewhere, it cannot be that
> we
> >> > run
> >> > > a
> >> > > > > service, that we do not control, in the sense of change
> >> > > > > management.
> >> > > > >
> >> > > > >
> >> > > > I assume this is just a legacy account from Sun days.  The same GA
> >> > > > code
> >> > > was
> >> > > > used for other OOo-associated web services as well.
> >> > > >
> >> > > > I raised this issue a year ago on this mailing list:
> >> > > >
> >> > > > http://markmail.org/thread/mtknhgrwjkajj2ux
> >> > > >
> >> > > > I'm assuming this is just a "lack of maintenance issue" and we can
> >> > > > add
> >> > it
> >> > > > to the list of other things that need attention.
> >> > > >
> >> > >
> >> > > Ok, I just wanted to be sure, when I come around to it, I will find
> >> > > the
> >> > > header page in forum and change. Sadly enough the setup of the forum
> >> > > vm
> >> > is
> >> > > not "a piece of cake", it seems to have gone through quite a lot of
> >> > changes
> >> > > (but it works and that is important).
> >> > >
> >> > >
> >> > > >
> >> > > > > For the website as a whole (all subdomains tracked) it is around
> >> > > 730K/day
> >> > > > > > for a peak day.  If you subtract out the forum traffic and the
> >> > > > downloads
> >> > > > > > (which serviced by SourceForge) then the static website is
> >> > > > > > probably
> >> > > > more
> >> > > > > > like 600K/day visits.
> >> > > > > >
> >> > > > >
> >> > > > > Why do I need to subtract forum, I thought they ran as a
> separate
> >> > dns ?
> >> > > > do
> >> > > > > we not count on basis of our template (that loads the GA
> account).
> >> > > > >
> >> > > > >
> >> > > >
> >> > > > Sorry, I meant "subtract out the wiki" traffic.  GA reports the
> >> > > > total
> >> > > > traffic for all tracked domains, and then allows filtering by URL
> >> > > pattern.
> >> > > > So total traffic minus the /wiki/ traffic minus the download
> traffic
> >> > > gives
> >> > > > us the static website traffic.
> >> > > >
> >> > >
> >> > > ok, got it.
> >> > >
> >> > > >
> >> > > >
> >> > > > >
> >> > > > > >
> >> > > > > > So that indirectly answers your other question -- the big
> >> > > > > > download
> >> > > > files
> >> > > > > > are from the SourceForge distribution network.  But we do have
> >> > > smaller
> >> > > > > > downloads, of documentation PDF's.  These are mainly from the
> >> > > > > > wiki.
> >> > > > > >
> >> > > > > > In terms of peaks, our biggest day usually Wednesday, lowest
> on
> >> > > Sunday.
> >> > > > > > The variation there around 200K/visits.
> >> > > > > >
> >> > > > >
> >> > > > > Super just what I need.
> >> > > > >
> >> > > > >
> >> > > > > >
> >> > > > > > Hourly peak is around 15K visits for all tracked subdomains
> >> > together.
> >> > > > > >
> >> > > > >
> >> > > > > Do you know the division between mwiki, forum and www ?
> >> > > > >
> >> > > > >
> >> > > > I can't seem to filter on the GA screen that gives hourly detail,
> so
> >> > > > I
> >> > > only
> >> > > > see the count cross-subdomain.
> >> > > >
> >> > > >
> >> > > > >
> >> > > > > > Note:  for all of these I'm talking about "pages", where a
> >> > > > > > single
> >> > > page
> >> > > > > > could include several HTTP calls, for the HTML, the
> Javascript,
> >> > > images,
> >> > > > > > etc.  Since anyone with Javascript not enabled would not
> >> > > > > > register,
> >> > > the
> >> > > > > > actual numbers are probably a little bit higher than this.
> >> > > > > >
> >> > > > >
> >> > > > > Good point, that is very important for a a traffic server, where
> >> > > > > e.g.
> >> > > > > javascripts would be cached.
> >> > > > >
> >> > > > >
> >> > > > We might be able to get the httpd logs from Infra, in which case
> we
> >> > could
> >> > > > do more traditional traffic stats
> >> > >
> >> > > I have the httpd logs for forum and mwiki, but I miss a tool to make
> >> > proper
> >> > > statistic. I have asked in the httpd channel, but no luck.
> >> > >
> >> > >
> >> > I use a commercial tool called WebLog Expert.  I'm happy to run a
> report
> >> > on
> >> > the logs if they are accessible.
> >> >
> >>
> >> Thx, I will see if the log can come down to a size where I can transport
> >> them.
> >>
> >
> > I'm happy to scp them as well, whatever works for you.
> >
> >>
> >> Forum is now changed to our main GA account, and I changed all the
> >> language
> >> forums as well, can you please check that we get GA data (I assume you
> >> have
> >> to wait 24 hours or something).
> >>
> >
> > Data is starting to come through, 52169 page views yesterday, across
> > languages.
> >
> >
> >>
> >> Lets see in a week or so, if my assumption is correct (that mwiki and
> >> forum
> >> have the same usage pattern).
> >>
> >
>
> We've had a couple of weeks of GA on the Forums, so here is what I'm
> seeing:
>
> Forums: Peak is 72K pages per day, low 42K per day (weekend)
>
> Wiki: Peak is 32K pages per day, low 20K per day
>
> So looks like a 2:1 ratio.
>
Less than what I expected, but thx a lot for the numbers !!

rgds
jan I.

>
> -Rob
>
>
> >
> > Yes.
> >
> >
> >>
> >> As a side remark, I can really understand the complains there have been
> >> about the problems with the forum. While I did my part, apache ran
> >> constant
> >> with more work tasks, than configured mysql connections and the vm was
> >> real
> >> busy (which is good, since it means we have active users).
> >>
> >
> > I know what you mean.  I used to do performance work for IBM Websphere
> > Portal products.  80% of it was tuning the various thread pools and work
> > queues.  10% of it was DB tuning.  The other 10% was black magic.
> >
> >
> > Regards,
> >
> > -Rob
> >
> >
> >>
> >> Have a nice day/evening.
> >>
> >> rgds
> >> jan I.
> >>
> >> >
> >> > -Rob
> >> >
> >> >
> >> >
> >> > >
> >> > > > .
> >> > > >
> >> > > >
> >> > > >
> >> > > > > I am trying to convince Infra, that we should have a common
> >> > > trafficserver
> >> > > > > for wiki and forum (and maybe www). Currently we use eresbus as
> >> > proxy,
> >> > > > and
> >> > > > > all packets are forwarded to the ooo-wiki2-vm where that
> >> > trafficserver
> >> > > > > filters cached pages. Forum goes direct to httpd, so every
> packet
> >> > > causes
> >> > > > > mysql activity. If we had a trafficserver in eresbus, we would
> for
> >> > > wiki2
> >> > > > > save loads of network traffic and for forum network traffic as
> >> > > > > well
> >> > as
> >> > > > > mysql activity.
> >> > > > >
> >> > > > >
> >> > > > > >
> >> > > > > > In any case, one approach would be to enable GA for the
> Forums,
> >> > track
> >> > > > > that
> >> > > > > > for a week or two and then compare that to the Wiki numbers.
>  If
> >> > the
> >> > > > > stacks
> >> > > > > > are similar, then we could think of the traffic ratio between
> >> > > > > > the
> >> > two
> >> > > > and
> >> > > > > > tune based on that.
> >> > > > > >
> >> > > > >
> >> > > > > I agree, but not being PMC, I do not feel I am in the position
> to
> >> > > "just"
> >> > > > > make that change. Who "owns" the forum, they should authorize
> such
> >> > > > > a
> >> > > > > change. I can make the change (provided you tell give me the GA
> >> > > > > line
> >> > to
> >> > > > > insert) when I am sure I am not doing a change that someone else
> >> > depend
> >> > > > on.
> >> > > > >
> >> > > > >
> >> > > > I've certainly raised the issue on the list previously and there
> >> > > > were
> >> > no
> >> > > > objections.   If you have access let me know offline and I can
> give
> >> > > > you
> >> > > the
> >> > > > codes.
> >> > > >
> >> > > > Note that this root page is not in the Apache CMS.  It doesn't
> look
> >> > > > phpBB-related so there might be a static page out there on that
> host
> >> > that
> >> > > > would need to be updated manually:  http://forum.openoffice.org/
> >> > > >
> >> > >
> >> > > thx for the info.
> >> > >
> >> > > rgds
> >> > > Jan I.
> >> > >
> >> > > >
> >> > > >
> >> > > > > >
> >> > > > > >
> >> > > > > > > I would like to tune at least forum before we release 4.0 to
> >> > > prepare
> >> > > > > for
> >> > > > > > a
> >> > > > > > > higher load. Tuning of www depend on a discussion with
> infra,
> >> > > because
> >> > > > > it
> >> > > > > > is
> >> > > > > > > a shared service.
> >> > > > > > >
> >> > > > > > >
> >> > > > > > If it is like the release of 3.4.0 and 3.4.1 we'll get a few
> hot
> >> > > pages:
> >> > > > > >
> >> > > > > > 1) The announcement blog post.  With both 3.4.0 and 3.4.1
> >> > > > > > releases
> >> > > > these
> >> > > > > > were heavily linked to by the press.  Roller went down in the
> >> > > > > > 3.4.1
> >> > > > case.
> >> > > > > > I don't know if that was due to the traffic or not.  But we
> >> > > > > > might
> >> > > want
> >> > > > to
> >> > > > > > plan on having the announcement post be a static webpage just
> to
> >> > > > > > be
> >> > > > sure.
> >> > > > > >
> >> > > > >
> >> > > > > The blog post is on the general apache server and not our www,
> is
> >> > that
> >> > > > > correct ?
> >> > > > >
> >> > > > >
> >> > > > Correct.
> >> > > >
> >> > > >
> >> > > > > >
> >> > > > > > 2) The homepage and http://www.openoffice.org/download page
> will
> >> > be
> >> > > > hot.
> >> > > > > > You can see from the download chart the boost we get when a
> new
> >> > > release
> >> > > > > > occurs:  http://www.openoffice.org/stats/downloads.html  (AOO
> >> > 3.4.0
> >> > > > was
> >> > > > > > May
> >> > > > > > 8th 2012 and AOO 3.4.1 was Aug 23rd).
> >> > > > > >
> >> > > > >
> >> > > > > That is a real obvious case for trafficserver, and until it is
> in
> >> > > place I
> >> > > > > can ask to have the httpd use memcached (which it might be
> >> > > > > already).
> >> > > > > Currently the www is on a general apache server, so I need
> strong
> >> > > > arguments
> >> > > > > to make changes (high load is a strong argument)
> >> > > > >
> >> > > > >
> >> > > > > >
> >> > > > > >
> >> > > > > >
> >> > > > > >
> >> > > > > > > Translate-vm also needs tunings, but that is in progress,
> and
> >> > infra
> >> > > > has
> >> > > > > > > been very kind and prepared an extra vm, so we/I
> install/tune
> >> > > without
> >> > > > > > > affecting the active system.
> >> > > > > > >
> >> > > > > > > thx in advance for figures.
> >> > > > > > > rgds
> >> > > > > > > jan I.
> >> > > > > >
> >> > > > >
> >> > > > > THANKS a lot for your answer, which help me a lot doing calc..
> >> > > > > Sorry
> >> > > for
> >> > > > > being frank about the forum, but it really make me nervous that
> we
> >> > > have a
> >> > > > > service where part it unknown.
> >> > > > >
> >> > > > > rgds
> >> > > > > Jan I
> >> > > > >
> >> > > > > > >
> >> > > > > >
> >> > > > >
> >> > > >
> >> > >
> >> >
> >
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
> For additional commands, e-mail: dev-h...@openoffice.apache.org
>
>

Reply via email to