On Mon, Apr 8, 2013 at 2:04 PM, Rob Weir <robw...@apache.org> wrote:
>
>
>
> On Sat, Apr 6, 2013 at 6:37 PM, janI <j...@apache.org> wrote:
>>
>> On 7 April 2013 00:16, Rob Weir <robw...@apache.org> wrote:
>>
>> > On Sat, Apr 6, 2013 at 5:19 PM, janI <j...@apache.org> wrote:
>> >
>> > > On 6 April 2013 22:21, Rob Weir <robw...@apache.org> wrote:
>> > >
>> > > > On Sat, Apr 6, 2013 at 12:37 PM, janI <j...@apache.org> wrote:
>> > > >
>> > > > > On 6 April 2013 13:37, Rob Weir <robw...@apache.org> wrote:
>> > > > >
>> > > > > > On Sat, Apr 6, 2013 at 5:51 AM, janI <j...@apache.org> wrote:
>> > > > > >
>> > > > > > > Hi.
>> > > > > > >
>> > > > > > > Now that our wiki (mwiki) runs smoothly and performs quite
>> > > > > > > well,
>> > it
>> > > > is
>> > > > > > > slowly time to take a look at our other services.
>> > > > > > >
>> > > > > > > In order to determine what should/can be done, I need some
>> > figures.
>> > > > > > >
>> > > > > > > Can anybody help me with traffic figures for
>> > > > > > >    wiki.o.o, forum.o.o (total for the vm not per language) and
>> > > > www.o.o
>> > > > > > >
>> > > > > > > Does end-users download directly from www.o.o or from a
>> > > > > > > secondary
>> > > > site
>> > > > > ?
>> > > > > > >
>> > > > > > > I need the following numbers (if possible):
>> > > > > > >    - daily number of clicks (average)
>> > > > > > >    - peak number of clicks within a short period like 1 hour
>> > > > > > >
>> > > > > > >
>> > > > > >
>> > > > > > From Google Analytics we have:
>> > > > > >
>> > > > > > For wiki:  around 33K page views/day
>> > > > > >
>> > > > > > For forums, I have no idea.  They are using a Google Analytics
>> > > account
>> > > > (
>> > > > > > UA-1087265-8) that is not connected with the rest of the
>> > > > > > website.
>> >  If
>> > > > > that
>> > > > > > could be updated to match the rest of the website we'd have
>> > > > > > typical
>> > > > > numbers
>> > > > > > in a week.
>> > > > > >
>> > > > >
>> > > > > Thanks for your answers, but I am a bit scared you (as a PMC) tell
>> > > > > me
>> > > > > "they" use an account unknown to us. Who has the forum account ?
>> > > > >
>> > > > > I have the technical power to change it (sudo access) but if a PMC
>> > tell
>> > > > we
>> > > > > do no know who owns it, it would not be right just to change it.
>> > > > > Who
>> > > but
>> > > > > the "owner" can authorize a change ?
>> > > > >
>> > > > > I trust this is a misunderstanding somewhere, it cannot be that we
>> > run
>> > > a
>> > > > > service, that we do not control, in the sense of change
>> > > > > management.
>> > > > >
>> > > > >
>> > > > I assume this is just a legacy account from Sun days.  The same GA
>> > > > code
>> > > was
>> > > > used for other OOo-associated web services as well.
>> > > >
>> > > > I raised this issue a year ago on this mailing list:
>> > > >
>> > > > http://markmail.org/thread/mtknhgrwjkajj2ux
>> > > >
>> > > > I'm assuming this is just a "lack of maintenance issue" and we can
>> > > > add
>> > it
>> > > > to the list of other things that need attention.
>> > > >
>> > >
>> > > Ok, I just wanted to be sure, when I come around to it, I will find
>> > > the
>> > > header page in forum and change. Sadly enough the setup of the forum
>> > > vm
>> > is
>> > > not "a piece of cake", it seems to have gone through quite a lot of
>> > changes
>> > > (but it works and that is important).
>> > >
>> > >
>> > > >
>> > > > > For the website as a whole (all subdomains tracked) it is around
>> > > 730K/day
>> > > > > > for a peak day.  If you subtract out the forum traffic and the
>> > > > downloads
>> > > > > > (which serviced by SourceForge) then the static website is
>> > > > > > probably
>> > > > more
>> > > > > > like 600K/day visits.
>> > > > > >
>> > > > >
>> > > > > Why do I need to subtract forum, I thought they ran as a separate
>> > dns ?
>> > > > do
>> > > > > we not count on basis of our template (that loads the GA account).
>> > > > >
>> > > > >
>> > > >
>> > > > Sorry, I meant "subtract out the wiki" traffic.  GA reports the
>> > > > total
>> > > > traffic for all tracked domains, and then allows filtering by URL
>> > > pattern.
>> > > > So total traffic minus the /wiki/ traffic minus the download traffic
>> > > gives
>> > > > us the static website traffic.
>> > > >
>> > >
>> > > ok, got it.
>> > >
>> > > >
>> > > >
>> > > > >
>> > > > > >
>> > > > > > So that indirectly answers your other question -- the big
>> > > > > > download
>> > > > files
>> > > > > > are from the SourceForge distribution network.  But we do have
>> > > smaller
>> > > > > > downloads, of documentation PDF's.  These are mainly from the
>> > > > > > wiki.
>> > > > > >
>> > > > > > In terms of peaks, our biggest day usually Wednesday, lowest on
>> > > Sunday.
>> > > > > > The variation there around 200K/visits.
>> > > > > >
>> > > > >
>> > > > > Super just what I need.
>> > > > >
>> > > > >
>> > > > > >
>> > > > > > Hourly peak is around 15K visits for all tracked subdomains
>> > together.
>> > > > > >
>> > > > >
>> > > > > Do you know the division between mwiki, forum and www ?
>> > > > >
>> > > > >
>> > > > I can't seem to filter on the GA screen that gives hourly detail, so
>> > > > I
>> > > only
>> > > > see the count cross-subdomain.
>> > > >
>> > > >
>> > > > >
>> > > > > > Note:  for all of these I'm talking about "pages", where a
>> > > > > > single
>> > > page
>> > > > > > could include several HTTP calls, for the HTML, the Javascript,
>> > > images,
>> > > > > > etc.  Since anyone with Javascript not enabled would not
>> > > > > > register,
>> > > the
>> > > > > > actual numbers are probably a little bit higher than this.
>> > > > > >
>> > > > >
>> > > > > Good point, that is very important for a a traffic server, where
>> > > > > e.g.
>> > > > > javascripts would be cached.
>> > > > >
>> > > > >
>> > > > We might be able to get the httpd logs from Infra, in which case we
>> > could
>> > > > do more traditional traffic stats
>> > >
>> > > I have the httpd logs for forum and mwiki, but I miss a tool to make
>> > proper
>> > > statistic. I have asked in the httpd channel, but no luck.
>> > >
>> > >
>> > I use a commercial tool called WebLog Expert.  I'm happy to run a report
>> > on
>> > the logs if they are accessible.
>> >
>>
>> Thx, I will see if the log can come down to a size where I can transport
>> them.
>>
>
> I'm happy to scp them as well, whatever works for you.
>
>>
>> Forum is now changed to our main GA account, and I changed all the
>> language
>> forums as well, can you please check that we get GA data (I assume you
>> have
>> to wait 24 hours or something).
>>
>
> Data is starting to come through, 52169 page views yesterday, across
> languages.
>
>
>>
>> Lets see in a week or so, if my assumption is correct (that mwiki and
>> forum
>> have the same usage pattern).
>>
>

We've had a couple of weeks of GA on the Forums, so here is what I'm seeing:

Forums: Peak is 72K pages per day, low 42K per day (weekend)

Wiki: Peak is 32K pages per day, low 20K per day

So looks like a 2:1 ratio.

-Rob


>
> Yes.
>
>
>>
>> As a side remark, I can really understand the complains there have been
>> about the problems with the forum. While I did my part, apache ran
>> constant
>> with more work tasks, than configured mysql connections and the vm was
>> real
>> busy (which is good, since it means we have active users).
>>
>
> I know what you mean.  I used to do performance work for IBM Websphere
> Portal products.  80% of it was tuning the various thread pools and work
> queues.  10% of it was DB tuning.  The other 10% was black magic.
>
>
> Regards,
>
> -Rob
>
>
>>
>> Have a nice day/evening.
>>
>> rgds
>> jan I.
>>
>> >
>> > -Rob
>> >
>> >
>> >
>> > >
>> > > > .
>> > > >
>> > > >
>> > > >
>> > > > > I am trying to convince Infra, that we should have a common
>> > > trafficserver
>> > > > > for wiki and forum (and maybe www). Currently we use eresbus as
>> > proxy,
>> > > > and
>> > > > > all packets are forwarded to the ooo-wiki2-vm where that
>> > trafficserver
>> > > > > filters cached pages. Forum goes direct to httpd, so every packet
>> > > causes
>> > > > > mysql activity. If we had a trafficserver in eresbus, we would for
>> > > wiki2
>> > > > > save loads of network traffic and for forum network traffic as
>> > > > > well
>> > as
>> > > > > mysql activity.
>> > > > >
>> > > > >
>> > > > > >
>> > > > > > In any case, one approach would be to enable GA for the Forums,
>> > track
>> > > > > that
>> > > > > > for a week or two and then compare that to the Wiki numbers.  If
>> > the
>> > > > > stacks
>> > > > > > are similar, then we could think of the traffic ratio between
>> > > > > > the
>> > two
>> > > > and
>> > > > > > tune based on that.
>> > > > > >
>> > > > >
>> > > > > I agree, but not being PMC, I do not feel I am in the position to
>> > > "just"
>> > > > > make that change. Who "owns" the forum, they should authorize such
>> > > > > a
>> > > > > change. I can make the change (provided you tell give me the GA
>> > > > > line
>> > to
>> > > > > insert) when I am sure I am not doing a change that someone else
>> > depend
>> > > > on.
>> > > > >
>> > > > >
>> > > > I've certainly raised the issue on the list previously and there
>> > > > were
>> > no
>> > > > objections.   If you have access let me know offline and I can give
>> > > > you
>> > > the
>> > > > codes.
>> > > >
>> > > > Note that this root page is not in the Apache CMS.  It doesn't look
>> > > > phpBB-related so there might be a static page out there on that host
>> > that
>> > > > would need to be updated manually:  http://forum.openoffice.org/
>> > > >
>> > >
>> > > thx for the info.
>> > >
>> > > rgds
>> > > Jan I.
>> > >
>> > > >
>> > > >
>> > > > > >
>> > > > > >
>> > > > > > > I would like to tune at least forum before we release 4.0 to
>> > > prepare
>> > > > > for
>> > > > > > a
>> > > > > > > higher load. Tuning of www depend on a discussion with infra,
>> > > because
>> > > > > it
>> > > > > > is
>> > > > > > > a shared service.
>> > > > > > >
>> > > > > > >
>> > > > > > If it is like the release of 3.4.0 and 3.4.1 we'll get a few hot
>> > > pages:
>> > > > > >
>> > > > > > 1) The announcement blog post.  With both 3.4.0 and 3.4.1
>> > > > > > releases
>> > > > these
>> > > > > > were heavily linked to by the press.  Roller went down in the
>> > > > > > 3.4.1
>> > > > case.
>> > > > > > I don't know if that was due to the traffic or not.  But we
>> > > > > > might
>> > > want
>> > > > to
>> > > > > > plan on having the announcement post be a static webpage just to
>> > > > > > be
>> > > > sure.
>> > > > > >
>> > > > >
>> > > > > The blog post is on the general apache server and not our www, is
>> > that
>> > > > > correct ?
>> > > > >
>> > > > >
>> > > > Correct.
>> > > >
>> > > >
>> > > > > >
>> > > > > > 2) The homepage and http://www.openoffice.org/download page will
>> > be
>> > > > hot.
>> > > > > > You can see from the download chart the boost we get when a new
>> > > release
>> > > > > > occurs:  http://www.openoffice.org/stats/downloads.html  (AOO
>> > 3.4.0
>> > > > was
>> > > > > > May
>> > > > > > 8th 2012 and AOO 3.4.1 was Aug 23rd).
>> > > > > >
>> > > > >
>> > > > > That is a real obvious case for trafficserver, and until it is in
>> > > place I
>> > > > > can ask to have the httpd use memcached (which it might be
>> > > > > already).
>> > > > > Currently the www is on a general apache server, so I need strong
>> > > > arguments
>> > > > > to make changes (high load is a strong argument)
>> > > > >
>> > > > >
>> > > > > >
>> > > > > >
>> > > > > >
>> > > > > >
>> > > > > > > Translate-vm also needs tunings, but that is in progress, and
>> > infra
>> > > > has
>> > > > > > > been very kind and prepared an extra vm, so we/I install/tune
>> > > without
>> > > > > > > affecting the active system.
>> > > > > > >
>> > > > > > > thx in advance for figures.
>> > > > > > > rgds
>> > > > > > > jan I.
>> > > > > >
>> > > > >
>> > > > > THANKS a lot for your answer, which help me a lot doing calc..
>> > > > > Sorry
>> > > for
>> > > > > being frank about the forum, but it really make me nervous that we
>> > > have a
>> > > > > service where part it unknown.
>> > > > >
>> > > > > rgds
>> > > > > Jan I
>> > > > >
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org

Reply via email to