openoffice-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From janI <j...@apache.org>
Subject Re: wiki, forum, www performance upgrade.
Date Mon, 08 Apr 2013 18:36:18 GMT
On 8 April 2013 20:04, Rob Weir <robweir@apache.org> wrote:

> On Sat, Apr 6, 2013 at 6:37 PM, janI <jani@apache.org> wrote:
>
> > On 7 April 2013 00:16, Rob Weir <robweir@apache.org> wrote:
> >
> > > On Sat, Apr 6, 2013 at 5:19 PM, janI <jani@apache.org> wrote:
> > >
> > > > On 6 April 2013 22:21, Rob Weir <robweir@apache.org> wrote:
> > > >
> > > > > On Sat, Apr 6, 2013 at 12:37 PM, janI <jani@apache.org> wrote:
> > > > >
> > > > > > On 6 April 2013 13:37, Rob Weir <robweir@apache.org> wrote:
> > > > > >
> > > > > > > On Sat, Apr 6, 2013 at 5:51 AM, janI <jani@apache.org>
wrote:
> > > > > > >
> > > > > > > > Hi.
> > > > > > > >
> > > > > > > > Now that our wiki (mwiki) runs smoothly and performs
quite
> > well,
> > > it
> > > > > is
> > > > > > > > slowly time to take a look at our other services.
> > > > > > > >
> > > > > > > > In order to determine what should/can be done, I need
some
> > > figures.
> > > > > > > >
> > > > > > > > Can anybody help me with traffic figures for
> > > > > > > >    wiki.o.o, forum.o.o (total for the vm not per language)
> and
> > > > > www.o.o
> > > > > > > >
> > > > > > > > Does end-users download directly from www.o.o or from
a
> > secondary
> > > > > site
> > > > > > ?
> > > > > > > >
> > > > > > > > I need the following numbers (if possible):
> > > > > > > >    - daily number of clicks (average)
> > > > > > > >    - peak number of clicks within a short period like
1 hour
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > > From Google Analytics we have:
> > > > > > >
> > > > > > > For wiki:  around 33K page views/day
> > > > > > >
> > > > > > > For forums, I have no idea.  They are using a Google Analytics
> > > > account
> > > > > (
> > > > > > > UA-1087265-8) that is not connected with the rest of the
> website.
> > >  If
> > > > > > that
> > > > > > > could be updated to match the rest of the website we'd
have
> > typical
> > > > > > numbers
> > > > > > > in a week.
> > > > > > >
> > > > > >
> > > > > > Thanks for your answers, but I am a bit scared you (as a PMC)
> tell
> > me
> > > > > > "they" use an account unknown to us. Who has the forum account
?
> > > > > >
> > > > > > I have the technical power to change it (sudo access) but if
a
> PMC
> > > tell
> > > > > we
> > > > > > do no know who owns it, it would not be right just to change
it.
> > Who
> > > > but
> > > > > > the "owner" can authorize a change ?
> > > > > >
> > > > > > I trust this is a misunderstanding somewhere, it cannot be that
> we
> > > run
> > > > a
> > > > > > service, that we do not control, in the sense of change
> management.
> > > > > >
> > > > > >
> > > > > I assume this is just a legacy account from Sun days.  The same GA
> > code
> > > > was
> > > > > used for other OOo-associated web services as well.
> > > > >
> > > > > I raised this issue a year ago on this mailing list:
> > > > >
> > > > > http://markmail.org/thread/mtknhgrwjkajj2ux
> > > > >
> > > > > I'm assuming this is just a "lack of maintenance issue" and we can
> > add
> > > it
> > > > > to the list of other things that need attention.
> > > > >
> > > >
> > > > Ok, I just wanted to be sure, when I come around to it, I will find
> the
> > > > header page in forum and change. Sadly enough the setup of the forum
> vm
> > > is
> > > > not "a piece of cake", it seems to have gone through quite a lot of
> > > changes
> > > > (but it works and that is important).
> > > >
> > > >
> > > > >
> > > > > > For the website as a whole (all subdomains tracked) it is around
> > > > 730K/day
> > > > > > > for a peak day.  If you subtract out the forum traffic
and the
> > > > > downloads
> > > > > > > (which serviced by SourceForge) then the static website
is
> > probably
> > > > > more
> > > > > > > like 600K/day visits.
> > > > > > >
> > > > > >
> > > > > > Why do I need to subtract forum, I thought they ran as a separate
> > > dns ?
> > > > > do
> > > > > > we not count on basis of our template (that loads the GA
> account).
> > > > > >
> > > > > >
> > > > >
> > > > > Sorry, I meant "subtract out the wiki" traffic.  GA reports the
> total
> > > > > traffic for all tracked domains, and then allows filtering by URL
> > > > pattern.
> > > > > So total traffic minus the /wiki/ traffic minus the download
> traffic
> > > > gives
> > > > > us the static website traffic.
> > > > >
> > > >
> > > > ok, got it.
> > > >
> > > > >
> > > > >
> > > > > >
> > > > > > >
> > > > > > > So that indirectly answers your other question -- the big
> > download
> > > > > files
> > > > > > > are from the SourceForge distribution network.  But we
do have
> > > > smaller
> > > > > > > downloads, of documentation PDF's.  These are mainly from
the
> > wiki.
> > > > > > >
> > > > > > > In terms of peaks, our biggest day usually Wednesday, lowest
on
> > > > Sunday.
> > > > > > > The variation there around 200K/visits.
> > > > > > >
> > > > > >
> > > > > > Super just what I need.
> > > > > >
> > > > > >
> > > > > > >
> > > > > > > Hourly peak is around 15K visits for all tracked subdomains
> > > together.
> > > > > > >
> > > > > >
> > > > > > Do you know the division between mwiki, forum and www ?
> > > > > >
> > > > > >
> > > > > I can't seem to filter on the GA screen that gives hourly detail,
> so
> > I
> > > > only
> > > > > see the count cross-subdomain.
> > > > >
> > > > >
> > > > > >
> > > > > > > Note:  for all of these I'm talking about "pages", where
a
> single
> > > > page
> > > > > > > could include several HTTP calls, for the HTML, the Javascript,
> > > > images,
> > > > > > > etc.  Since anyone with Javascript not enabled would not
> > register,
> > > > the
> > > > > > > actual numbers are probably a little bit higher than this.
> > > > > > >
> > > > > >
> > > > > > Good point, that is very important for a a traffic server, where
> > e.g.
> > > > > > javascripts would be cached.
> > > > > >
> > > > > >
> > > > > We might be able to get the httpd logs from Infra, in which case
we
> > > could
> > > > > do more traditional traffic stats
> > > >
> > > > I have the httpd logs for forum and mwiki, but I miss a tool to make
> > > proper
> > > > statistic. I have asked in the httpd channel, but no luck.
> > > >
> > > >
> > > I use a commercial tool called WebLog Expert.  I'm happy to run a
> report
> > on
> > > the logs if they are accessible.
> > >
> >
> > Thx, I will see if the log can come down to a size where I can transport
> > them.
> >
> >
> I'm happy to scp them as well, whatever works for you.
>
>
> > Forum is now changed to our main GA account, and I changed all the
> language
> > forums as well, can you please check that we get GA data (I assume you
> have
> > to wait 24 hours or something).
> >
> >
> Data is starting to come through, 52169 page views yesterday, across
> languages.
>
>
>
> > Lets see in a week or so, if my assumption is correct (that mwiki and
> forum
> > have the same usage pattern).
> >
> >
>
> Yes.
>
>
>
> > As a side remark, I can really understand the complains there have been
> > about the problems with the forum. While I did my part, apache ran
> constant
> > with more work tasks, than configured mysql connections and the vm was
> real
> > busy (which is good, since it means we have active users).
> >
> >
> I know what you mean.  I used to do performance work for IBM Websphere
> Portal products.  80% of it was tuning the various thread pools and work
> queues.  10% of it was DB tuning.  The other 10% was black magic.
>

Do I hear a new volunteer...could for sure use a hand.

The 10% black magic is exactly the difference between reading the manual
(like most do) and really doing the job.

Is anyone considering giving wiki/www/forum a facelift parallel to the
launch of 4.0. We should really make them more look alike and  a lot more
moden. If we look at other similar products they have much more modern
looks.

thx for the numbers.

rgds
jan I.



> Regards,
>
> -Rob
>
>
>
> > Have a nice day/evening.
> >
> > rgds
> > jan I.
> >
> > >
> > > -Rob
> > >
> > >
> > >
> > > >
> > > > > .
> > > > >
> > > > >
> > > > >
> > > > > > I am trying to convince Infra, that we should have a common
> > > > trafficserver
> > > > > > for wiki and forum (and maybe www). Currently we use eresbus
as
> > > proxy,
> > > > > and
> > > > > > all packets are forwarded to the ooo-wiki2-vm where that
> > > trafficserver
> > > > > > filters cached pages. Forum goes direct to httpd, so every packet
> > > > causes
> > > > > > mysql activity. If we had a trafficserver in eresbus, we would
> for
> > > > wiki2
> > > > > > save loads of network traffic and for forum network traffic
as
> well
> > > as
> > > > > > mysql activity.
> > > > > >
> > > > > >
> > > > > > >
> > > > > > > In any case, one approach would be to enable GA for the
Forums,
> > > track
> > > > > > that
> > > > > > > for a week or two and then compare that to the Wiki numbers.
>  If
> > > the
> > > > > > stacks
> > > > > > > are similar, then we could think of the traffic ratio between
> the
> > > two
> > > > > and
> > > > > > > tune based on that.
> > > > > > >
> > > > > >
> > > > > > I agree, but not being PMC, I do not feel I am in the position
to
> > > > "just"
> > > > > > make that change. Who "owns" the forum, they should authorize
> such
> > a
> > > > > > change. I can make the change (provided you tell give me the
GA
> > line
> > > to
> > > > > > insert) when I am sure I am not doing a change that someone
else
> > > depend
> > > > > on.
> > > > > >
> > > > > >
> > > > > I've certainly raised the issue on the list previously and there
> were
> > > no
> > > > > objections.   If you have access let me know offline and I can give
> > you
> > > > the
> > > > > codes.
> > > > >
> > > > > Note that this root page is not in the Apache CMS.  It doesn't look
> > > > > phpBB-related so there might be a static page out there on that
> host
> > > that
> > > > > would need to be updated manually:  http://forum.openoffice.org/
> > > > >
> > > >
> > > > thx for the info.
> > > >
> > > > rgds
> > > > Jan I.
> > > >
> > > > >
> > > > >
> > > > > > >
> > > > > > >
> > > > > > > > I would like to tune at least forum before we release
4.0 to
> > > > prepare
> > > > > > for
> > > > > > > a
> > > > > > > > higher load. Tuning of www depend on a discussion
with infra,
> > > > because
> > > > > > it
> > > > > > > is
> > > > > > > > a shared service.
> > > > > > > >
> > > > > > > >
> > > > > > > If it is like the release of 3.4.0 and 3.4.1 we'll get
a few
> hot
> > > > pages:
> > > > > > >
> > > > > > > 1) The announcement blog post.  With both 3.4.0 and 3.4.1
> > releases
> > > > > these
> > > > > > > were heavily linked to by the press.  Roller went down
in the
> > 3.4.1
> > > > > case.
> > > > > > > I don't know if that was due to the traffic or not.  But
we
> might
> > > > want
> > > > > to
> > > > > > > plan on having the announcement post be a static webpage
just
> to
> > be
> > > > > sure.
> > > > > > >
> > > > > >
> > > > > > The blog post is on the general apache server and not our www,
is
> > > that
> > > > > > correct ?
> > > > > >
> > > > > >
> > > > > Correct.
> > > > >
> > > > >
> > > > > > >
> > > > > > > 2) The homepage and http://www.openoffice.org/download
page
> will
> > > be
> > > > > hot.
> > > > > > > You can see from the download chart the boost we get when
a new
> > > > release
> > > > > > > occurs:  http://www.openoffice.org/stats/downloads.html
 (AOO
> > > 3.4.0
> > > > > was
> > > > > > > May
> > > > > > > 8th 2012 and AOO 3.4.1 was Aug 23rd).
> > > > > > >
> > > > > >
> > > > > > That is a real obvious case for trafficserver, and until it
is in
> > > > place I
> > > > > > can ask to have the httpd use memcached (which it might be
> > already).
> > > > > > Currently the www is on a general apache server, so I need strong
> > > > > arguments
> > > > > > to make changes (high load is a strong argument)
> > > > > >
> > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > Translate-vm also needs tunings, but that is in progress,
and
> > > infra
> > > > > has
> > > > > > > > been very kind and prepared an extra vm, so we/I install/tune
> > > > without
> > > > > > > > affecting the active system.
> > > > > > > >
> > > > > > > > thx in advance for figures.
> > > > > > > > rgds
> > > > > > > > jan I.
> > > > > > >
> > > > > >
> > > > > > THANKS a lot for your answer, which help me a lot doing calc..
> > Sorry
> > > > for
> > > > > > being frank about the forum, but it really make me nervous that
> we
> > > > have a
> > > > > > service where part it unknown.
> > > > > >
> > > > > > rgds
> > > > > > Jan I
> > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message