httpd-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rainer Jung <rainer.j...@kippdata.de>
Subject Re: [users@httpd] Missing log entries?
Date Fri, 03 Apr 2015 05:17:44 GMT
Am 03.04.2015 um 03:41 schrieb Andrew Moise:
>    Hi Richard.
>    Okay, that was exactly my question -- yes, I'm pretty confident that
> that's what's happening (the page is not completely being served and
> the transfer is aborted, even though from the user's perspective the
> page is present and they can interact with it e.g. click off onto
> another page). I just didn't know that it didn't go into the log in
> that case; I assumed that it would go in with a count of the bytes
> that were successfully sent.
>    That leads me to my next question: Assuming that I've understood
> this correctly, is there any accurate way to see what apache is
> serving (even partially sent pages)? This is forming a significant
> part of my traffic (i.e. apache log analysis only shows a fraction of
> the visitors that google analytics tells me are present).
>    Thanks,
>    Andrew

mod_log_forensic?

Regards,

Rainer

> On Thu, Apr 2, 2015 at 8:29 PM, Richard
> <lists-apache@listmail.innovate.net> wrote:
>> If the log entry for the image has a "referer" for a site page, then
>> it is unlikely that the image is being served from another page/site
>> (unless there is referer-based access control and someone *really*
>> wants the image(s), so is mucking the referer).
>>
>> Apache doesn't write the log entry until the page has been delivered
>> (otherwise it wouldn't be able to give you the number of bytes
>> served). If your page(s) have something that keeps them from being
>> fully served (js, and ad, etc.) then the user may click off the page
>> before everything closes out and the entry is logged. I.e., this
>> wouldn't require the user to explicitly abort the page delivery,
>> they may just be doing it implicitly if the page has delivery issues.
>>
>> You should be able to check this fairly easily yourself.
>>
>>
>>
>> ------------ Original Message ------------
>>> Date: Thursday, April 02, 2015 20:00:00 -0400
>>> From: 5k Kate <5k.kate.1114@gmail.com>
>>>
>>> Is it possible that the image links are being shared separately
>>> from the page? Theoretically someone could go and post your image
>>> into another page.
>>>
>>> -Kate
>>>
>>> On Wed, Apr 1, 2015 at 1:09 PM, Andrew Moise
>>> <andrew.moise@gmail.com> wrote:
>>>
>>>>    Hello all.
>>>>    I'm trying to analyze the behavior of visitors to my web site by
>>>> parsing my Apache access log, and it seems that the log is missing
>>>> some entries. For example, it's happened before that there's an
>>>> entry in the log for serving an image, with a referrer set to one
>>>> of my HTML pages, but there is no entry in the log for that HTML
>>>> page (either before or after the entry for the image loading)
>>>> from any similar IP address. I don't think that the headers I'm
>>>> emitting would cause the browser to cache that particular HTML
>>>> page.
>>>>    Furthermore (and more to the point of why I'm concerned about
>>>>    this), parsing the apache log indicates a small fraction (maybe
>>>> 25%) of the number of visitors indicated by a third party logging
>>>> tool (in this case clicky.com).
>>>>    Is there anything that could cause this? For example, does
>>>>    apache not put an entry in the log if the connection is
>>>> interrupted before the entire HTML file is emitted?
>>>>    Thanks in advance for your help.
>>>>    -Andrew

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
For additional commands, e-mail: users-help@httpd.apache.org


Mime
View raw message