[Dspace-general] [Possible Spam]::Re: Statistics Issues andDSpace
Angelo Miranda
amiranda at reitoria.uminho.pt
Wed Feb 28 06:30:41 EST 2007
Hi,
Just some additional info:
The stats_detect_spider scans the Apache access_log and stores a pointer
(date time). The next time the procedure is run it will start the scanning
on that point.
As access_log gets bigger, the procedure takes more time to run. When this
happen you can start a new apache Access_log and clean the pointer
(stats_control). Like this the stats-detect-spider will start the scanning
from the beggining of access_log again.
Thank you
Angelo Miranda
-----Original Message-----
From: Bjorn Skobba [mailto:bjorn.skobba at brunel.ac.uk]
Sent: quarta-feira, 28 de Fevereiro de 2007 8:53
To: Angelo Miranda
Cc: 'John Murtagh'; dspace-general at mit.edu
Subject: Re: [Possible Spam]::Re: [Dspace-general] Statistics Issues
andDSpace
Hi Angelo,
Thanks for your reply.
Is the mechanism for excluding crawlers, etc the stats-detect-spiders
script?
Can you tell me a little bit about it - is it supposed to be scheduled
from cron(how often), how does it work, what it does, etc?
Many thanks for your help.
Bjorn Skobba
Brunel University
On Tue, 2007-02-27 at 16:45 +0000, Angelo Miranda wrote:
> Hi,
>
>
>
> I am from RepositoriUM team at University of Minho.
>
> I am not sure if i understood your question.
>
> Are you asking if the statistics add-on excludes the views and
> downloads from crawler/robot and similar devices ?
>
> If thats the question the answer is yes. The add-on has a mechanism
> for ignoring those accesses.
>
>
>
> Thank You
>
> Angelo Miranda
>
>
>
> -----Original Message-----
> From: dspace-general-bounces at mit.edu
> [mailto:dspace-general-bounces at mit.edu] On Behalf Of John Murtagh
> Sent: terça-feira, 27 de Fevereiro de 2007 12:08
> To: dspace-general at mit.edu
> Subject: [Dspace-general] Statistics Issues and DSpace
>
>
>
> Hello all,
>
> The use of statistics for DSpace is an important part of our strategy
> to increase deposits and we wish to protect the integrity of that
> information.
>
> A quick question about the statistics element of DSpace, namely the
> add on provided by Universidade do Minho
> <http://www.ecs.soton.ac.uk/~harnad/Hypermail/Amsci/6086.html>
>
> I wondered if there had been any issues or concerns in the collection
> and processing of downloads and views for items on DSpace? This could
> be in the form of retrieval robots, OAI harvesters, Google or the
> manipulation of statistics.
>
> Anyone got any news or info they'd like to share?
>
> Thanks in advance
>
>
> John Murtagh
> ________________________________________________
>
>
> Website: http://bura.brunel.ac.uk
>
>
>
> John Murtagh
> Project Manager - Brunel University Research Archive
> Brunel Library
> Kingston Road
> Uxbridge
> UB8 3PH
>
> Tel: 0189 526 5417
> Fax: 01895269741
> E-mail: john.murtagh at brunel.ac.uk
>
>
>
>
>
>
>
>
>
>
>
> _______________________________________________
> Dspace-general mailing list
> Dspace-general at mit.edu
> http://mailman.mit.edu/mailman/listinfo/dspace-general
More information about the Dspace-general
mailing list