[Dspace-general] [Possible Spam]::Re: Statistics Issues andDSpace

Angelo Miranda amiranda at reitoria.uminho.pt
Wed Feb 28 06:30:41 EST 2007


Hi,

Just some additional info:

The stats_detect_spider scans the Apache access_log and stores a pointer
(date time). The next time the procedure is run it will start the scanning
on that point.
As access_log gets bigger, the procedure takes more time to run. When this
happen you can start a new apache Access_log and clean the pointer
(stats_control). Like this the stats-detect-spider will start the scanning
from the beggining of access_log again.

Thank you
Angelo Miranda

-----Original Message-----
From: Bjorn Skobba [mailto:bjorn.skobba at brunel.ac.uk] 
Sent: quarta-feira, 28 de Fevereiro de 2007 8:53
To: Angelo Miranda
Cc: 'John Murtagh'; dspace-general at mit.edu
Subject: Re: [Possible Spam]::Re: [Dspace-general] Statistics Issues
andDSpace

Hi Angelo,

Thanks for your reply.

Is the mechanism for excluding crawlers, etc the stats-detect-spiders
script?

Can you tell me a little bit about it - is it supposed to be scheduled
from cron(how often), how does it work, what it does, etc?

Many thanks for your help.

Bjorn Skobba
Brunel University

On Tue, 2007-02-27 at 16:45 +0000, Angelo Miranda wrote:
> Hi,
> 
>  
> 
> I am from RepositoriUM team at University of Minho.
> 
> I am not sure if i understood your question.
> 
> Are you asking if the statistics add-on excludes the views and
> downloads from crawler/robot and similar devices ?
> 
> If thats the question the answer is yes. The add-on has a mechanism
> for ignoring those accesses.
> 
>  
> 
> Thank You
> 
> Angelo Miranda
> 
>  
> 
> -----Original Message-----
> From: dspace-general-bounces at mit.edu
> [mailto:dspace-general-bounces at mit.edu] On Behalf Of John Murtagh
> Sent: terça-feira, 27 de Fevereiro de 2007 12:08
> To: dspace-general at mit.edu
> Subject: [Dspace-general] Statistics Issues and DSpace
> 
>  
> 
> Hello all,
> 
> The use of statistics for DSpace is an important part of our strategy
> to increase deposits and we wish to protect the integrity of that
> information.
> 
> A quick question about the statistics element of DSpace, namely the
> add on provided by Universidade do Minho
> <http://www.ecs.soton.ac.uk/~harnad/Hypermail/Amsci/6086.html>
> 
> I wondered if there had been any issues or concerns in the collection
> and processing of downloads and views for items on DSpace? This could
> be in the form of retrieval robots, OAI harvesters, Google or the
> manipulation of statistics.
> 
> Anyone got any news or info they'd like to share?
> 
> Thanks in advance
> 
> 
> John Murtagh
> ________________________________________________
> 
> 
> Website: http://bura.brunel.ac.uk
> 
> 
> 
> John Murtagh
> Project Manager - Brunel University Research Archive
> Brunel Library
> Kingston Road
> Uxbridge
> UB8 3PH
> 
> Tel: 0189 526 5417
> Fax: 01895269741
> E-mail: john.murtagh at brunel.ac.uk
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> _______________________________________________
> Dspace-general mailing list
> Dspace-general at mit.edu
> http://mailman.mit.edu/mailman/listinfo/dspace-general





More information about the Dspace-general mailing list