nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Carl Cerecke <>
Subject OPIC scoring differences
Date Sun, 08 Jul 2007 22:38:08 GMT

The docs for the OPICScoringFilter mention that the plugin implements a 
variant of OPIC from Artiboul et al's paper. What exactly is different? 
How does the difference affect the scores?

Also, there's a comment in the code:

// XXX (ab) no adjustment? I think this is contrary to the algorithm descr.
// XXX in the paper, where page "loses" its score if it's distributed to
// XXX linked pages...

Is this something that will be looked at eventually or is the scoring 
"good enough" at the moment without some "adjustment".


View raw message