search engine spam detector

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

search engine spam detector

Stefan Groschupf-2
Hi,

a interesting tool:
http://tool.motoricerca.info/spam-detector/

Stefan
Reply | Threaded
Open this post in threaded view
|

Re: search engine spam detector

Stefan Neufeind
Stefan Groschupf wrote:
>
> a interesting tool:
> http://tool.motoricerca.info/spam-detector/

Do you have good/bad experience with that tool? The idea to have
someething like this as a nutch-module (dropping pages or ranking them
very low) might come up :-)

From the FAQ I read that the author is a PHP-guy - I'd say luckily ...
but for nutch that would at least mean translating a big part. Question
still remains how advanced his ideas already are and if he would
contribute to such an extension. But contributing the ideas behind it
might be an interesting collaboration.

  Stefan
Reply | Threaded
Open this post in threaded view
|

Re: search engine spam detector

Stefan Groschupf-2
>
> The idea to have
> someething like this as a nutch-module (dropping pages or ranking them
> very low) might come up :-)

This will be a very long way.
I collect some thoughts and a list of web spam related papers in my  
blog.
http://www.find23.net/Web-Site/blog/521BA1CD-14C4-4E84-A072- 
F98E13CAEFE1.html
Feedback is welcome.


Stefan

Reply | Threaded
Open this post in threaded view
|

RE: search engine spam detector

Arsen Popovyan
In reply to this post by Stefan Groschupf-2

Hello!!!

-----Original Message-----
From: Stefan Groschupf [mailto:[hidden email]]
Sent: Sunday, June 04, 2006 9:15 PM
To: [hidden email]
Subject: search engine spam detector

Hi,

a interesting tool:
http://tool.motoricerca.info/spam-detector/

Stefan

Reply | Threaded
Open this post in threaded view
|

Re: search engine spam detector

Andrzej Białecki-2
In reply to this post by Stefan Groschupf-2
Stefan Groschupf wrote:

>>
>> The idea to have
>> someething like this as a nutch-module (dropping pages or ranking them
>> very low) might come up :-)
>
> This will be a very long way.
> I collect some thoughts and a list of web spam related papers in my blog.
> http://www.find23.net/Web-Site/blog/521BA1CD-14C4-4E84-A072-F98E13CAEFE1.html 
>
> Feedback is welcome.

Have a look also at the published papers from the (just finished)
WWW2006 conference: http://www2006.org/tracks/#session_paper03 . Other
papers related to search (Search Engineering, Search tracks) are equally
interesting ... Enjoy the reading! :)

--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com