[Developers] Blog properties and ping idea

William Pietri william at scissor.com
Tue Aug 21 17:27:18 UTC 2007


Will Moffat wrote:
>>> is where Liz suggestion for automatic population (through a ping
>>> service) can be really interesting :)
>>>       
> Tthere were 111 _thousand_ pings in the last hour! Many of which look
> like spam blogs. So I think Pat's suggestion of cataloguing individual
> posts is a little ambitious :-)
>   

I can definitely confirm both the volume and the spam. A year or so ago 
when I spent some time parsing that feed and crawling blogs, spam was in 
the majority, and a crawler that could keep up with pings was drawing 
~12 megabits a second. I'm sure both spam proportion and volume are up 
substantially from then.

If somebody gets excited about the topic, I'd recommend chatting with 
the guy behind this site:

http://www.blogsnow.com/

Not only does he have experience in extracting interesting meta 
information from blogs, but he's got a solid grip on the spam issues. He 
mentioned to me that 70% of his code is for spam removal.

William

-- 
William Pietri - william at scissor.com - +1-415-643-1024
Agile consulting, coaching, and development: http://www.scissor.com/
Use your geek-fu to fight poverty: http://www.mifos.org/


More information about the Developers mailing list