[Developers] Blog properties and ping idea
William Pietri
william at scissor.com
Tue Aug 21 17:27:18 UTC 2007
Will Moffat wrote:
>>> is where Liz suggestion for automatic population (through a ping
>>> service) can be really interesting :)
>>>
> Tthere were 111 _thousand_ pings in the last hour! Many of which look
> like spam blogs. So I think Pat's suggestion of cataloguing individual
> posts is a little ambitious :-)
>
I can definitely confirm both the volume and the spam. A year or so ago
when I spent some time parsing that feed and crawling blogs, spam was in
the majority, and a crawler that could keep up with pings was drawing
~12 megabits a second. I'm sure both spam proportion and volume are up
substantially from then.
If somebody gets excited about the topic, I'd recommend chatting with
the guy behind this site:
http://www.blogsnow.com/
Not only does he have experience in extracting interesting meta
information from blogs, but he's got a solid grip on the spam issues. He
mentioned to me that 70% of his code is for spam removal.
William
--
William Pietri - william at scissor.com - +1-415-643-1024
Agile consulting, coaching, and development: http://www.scissor.com/
Use your geek-fu to fight poverty: http://www.mifos.org/
More information about the Developers
mailing list