[Developers] Google Custom Search Engine + Vancouver Base
Jim Pick
jim at jimpick.com
Tue Dec 9 07:38:23 UTC 2008
It shouldn't be too hard to parameterize a few parts, and make it
generic. It doesn't use any custom types from the Vancouver base.
One thing I noticed is that I need to sanitize the data a bit better
before passing it on -- any poorly formatted URIs that get added to the
base will kill it for everybody. I bet there's probably already a
JavaScript library out there somewhere to do that.
The custom search engine needs to be created and registered first with
Google (tied to a user or group of users). They do have an automated
way to generate custom search engine descriptions. Also, they require
advertising, unless it's for a non-profit or charity. They do pay
AdSense revenue to whoever has registered the search engine.
So in order to make it generic, some decisions have to be made about how
to register and configure it on the Google side (including decisions
about who gets the AdSense money, however small that may be).
Google also has limits on the number of "annotations" (aka. sites) they
will index. I think it's 3MB per annotation file, 10MB max. That
limits the total size. I'm at 132K for 1168 URIs now. If I'm
aggressive in collecting data, I might eventually hit the limit for the
Vancouver base - at which point I'll have to get selective.
Yahoo BOSS looks like an interesting alternative to try out as well
(with news and images too). There probably are other similar services
lurking out there (eg. sites that read OPML files).
Cheers,
- Jim
Robert Cook wrote:
> This is very cool! Jim, if you do the small tweak so it can work with
> any base, we'll weave it in to the interface.
>
> R
> On Dec 8, 2008, at 5:42 PM, Jason Douglas wrote:
>
>> Very clever!!
>>
>> I never thought of that mash-up before, but the results speak for
>> themselves. :-)
>>
>> -jason
>>
>>
>> On Dec 8, 2008, at 5:33 PM, Jim Pick wrote:
>>
>>> Hi,
>>>
>>> I just used Acre to build a Google Custom Search Engine based on the
>>> data contained in the Vancouver Base.
>>>
>>> http://vanbase.jpick.user.dev.freebaseapps.com/search
>>>
>>> Please look the Acre source code and feel free to clone it.
>>>
>>> There are currently 1145 URLs it searches against. I've only done a
>>> few
>>> searches with it, but it does a very good job for things in the
>>> Vancouver area.
>>>
>>> Cheers,
>>>
>>> - Jim
>>> _______________________________________________
>>> Developers mailing list
>>> Developers at freebase.com
>>> http://lists.freebase.com/mailman/listinfo/developers
>> _______________________________________________
>> Developers mailing list
>> Developers at freebase.com
>> http://lists.freebase.com/mailman/listinfo/developers
>
> _______________________________________________
> Developers mailing list
> Developers at freebase.com
> http://lists.freebase.com/mailman/listinfo/developers
More information about the Developers
mailing list