[Data-modeling] keeping Freebase topics and Wikipedia pages in sync

Iain Sproat iainsproat at gmail.com
Wed Jul 22 19:12:23 UTC 2009


On Wed, Jul 22, 2009 at 9:56 PM, Brian Karlak <zenkat at metaweb.com> wrote:
>
>
>  about 30% of the "wikipedia merges" are true reconciliations that should

result in a merge in freebase. We considered putting these on the merge
> queue, but with >2000 merges

every two weeks, we quickly realized that we would swamp our
> community's bandwidth.


 Does that mean that every 2 weeks we have some 600 topics duplicate topics
accumulating which should be merged?
Or are they handled behind the scenes, without community input?

Although to be fair, the 30k of dupe topics accumulated over the last 2
years is only a small dent (< 0.5%) in the ~6 million topics currently in
freebase.

Iain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freebase.com/pipermail/data-modeling/attachments/20090722/c3f3dd0c/attachment-0001.htm 


More information about the Data-modeling mailing list