[Data-modeling] keeping Freebase topics and Wikipedia pages in sync
Iain Sproat
iainsproat at gmail.com
Wed Jul 22 19:12:23 UTC 2009
On Wed, Jul 22, 2009 at 9:56 PM, Brian Karlak <zenkat at metaweb.com> wrote:
>
>
> about 30% of the "wikipedia merges" are true reconciliations that should
result in a merge in freebase. We considered putting these on the merge
> queue, but with >2000 merges
every two weeks, we quickly realized that we would swamp our
> community's bandwidth.
Does that mean that every 2 weeks we have some 600 topics duplicate topics
accumulating which should be merged?
Or are they handled behind the scenes, without community input?
Although to be fair, the 30k of dupe topics accumulated over the last 2
years is only a small dent (< 0.5%) in the ~6 million topics currently in
freebase.
Iain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freebase.com/pipermail/data-modeling/attachments/20090722/c3f3dd0c/attachment-0001.htm
More information about the Data-modeling
mailing list