[Data-modeling] keeping Freebase topics and Wikipedia pages in sync

Brian Karlak zenkat at metaweb.com
Thu Jul 23 18:26:21 UTC 2009


On Jul 22, 2009, at 12:12 PM, Iain Sproat wrote:

>  Does that mean that every 2 weeks we have some 600 topics duplicate  
> topics accumulating which should be merged?

Theoretically, yes.

> Although to be fair, the 30k of dupe topics accumulated over the  
> last 2 years is only a small dent (< 0.5%) in the ~6 million topics  
> currently in freebase.

Exactly - it just wasn't considered a high priority at the time.

Now that we have Data Games, however, we could consider revisiting it  
-- possibly by making a special "one-vote-only" merge queue.

Brian


More information about the Data-modeling mailing list