[Data-modeling] keeping Freebase topics and Wikipedia pages in sync; uncertainty in who is the composer

Brian Karlak zenkat at metaweb.com
Wed Jul 22 17:39:06 UTC 2009


On Jul 21, 2009, at 3:50 PM, Raymond Yee wrote:

> 1) I go ahead with creating Freebase topics for the cantatas w/o any
> Freebase IDs currently
>
> 2) Start filling out the data as I can find them or as I can recruit
> help to fill them in for all the cantatas.
>
> 3) As I gather enough data to create Bach stuff articles on the
> Wikipedia, do so.
>
> 4) Wait for Freebase to discover the new Bach cantata pages and then
> flag them for merging.

This is currently the best course of action.  Since wikipedia topics  
are created without any properties, it's hard to reconcile them on  
import.

However, it does seem that we could provide a "hint" property that  
could be set on topic creation that would auto-reconcile to an as-yet- 
uncreated wikipedia page.  When you create the wikipedia page, you  
could add a property to your page of "/dataworld/expected_key" with  
the new article name, which we'd pick up on the next load.

Brian


More information about the Data-modeling mailing list