[Developers] Question about references to wikipedia articles
Colin Evans
colin at metaweb.com
Wed Feb 20 01:45:32 UTC 2008
Hi Kavitha,
Every Freebase topic that has a corresponding Wikipedia topic has a set
of keys with the namespace "/wikipedia/en" and also a "/wikipedia/en_id"
key. You can see examples of those keys here:
http://freebase.com/view/explore/topic/en/abraham_lincoln
The "en" keys are the Wikipedia article titles and redirects that
correspond to the Freebase topic. The "en_id" is the Wikipedia page_id
from the Mediawiki database.
We re-synchronize with Wikipedia roughly twice a month, with a strong
emphasis on keeping Freebase topics stable. Thus, if a page is split,
merged, moved, or deleted in Wikipedia, the corresponding topic(s) in
Freebase will reflect the changes conservatively or maybe not at all,
depending on the data and our current sync algorithm. We work hard to
keep Freebase.com aligned with Wikipedia, so for most cases the "en"
keys should be a good mapping.
The dbpedia extraction is run on a regular basis from scratch, and
therefore their canonical reference is to a specific article version in
a specific Wikipedia dump. Because Freebase.com is continuously
synchronizing with Wikipedia, there isn't a more canonical reference
than the "en" and "en_id" keys.
If you're looking to reconcile a specific dbpedia dump with
Freebase.com, I'd recommend aligning on article names and redirects --
the "en" keys in Freebase. There may be some inconsistencies as
Wikiepdia redirects move around over time, but this will probably work
for the majority of records.
Also, I'd like to get in a plug for the new Freebase Wikipedia
Extraction, which is the basis for a lot of our sync algorithms and data
mining of Wikipdia. The "freebase_names" table has a mapping of
Freebase guids to Wikipedia article names, which should make alignment
easier: http://download.freebase.com/wex/doc/#freebase_names
Hope that helps!
-Colin
Kavitha Srinivas wrote:
> Is there a way to get a canonical reference to the current version of
> the wikipedia article for a given guid in freebase? We are trying to
> establish connections between freebase and dbpedia, and it looks like
> dbpedia has a canonical reference, whereas freebase uses a link to
> the version from which it was created. Finding equivalences between
> dbpedia and freebase has not been easy, and is prone to errors. Thanks!
> Kavitha
> _______________________________________________
> Developers mailing list
> Developers at freebase.com
> http://lists.freebase.com/mailman/listinfo/developers
>
More information about the Developers
mailing list