[Developers] WEX (wpid/fbid lookup) database
Alexander Marks
al at metaweb.com
Thu Nov 6 23:11:23 UTC 2008
Hey Sam. We're working on a new WEX dump that will be out soon. Note that you can, however, generate the guid->wpid map that you want using the quad dump with something like this python script:
dump = open("freebase-datadump-quadruples.tsv", "r")
out = open("guid2wpid.tsv", "w")
for line in dump:
src, prop, dst, val = line.split("\t")
if prop == "/type/object/key" and dst == "/wikipedia/en_id":
out.write("%s\t%s\n" % (src, val))
which will give you a format like this:
/guid/9202a8c04000641f8000000000009e89 3746
/guid/9202a8c04000641f8000000000032ded 25493
...
As for Wikipedia article redirects, you will always need the WEX dump for that, although replacing /wikipedia/en_id with /wikipedia/en above might get you part of the data you want. The /wikipedia/en Freebase namespace contains both the name of the Wikipedia article, and the name of all redirects to that article.
Hope that helps,
Al
----- Original Message -----
From: "Sam Halliday" <sam.halliday at gmail.com>
To: developers at freebase.com
Sent: Tuesday, November 4, 2008 2:30:09 AM GMT -08:00 US/Canada Pacific
Subject: [Developers] WEX (wpid/fbid lookup) database
Hi all,
I noticed that freebase released a new database dump for October, but
no corresponding WEX dump. My interests are not in the WEX part of the
latter database, but in the piece that links wpids to fbuids and
gathers all the Wikipedia redirects... I am confused why this is not a
part of the freebase download in the first place.
I used freebase for a project on the assumption that new data would be
available on a quarterly basis. Is this not the case for the WEX data?
I'd also like to request that the wpid and redirect tables be included
in the freebase data dumps in the future, and not exclusively in the
WEX data. I understand that the WEX can be built from the wikipedia
data, but the wpid/fbuid lookup cannot... it must be freebase that do
this.
_______________________________________________
Developers mailing list
Developers at freebase.com
http://lists.freebase.com/mailman/listinfo/developers
More information about the Developers
mailing list