[Developers] Bulk download from sandbox.freebase.com
Kavitha Srinivas
ksrinivs at gmail.com
Mon Oct 8 17:30:23 UTC 2007
Hello
We are trying to see what it takes to do a bulk download from
sandbox.freebase.com so we can generate an RDF like dump of Freebase
data.
Here's what we tried -- we tried to get any instance of /common/topic
and dump all of its links to other instances of /common/topic. When
we try this with no explicit limits set, this gives us some randomly
selected instances (within a default limit, which I guess is 100).
Is there a way of (a) not having a limit so we can get everything, or
(b) getting a well defined subset at a time. For (b) we tried
sorting it by various attributes (name, id, timestamp), and in each
case, adding the sort caused a query timeout. Here's an example of
what we tried with sorting by timestamp:
[{
"/type/reflect/any_master":[{
"link":{
"master_property":{
"name":null
}
},
"name":null,
"id":null,
"type":"/common/topic"
}],
"limit":10,
"timestamp":null,
"sort":"timestamp",
"name":null,
"type":"/common/topic"
}]
If there's a canonical way of getting everything, that would be
great! Thanks!
Kavitha
More information about the Developers
mailing list