[Developers] Bulk download from sandbox.freebase.com

Kavitha Srinivas ksrinivs at gmail.com
Mon Oct 8 17:30:23 UTC 2007


Hello
    We are trying to see what it takes to do a bulk download from  
sandbox.freebase.com so we can generate an RDF like dump of Freebase  
data.
Here's what we tried -- we tried to get any instance of /common/topic  
and dump all of its links to other instances of /common/topic.  When  
we try this with no explicit limits set, this gives us some randomly  
selected instances (within a default limit, which I guess is 100).   
Is there a way of (a) not having a limit so we can get everything, or  
(b) getting a well defined subset at a time.  For (b) we tried  
sorting it by various attributes (name, id, timestamp), and in each  
case, adding the sort caused a query timeout.  Here's an example of  
what we tried with sorting by timestamp:

[{
    "/type/reflect/any_master":[{
        "link":{
          "master_property":{
            "name":null
          }
        },
        "name":null,
        "id":null,
        "type":"/common/topic"
      }],
     "limit":10,
     "timestamp":null,
     "sort":"timestamp",
      "name":null,
      "type":"/common/topic"
     }]

If there's a canonical way of getting everything, that would be  
great!  Thanks!

Kavitha




More information about the Developers mailing list