[Freebase-discuss] Converting Freebase mid to integer

Paul Houle paul at ontology2.com
Tue Sep 27 14:12:28 UTC 2011


       It's not crazy to be concerned about the efficiency of database 
keys.  On the other hand,  properly used mids are fairly efficient.

     Those who use sequential integer primary keys are almost always 
satisfied with performance;  some of them have tried a different kind of 
key and found performance degraded by orders of magnitude.

     The first explanation that comes to many people for this is the 
small size of the integer key,  which uses less RAM,  less disk space,  
less room in the cache,  fewer disk blocks,  etc.  This is a real 
effect,  but it's nowhere near large enough to account for the huge 
performance change.  This guy did a nice experiment

http://www.informit.com/articles/printerfriendly.aspx?p=25862

     who showed that sequential integer keys are drastically faster in 
Microsoft's SQL server than GUIDs,  not because of the the size,  but 
because they're sequential.  Once he started generating sequential 
"GUID" objects (breaking the GUID property of the objects) he found only 
a tiny difference compared with primary keys.


More information about the Freebase-discuss mailing list