[Freebase-discuss] No "Metaweb System Types" in data dump?
Roland Bouman
roland.bouman at gmail.com
Fri Jul 23 21:54:46 UTC 2010
Hi Shawn!
On Fri, Jul 23, 2010 at 9:52 PM, Shawn Simister <narphorium at gmail.com> wrote:
> What you're describing sounds more like what the link export is intended
> for. I know it seems intuitively easier to map TSV files onto relational
> tables but I'm worried that you're going to run into alot more problems by
> trying to create a complete version of Freebase from tables which were not
> really meant to be stitched together like that.
>
> For example, if you look at the /people/ethnicity topic dump you'll see a
> column called includes_groups where the values are comma delimited lists of
> other ethnicities. Even if you parse out these lists and make a separate
> join table, you still only have a denormalized list of names with no
> guarantee that those names uniquely identify a single ethnicity. This is
> pretty common across all the topic dumps so I think you might find it hard
> to link topic together by foreign key in that way.
Thanks for your reply. The "link export" that's the same as the
"quadruples dump", correct? If so, you're the third or fourth person
that suggested to look at this, so I think I will, and drop the idea
of using the tsv dumps.
The point you make about the comma-delimited lists not being proper
identifiers is of course a valid one - I actually didn't realize the
tsv dump used the names.
So - I won't make this thread any longer than needed.
Thanks all for taking the time to explain it to me and for the
valuable suggestions and workarounds - it is much appreciated.
kind regards,
Roland.
>
> If your goal is to create a local, queryable copy of the Freebase graph then
> I think you may have a difficult road ahead. However, if there is a specific
> problem that you're trying to solve with that data we may be able to help
> you find a simpler solution.
>
> Shawn
>
> _______________________________________________
> You are receiving this message because you are subscribed to the
> Freebase-discuss mailing list.
> To post a message to the list: Freebase-discuss at freebase.com
> To unsubscribe, view archives, etc:
> http://lists.freebase.com/mailman/listinfo/freebase-discuss
>
--
Roland Bouman
blog: http://rpbouman.blogspot.com/
twitter: @rolandbouman
Author of "Pentaho Solutions: Business Intelligence and Data
Warehousing with Pentaho and MySQL",
http://tinyurl.com/lvxa88 (Wiley, ISBN: 978-0-470-48432-6)
Author of "Pentaho Kettle Solutions: Building Open Source ETL
Solutions with Pentaho Data Integration",
http://tinyurl.com/33r7a8m (Wiley, ISBN: 978-0-470-63517-9)
More information about the Freebase-discuss
mailing list