[Developers] Ambiguities in /film domain
Christopher R. Maden
crism at metaweb.com
Thu Jul 10 16:45:59 UTC 2008
Dave Boyce wrote:
> I'm working on a movies viewer, and I'm finding a number of data issues.
>
> 1. A person can be represented by more than one object.
>
> 2. Two people with the same name result in data that is scattered, and
> there's
> a lot of ambiguity.
>
> I appreciate the way in which this situation has arisen and also that
> there isn't
> an immediate or easy fix, but this is a problem. I guess this isn't
> news, so
> does Freebase have any plans to rectify these kinds of issues, or is
> it just
> going to be left to the community to fix them as they're encountered?
The community *is* the plan.
Some things can be detected automatically; for instance, two different
directors with the same name on one film is probably a sign of a data
problem. There is an internal tracking issue to look for problems like
that. However, most of these problems will rely on someone like you to
notice them and flag them for correction.
[Most of these sort of problems come into being from our Wikipedia
import and analysis. Someone adds a simple wikilink like [[Paul
Verhoeven]] which selects the wrong one; someone else corrects it, but
we’ve already captured the bad association. We aren’t yet good at
*correcting* errors like that, so we end up with two identically-named
directors on the film.]
So please do keep flagging these as you find them. Thanks.
~Chris
--
Christopher R. Maden
Data Architect
Freebase.com: <URL: http://www.freebase.com/ >
Metaweb Technologes, Inc. <URL: http://www.metaweb.com/ >
More information about the Developers
mailing list