[Data-modeling] State of the commons (A-B)
Kirrily Robert
kirrily at metaweb.com
Wed Jul 1 19:18:44 UTC 2009
On Jun 30, 2009, at 9:53 AM, Robert Cook wrote:
> It would be very helpful if people could list the domains they think
> need serious work and what they'd like to be done and we'll work
> together to fix them. I think this mailing list is a good place to
> air pent up concerns about domains and if it becomes too detailed,
> we'll move it to a different venue.
Here are my current thoughts, alphabetically by commons domain:
American Football: schema is fairly solid, I think. However, there
are no very active admins who actually know football.
Anime/Manga: no active admins, no schema development in years, and
duplicates information already found in the TV, Film, and Comics
Commons. If we'd had bases when this was created, it probably should
have been built as a base. I don't believe anyone really uses this
much, and I'm very tempted to suggest we delete it and refactor all
the data into other commons.
Architecture: Brendan, Iain, and Spencer are working on integrating
stuff from structure2 into Architecture, which is great. We also need
to move out the "Museum" type and model it properly. I started
working on that in http://mladraft.freebase.com/
Astronomy: fairly good, has active community, I think it's doing OK.
However, there are ongoing issues of "Location" as applied to extra-
terrestrial locations.
Aviation: Could use some work in splitting "Aircraft" into "Aircraft
class" and "Aircraft instance" as we already have in eg. boats and
rail. Some discussion around aircraft accidents / disasters that
never got resolved. Basically there are no active admins with expert
knowledge in this field, and there's bit rot afoot.
Awards: pretty solid.
Baseball: People are requesting properties, but there are no active/
responsive admins with a knowledge of the field. Needs work.
Basketball: Very little schema here, very little activity, looks kinda
moribund. Many of these sports domains exist only because "position"
varies between sports. Have we learnt anything about modeling in the
last year or so to know a better way to describe these things, so we
don't need a different domain for every sport?
Bicycles: 3 types, very skeletonish. Needs data import work to make
it useful for base-builders (which is why it was originally made a
Commons, because there were so many bike-related bases), then some
build-out for bicycle specs and the like, which will vary considerably
between eg. MTB and racing.
Biology: so-so, quality-wise. I think what we really need is some
data gardening work on organism classifications, not schema work per se.
Boats: needs major reworking. Draft in progress at http://www.freebase.com/view/user/skud/boats
Broadcast: Needs work. A lot of the schema design here, especially
around Broadcast Content and Broadcast Artist, was done to support a
particular application, but it's hard to understand outside of that
context. Luckily, the major types are fairly self-explanatory, eg.
"Radio station" and "TV station". But some are kind of bewildering.
K.
--
Kirrily Robert
Freebase Community Director
kirrily at metaweb.com
http://freebase.com/
More information about the Data-modeling
mailing list