[Data-modeling] English Words

Jeff Prucher jeff at metaweb.com
Thu May 7 17:29:59 UTC 2009


Sorry!  Let me try again. The noun "dog" has seven different senses in
WordNet. (Note that this really is one word, and not seven different words).
These senses in WordNet are linked as synonyms, not directly to other words,
but to specific senses of other words. To give an example, Dog is not a
synonym of Heel, but Dog4 (someone who is morally reprehensible) is a
synonym of Heel3. My question is how one would represent this structure in
Freebase.
 
Jeff



  _____  

From: data-modeling-bounces at freebase.com
[mailto:data-modeling-bounces at freebase.com] On Behalf Of Arthur van Hoff
Sent: Wednesday, May 06, 2009 5:17 PM
To: Freebase data modeling mailing list
Subject: Re: [Data-modeling] English Words


Sorry, but you already lost me. I am neither a native English speaker, nor a
linguist. Examples would help.

We use the wordnet data to find the stems of words with synonyms, and
descriptions. 
Then we use an English lexicons and Wiktionary to find derivations and
additional stems.
It is not perfect, but it is a start.


On Wed, May 6, 2009 at 5:00 PM, Jeff Prucher <jeff at metaweb.com> wrote:


One of the things that's prevented anyone from tackling this yet is the
issue of polysemy. WordNet handles this by having separate entities for each
sense, which might be very cumbersome in Freebase.  Each sense could be a
CVT, but if you're linking synonyms, you have to be linking two CVTs (that
is, a sense of a word is only synonymous with a sense of another word),
which is not easily done in the client (although it can be done via MQL). 
 
Talking of senses opens another issue -- whose breakdown do you use?
Different dictionaries break down senses differently (lumping vs. splitting
is an age-old issue in lexicography). Would there be a need to represent
sense breakdowns by multiple authorities?
 
Jeff


  _____  

From: data-modeling-bounces at freebase.com
[mailto:data-modeling-bounces at freebase.com] On Behalf Of Arthur van Hoff
Sent: Wednesday, May 06, 2009 1:30 PM
To: data-modeling at freebase.com
Subject: [Data-modeling] English Words


Hi,

Has anyone thought about loading English (and other language) words into
Freebase?
We are currently using WordNet and Wiktionary data, but it would be really
convenient if this was available Freebase.
For each word we need the language, POS (noun, verb, adverb, adjective),
synonyms, sample usage, translations, etc.

Thanks.

-- 
Arthur van Hoff
arthur.van.hoff at gmail.com
650-283-0842



_______________________________________________
Data-modeling mailing list
Data-modeling at freebase.com
http://lists.freebase.com/mailman/listinfo/data-modeling






-- 
Arthur van Hoff
arthur.van.hoff at gmail.com
650-283-0842


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freebase.com/pipermail/data-modeling/attachments/20090507/b353daa9/attachment.htm 


More information about the Data-modeling mailing list