[Data-modeling] English Words
Spencer Kelly
spencerkelly86 at gmail.com
Tue Aug 25 01:34:37 UTC 2009
On Mon, Aug 24, 2009 at 2:53 PM, Jeff Prucher<jeff at metaweb.com> wrote:
> does anyone have data for it or does it just sound like a good idea, I
can't say),
you betcha.
the CMU Pronouncing
Dictionary<http://www.speech.cs.cmu.edu/cgi-bin/cmudict#about>
is great and gpl, aswell The moby lexicon
<http://icon.shef.ac.uk/Moby/>(also gpl) has a great Pronunciator.
etymology is harder, but theres
lots<http://www.dmoz.org/Reference/Dictionaries/Etymology//>of data.
> For example, WordNet gives five noun synsets
> for "rat"; these are all synsets of the same English word -- there are not
> five different words "rat", with unique etymologies, inflections, etc.
ya, yuck. well is it easier to split these homonyms before or after?
...and lets take a moment to realize that what we're doing is making the
world's greatest ever dictionary.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freebase.com/pipermail/data-modeling/attachments/20090824/be859947/attachment.htm
More information about the Data-modeling
mailing list