[Developers] language codes
Christopher R. Maden
crism at metaweb.com
Fri Feb 1 03:55:13 UTC 2008
Arthur van Hoff wrote:
> I'll special case English for now, and move on, but I would appreciated
> it if someone could take a look at this and fix up the various language
> codes to be correct.
This has been tested on sandbox, and should be deployed on OTG tomorrow.
Note that there are two properties on /language/human_language, though;
the ISO 639-1 code and ISO 639-2 codes may both need to be checked.
> I tried that mapping, but I don't think that file maps 639-1 to 639-3
> correctly. The 3 letter codes appear to be SIL codes (not sure what
> these are), and the mapping is ambiguous (some 2 letter codes map to
> many 3 letter codes).
SIL was, formerly, the Summer Institute for Linguistics, a missionary
group whose work included translating the Bible into as many languages
as possible; as a result, they have better language knowledge coverage
than many other institutions.
However, you’re right that that’s a short file; the ambiguity is
unavoidable since a single 2-letter code (zh for Chinese) may map to
multiple 3-letter codes (yue for Cantonese, cmn for Mandarin).
~Chris
--
Christopher R. Maden
Data Architect
Metaweb Technologies, Inc.
<URL: http://www.metaweb.com/ >
More information about the Developers
mailing list