[Developers] language codes

Christopher R. Maden crism at metaweb.com
Fri Feb 1 03:55:13 UTC 2008


Arthur van Hoff wrote:
> I'll special case English for now, and move on, but I would appreciated
> it if someone could take a look at this and fix up the various language
> codes to be correct.

This has been tested on sandbox, and should be deployed on OTG tomorrow.

Note that there are two properties on /language/human_language, though; 
the ISO 639-1 code and ISO 639-2 codes may both need to be checked.

> I tried that mapping, but I don't think that file maps 639-1 to 639-3
> correctly. The 3 letter codes appear to be SIL codes (not sure what
> these are), and the mapping is ambiguous (some 2 letter codes map to
> many 3 letter codes).

SIL was, formerly, the Summer Institute for Linguistics, a missionary 
group whose work included translating the Bible into as many languages 
as possible; as a result, they have better language knowledge coverage 
than many other institutions.

However, you’re right that that’s a short file; the ambiguity is 
unavoidable since a single 2-letter code (zh for Chinese) may map to 
multiple 3-letter codes (yue for Cantonese, cmn for Mandarin).

~Chris
-- 
Christopher R. Maden
Data Architect
Metaweb Technologies, Inc.
<URL: http://www.metaweb.com/ >


More information about the Developers mailing list