Source text |
Target text |
Regex find |
Regex replace with |
$218,480 |
218.480 $ |
\$(\d+)\,(\d+) |
$1.$2 \$ |
€57.495 |
57.495 € |
\€(\d.+\.\d.+) |
$1 € |
Sunday, August 31, 2025
Regex Localization Convertions
Friday, August 29, 2025
Python Script to Download Bergamot Models
import httpx
import tarfile
import os
from urllib.parse import urlparse
# https://data.statmt.org/bergamot/models/models.json
# List of model URLs
urls = [
"https://data.statmt.org/bergamot/models/csen/csen.student.base.v1.cd5418ba6a412fc7.tar.gz",
"https://data.statmt.org/bergamot/models/csen/csen.student.tiny11.v1.8f603aded58f0a3c.tar.gz",
"https://data.statmt.org/bergamot/models/csen/encs.student.base.v1.db770d87e491b0dc.tar.gz",
"https://data.statmt.org/bergamot/models/csen/encs.student.tiny11.v1.b5c1ff605296b0e5.tar.gz",
"https://data.statmt.org/bergamot/models/deen/deen.student.base.v2.caa7c0ce3c8eaf05.tar.gz",
"https://data.statmt.org/bergamot/models/deen/deen.student.tiny11.v2.9f70fcb17bf9572d.tar.gz",
"https://data.statmt.org/bergamot/models/deen/ende.student.base.v2.37b172bc9b594f9b.tar.gz",
"https://data.statmt.org/bergamot/models/deen/ende.student.tiny11.v2.93821e13b3c511b5.tar.gz",
"https://data.statmt.org/bergamot/models/esen/esen.student.tiny11.v1.09576f06d0ad805e.tar.gz",
"https://data.statmt.org/bergamot/models/esen/enes.student.tiny11.v1.a7203a8f8e9daea8.tar.gz",
"https://data.statmt.org/bergamot/models/eten/eten.student.tiny11.v1.38de61c668e42f36.tar.gz",
"https://data.statmt.org/bergamot/models/eten/enet.student.tiny11.v1.0b8f835b0c154aaa.tar.gz",
"https://data.statmt.org/bergamot/models/isen/isen.student.base.v2.536d6b8808a5c076.tar.gz",
"https://data.statmt.org/bergamot/models/isen/isen.student.tiny11.v2.829203cf37b7bdc4.tar.gz",
"https://data.statmt.org/bergamot/models/nben/nben.student.tiny11.v1.e410ce34f8337aab.tar.gz",
"https://data.statmt.org/bergamot/models/nnen/nnen.student.tiny11.v1.0efa37c16887eea4.tar.gz",
"https://data.statmt.org/bergamot/models/bgen/bgen.student.tiny11.v1.f9c89a3a25ff8dca.tar.gz",
"https://data.statmt.org/bergamot/models/bgen/enbg.student.tiny11.v1.3ea060c1b76470a7.tar.gz",
"https://data.statmt.org/bergamot/models/plen/plen.student.tiny11.v1.87148203cbda2842.tar.gz",
"https://data.statmt.org/bergamot/models/plen/enpl.student.tiny11.v1.c33219daa12e7872.tar.gz",
"https://data.statmt.org/bergamot/models/fren/fren.student.tiny11.v1.dccea16d03c0a389.tar.gz",
"https://data.statmt.org/bergamot/models/fren/enfr.student.tiny11.v1.805d112122af03d0.tar.gz",
"https://data.statmt.org/bergamot/models/hbseng/hbseng.student.tiny11.v1.fa8a29e01a5332ba.tar.gz",
"https://data.statmt.org/bergamot/models/slen/slen.student.tiny11.v1.d029034e49c3bb08.tar.gz",
"https://data.statmt.org/bergamot/models/mken/mken.student.tiny11.v1.dd03ef56f4695c7b.tar.gz",
"https://data.statmt.org/bergamot/models/mten/mten.student.tiny11.v1.4089a5a036eff1c3.tar.gz",
"https://data.statmt.org/bergamot/models/tren/tren.student.tiny11.v1.d7728d17a313230a.tar.gz",
"https://data.statmt.org/bergamot/models/sqen/sqen.student.tiny11.v1.6ead0c9b236f942b.tar.gz",
"https://data.statmt.org/bergamot/models/caen/caen.student.tiny11.v1.edaf67d1938e80d3.tar.gz",
"https://data.statmt.org/bergamot/models/elen/elen.student.tiny11.v1.0006442831596378.tar.gz",
"https://data.statmt.org/bergamot/models/uken/uken.student.tiny11.v1.108d04d1e160153a.tar.gz"
]
# Create a folder to store all models
os.makedirs("models", exist_ok=True)
for url in urls:
filename = os.path.basename(url)
folder_name = filename.replace(".tar.gz", "")
folder_path = os.path.join("models", folder_name)
os.makedirs(folder_path, exist_ok=True)
print(f"📥 Downloading {filename}...")
response = httpx.get(url)
archive_path = os.path.join(folder_path, filename)
with open(archive_path, "wb") as f:
f.write(response.content)
print(f"📦 Extracting to {folder_path}...")
with tarfile.open(archive_path, "r:gz") as tar:
tar.extractall(path=folder_path)
os.remove(archive_path)
print(f"✅ Done: {folder_name}\n")
print("🎉 All models downloaded and extracted!")
Thursday, August 28, 2025
Bergamot Translator Linux and Python - Bergamot and TranslateLocally Models
Running bergamot-translator on Linux
$ git clone git@github.com:browsermt/bergamot-translator.git
$ mkdir build
$ sudo apt install libpcre2-dev libopenblas-dev
$ cmake ..
$ make -jyaml file:bergamot-mode: native
models:
- firefox-translations-models/models/prod/esen/model.esen.intgemm.alphas.bin
vocabs:
- firefox-translations-models/models/prod/esen/vocab.esen.spm
- firefox-translations-models/models/prod/esen/vocab.esen.spm
shortlist:
- firefox-translations-models/models/prod/esen/lex.50.50.esen.s2t.bin
- false
beam-size: 1
normalize: 1.0
word-penalty: 0
max-length-break: 128
mini-batch-words: 1024
workspace: 128
max-length-factor: 2.0
skip-cost: true
cpu-threads: 0
quiet: false
quiet-translation: false
gemm-precision: int8shiftAlphaAll
alignment: soft where esen is the language pair for the translation, in this case es→en (Spanish to English).
The models/vocabs/shortlist files should be sourced from the firefox-translations-models repository, with git-lfs. There's some docs which still point to Google
cloud storage for downloads, but those are stale.
Pipe some data through bergamot-translator:
echo "Hola mundo" | ./bergamot-translator --model-config-paths config.yml---Requirement: Python <= 3.10 (wheels are not available for newer versions)
pip install bergamot
import bergamot
config = bergamot.ServiceConfig(numWorkers=4)
service = bergamot.Service(config)
model = service.modelFromConfigPath("bergamot.config.yml")
options = bergamot.ResponseOptions(
alignment=False, qualityScores=False, HTML=False
)
response = service.translate(model, bergamot.VectorString([
"In the last 3 months, over 80 arrestees were released from the Central Booking facility without being formally charged.",
"Since its inception, The Onion has become a veritable news parody empire.",
"The hostel’s guests were mostly citizens of the United Arab Emirates.",
]), options)
for r in response:
print(r.target.text)
bergamot.config.yml:
# To imitate production setting, these Marian options are set according to
# https://github.com/mozilla/firefox-translations/blob/main/extension/controller/translation/translationWorker.js
# For reference, see https://github.com/mozilla/firefox-translations-models/blob/main/evals/translators/bergamot.sh
bergamot-mode: wasm
models:
- ./model.enro.intgemm.alphas.bin
vocabs:
- ./vocab.enro.spm
- ./vocab.enro.spm
shortlist:
- ./lex.50.50.enro.s2t.bin
- false
beam-size: 1
normalize: 1.0
word-penalty: 0
max-length-break: 128
mini-batch-words: 1024
workspace: 128
max-length-factor: 2.0
skip-cost: true
cpu-threads: 4
quiet: false
quiet-translation: false
gemm-precision: int8shiftAlphaAll
alignment: soft
Translatelocally compatible models:
https://translatelocally.com/models.json
Firefox models:
https://github.com/mozilla/firefox-translations-models/tree/main/models
Wednesday, August 27, 2025
Glosar Energie DERO
| Heizkraftwerk | centrală termică de încălzire |
| Abbaurate | rată de descompunere |
| Abfallbehandlunsanlage | instalaţie de tratare a reziduurilor |
| Abgabe | transmitere |
| Absorber | instalaţie de absorbţie |
| Additiv | aditiv |
| Altholzsortimente | sortimente de lemn utilizat |
| anfallend | apărut |
| Anlieferung | livrare |
| Annahmebereich | zonă de alimentare |
| Aufbereitungsanlage | instalaţie de tratare |
| Ausbeute | exploatare |
| Ausbringung | evacuare |
| Ausgangssstoff | materie primă |
| Basismaterial | material de bază |
| Bedienkomfort | confort de utilizare |
| BHKW | unitate de cogenerare |
| Bioabfall | resturi vegetale |
| Biogas | biogaz |
| Biomasse | biomasă |
| Blockheizkraftwerk | centrală termică combinată |
| Brikettierungsanlagen | instalaţii de brichetare |
| Bunker | buncăr |
| CO2-Ausstoß | evacuare CO2 |
| CO2-neutral | fără CO2 |
| combined heat and power / Kraft-Wärme-Kopplung | CHP / cogenerare |
| Dampfaustausch | schimb de aburi |
| Dampfkessel | cazan cu aburi |
| Deponie | depozit de reziduuri |
| Deponiefraktion | fracţiunea depozitului de reziduuri |
| Durchforstung | rărirea arboretului |
| Eigenkapitalbeschaffung | procurarea capitalului propriu |
| Einkommensquelle | sursă de venit |
| eisenhaltig | feros |
| Elektrizitätsnetz | reţea de electricitate |
| Emissionsgrenzwert | valoare limită a emisiilor |
| Endlager | depozit final |
| Energiepflanze | plantă energetică |
| Energieträger | suport energetic |
| Energieversorgungskonzept | concept de alimentare cu energie |
| entschwefelt | desulfurizat |
| erneubar | regenerabil |
| Ersatzbrennstoff | combustibil alternativ |
| Fassungsvermögen | capacitate |
| Faulraum | autoclavă [ch.] |
| Faulraum | autoclavă de devulcanizare |
| Faulraum | bazin de fermentare |
| Faulraum | cazan fierbător |
| Faulraum | devulcanizator |
| Faulraum | digestor [plast.] |
| Faulraum | fierbător [term.] |
| Ferment | ferment |
| Fermenter | fermenter |
| Festmist | gunoi solid de grajd |
| Feststoff | material solid |
| Feuerraum | cameră de ardere |
| Feuerraum | focar |
| Feuerwärmeleistung | putere calorică de ardere |
| Flotatfett | grăsime de flotaţie |
| fossile Energiequelle | sursă fosilă de energie |
| Frischdampf | abur proaspăt |
| Gärrestlager | depozit pentru resturile de fermentare |
| gasdicht | etanş |
| Gasertrag | debit de gaz, producţie de gaz |
| Gaslager | depozit de acumulare a gazelor |
| Getreidespelzen | pleavă de cereale |
| Großflügelrührwerk | malaxor cu pale de mari dimensiuni |
| Grünschnittpellets | pelete din tăierea plantelor |
| Gülle | must de bălegar |
| Halogenverbindung | compus cu halogen |
| Hausmüll | gunoi menajer |
| Heizöl | combustibil lichid de încãlzire |
| Heizöl | păcură |
| Hochdruckdampfkessel | cazan cu aburi de înaltă presiune |
| Holzbriketts | brichete de lemn |
| Inputstoff | material de input |
| Kesselhaus | sala cazanelor |
| Kesselwasser | apă din cazan |
| Klärschlamm | nămol de la staţia de epurare |
| Kleinfeuerungsanlagen | instalaţii de ardere de mici dimensiuni |
| klimaschädlich | dăunător climei |
| Kohlekraftwerk | centrală termică pe cărbuni |
| Kohlendioxid | dioxid de carbon |
| Kondensat | condens |
| konventionelles Kraftwerk | centrală termică convenţională |
| Konvertierung | convertire |
| Kraft-Wärme-Koppelungseinheiten | unităţi de cogenerare |
| Kraft-Wärme-Kopplung | cuplare curent-căldură |
| Kraft-Wärme-Kopplung / KWK | cogenerare |
| Landschaftspflege | amenajarea peisajului |
| Luftkondensator | condensator de aer |
| Maissilage | însilozarea porumbului |
| Maschinenhaus | sala maşinilor |
| Methan | metan |
| mikrobiell | microbian |
| Mist | gunoi de grajd |
| Nachgärer | fermenter secundar |
| Nachgärung | fermentare ulterioară |
| Nachwachsende Rohstoffe | resurse primare regenerabile |
| naturbelassenen Biomassen | biomase naturale |
| NaWaRo | resurse primare regenerabile |
| Nebenprodukt | produs auxiliar |
| nicht eisenhaltig | neferos |
| Nutzvolumen | volum util |
| Olivenkern | sâmbure de măsline |
| organische Reststoffe | resturi organice |
| Pelletheizung | încălzire cu peleţi |
| Pelletierung | peletizare |
| Preisspirale | spirala preţurilor |
| Privathaushalt | gospodărie privată |
| Produktionsablauf | desfăşurarea producţiei |
| Produktionsstandort | locaţie de producţie |
| Prozesoptimierung | optimizarea producţiei |
| Prozessdamf | abur de proces |
| Prozesswasser | apă de proces |
| Pumpsystem | sistem de pompare |
| Rauchgas | gaz de ardere |
| Reaktionsturm | turn de reacţie |
| refuse-derived fuel | RDF |
| regenerativ | regenerativ |
| Restholz | resturi de material lemnos |
| Rindenbriketts | birichete din coajă de lemn |
| Rohfaser | fibră brută |
| Rohrschlange | serpentină din ţeavă |
| Rost | grilaj |
| Rostschlacke | zgură |
| sauerstofffrei | anaerob |
| Saugzug | tiraj artificial |
| Saugzug | tiraj forţat |
| Schadstoff | substanţă dăunătoară |
| Schadstoffe | noxe |
| Schallschutzcontainer | container cu protecţie fonică |
| Schlackebunker | buncăr pentru zgură |
| Schlauchfilter | filtru cu furtun |
| Schwefeloxid | oxid de sulf |
| Schwermetall | metal greu |
| Schwermetalle | metale grele |
| Sekundärluft | aer secundar |
| Silage | însilozare |
| Silo | siloz |
| Speiseabfälle | resturi alimentare |
| Speisereste | resturi alimentare |
| Speisewasser | apă de alimentare |
| Spurengas | gaz remanent |
| Stickgas | azot |
| Stickoxid | oxid de azot |
| Stickstoff | azot |
| Stößel | bătător |
| Stößel | berbec de sonetă |
| Stößel | tachet |
| Stößel | cap de mortezat |
| Stößel | pisălog |
| Strauchschnitt | tăierea arbuştilor |
| Strombedarf | necesar de curent |
| substituieren | substitui |
| Substrat | substrat |
| Symbiose | simbioză |
| Tauchmotor | motor submersibil |
| Treibhausgas | gaz de seră |
| Turbine | turbină |
| Turbogeneratorsatz | unitate turbogeneratoare |
| unter Luftabschluss | fără aer |
| Verbrennungsprozess | proces de ardere |
| Vergärung | fermentare |
| Versorger | companie de alimentare cu energie |
| Verweildauer | durată de menţinere |
| Verwertung | valorificare |
| Vorgrube | rezervor preliminar |
| Waldrestholz | resturi de lemn din pădure |
| Wärmetäuscher | schimbător de căldură |
| Wasserbad | baie de apă |
| Wertschöpfungskette | lanţ de valoare adăugată |
| Wirtschaftsdünger | îngrăşăminte economice |
| Wirtschaftsdünger | fertlizant industrial |
| zersetzen | descompune |
| Zufeuerung zu fossilen Brennstoffen | co-firing / combustie combinată cu combustibili fosili |
| Zersetzung | descompunere |
Monday, August 18, 2025
ISO 639 Language Codes
| ISO Language Names | Set 1 | Set 2 | Set 3 | Endonym(s) | Other Name(s) [note 1] |
Notes | |
| T | B | ||||||
| Abkhazian | ab | abk | abk | Аҧсуа; Apsua; აფსუა | Abkhaz | ||
| Afar | aa | aar | aar | Qafar af | |||
| Afrikaans | af | afr | afr | Afrikaans | |||
| Akan | ak | aka | aka + 2 | Ákán | Twi is tw/twi, Fanti is fat | ||
| Albanian | sq | sqi | alb | sqi + 4 | Shqip | called "Albanian Phylozone" in 639-6 | |
| Amharic | am | amh | amh | አማርኛ (Amarəñña) | |||
| Arabic | ar | ara | ara + 28 | اَلْعَرَبِيَّةُ (al-ʿarabiyyah) |
Standard Arabic is arb | ||
| Aragonese | an | arg | arg | Aragonés | |||
| Armenian | hy | hye | arm | hye | Հայերեն (Hayeren) | ISO 639-3 code hye is for Eastern Armenian, hyw is for Western Armenian, and xcl is for Classical Armenian | |
| Assamese | as | asm | asm | অসমীয়া (Ôxômiya) | Asamiya | ||
| Avaric | av | ava | ava | Авар мацӏ; اوار ماض (Avar maz) | Avar | ||
| Avestan | ae | ave | ave | Upastawakaēna | |||
| Aymara | ay | aym | aym + 2 | Aymara | Aymaran | ||
| Azerbaijani | az | aze | aze + 2 | Azərbaycan dili; آذربایجان دیلی; Азәрбајҹан дили | Azeri | ||
| Bambara | bm | bam | bam | بَمَنَنكَن ;ߓߡߊߣߊ߲ߞߊ߲ (Bamanankan) | Bamana; Bamanankan | ||
| Bashkir | ba | bak | bak | Башҡорт теле; Başqort tele | Bashkort | ||
| Basque | eu | eus | baq | eus | Euskara/Euskera | ||
| Belarusian | be | bel | bel | Беларуская мова (Biełaruskaja mova) | |||
| Bengali | bn | ben | ben | বাংলা (Bāŋlā) | Bangla | ||
| Bislama | bi | bis | bis | Bislama | Language formed from English and Vanuatuan languages, with some French influence. | ||
| Bosnian | bs | bos | bos | Босански (Bosanski) | Bosniak | Member language of Serbo-Croatian with code sh deprecated in 2000 | |
| Breton | br | bre | bre | Brezhoneg | |||
| Bulgarian | bg | bul | bul | Български (Bulgarski) | |||
| Burmese | my | mya | bur | mya | မြန်မာစာ (Mrãmācā) | Myanmar | |
| Catalan, Valencian | ca | cat | cat | Català; Valencià | |||
| Central Khmer | km | khm | khm | ខេមរភាសា; (Khémôrôphéasa) | Khmer; Cambodian | ||
| Chamorro | ch | cha | cha | Finu' Chamoru | |||
| Chechen | ce | che | che | Нохчийн мотт; (Noxçiyn mott) |
Chechnyan; Chechnian | ||
| Chichewa, Chewa, Nyanja | ny | nya | nya | Chichewa; Chinyanja | |||
| Chinese | zh | zho | chi | zho + 19 | 中文 (Zhōngwén) 汉语; 漢語 (Hànyǔ) |
||
| Church Slavonic, Old Slavonic, Old Church Slavonic | cu | chu | chu | Славе́нскїй ѧ҆зы́къ | In use by the Eastern Orthodox Church | ||
| Chuvash | cv | chv | chv | Чӑвашла (Çăvaşla) | |||
| Cornish | kw | cor | cor | Kernowek | |||
| Corsican | co | cos | cos | Corsu | |||
| Cree | cr | cre | cre + 6 | ᓀᐦᐃᔭᐁᐧᐃᐧᐣ (Nehiyawewin) | |||
| Croatian | hr | hrv | hrv | Hrvatski | Crovatian | Member language of Serbo-Croatian with code sh deprecated in 2000 | |
| Czech | cs | ces | cze | ces | Čeština | Czechian | |
| Danish | da | dan | dan | Dansk | |||
| Divehi, Dhivehi, Maldivian | dv | div | div | ދިވެހި (Dhivehi) | |||
| Dutch, Flemish | nl | nld | dut | nld | Nederlands | Flemish is not to be confused with the closely related West Flemish which is referred to as "Vlaams" and has the code vls in ISO 639-3 | |
| Dzongkha | dz | dzo | dzo | རྫོང་ཁ་ (Dzongkha) | Bhutanese | ||
| English | en | eng | eng | English | |||
| Esperanto | eo | epo | epo | Esperanto | |||
| Estonian | et | est | est + 2 | Eesti keel | |||
| Ewe | ee | ewe | ewe | Èʋegbe | |||
| Faroese | fo | fao | fao | Føroyskt | Faeroese | ||
| Fijian | fj | fij | fij | Na Vosa Vakaviti | |||
| Finnish | fi | fin | fin | Suomi | |||
| French | fr | fra | fre | fra | Français | ||
| Fulah | ff | ful | ful + 9 | 𞤊𞤵𞤤𞤬𞤵𞤤𞤣𞤫 ;ࢻُلْࢻُلْدٜ; Fulfulde 𞤆𞤵𞤤𞤢𞥄𞤪 ;ݒُلَارْ; Pulaar |
Fula; Fulani | ||
| Gaelic, Scottish Gaelic | gd | gla | gla | Gàidhlig | Scots Gaelic | ||
| Galician | gl | glg | glg | Galego | Galego | ||
| Ganda | lg | lug | lug | Luganda | Luganda | ||
| Georgian | ka | kat | geo | kat | ქართული (Kharthuli) | ||
| German | de | deu | ger | deu | Deutsch | ||
| Greek, Modern (1453–) | el | ell | gre | ell | Νέα Ελληνικά; (Néa Ellêniká) | for Ancient Greek, use the ISO 639-3 code grc | |
| Guarani | gn | grn | grn + 5 | Avañe'ẽ | |||
| Gujarati | gu | guj | guj | ગુજરાતી (Gujarātī) | |||
| Haitian, Haitian Creole | ht | hat | hat | Kreyòl ayisyen | |||
| Hausa | ha | hau | hau | هَرْشٜن هَوْس (halshen Hausa) | Hausan | ||
| Hebrew | he | heb | heb | עברית (Ivrit) | Modern Hebrew. Code changed in 1989 from original ISO 639:1988, iw.[3] | ||
| Herero | hz | her | her | Otjiherero | Otjiherero | ||
| Hindi | hi | hin | hin | हिन्दी (Hindī) | |||
| Hiri Motu | ho | hmo | hmo | Hiri Motu | Police Motu; Pidgin Motu | ||
| Hungarian | hu | hun | hun | Magyar nyelv | Magyar | ||
| Icelandic | is | isl | ice | isl | Íslenska | ||
| Ido | io | ido | ido | Ido | |||
| Igbo | ig | ibo | ibo | ásụ̀sụ́ Ìgbò | |||
| Indonesian | id | ind | ind | bahasa Indonesia | covered by macrolanguage ms/msa. Changed in 1989 from original ISO 639:1988, in.[3] | ||
| Interlingua (International Auxiliary Language Association) | ia | ina | ina | Interlingua | |||
| Interlingue, Occidental | ie | ile | ile | Interlingue; Occidental | |||
| Inuktitut | iu | iku | iku + 2 | ᐃᓄᒃᑎᑐᑦ (Inuktitut) | |||
| Inupiaq | ik | ipk | ipk + 2 | Iñupiaq | Inupiat; Inupiatun | ||
| Irish | ga | gle | gle | Gaeilge | Irish Gaelic | ||
| Italian | it | ita | ita | Italiano | |||
| Japanese | ja | jpn | jpn | 日本語 (Nihongo) | |||
| Javanese | jv | jav | jav | ꦧꦱꦗꦮ; basa Jawa | |||
| Kalaallisut, Greenlandic | kl | kal | kal | Kalaallisut | |||
| Kannada | kn | kan | kan | ಕನ್ನಡ (Kannađa) | Kannadan; Canarese | ||
| Kanuri | kr | kau | kau + 3 | كَنُرِيِه; Kànùrí | |||
| Kashmiri | ks | kas | kas | कॉशुर; كأشُر (Kosher) | Koshur | ||
| Kazakh | kk | kaz | kaz | Қазақша; Qazaqşa; قازاقشا | Qazaq | ||
| Kikuyu, Gikuyu | ki | kik | kik | Gĩgĩkũyũ | |||
| Kinyarwanda | rw | kin | kin | Ikinyarwanda | Rwandan; Rwanda; Ikinyarwanda | ||
| Komi | kv | kom | kom + 2 | Коми кыв | Zyran; Zyrian; Komi-Zyryan | ||
| Kongo | kg | kon | kon + 3 | Kikongo | Kikongo | ||
| Korean | ko | kor | kor | 한국어 (Hangugeo) 조선말 (Chosŏnmal) |
|||
| Kuanyama, Kwanyama | kj | kua | kua | Oshikwanyama | Cuanhama; Oshikwanyama | ||
| Kurdish | ku | kur | kur + 3 | کوردی; Kurdî | |||
| Kyrgyz, Kirghiz | ky | kir | kir | Кыргыз; قىرعىز |
|||
| Lao | lo | lao | lao | ພາສາລາວ (phasa Lao) | Laotian | ||
| Latin | la | lat | lat | Latinum | In use by several Christian organization of churches, and for sciences | ||
| Latvian | lv | lav | lav + 2 | Latviski | Lettish | ||
| Limburgan, Limburger, Limburgish | li | lim | lim | Lèmburgs | |||
| Lingala | ln | lin | lin | Lingála | Ngala | ||
| Lithuanian | lt | lit | lit | Lietuvių | |||
| Luba-Katanga | lu | lub | lub | Kiluba | Luba-Shaba | ||
| Luxembourgish, Letzeburgesch | lb | ltz | ltz | Lëtzebuergesch | Luxembourgian | ||
| Macedonian | mk | mkd | mac | mkd | Македонски (Makedonski) | ||
| Malagasy | mg | mlg | mlg + 11 | مَلَغَسِ; Malagasy | |||
| Malay | ms | msa | may | msa + 36 | بهاس ملايو (bahasa Melayu) | Standard Malay is zsm, Indonesian is id/ind | |
| Malayalam | ml | mal | mal | മലയാളം (Malayāļã) | |||
| Maltese | mt | mlt | mlt | Malti | |||
| Manx | gv | glv | glv | Gaelg; Gailck | Manx Gaelic | ||
| Maori | mi | mri | mao | mri | reo Māori | ||
| Marathi | mr | mar | mar | मराठी (Marāṭhī) | Maharashtran | ||
| Marshallese | mh | mah | mah | kajin M̧ajeļ | Ebon | ||
| Mongolian | mn | mon | mon + 2 | ᠮᠣᠩᠭᠣᠯ ᠬᠡᠯᠡ; Монгол хэл (Mongol xel) |
Mongol | ||
| Nauru | na | nau | nau | dorerin Naoe | Nauruan | ||
| Navajo, Navaho | nv | nav | nav | Diné bizaad; Naabeehó bizaad | |||
| Ndonga | ng | ndo | ndo | Ndonga | Oshindonga | ||
| Nepali | ne | nep | nep + 2 | नेपाली भाषा (Nepālī bhāśā) | Nepalese; Gorkhali | ||
| North Ndebele | nd | nde | nde | isiNdebele; saseNyakatho; Mthwakazi Ndebele | Northern Ndebele | ||
| Northern Sami | se | sme | sme | Davvisámegiella | North Sami | ||
| Norwegian | no | nor | nor + 2 | Norsk | Bokmål is nb/nob, Nynorsk is nn/nno | ||
| Norwegian Bokmål | nb | nob | nob | Norsk Bokmål | covered by macrolanguage no/nor | ||
| Norwegian Nynorsk | nn | nno | nno | Norsk Nynorsk | covered by macrolanguage no/nor | ||
| Occitan | oc | oci | oci | Occitan; Provençal | Provential; Provencal | ||
| Ojibwa | oj | oji | oji + 7 | ᐊᓂᔑᓈᐯᒧᐎᓐ (Anishinaabemowin) | Ojibwe; Ojibway; Otchipwe; Ojibwemowin | ||
| Oriya | or | ori | ori + 2 | ଓଡ଼ିଆ (Odia) | Odian; Odishan; Orissan | ||
| Oromo | om | orm | orm + 4 | afaan Oromoo | Oromoo | ||
| Ossetian, Ossetic | os | oss | oss | ирон Ӕвзаг (iron Ævzag) |
Ossete | ||
| Pali | pi | pli | pli | Pāli | Pali-Magadhi | ||
| Pashto, Pushto | ps | pus | pus + 3 | پښتو (Pax̌tow) | |||
| Persian | fa | fas | per | fas + 2 | فارسی (Fārsiy) | Farsi | |
| Polish | pl | pol | pol | Polski | |||
| Portuguese | pt | por | por | Português | |||
| Punjabi, Panjabi | pa | pan | pan | ਪੰਜਾਬੀ; پنجابی (Pãjābī) | |||
| Quechua | qu | que | que + 43 | Runa simi; kichwa simi; Nuna shimi | Quechuan | ||
| Romanian, Moldavian, Moldovan | ro | ron | rum | ron | Română; Ромынэ | the identifiers mo and mol for Moldavian are deprecated. They will not be assigned to different items, and recordings using these identifiers will not be invalid. | |
| Romansh | rm | roh | roh | Rumantsch; Rumàntsch; Romauntsch; Romontsch | Romansch | ||
| Rundi | rn | run | run | Ikirundi | Kirundi | ||
| Russian | ru | rus | rus | Русский язык (Russkiĭ âzyk) | |||
| Samoan | sm | smo | smo | gagana Sāmoa | |||
| Sango | sg | sag | sag | yângâ tî Sängö | Sangoic | ||
| Sanskrit | sa | san | san + 2 | संस्कृतम् (Saṃskṛtam) | In use by some Indian states on judicial purposes | ||
| Sardinian | sc | srd | srd + 4 | Sardu | Sard | ||
| Serbian | sr | srp | srp | Српски (Srpski) | Member language of Serbo-Croatian with code sh deprecated in 2000, the ISO 639-2/T code srp deprecated the ISO 639-2/B code scc[4] | ||
| Shona | sn | sna | sna | chiShona | |||
| Sichuan Yi, Nuosu | ii | iii | iii | ꆈꌠꉙ (Nuosuhxop) | Northern Yi; Liangshan Yi; Nosu | standard form of the Yi languages | |
| Sindhi | sd | snd | snd | سنڌي; सिन्धी (Sindhī) | |||
| Sinhala, Sinhalese | si | sin | sin | සිංහල (Siṁhala) | |||
| Slovak | sk | slk | slo | slk | Slovenčina | Slovakian | |
| Slovenian | sl | slv | slv | Slovenščina | Slovene | ||
| Somali | so | som | som | Soomaali; 𐒈𐒝𐒑𐒛𐒐𐒘; سٝومالِ | Somalian | ||
| South Ndebele | nr | nbl | nbl | isiNdebele; sakwaNdzundza | Southern Ndebele | ||
| Southern Sotho | st | sot | sot | Sesotho | Sesotho; Sotho | ||
| Spanish, Castilian | es | spa | spa | Español; Castellano | |||
| Sundanese | su | sun | sun | basa Sunda; ᮘᮞ ᮞᮥᮔ᮪ᮓ; بَاسَا سُوْندَا | |||
| Swahili | sw | swa | swa + 2 | Kiswahili; كِسوَحِيلِ | Kiswahili | ||
| Swati | ss | ssw | ssw | siSwati | Swazi | ||
| Swedish | sv | swe | swe | Svenska | |||
| Tagalog | tl | tgl | tgl | Wikang Tagalog | note: Filipino (Pilipino) has the code fil | ||
| Tahitian | ty | tah | tah | reo Tahiti | One of the Reo Mā`ohi (languages of French Polynesia)[5] | ||
| Tajik | tg | tgk | tgk | Тоҷикӣ (Tojikī) | Tajiki | ||
| Tamil | ta | tam | tam | தமிழ் (Tamiḻ) | Thamizh | ||
| Tatar | tt | tat | tat | Татар теле; Tatar tele; تاتار تئلئ |
|||
| Telugu | te | tel | tel | తెలుగు (Telugu) | |||
| Thai | th | tha | tha | ภาษาไทย (Phasa Thai) | Central Thai; Siamese | ||
| Tibetan | bo | bod | tib | bod | བོད་སྐད་ (Bodskad); ལྷ་སའི་སྐད་ (Lhas'iskad) |
Standard Tibetan; Lhasa Tibetan | |
| Tigrinya | ti | tir | tir | ትግርኛ (Təgrəñña) | Tigrigna | ||
| Tonga (Tonga Islands) | to | ton | ton | lea faka-Tonga | Tongan | ||
| Tsonga | ts | tso | tso | Xitsonga | Xitsonga | ||
| Tswana | tn | tsn | tsn | Setswana | Setswana; Sechuana | ||
| Turkish | tr | tur | tur | Türkçe | |||
| Turkmen | tk | tuk | tuk | Türkmençe; Түркменче; تۆرکمنچه |
|||
| Twi | tw | twi | twi | Twi | covered by macrolanguage ak/aka | ||
| Uighur, Uyghur | ug | uig | uig | ئۇيغۇر تىلى; Уйғур тили; Uyƣur tili |
|||
| Ukrainian | uk | ukr | ukr | Українська (Ukraїnska) | |||
| Urdu | ur | urd | urd | اُردُو (Urduw) | |||
| Uzbek | uz | uzb | uzb + 2 | Ózbekça; ўзбекча; ئوزبېچه |
|||
| Venda | ve | ven | ven | Tshivenḓa | Tshivenda | ||
| Vietnamese | vi | vie | vie | tiếng Việt | |||
| Volapük | vo | vol | vol | Volapük | |||
| Walloon | wa | wln | wln | Walon | |||
| Welsh | cy | cym | wel | cym | Cymraeg | ||
| Western Frisian | fy | fry | fry | Frysk | West Frisian; Frisian; Fries |
||
| Wolof | wo | wol | wol | وࣷلࣷفْ | |||
| Xhosa | xh | xho | xho | isiXhosa | Xosa | ||
| Yiddish | yi | yid | yid + 2 | ייִדיש (Yidiš) | Judeo-German | Changed in 1989 from original ISO 639:1988, ji.[3] | |
| Yoruba | yo | yor | yor | èdè Yorùbá | |||
| Zhuang, Chuang | za | zha | zha + 16 | 話僮 (Vahcuengh) | |||
| Zulu | zu | zul | zul | isiZulu | |||

