Here’s a non-exhaustive list of links to existing free/open-source machine translation systems, which I will try to complete as I find about them. To the best of my knowledge, software listed here has:
Rule-based systems
- either a free license as defined by the Free Software Foundation,
- or an open-source license as defined by the Open Source Initiative.
Statistical machine translation systems
- Apertium, a free/open-source rule-based machine translation platform.
- Matxin, a free/open-source rule-based machine translation system for Basque.
- OpenLogos, a free/open-source version of the historical Logos machine translation system.
- Anusaaraka, English-Hindi machine translation system.
Decoders
- Moses, a statistical machine translation system.
- Marie, an n-gram-based statistical machine translation decoder.
- Joshua, an open source decoder for statistical translation models based on synchronous context free grammars
- Phramer, an open-source statistical phrase-based machine translation decoder
- GREAT, a decoder based on stochastic finite-state transducers, which includes a training toolkit.
Training translation models
- Giza++ is a tool to train translation models for statistical machine translation (see also the related mkcls tool to train word classes)
- Thot is a toolkit to train phrase-based models for statistical machine translation. Read more. Source: DCU School of Computing
Saturday, November 19, 2011
Free/open-source machine translation software
Labels:
Translation