Friday, January 19, 2018

HTML Codes for Romanian Language Characters

Even if your site is written in English only and does not include multilingual translations, you may need to add Romanian language characters to that site on certain pages or for certain words. The list below includes the HTML codes necessary to use Romanian characters that are not in the standard character set and are not found on a keyboard's keys.
Not all browsers support all these codes (mainly, older browsers may cause problems - newer browsers should be fine), so be sure to test your HTML codes before you use them.
Some Romanian characters may be part of the Unicode character set, so you need to declare that in the head of your documents:
<meta http-equiv="content-type" content="text/html;charset=utf-8" />
Here are the different characters you may need to use.
DisplayFriendly CodeNumerical CodeHex CodeDescription
Ă &#258;&#x102;Capital A-breve
ă &#259;&#x103;Lowercase a-breve
 &Acirc;&#194;&#xC2;Capital A-circumflex
â &acirc;&#226;&#xE2;Lowercase a-circumflex
Π&Icirc;&#206;&#xCE;Capital I-circumflex
î &icirc;&#238;&#xEE;Lowercase i-circumflex
Ș &#218;&#xDA;Capital S-comma
ș &#219;&#xDB;Lowercase s-comma
Ş &#350;&#x15E;Capital S-cedilla
ş &#351;&#x15F;Lowercase s-cedilla
Ț &#538;&#x21A;Capital T-comma
ț &#539;&#x21B;Lowercase t-comma
Ţ &#354;&#x162;Capital T-cedilla
ţ &#355;&#x163;Lowercase t-cedilla
Using these characters is simple. In the HTML markup, you would place these special character codes where you want the Romanian character to appear.
These are used similarly to other HTML special character codes that allow you to add characters that are also not found on the traditional keyboard, and therefore cannot be simply typed into the HTML in order to display on a web page.
Remember, these characters codes may be used on an English language website if you need to display a word with one of these characters.
 These characters would also be used in HTML that was actually displaying full Romanian translations, whether you actually coded those pages by hand and had a full Romanian version of the site, or if you used a more automated approach to multilingual webpages and went with a solution like Google Translate.
Source: https://www.thoughtco.com
Romanian special characters:[ăĂ,âÂ,îÎ,şŞ,ţŢ] or (if your browser can't deal with ISO-8859-2 charset).

Unicode (ro)
ISO-10646
If you want to use the Unicode (UTF-8) charset, be aware that for older browsers you might have to set View->Encoding to UTF-8 or Central european.

Latin-1 SupplementLatin Extended-ALatin Extended-B
&#194226258259206238351350355354537536539538
CharÂâĂăÎţŢșȘțȚ
1. First you need to tell the browser to use unicode. You do that by adding the meta line of code after the opening html tag:
<html>
<meta http-equiv="content-type" content="text/html; charset=UTF-8" />

...

2. In your HTML code you would use "&#NNN", where NNN is the number in the middle row in the above table.
For reference the special chars were retrieved from these UNICODE charts:
3. Text example using UTF-8 Romanian characters:
  • Tudor Arghezi -- O zi
4. Perl script to convert Romanian texts from ISO-8859-2 to UTF-8: roconvert.pl
ISO-8859-20. If you can't see the ISO-8859-2 charset correctly, you could upgrade your browser to either the latest Internet Explorer, Netscape, Opera, or Mozilla. Lynx 2.8x (although not supporting Javascript) seems to do at least a decent job at approximating those non-ASCII characters...
1. Add this "meta" tag right after your oppening "html" tag in order to tell the viewer/browser to use the proper charset:

<meta http-equiv="content-type" content="text/html; charset=iso-8859-2">

2. ş does not have a cedilla underneath but a comma (ISO-8859-2 has the Turkish version of "sh"). See Unicode for the correct glyphs.
3. Text examples using ISO-8859-2 Romanian characters:

Romanian Keyboard Control using your browser

Romanian Keyboard Control in Mac OS/X

Open System Preferences and select International icom under the Personal category. Select the Input Menu and scroll down to Romanian. Select it and the "Show input menu in menu bar" checkbox below in order to be able to switch between different keyboard layouts. Close System Preferences. Now, you're able to switch to/from the Romanian keyboard layout using the top menu bar. Typically the special characters map as illustrated below:
US[{]}\|;:'"
ROăĂîÎâÂşŞţŢ

Romanian Keyboard Control in Windows

Control Panel->Keyboard ->Language (95/98) ->Add...->Romanian->OK

->Input Locales (NT/XP/2000)
If you also check the Enable indicator on taskbar box on the same Input Locales property page, you'll be able to easily switch between RO and EN by left-click on symbol on the taskbar and choosing the desired keyboard mapping.
This setting allows the following mapping of Romanian characters on a standard QWERTY 101-AT US keyboard:
US[{]}\|;:'"
ROăĂîÎâÂşŞţŢ
Pronounciation guide - as close as I can describe it in plain English
  • The pronunciation of ă is similar to the final vowel in the word "mother".
  • â and î are pronounced the same. It's similar to the Russian . Also, î is used only if the first letter in a word, and â only if it's not the first letter in a word. Here is a lame sound clip of me saying "Câmpina cânt" (it sounds like KHM-PE-NAH KHNT).
  • The last 2 special characters(ş and ţ) displayed are pronounced as "sh" and "ts" in English. 
Source: http://www.marinel.net

The below list includes HTML character codes in Romanian which are usually not found in the standard character set. Should you wish to produce any of the unique characters below from the ASCII Romanian Library, use the respective codes when writing your HTML code.
In order to use these characters, simply place the unique character code where you'd like the character to appear. Codes as such play a key part in translation projects.
Ă
&#258;
&#x102;

Capital A-breve
ă
&#259;
&#x103;

Lowercase a-breve
Â
&#194;
&#xc2;
&Acirc;
Capital A-circumflex
â
&#226;
&#xe2;
&acirc;
Lowercase a-circumflex
Î
&#206;
&#xce;
&Icirc;
Capital I-circumflex
î
&#238;
&#xee;
&icirc;
Lowercase i-circumflex
Ș
&#x218;


Capital S-comma
ș
&#x219;


Lowercase s-comma
Ş
&#350;
&#x15e;

Capital S-cedilla
ş
&#351;
&#x15f;

Lowercase s-cedilla
Ț
&#538;
&#x21a;

Capital T-comma
ț
&#539;
&#x21b;

Lowercase t-comma
Ţ
&#354;
&#x162;

Capital T-cedilla
ţ
&#355;
&#x163;

Lowercase t-cedilla