Posted by Dom K on Wednesday, January 25, 2017 - 0:07
What Is Transliteration?
Transliteration is the process of converting words or letters in one script to another script. With transliteration, one maps word to word or letter to letter. In many cases, reverse transliteration is also possible. However, transliteration may not preserve the subtle differences in sounds that are specific to a language.
Supported Character Sets
|Character set||ISO-15924||Primary dataset(s)|
Using Transliteration on the Everything Location API
Transliteration options are exposed on our /address/verify, /address/complete and /address/capture endpoints. By default, transliteration is set to match the script detected in the the input record, but the server option OutputScript can force the output in either Native script (Native or respective ISO), Latin script (Latn).
How Transliteration Works
Everything Location has a compiled set of words for commonly used address field values in each character set. The compiled data for field based transform does transliteration in some cases and translation in some cases.
We map between characters in supported languages and Latin characters. The character mapping is used to do character based transliteration.
Native Language to Latin (English/Roman)
If user enters an address in native language and needs the output in English, the individual address fields are queried in the address field set for the transliterated value. If not found, individual characters are transliterated. If either of these do not work, we return the original native word.
Transliteration Is Not Translation
Translation is converting text in one script into text in another script with equivalent sound. Transliteration is looking for equivalent alphabets and does not strive for equivalent meaning of the word or sentence.