Joined: June 2006
||Posted: Mar. 11 2008,14:04
Damn Small Characters for Interlingual Interchange (DaSCII)
-- a proposal --
The majority languages spoken by over 50% of people worldwide are(in order): Chinese (962 million), English (322m), Spanish (266m), Russian (170m), Portuguese (170m), Japanese (125m), German (98m), Bengali (189m) & Hindi 182m). Neither 7-bit nor 8-bit systems provide enough characters to directly cover these languages.
By examining the transliteration systems for each of the languages, the number of unique characters can be drastically reduced. The dravidian languages (Bengali, Hindi, et cetera) present the greatest difficulty due to a great number of diacritical and other marks. Therefore:
1. Start with the IBM 8-bit character set.
2. Insert characters 177 - 250 from the Indian Script Code for Information Interchange.
3. Add the following characters:
158 --- (the Euro symbol)
166 - z (with a tail underneath, used in Arabic transliteration)
167 - t (with a tail underneath, used in Arabic transliteration)
169 - e rising tone (the other rising-tone characters are covered)
170 - a falling-rising tone
235 - e falling-rising tone
236 - i falling-rising tone
237 - o falling-rising tone
238 - u falling-falling tone
251 - s (with a tail underneath, used in Arabic transliteration)
252 - d (with a tail underneath, used in Arabic transliteration)
253 - o (with a shallow u-shaped mark above, used in Korean transliteration)
254 - u (with a shallow u-shaped mark above, used in Korean transliteration)
1. Where a character is found in DaSCII, use it.
2. Where a character is not found in DaSCII, use transliterations, for example:
German - Use dipthongs suggested in the BGN/PCGN 2000 Agreement
Russian - Use dipthongs suggested in the BGN/PCGN 1947 Agreement
Romanji - Use the umlauted character for the high tone.
== This will allow transliterations of over 50% of human languages with _one_ font ==