RFC 1456 (rfc1456) - Page 3 of 7


Conventions for Encoding the Vietnamese Language VISCII: VIetnamese Standard Code for Information Interchange VIQR: VIetnamese Quoted-Readable Specification



Alternative Format: Original Text Document



RFC 1456          Conventions for Encoding Vietnamese           May 1993


    4.  Character placement have been designed with
        consideration for Unix/X integration, ISO-8859/Latin-1
        compatibility, coexistence with a wide array of
        existing software, including provisions for single-
        and double-line drawing characters in the IBM graphic
        character set.

   The 8-bit VISCII encoding is shown below.  Because of the limitations
   of the 7-bit US ASCII character set, here we use the mnemonic form to
   represent Vietnamese glyphs.  See the VIQR specification below for
   clarification of how diacritical marks are applied.  The online
   PostScript version of reference [2] may also be useful as it does
   display each character correctly.

               Table 1.  VISCII 8-bit Encoding Table (v1.1)
*=======================================================================*
|    | 0x  1x  2x  3x  4x  5x  6x  7x | 8x  9x  Ax  Bx  Cx  Dx  Ex  Fx  |
|====|==================================================================|
| x0 | nul dle sp  0   @   P   `   p  | A.  O^` O~  o^` A`  DD  a`  dd  |
| x1 | soh dc1 !   1   A   Q   a   q  | A(' O^? a(' o^? A'  u+' a'  u+. |
| x2 | A(? dc2 "   2   B   R   b   r  | A(` O^~ a(` o^~ A^  O`  a^  o`  |
| x3 | etx dc3 #   3   C   S   c   s  | A(. O^. a(. O+~ A~  O'  a~  o'  |
| x4 | eot Y?  $   4   D   T   d   t  | A^' O+. a^' O+  A?  O^  a?  o^  |
| x5 | A(~ nak %   5   E   U   e   u  | A^` O+' a^` o^. A(  a.  a(  o~  |
| x6 | A^~ syn &   6   F   V   f   v  | A^? O+` a^? o+` a(? y?  u+~ o?  |
| x7 | bel etb '   7   G   W   g   w  | A^. O+? a^. o+? a(~ u+` a^~ o.  |
| x8 | bs  can (   8   H   X   h   x  | E~  I.  e~  i.  E`  u+? e`  u.  |
| x9 | ht  Y~  )   9   I   Y   i   y  | E.  O?  e.  U+. E'  U`  e'  u`  |
| xA | lf  sub *   :   J   Z   j   z  | E^' O.  e^' U+' E^  U'  e^  u'  |
| xB | vt  esc +   ;   K   [   k   {  | E^` I?  e^` U+` E?  y~  e?  u~  |
| xC | ff  fs  ,      N   ^   n   ~  | E^. U.  e^. o+' I~  o+~ i~  o+. |
| xF | si  us  /   ?   O   _   o   DEL| O^' Y`  o^' U+  y`  u+  i?  U+~ |
*=======================================================================*

4. SPECIFICATION OF VIQR MNEMONICS

   VIQR, VIetnamese Quoted-Readable specification, is not an encoding
   convention but is rather a convention for typing, reading, and
   transferring Vietnamese data using only the 7-bit ASCII character
   set.  With VIQR, accented Vietnamese letters are represented by the
   vowel followed by ASCII characters whose appearances resemble those
   of the corresponding Vietnamese diacritical marks.  For example, the
   phrase "Nc Vit Nam" is represented in 7-bits by
   "Nu+o+'c Vie^.t Nam".  The complete list of diacritical mark
   equivalents is given in Table 2.  There is also provision in the VIQR
   specification to prevent undesirable composition, for example, to



Vietnamese Standardization Working Group