RFC 1468 (rfc1468) - Page 2 of 6
Japanese Character Encoding for Internet Messages
Alternative Format: Original Text Document
RFC 1468 Japanese Character Encoding for Internet Messages June 1993 on March 1st, 1987. Likewise, JIS C 6220 was renamed JIS X 0201. The "Roman" character set of JIS X 0201 [JISX0201] is identical to ASCII except for backslash () and tilde (~). The backslash is replaced by the Yen sign, and the tilde is replaced by overline. This set is Japan's national variant of ISO 646 [ISO646]. The JIS X 0208 [JISX0208] character sets consist of Kanji, Hiragana, Katakana and some other symbols and characters. Each character takes up two bytes. For further details about the JIS Japanese national character set standards, refer to [JISX0201] and [JISX0208]. For further information about the escape sequences, see [ISO2022] and [ISOREG]. If there are JIS X 0208 characters on a line, there must be a switch to ASCII or to the "Roman" set of JIS X 0201 before the end of the line (i.e., before the CRLF). This means that the next line starts in the character set that was switched to before the end of the previous line. Also, the text must end in ASCII. Other restrictions are given in the Formal Syntax below. Formal Syntax The notational conventions used here are identical to those used in RFC 822 [RFC 822]. The * (asterisk) convention is as follows: l*m something meaning at least l and at most m somethings, with l and m taking default values of 0 and infinity, respectively. message = headers 1*( CRLF *single-byte-char *segment single-byte-seq *single-byte-char ) ; see also [MIME1] "body-part" ; note: must end in ASCII headers = RFC 822] "fields" and [MIME1] "body-part"> segment = single-byte-segment / double-byte-segment single-byte-segment = single-byte-seq 1*single-byte-char Murai, Crispin & van der Poel