Character map for Fowler corpora (Based on AATSEEL student keyboard) The tables below summarize the ASCII coding I use in the Russian corpora I maintain (a list is available separately). The coding is based on the AATSEEL-mandated "student" transliteration-style keyboard, with one modification on the basic keyboard (I substituted an m-dash for the underline character, which is rarely used on the Mac). I display corpora on the Mac using the font family "Fowler", which includes a full set of bitmapped fonts in various sizes and is easy to read on the screen; it also includes a 42-point font with identical coding called "LaserFowler", which is nearly illegible on the screen for 12-pt text but prints out rather nicely on a 300-dpi laser printer. These fonts are freely available with corpora; just ask. George Fowler GFowler@Indiana.Edu [Email] Dept. of Slavic Languages (812) 855-2829 [office] Ballantine 502 (317) 726-1482 [home] Indiana University (812) 855-2624/-2608/-9906 [dept.] Bloomington, IN 47405 USA (812) 855-2107 [dept. fax] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Cyrillic Transliteration ASCII Notes UPPER CASE A A 65 B B 66 V V 86 G G 71 D D 68 E E 69 JO ~ 126 Not used very much ZH " 34 Z Z 90 I I 73 J J 74 K K 75 L L 76 M M 77 N N 78 O O 79 P P 80 R R 82 S S 83 T T 84 U U 85 F F 70 X X 88 C C 67 CH H 72 SH W 87 SHCH } 125 HARD + 43 Y Y 89 SOFT : 58 E OBOR. | 124 JU { 123 JA Q 81 lower case a a 97 b b 98 v v 118 g g 103 d d 100 e e 101 jo ` 96 Not used very much zh ' 39 z z 122 i i 105 j j 106 k k 107 l l 108 m m 109 n n 110 o o 111 p p 112 r r 114 s s 115 t t 116 u u 117 f f 102 x x 120 c c 99 ch h 104 sh w 119 shch ] 93 hard = 61 y y 121 soft ; 59 e obor \ 92 ju [ 91 ja q 113 Non-alphabetic deviations from standard keyboard (low ascii) % / 47 = ? 63 accent < 60 Follows stressed vowel ; > 62 No ! 33 ! @ 64 / # 35 " $ 36 Straight double quotes : % 37 << ^ 94 Double left angle brackets >> & 38 ? * 42 M-dash _ 95 Not used often; usually a double hyphen instead Non-alphabetic high-ascii characters used occasionally Left low double curly quote Opt-3 163 Right high double curly quote Sh-Opt-3 220 Left high double curly quote Sh-Opt-4 221 Left single curly quote Opt-] 212 Right single curly quote Sh-Opt-] 213 N-dash Opt-hyphen 208 _ Sh-Opt-hyphen 209 [ Opt-[ 210 ] Sh-Opt-[ 211 { Opt-9 187 } Opt-0 188 < Opt-, 178 > Opt-. 179 \ Opt-\ 199 | Sh-Opt-\ 200 || Opt-/ 214 + Sh-Opt-= 177 The fonts I use on the Mac have a full complement of other high ascii characters (non-Russian Cyrillic, OCS, lexicographic symbols, etc.), but these should be irrelevant to Russian text corpora.