0% found this document useful (0 votes)
141 views

Internal Storage Encoding of Characters

The document discusses three character encoding schemes - ASCII, ISCII, and Devanagari. [1] ASCII encodes 128 characters and is the basis for most modern encoding schemes. [2] ISCII encodes the main Indic scripts of India like Devanagari, Tamil, Telugu, etc. [3] A table shows the Devanagari character set with the decimal code and Unicode equivalent for each character.

Uploaded by

Naseeb
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
141 views

Internal Storage Encoding of Characters

The document discusses three character encoding schemes - ASCII, ISCII, and Devanagari. [1] ASCII encodes 128 characters and is the basis for most modern encoding schemes. [2] ISCII encodes the main Indic scripts of India like Devanagari, Tamil, Telugu, etc. [3] A table shows the Devanagari character set with the decimal code and Unicode equivalent for each character.

Uploaded by

Naseeb
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

9/13/2018 Internal Storage encoding of Characters | KVSeContents.

in

Internal Storage encoding of Characters


ASCII

The American Standard Code for Information Interchange is a


character-encoding scheme originally based on the English alphabet
that encodes 128 specified characters - the numbers 0-9, the
letters a-z and A-Z, some basic punctuation symbols, some control
codes that originated with Teletype machines, and a blank space -
into the 7-bit binary integers.ASCII codes represent text in
computers, communications equipment, and other devices that use
text. Most modern character-encoding schemes are based on ASCII,
though they support many additional characters.

ASCII printable code chart

Binary Oct Dec Hex Glyph Binary Oct Dec Hex Glyph Binary Oct Dec Hex Glyph
010 0000 040 32 20 (space) 100 0000 100 64 40 @ 110 0000 140 96 60 `
010 0001 041 33 21 ! 100 0001 101 65 41 A 110 0001 141 97 61 a
010 0010 042 34 22 " 100 0010 102 66 42 B 110 0010 142 98 62 b
010 0011 043 35 23 # 100 0011 103 67 43 C 110 0011 143 99 63 c
010 0100 044 36 24 $ 100 0100 104 68 44 D 110 0100 144 100 64 d
010 0101 045 37 25 % 100 0101 105 69 45 E 110 0101 145 101 65 e
010 0110 046 38 26 & 100 0110 106 70 46 F 110 0110 146 102 66 f
010 0111 047 39 27 ' 100 0111 107 71 47 G 110 0111 147 103 67 g
010 1000 050 40 28 ( 100 1000 110 72 48 H 110 1000 150 104 68 h
010 1001 051 41 29 ) 100 1001 111 73 49 I 110 1001 151 105 69 i
010 1010 052 42 2A * 100 1010 112 74 4A J 110 1010 152 106 6A j
010 1011 053 43 2B + 100 1011 113 75 4B K 110 1011 153 107 6B k
010 1100 054 44 2C , 100 1100 114 76 4C L 110 1100 154 108 6C l
010 1101 055 45 2D - 100 1101 115 77 4D M 110 1101 155 109 6D m
010 1110 056 46 2E . 100 1110 116 78 4E N 110 1110 156 110 6E n
010 1111 057 47 2F / 100 1111 117 79 4F O 110 1111 157 111 6F o
011 0000 060 48 30 0 101 0000 120 80 50 P 111 0000 160 112 70 p
011 0001 061 49 31 1 101 0001 121 81 51 Q 111 0001 161 113 71 q
011 0010 062 50 32 2 101 0010 122 82 52 R 111 0010 162 114 72 r
011 0011 063 51 33 3 101 0011 123 83 53 S 111 0011 163 115 73 s
011 0100 064 52 34 4 101 0100 124 84 54 T 111 0100 164 116 74 t
011 0101 065 53 35 5 101 0101 125 85 55 U 111 0101 165 117 75 u

http://www.kvsecontents.in/index.php/internal-storage-encoding-of-characters?tmpl=component&print=1&page= 1/4
9/13/2018 Internal Storage encoding of Characters | KVSeContents.in

011 0110 066 54 36 6 101 0110 126 86 56 V 111 0110 166 118 76 v
011 0111 067 55 37 7 101 0111 127 87 57 W 111 0111 167 119 77 w
011 1000 070 56 38 8 101 1000 130 88 58 X 111 1000 170 120 78 x
011 1001 071 57 39 9 101 1001 131 89 59 Y 111 1001 171 121 79 y
011 1010 072 58 3A : 101 1010 132 90 5A Z 111 1010 172 122 7A z
011 1011 073 59 3B ; 101 1011 133 91 5B [ 111 1011 173 123 7B {
011 1100 074 60 3C < 101 1100 134 92 5C \ 111 1100 174 124 7C |
011 1101 075 61 3D = 101 1101 135 93 5D ] 111 1101 175 125 7D }
011 1110 076 62 3E > 101 1110 136 94 5E ^ 111 1110 176 126 7E ~
011 1111 077 63 3F ? 101 1111 137 95 5F _

ISCII
Indian Standard Code for Information Interchange (ISCII) is a
coding scheme for representing various writing systems of India.
It encodes the main Indic scripts and a Roman transliteration. The
supported scripts are: Assamese, Bengali (Bengla), Devanagari,
Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu.
ISCII does not encode the writing systems of India based on
Arabic, but its writing system switching codes nonetheless provide
for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic.
The following table shows the character set for Devanagari. The
code sets for Assamese, Bengali, Gujarati, Gurmukhi, Kannada,
Malayalam, Oriya, Tamil, and Telugu are similar, with each
Devanagari form replaced by the equivalent form in each writing
system. Each character is shown with its decimal code and its
Unicode equivalent.
ISCII Devanagari
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
0_ 0000 0001 0002 0003 0004 0005 0006 0007 0008 0009 000A 000B 000C 000D 000E 000F
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
1_ 0010 0011 0012 0013 0014 0015 0016 0017 0018 0019 001A 001B 001C 001D 001E 001F
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

SP ! " # $ % & ' ( ) * + , - . /


2_ 0020 0021 0022 0023 0024 0025 0026 0027 0028 0029 002A 002B 002C 002D 002E 002F
32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47

0 1 2 3 4 5 6 7 8 9 : ; < = > ?
3_ 0030 0031 0032 0033 0034 0035 0036 0037 0038 0039 003A 003B 003C 003D 003E 003F
48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63

@ A B C D E F G H I J K L M N O
4_ 0040 0041 0042 0043 0044 0045 0046 0047 0048 0049 004A 004B 004C 004D 004E 004F
64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79

P Q R S T U V W X Y Z [ \ ] ^ _
5_ 0050 0051 0052 0053 0054 0055 0056 0057 0058 0059 005A 005B 005C 005D 005E 005F

http://www.kvsecontents.in/index.php/internal-storage-encoding-of-characters?tmpl=component&print=1&page= 2/4
9/13/2018 Internal Storage encoding of Characters | KVSeContents.in
80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95

` a b c d e f g h i j k l m n o
6_ 0060 0061 0062 0063 0064 0065 0066 0067 0068 0069 006A 006B 006C 006D 006E 006F
96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111

p q r s t u v w x y z { | } ~ DEL
7_ 0070 0071 0072 0073 0074 0075 0076 0077 0078 0079 007A 007B 007C 007D 007E 007F
112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127

8_

9_

A_
◌ँ ◌ं ◌ः अ आ इ ई उ ऊ ऋ ऎ ए ऐ ऍ ऒ
0901 0902 0903 0905 0906 0907 0908 0909 090A 090B 090E 090F 0910 090D 0912
161 162 163 164 165 166 167 168 169 170 171 172 173 174 175

B_
ओ औ ऑ क ख ग घ ङ च छ ज झ ञ ट ठ ड
0913 0914 0911 0915 0916 0917 0918 0919 091A 091B 091C 091D 091E 091F 0920 0921
176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191

ढ ण त थ द ध न ऩ प फ ब भ म य य़ र
C_ 0922 0923 0924 0925 0926 0927 0928 0929 092A 092B 092C 092D 092E 092F 095F 0930
192 193 194 195 196 197 198 199 200 201 202 203 204 207 206 205

ऱ ल ळ ऴ व श ष स ह ◌ा ि◌ ◌ी ◌ु ◌ू ◌ृ
INV
D_ 0931 0932 0933 0934 0935 0936 0937 0938 0939 217 093E 093F 0940 0941 0942 0943
208 209 210 211 212 213 214 215 216 218 219 220 221 222 223

◌ॆ ◌े ◌ै ◌ॅ ◌ॊ ◌ो ◌ौ ◌ॉ ◌् ◌़ ।
ATR
E_ 0946 0947 0948 0945 094A 094B 094C 0949 094D 093C 0964 239
224 225 226 227 228 229 230 231 232 233 234
EXT ० १ २ ३ ४ ५ ६ ७ ८ ९
F_ 240 0966 0967 0968 0969 096A 096B 096C 096D 096E 096F

241 242 243 244 245 246 247 248 249 250

Unicode is a computing industry standard for the consistent


encoding, representation and handling of text expressed in most of
the world's writing systems. Developed in conjunction with the
Universal Character Set standard and published in book form as The
Unicode Standard, the latest version of Unicode contains a
repertoire of more than 110,000 characters covering 100 scripts
and various symbols. The standard consists of a set of code charts
for visual reference, an encoding method and set of standard
http://www.kvsecontents.in/index.php/internal-storage-encoding-of-characters?tmpl=component&print=1&page= 3/4
9/13/2018 Internal Storage encoding of Characters | KVSeContents.in

character encodings, a set of reference data computer files, and a


number of related items, such as character properties, rules for
normalization, decomposition, collation, rendering, and
bidirectional display order (for the correct display of text
containing both right-to-left scripts, such as Arabic and Hebrew,
and left-to-right scripts). As of June 2014, the most recent
version is Unicode 7.0. The standard is maintained by the Unicode
Consortium.

http://www.kvsecontents.in/index.php/internal-storage-encoding-of-characters?tmpl=component&print=1&page= 4/4

You might also like