music
OSdata.com: programming text book 

OSdata.com

character set

summary

    Each language has at least one character set, the characters that are legally allowed in programs.

    A source character set is the character set used for writing source code.

    An execution character set is a character set used when a program is running and usually includes any characters that may appear in either input or putput.

free computer programming text book project

table of contents
If you like the idea of this project,
then please donate some money.
more information on donating

Google

character set

    Each language has at least one character set, the characters that are legally allowed in programs.

    A source character set is the character set used for writing source code.

    An execution character set is a character set used when a program is running and usually includes any characters that may appear in either input or output.

    The first computer keyboards were usually teletypes or teletype-compaible. These early keyboards traced back to closely related keyboards used for telegraph networks.

    The code used for these teletypes was based on a code invented by Frenchman Emile Baudot in 1870. Only the original code was Baudot Code, but in slang all subsequent codes were also called Baudot Code.

    Baudot Code uses five bits, which allows 32 different character codes. Using two characters to indicate a shift between interpretations (letter shift and figures shift), it was possible to encode all of the capital letters, the decimal digits, a few punctuation marks, and special control codes.

    The resulting character set explains the small character sets of several early programming languages, such as BASIC, COBOL, FORTRAN, LISP, and PL/I.

    Baudot includes ABCDEFGHIJKLMNOPQRSTUVWXYZ and 012345678, as well as punctuation -$',!:(")#&./;+= and control codes: Space, Bell, Carriage Return (CR), and Line Feed (LF). Not all teletypes had += or ;"

Binary Decimal Hex Octal Letter U.S.
Figures
CCITT No. 2
Figures
00000 0 0 0 N/A N/A N/A
00001 1 1 1 E 3 3
00010 2 2 2 LF LF LF
00011 3 3 3 A - -
00100 4 4 4 Space Space Space
00101 5 5 5 S BELL '
00110 6 6 6 I 8 8
00111 7 7 7 U 7 7
01000 8 8 10 CR CR CR
01001 9 9 11 D $ WRU
01010 10 A 12 R 4 4
01011 11 B 13 J ' Bell
01100 12 C 14 N , ,
01101 13 D 15 F ! !
01110 14 E 16 C : :
01111 15 F 17 K ( (
10000 16 10 20 T 5 5
10001 17 11 21 Z " +
10010 18 12 22 L ) )
10011 19 13 23 W 2 2
10100 20 14 24 H # £
10101 21 15 25 Y 6 6
10110 22 16 26 P 0 0
10111 23 17 27 Q 1 1
11000 24 18 30 O 9 9
11001 25 19 31 B ? ?
11010 26 1A 32 G & &
11011 27 1B 33 Figures Shift Figures Shift Figures Shift
11100 28 1C 34 M . .
11101 29 1D 35 X / /
11110 30 1E 36 V ; =
11111 31 1F 37 Letters Shift Letters Shift Letters Shift

name C PL/I
  A A
  B B
  C C
  D D
  E E
  F F
  G G
  H H
  I I
  J J
  K K
  L L
  M M
  N N
  O O
  P P
  Q Q
  R R
  S S
  T T
  U U
  V V
  W W
  X X
  Y Y
  Z Z
name C PL/I
  a  
  b  
  c  
  d  
  e  
  f  
  g  
  h  
  i  
  j  
  k  
  l  
  m  
  n  
  o  
  p  
  q  
  r  
  s  
  t  
  u  
  v  
  w  
  x  
  y  
  z  
name C PL/I
  0 0
  1 1
  2 2
  3 3
  4 4
  5 5
  6 6
  7 7
  8 8
  9 9
name C PL/I
alert
bell
\a
ANSI
 
backspace
BS
\b
formatting character
 
horizontal tab
HT
\t
formatting character
 
vertical tab
VT
\v
formatting character
 
form feed
FF
\f<
formatting character/td>
 
carriage return
CR
\r
formatting character
 
space
blank
   
exclamation point !  
double quote "
\"
 
number sign #  
dollar sign $
extra graphic
 
percent sign % %
ampersand & &
AND symbol
apostrophe
single quotation mark
'
\'
'
left parenthesis ( (
right parenthesis ) )
asterisk * *
multiplication symbol
plus sign + +
comma , ,
hyphen or
minus sign
- -
period
point
. .
slash
forward slash
/ /
divide symbol
colon : :
semicolon : ;
less than < <
equal = =
also assignment symbol
greater than > >
question mark ?
\?
ANSI
?
at sign @
extra graphic
 
left bracket [  
backslash \
\\
 
right bracket ]  
circumflex ^  
underscore _ _
break character
accent grave `
extra graphic
 
left brace [  
vertical bar
stroke
| |
OR symbol
right brace ]  
tilde ~  
NOT symbol   ¬
null must equal zero  
newline
end-of-line marker
\n  

Python

    The official character set for Python is the orginal ASCII character codes (0 to 127 inclusive).

    You can use additional characters in both comments and string literals if you declare a character coding.

    # _*_ coding: utf-8 _*_

    # _*_ coding: iso-8859-1 _*_

    The character set coding line must go at the beginning of the source file (or immediately after the she-bang or hash-bang line).


free music player coding example

    Coding example: I am making heavily documented and explained open source code for a method to play music for free — almost any song, no subscription fees, no download costs, no advertisements, all completely legal. This is done by building a front-end to YouTube (which checks the copyright permissions for you).

    View music player in action: www.musicinpublic.com/.

    Create your own copy from the original source code/ (presented for learning programming).


return to table of contents
free downloadable college text book

view text book
HTML file

Because I no longer have the computer and software to make PDFs, the book is available as an HTML file, which you can convert into a PDF.

previous page next page
previous page next page

free computer programming text book project

Building a free downloadable text book on computer programming for university, college, community college, and high school classes in computer programming.

If you like the idea of this project,
then please donate some money.

send donations to:
Milo
PO Box 1361
Tustin, California 92781

Supporting the entire project:

    If you have a business or organization that can support the entire cost of this project, please contact Pr Ntr Kmt (my church)

more information on donating

Some or all of the material on this web page appears in the
free downloadable college text book on computer programming.


Google


Made with Macintosh

    This web site handcrafted on Macintosh computers using Tom Bender’s Tex-Edit Plus and served using FreeBSD .

Viewable With Any Browser


    †UNIX used as a generic term unless specifically used as a trademark (such as in the phrase “UNIX certified”). UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company Ltd.

    Names and logos of various OSs are trademarks of their respective owners.

    Copyright © 2010, 2011 Milo

    Created: November 10, 2010

    Last Updated: March 21, 2011


return to table of contents
free downloadable college text book

previous page next page
previous page next page