606646
48
Zoom out
Zoom in
Previous page
1/58
Next page
KompoZer User Guide 17 December 2007 Based on version 0.7.10 48
means that, in principle, any character in the
UCS may be used in any HTML document. In
practice support for the complete character
range is uneconomic and systems provide
support for subsets only.
A6.3.4 Character Encoding
Character Encoding, at its simplest, refers
to the process whereby the codes for the char-
acters are mapped to the code points for the
Unicode characters appropriate to the lan-
guage in use. In the case of ISO-8859-1 the
character codes are mapped to identical Uni-
code code points. (The first 256 Unicode char-
acters being the same as the Latin-1 set.) As
another example, ISO-8859-7 encodes Greek
characters displacing many from the Latin-1
set to make room. (Compare table A6.3-2 to
table A6.3-1.) In this case the code EA instead
of being mapped to Unicode code point EA
(giving e circumflex ê) is mapped to code point
03BA which returns a small kappa . In fact
ISO-8859-7 does not include the ê character.
All ISO-8859 encodings retain the ASCII
characters at the original positions.
The HTML version of this document uses
ISO-8859-1 encoding but, in spite of this, has
no difficulty in representing the full repertoire
of the Greek characters covered by ISO-8859-
7 as can be seen in the table. How this is
achieved is explained in the next section.
Authors should note that every page uses one
character encoding, and one only, irrespective of
the number or range of languages encountered on a
page.
In HTML pages character encoding is specified us-
ing the 'charset' parameter in the head area for each
page. Several options are permissible but KompoZer
always uses the form <meta http-equiv= "content-type"
content= "text/html; charset=ISO-8859-7">.
Note 'charset', in spite of its name, does not
specify a character set. The character set for
HTML documents is always the UCS. 'charset'
specifies the encoding.
A6.3.5 Character references
ISO-8859 uses a single byte per character to repre-
sent all the characters commonly expected in a lan-
guage but clearly there may be a need to represent
uncommon characters. HTML provides two mecha-
nisms - Character entity references (entities) and nu-
meric character references. Using these methods any
character in the UCS may be reached by using a
sequence of ASCII characters to point to the required
character. Entities take the form &euro; and numeric
references the form &#8364; or &#x20AC; all repre-
senting the euro symbol. The 8464 and x20AC repre-
sent the Unicode code point for the symbol in decimal
and hexadecimal notation.
These methods free the author to employ Unicode
characters, irrespective of the encoding in use, at the
expense of increasing file size. Where such use is
limited this is inconsequential.
The list of entities is included at section 24 of the
HTML specification [Ref 16]. About 250 are defined,
numeric character references must be used for charac-
ters outside this range. Characters do not have to be
out of range of the encoding for entity references to be
provided as is clear from Table A6.3-3 which lists some
of the most frequently used including some in the
ASCII set.
Table A6.3-3
Important entity references
Character Entity Numeric character
reference
&euro; &#8364; &#x20AC;
< &lt; &#60; &#x3c;
> &gt; &#62; &#x3e;
× &times; &#215; &#xd7;
÷ &divide; &#247; &#xf7;
& &amp; &#38; &#x26;
" &quot; &#34; &#x22;
no-break space &nbsp; &#160; &#xa0;
Note Entities are case sensitive thus &Eacute;
represents upper case E with an acute accent (É)
while &eacute; represents the corresponding
lower case letter (é). &EacutE; does not repre-
sent anything (&EacutE;). (The error just gets
printed out.)
Note Irrespective of the ISO-8859 encoding em-
ployed the entity or numeric reference to be
input remains the same. So, although in ISO-
8859-7 the euro symbol is represented as byte
A4, entering the code &#xA4; will generate a ¤
symbol not a euro symbol. The code to be input
is the entity or numeric character reference for
the character required.
Table A6.3.2
Printable ASCII and Greek characters
Using ISO-8859-7 encoding
Hex code for character MSD in row lsd in column
x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0x
1x
A
2x
sp
! # $ % & ( ) * + , - . / S
3x
0 1 2 3 4 5 6 7 8 9 : ; < > = ? C
4x
@
A B C D E F G H I J K L M N O I
5x
P Q R S T U V
W
X Y Z [ \ ] ^ _ I
6x
` a b c d e f g h i j k l m n o
7x
p q r s t u v w x y z { | } ~
8x
9x
Ax
nb
sp
£ ¦ §
¨
© « ¬
shy
¯
Bx
° ± ² ³ ´ · » ½
Cx
Dx
Ex
Fx
48


Need help? Post your question in this forum.

Forumrules


Report abuse

Libble takes abuse of its services very seriously. We're committed to dealing with such abuse according to the laws in your country of residence. When you submit a report, we'll investigate it and take the appropriate action. We'll get back to you only if we require additional details or have more information to share.

Product:

For example, Anti-Semitic content, racist content, or material that could result in a violent physical act.

For example, a credit card number, a personal identification number, or an unlisted home address. Note that email addresses and full names are not considered private information.

Forumrules

To achieve meaningful questions, we apply the following rules:

Register

Register getting emails for Kompozer 07 at:


You will receive an email to register for one or both of the options.


Get your user manual by e-mail

Enter your email address to receive the manual of Kompozer 07 in the language / languages: English as an attachment in your email.

The manual is 2,17 mb in size.

 

You will receive the manual in your email within minutes. If you have not received an email, then probably have entered the wrong email address or your mailbox is too full. In addition, it may be that your ISP may have a maximum size for emails to receive.

Others manual(s) of Kompozer 07

Kompozer 07 User Manual - German - 157 pages


The manual is sent by email. Check your email

If you have not received an email with the manual within fifteen minutes, it may be that you have a entered a wrong email address or that your ISP has set a maximum size to receive email that is smaller than the size of the manual.

The email address you have provided is not correct.

Please check the email address and correct it.

Your question is posted on this page

Would you like to receive an email when new answers and questions are posted? Please enter your email address.



Info