Unicode Normalization for Vietnam

Test Page 4.0

A Test Page by Stefan Probst

Version 1.0: 2002-01-27 Stefan Probst -- first version

These 8 pages test, whether the browser can show the title of a web page, when it is written in Unicode.
This corresponds to the HTML code:
  <title> ... Việt Nam </title>
Since there can be only one title per page, this test consists of 8 pages:


A) Unicode Characters outside the ASCII table
are encoded using "Numerical Character References" (NCR)

(when using only NCRs, the HTML file may also be specified as "charset=ISO 8859-1",
i.e. as a normal 7-bit plain ASCII file)
Only usable when the applications can handle NCRs, e.g. web browsers.
Those applications recognize an NCR at the initial "&#" escape sequence.

1) Fully Precomposed:
Unicode Normalization Form C ("NFC"):
Việt Nam: Vi&#x1EC7;t Nam Test Page 4a1

2) "VN-Combining", i.e. characters pre-composed, tone marks combining
Note: There is no international standard for this behaviour:
Viê  ̣t Nam: Vi&#x00EA;&#x0323;t Nam Test Page 4a2

3) "VN-Canonical", i.e. only combining characters, tone marks sorted last
Note: There is no international standard for this order:
Vie ^  ̣t Nam: Vie&#x0302;&#x0323;t Nam Test Page 4a3

4) Only combining characters, sorted by canonical order:
Unicode Normalization Form D ("NFD"):
Vie  ̣^t Nam: Vie&#x0323;&#x0302;t Nam Test Page 4a4

 

B) Unicode Characters outside the ASCII table
are encoded using UTF-8 format

(i.e. they are encoded as a sequence of printable bytes including the right side of the 8-bit table.)
In order to instruct the browser or any other application to interpret those sequences
as representations of an Unicode character (and not to render them directly),
the file format has to be specified as "charset=utf-8"

1) Fully Precomposed:
Unicode Normalization Form C ("NFC"):
Việt Nam: Việt Nam Test Page 4b1

2) "VN-Combining", i.e. characters pre-composed, tone marks combining
Note: There is no international standard for this behaviour:
Viê  ̣t Nam: Việt Nam Test Page 4b2

3) "VN-Canonical", i.e. only combining characters, tone marks sorted last
Note: There is no international standard for this order:
Vie ^  ̣t Nam: VieÌ‚Ì£t Nam Test Page 4b3

4) Only combining characters, sorted by canonical order:
Unicode Normalization Form D ("NFD"):
Vie  ̣^t Nam: Việt Nam Test Page 4b4


Test Instructions:
Open all those 8 files in a web browser (e.g. Internet Explorer).
- Is the title of the page properly displayed in the title bar of the window?
- Is the title of the page properly displayed in the window list (e.g. at the bottom)?


Copy the following form into your eMail program,
fill it as far as possible, and send it to Unicode-Tests@isoc-vn.org

*******************************************
Results of Unicode Tests
Used Testpage: 4.0

1) Platform:
OS (kind, version)     :
Browser (incl. version):

2) Results:
Page Title in Browser Window
  page title in version A1 is displayed as:
  page title in version A2 is displayed as:
  page title in version A3 is displayed as:
  page title in version A4 is displayed as:
  page title in version B1 is displayed as:
  page title in version B2 is displayed as:
  page title in version B3 is displayed as:
  page title in version B4 is displayed as:
Page Title in Window List (usually at bottom of screen)
  page title in version A1 is displayed as:
  page title in version A2 is displayed as:
  page title in version A3 is displayed as:
  page title in version A4 is displayed as:
  page title in version B1 is displayed as:
  page title in version B2 is displayed as:
  page title in version B3 is displayed as:
  page title in version B4 is displayed as:

Other comments         :
Tested by              :
*******************************************
Example:
*******************************************
Results of Unicode Tests
Used Testpage: 4.0

1) Platform:
OS (kind, version)     : Windows 2000
Browser (incl. version): Internet Explorer 5.9

2) Results:
Page Title in Browser Window
  page title in version A1 is displayed as: page title
  page title in version A2 is displayed as: page title, but space after "e"
  page title in version A3 is displayed as: page title, but "e" completely wrong
  page title in version A4 is displayed as: page title, but "e" completely wrong
  page title in version B1 is displayed as: page title
  page title in version B2 is displayed as: page title, but space after "e"
  page title in version B3 is displayed as: page title, but "e" completely wrong
  page title in version B4 is displayed as: (file title only)
Page Title in Window List (usually at bottom of screen)
  page title in version A1 is displayed as: page title
  page title in version A2 is displayed as: page title
  page title in version A3 is displayed as: page title, but "e" completely wrong
  page title in version A4 is displayed as: page title, but "e" completely wrong
  page title in version B1 is displayed as: page title
  page title in version B2 is displayed as: page title
  page title in version B3 is displayed as: page title, but "e" completely wrong
  page title in version B4 is displayed as: (file title only)

Other comments         :
Tested by              : Stefan Probst
*******************************************