Character mapping table (encodingMap.xml)
The encodingMap.xml file maps character set aliases, fallback
character set names to try if the alias is unsupported, the name of the character set used by Genero
Studio, and the corresponding language definition for the LANG
variable for Genero
on Unix.
A default list of text encodings is provided in Genero Studio. The mappings between each character set alias and its corresponding attributes are defined in the encodingMap.xml file in the GSTDIR/conf directory.
For some of the encodings there is only a partial match between Unix and Windows® platforms.
This file contains a table used to map each encoding name (alias) to:
- A list of fallback character set names to try if the alias is not supported.
- The name of the character set used for encoding/decoding.
- An implementation name in Genero Studio (impl attribute)
- a
LANG
qualified country for Unix/Linux (Country attribute)
<Alias name="IBM852" fallback="852,ibm852" impl="IBM852" unixCountry="pl_PL"/>
This means that:
- IBM852 is the name Genero Studio should use for this text encoding.
- 852 and ibm852 are character set names to try if IBM852 is not supported.
- IBM852 is the name of the character set used for encoding/decoding.
- Under Unix or Linux™ the
LANG
language definition ispl_PL
.
The impl attribute is defined within the POSIX2 charmap file after the code_set_name. It appears in the Genero Studio preferences combobox: <code_set_name> IBM852.
The name attribute is the name Genero Studio should use for this text encoding. By default it is the character set name (code_set_name); when an alias to an existing encoding is needed, the name attribute should contain the alias name.
<Alias name="CP852" fallback="IBM852,852" impl="IBM852" unixCountry="pl_PL"/>