Character mapping table (encodingMap.xml)

The encodingMap.xml file maps character set aliases, fallback character set names to try if the alias is unsupported, the name of the character set used by Genero Studio, and the corresponding language definition for the LANG variable for Genero on Unix.

A default list of text encodings is provided in Genero Studio. The mappings between each character set alias and its corresponding attributes are defined in the encodingMap.xml file in the $GSTUSERDIR directory (for example, C:\Users\Jean Dupont\AppData\Roaming\FourJs\Genero Studio 4.00.00-202102161845).

For some of the encodings there is only a partial match between UNIX™ and Windows® platforms.

This file contains a table used to map each encoding name (alias) to:

  • A list of fallback character set names to try if the alias is not supported.
  • The name of the character set used for encoding/decoding.
  • An implementation name in Genero Studio (impl attribute)
  • A LANG qualified country for UNIX/Linux™ (Country attribute)
<Alias name="IBM852" fallback="852,ibm852" impl="IBM852" unixCountry="pl_PL"/>

This means that:

  • IBM852 is the name Genero Studio should use for this text encoding.
  • 852 and ibm852 are character set names to try if IBM852 is not supported.
  • IBM852 is the name of the character set used for encoding/decoding.
  • Under UNIX or Linux the LANG language definition is pl_PL.

The impl attribute is defined within the POSIX2 charmap file after the code_set_name. It appears in the Genero Studio preferences combobox: <code_set_name> IBM852.

The name attribute is the name Genero Studio should use for this text encoding. By default it is the character set name (code_set_name); when an alias to an existing encoding is needed, the name attribute should contain the alias name.

<Alias name="CP852" fallback="IBM852,852" impl="IBM852" unixCountry="pl_PL"/>