On Wed Sep 18 20:37:34 2013, WIML wrote:
Show quoted text> The Unicode Consortium's mapping table for GSM 03.38 has this to say
> on the matter:
>
>
> # The ETSI GSM 03.38 specification shows an uppercase C-cedilla
> # glyph at 0x09. This may be the result of limited display
> # capabilities for handling characters with descenders. However, the
> # language coverage intent is clearly for the lowercase c-cedilla, as shown
> # in the mapping below. The mapping for uppercase C-cedilla is shown
> # in a commented line in the mapping table.
>
>
> The other accented characters in column 0000 of the table are mostly
> lowercase with no uppercase equivalents elsewhere in the mapping, so
> who knows.
I'm afraid, but Unicode Consortium's mapping table is incorrect here. Maybe older GSM specifications were not clear about this issue (and Unicode Consortium come up with that incorrect interpretation), but the latest GSM 03.38 specification ETSI TS 123 038 V16.0.0 (2020-07) available at
https://www.etsi.org/deliver/etsi_ts/123000_123099/123038/16.00.00_60/ts_123038v160000p.pdf is clear in fact that Upper case C-cedilla is at the position 0x09 of GSM 7 bit Default Alphabet and lower case C-cedilla is available in some National Single Shift Alphabets. National Single Shift Alphabets are used when requested via escape byte 0x1B. So GSM 03.38 supports both lower case and upper case C-cedilla. Above Unicode Consortium's mapping table supports only lower case C-cedilla which is limitation due to incorrect interpretation.
Just to note that Encode::GSM0338 currently does not provide National Single Shift Alphabets, therefore it does not support lower case C-cedilla. Indication of National Single Shift Alphabets is out-of-band and therefore National Single Shift Alphabets cannot be implemented directly into Encode::GSM0338 module as Encode API does not provide out-of-band settings when encoding/decoding strings. So the best choice for future implementation of National Single Shift Alphabets and National Locking Shift Alphabets into Encode API would be to provide new encoding modules for every alphabet. E.g. GSM0338-Turkish, GSM0338-Spanish, ...