Expand description
This module contains reusable components for encoding and decoding text in DICOM data structures, including support for character repertoires.
At the moment the following character sets are supported:
| Character Set | decoding support | encoding support | 
|---|---|---|
| ISO-IR 6 (default) | ✓ | ✓ | 
| ISO-IR 100 (ISO-8859-1): Right-hand part of the Latin alphabet no. 1, the Western Europe character set | ✓ | ✓ | 
| ISO-IR 101 (ISO-8859-2): Right-hand part of the Latin alphabet no. 2, the Central/Eastern Europe character set | ✓ | ✓ | 
| ISO-IR 109 (ISO-8859-3): Right-hand part of the Latin alphabet no. 3, the South Europe character set | ✓ | ✓ | 
| ISO-IR 110 (ISO-8859-4): Right-hand part of the Latin alphabet no. 4, the North Europe character set | ✓ | ✓ | 
| ISO-IR 144 (ISO-8859-5): The Latin/Cyrillic character set | ✓ | ✓ | 
| ISO-IR 192: The Unicode character set based on the UTF-8 encoding | ✓ | ✓ | 
| GB18030: The Simplified Chinese character set | ✓ | ✓ | 
| JIS X 0201-1976: Code for Information Interchange | x | x | 
| JIS X 0208-1990: Code for the Japanese Graphic Character set for information interchange | x | x | 
| JIS X 0212-1990: Code of the supplementary Japanese Graphic Character set for information interchange | x | x | 
| KS X 1001 (registered as ISO-IR 149) for Korean Language | x | x | 
| TIS 620-2533 (1990) Thai Characters Code for Information Interchange | x | x | 
| GB2312: Simplified Chinese character set | x | x | 
These capabilities are available through SpecificCharacterSet.
Structs§
- Default
Character SetCodec  - Data type representing the default character set.
 - Gb18030
Character SetCodec  - Data type for the GB18030 character set encoding.
 - IsoIr100
Character SetCodec  - Data type for the ISO_IR 100 character set encoding.
 - IsoIr101
Character SetCodec  - Data type for the ISO_IR 101 character set encoding.
 - IsoIr109
Character SetCodec  - Data type for the ISO_IR 109 character set encoding.
 - IsoIr110
Character SetCodec  - Data type for the ISO_IR 110 character set encoding.
 - IsoIr144
Character SetCodec  - Data type for the ISO_IR 144 character set encoding.
 - Specific
Character Set  - A descriptor for a specific character set, taking part in text encoding and decoding as per PS3.5 ch 6 6.1.
 - Utf8
Character SetCodec  - Data type for the ISO_IR 192 character set encoding.
 
Enums§
- Decode
Text Error  - An error type for text decoding issues.
 - Encode
Text Error  - An error type for text encoding issues.
 - Text
Validation Outcome  - The result of a text validation procedure (please see 
validate_iso_8859). 
Traits§
- Text
Codec  - A holder of encoding and decoding mechanisms for text in DICOM content, which according to the standard, depends on the specific character set.
 
Functions§
- validate_
cs  - Check whether the given byte slice contains only valid characters for a Code String value representation.
 - validate_
da  - Check whether the given byte slice contains only valid characters for a Date value representation.
 - validate_
dt  - Check whether the given byte slice contains only valid characters for a Date Time value representation.
 - validate_
iso_ 8859  - Check whether the given byte slice contains valid text from the default character repertoire.
 - validate_
tm  - Check whether the given byte slice contains only valid characters for a Time value representation.