Package com.ibm.icu.charset
Class CharsetCompoundText
java.lang.Object
java.nio.charset.Charset
com.ibm.icu.charset.CharsetICU
com.ibm.icu.charset.CharsetCompoundText
- All Implemented Interfaces:
Comparable<Charset>
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescription(package private) class(package private) class -
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byte[][]private static final byte[]private static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate static final byteprivate CharsetMBCS[]private static final byteprivate static final byteprivate byteFields inherited from class com.ibm.icu.charset.CharsetICU
codepage, conversionType, hasFromUnicodeFallback, hasToUnicodeFallback, icuCanonicalName, maxBytesPerChar, maxCharsPerByte, minBytesPerChar, name, options, platform, ROUNDTRIP_AND_FALLBACK_SET, ROUNDTRIP_SET, subChar, subChar1, subCharLen, unicodeMask -
Constructor Summary
ConstructorsConstructorDescriptionCharsetCompoundText(String icuCanonicalName, String javaCanonicalName, String[] aliases) -
Method Summary
Modifier and TypeMethodDescriptionprivate static intfindNextEsc(ByteBuffer source) private static bytefindStateFromEscSeq(ByteBuffer source, byte[] toUBytes, int toUBytesLength) private static bytegetState(int codepoint) (package private) voidgetUnicodeSetImpl(UnicodeSet setFillIn, int which) This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored.private static booleanisASCIIRange(int codepoint) private static booleanisCompoundS1(int codepoint) private static booleanisCompoundS2(int codepoint) private static booleanisCompoundS3(int codepoint) private static booleanisIBM874(int codepoint) private static booleanisIBM912(int codepoint) private static booleanisIBM913(int codepoint) private static booleanisIBM914(int codepoint) private static booleanisIBM915(int codepoint) private static booleanisIBM916(int codepoint) private static booleanisIBM923(int codepoint) private static booleanisISO8859_14(int codepoint) private voidMethods inherited from class com.ibm.icu.charset.CharsetICU
contains, forNameICU, getCharset, getCompleteUnicodeSet, getNonSurrogateUnicodeSet, getUnicodeSet, isFixedWidth, isSurrogateMethods inherited from class java.nio.charset.Charset
aliases, availableCharsets, canEncode, compareTo, decode, defaultCharset, displayName, displayName, encode, encode, equals, forName, forName, hashCode, isRegistered, isSupported, name, toString
-
Field Details
-
fromUSubstitution
private static final byte[] fromUSubstitution -
myConverterArray
-
state
private byte state -
INVALID
private static final byte INVALID- See Also:
-
DO_SEARCH
private static final byte DO_SEARCH- See Also:
-
COMPOUND_TEXT_SINGLE_0
private static final byte COMPOUND_TEXT_SINGLE_0- See Also:
-
COMPOUND_TEXT_SINGLE_1
private static final byte COMPOUND_TEXT_SINGLE_1- See Also:
-
COMPOUND_TEXT_SINGLE_2
private static final byte COMPOUND_TEXT_SINGLE_2- See Also:
-
COMPOUND_TEXT_SINGLE_3
private static final byte COMPOUND_TEXT_SINGLE_3- See Also:
-
IBM_915
private static final byte IBM_915- See Also:
-
IBM_916
private static final byte IBM_916- See Also:
-
IBM_914
private static final byte IBM_914- See Also:
-
IBM_874
private static final byte IBM_874- See Also:
-
IBM_912
private static final byte IBM_912- See Also:
-
IBM_913
private static final byte IBM_913- See Also:
-
ISO_8859_14
private static final byte ISO_8859_14- See Also:
-
IBM_923
private static final byte IBM_923- See Also:
-
NUM_OF_CONVERTERS
private static final byte NUM_OF_CONVERTERS- See Also:
-
SEARCH_LENGTH
private static final byte SEARCH_LENGTH- See Also:
-
escSeqCompoundText
private static final byte[][] escSeqCompoundText -
ESC_START
private static final byte ESC_START- See Also:
-
-
Constructor Details
-
CharsetCompoundText
-
-
Method Details
-
isASCIIRange
private static boolean isASCIIRange(int codepoint) -
isIBM915
private static boolean isIBM915(int codepoint) -
isIBM916
private static boolean isIBM916(int codepoint) -
isCompoundS3
private static boolean isCompoundS3(int codepoint) -
isCompoundS2
private static boolean isCompoundS2(int codepoint) -
isIBM914
private static boolean isIBM914(int codepoint) -
isIBM874
private static boolean isIBM874(int codepoint) -
isIBM912
private static boolean isIBM912(int codepoint) -
isIBM913
private static boolean isIBM913(int codepoint) -
isCompoundS1
private static boolean isCompoundS1(int codepoint) -
isISO8859_14
private static boolean isISO8859_14(int codepoint) -
isIBM923
private static boolean isIBM923(int codepoint) -
findNextEsc
-
getState
private static byte getState(int codepoint) -
findStateFromEscSeq
-
LoadConverters
private void LoadConverters() -
newDecoder
- Specified by:
newDecoderin classCharset
-
newEncoder
- Specified by:
newEncoderin classCharset
-
getUnicodeSetImpl
Description copied from class:CharsetICUThis follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored. Detects Unicode signature byte sequences at the start of the byte stream and returns number of bytes of the BOM of the indicated Unicode charset. 0 is returned when no Unicode signature is recognized.- Specified by:
getUnicodeSetImplin classCharsetICU
-