Package com.ibm.icu.charset
Class CharsetCompoundText
- java.lang.Object
-
- java.nio.charset.Charset
-
- com.ibm.icu.charset.CharsetICU
-
- com.ibm.icu.charset.CharsetCompoundText
-
- All Implemented Interfaces:
java.lang.Comparable<java.nio.charset.Charset>
class CharsetCompoundText extends CharsetICU
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description (package private) class
CharsetCompoundText.CharsetDecoderCompoundText
(package private) class
CharsetCompoundText.CharsetEncoderCompoundText
-
Field Summary
Fields Modifier and Type Field Description private static byte
COMPOUND_TEXT_SINGLE_0
private static byte
COMPOUND_TEXT_SINGLE_1
private static byte
COMPOUND_TEXT_SINGLE_2
private static byte
COMPOUND_TEXT_SINGLE_3
private static byte
DO_SEARCH
private static byte
ESC_START
private static byte[][]
escSeqCompoundText
private static byte[]
fromUSubstitution
private static byte
IBM_874
private static byte
IBM_912
private static byte
IBM_913
private static byte
IBM_914
private static byte
IBM_915
private static byte
IBM_916
private static byte
IBM_923
private static byte
INVALID
private static byte
ISO_8859_14
private CharsetMBCS[]
myConverterArray
private static byte
NUM_OF_CONVERTERS
private static byte
SEARCH_LENGTH
private byte
state
-
Fields inherited from class com.ibm.icu.charset.CharsetICU
codepage, conversionType, hasFromUnicodeFallback, hasToUnicodeFallback, icuCanonicalName, maxBytesPerChar, maxCharsPerByte, minBytesPerChar, name, options, platform, ROUNDTRIP_AND_FALLBACK_SET, ROUNDTRIP_SET, subChar, subChar1, subCharLen, unicodeMask
-
-
Constructor Summary
Constructors Constructor Description CharsetCompoundText(java.lang.String icuCanonicalName, java.lang.String javaCanonicalName, java.lang.String[] aliases)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description private static int
findNextEsc(java.nio.ByteBuffer source)
private static byte
findStateFromEscSeq(java.nio.ByteBuffer source, byte[] toUBytes, int toUBytesLength)
private static byte
getState(int codepoint)
(package private) void
getUnicodeSetImpl(UnicodeSet setFillIn, int which)
This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored.private static boolean
isASCIIRange(int codepoint)
private static boolean
isCompoundS1(int codepoint)
private static boolean
isCompoundS2(int codepoint)
private static boolean
isCompoundS3(int codepoint)
private static boolean
isIBM874(int codepoint)
private static boolean
isIBM912(int codepoint)
private static boolean
isIBM913(int codepoint)
private static boolean
isIBM914(int codepoint)
private static boolean
isIBM915(int codepoint)
private static boolean
isIBM916(int codepoint)
private static boolean
isIBM923(int codepoint)
private static boolean
isISO8859_14(int codepoint)
private void
LoadConverters()
java.nio.charset.CharsetDecoder
newDecoder()
java.nio.charset.CharsetEncoder
newEncoder()
-
Methods inherited from class com.ibm.icu.charset.CharsetICU
contains, forNameICU, getCharset, getCompleteUnicodeSet, getNonSurrogateUnicodeSet, getUnicodeSet, isFixedWidth, isSurrogate
-
-
-
-
Field Detail
-
fromUSubstitution
private static final byte[] fromUSubstitution
-
myConverterArray
private CharsetMBCS[] myConverterArray
-
state
private byte state
-
INVALID
private static final byte INVALID
- See Also:
- Constant Field Values
-
DO_SEARCH
private static final byte DO_SEARCH
- See Also:
- Constant Field Values
-
COMPOUND_TEXT_SINGLE_0
private static final byte COMPOUND_TEXT_SINGLE_0
- See Also:
- Constant Field Values
-
COMPOUND_TEXT_SINGLE_1
private static final byte COMPOUND_TEXT_SINGLE_1
- See Also:
- Constant Field Values
-
COMPOUND_TEXT_SINGLE_2
private static final byte COMPOUND_TEXT_SINGLE_2
- See Also:
- Constant Field Values
-
COMPOUND_TEXT_SINGLE_3
private static final byte COMPOUND_TEXT_SINGLE_3
- See Also:
- Constant Field Values
-
IBM_915
private static final byte IBM_915
- See Also:
- Constant Field Values
-
IBM_916
private static final byte IBM_916
- See Also:
- Constant Field Values
-
IBM_914
private static final byte IBM_914
- See Also:
- Constant Field Values
-
IBM_874
private static final byte IBM_874
- See Also:
- Constant Field Values
-
IBM_912
private static final byte IBM_912
- See Also:
- Constant Field Values
-
IBM_913
private static final byte IBM_913
- See Also:
- Constant Field Values
-
ISO_8859_14
private static final byte ISO_8859_14
- See Also:
- Constant Field Values
-
IBM_923
private static final byte IBM_923
- See Also:
- Constant Field Values
-
NUM_OF_CONVERTERS
private static final byte NUM_OF_CONVERTERS
- See Also:
- Constant Field Values
-
SEARCH_LENGTH
private static final byte SEARCH_LENGTH
- See Also:
- Constant Field Values
-
escSeqCompoundText
private static final byte[][] escSeqCompoundText
-
ESC_START
private static final byte ESC_START
- See Also:
- Constant Field Values
-
-
Method Detail
-
isASCIIRange
private static boolean isASCIIRange(int codepoint)
-
isIBM915
private static boolean isIBM915(int codepoint)
-
isIBM916
private static boolean isIBM916(int codepoint)
-
isCompoundS3
private static boolean isCompoundS3(int codepoint)
-
isCompoundS2
private static boolean isCompoundS2(int codepoint)
-
isIBM914
private static boolean isIBM914(int codepoint)
-
isIBM874
private static boolean isIBM874(int codepoint)
-
isIBM912
private static boolean isIBM912(int codepoint)
-
isIBM913
private static boolean isIBM913(int codepoint)
-
isCompoundS1
private static boolean isCompoundS1(int codepoint)
-
isISO8859_14
private static boolean isISO8859_14(int codepoint)
-
isIBM923
private static boolean isIBM923(int codepoint)
-
findNextEsc
private static int findNextEsc(java.nio.ByteBuffer source)
-
getState
private static byte getState(int codepoint)
-
findStateFromEscSeq
private static byte findStateFromEscSeq(java.nio.ByteBuffer source, byte[] toUBytes, int toUBytesLength)
-
LoadConverters
private void LoadConverters()
-
newDecoder
public java.nio.charset.CharsetDecoder newDecoder()
- Specified by:
newDecoder
in classjava.nio.charset.Charset
-
newEncoder
public java.nio.charset.CharsetEncoder newEncoder()
- Specified by:
newEncoder
in classjava.nio.charset.Charset
-
getUnicodeSetImpl
void getUnicodeSetImpl(UnicodeSet setFillIn, int which)
Description copied from class:CharsetICU
This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored. Detects Unicode signature byte sequences at the start of the byte stream and returns number of bytes of the BOM of the indicated Unicode charset. 0 is returned when no Unicode signature is recognized.- Specified by:
getUnicodeSetImpl
in classCharsetICU
-
-