Class UCharacterProperty
- java.lang.Object
-
- com.ibm.icu.impl.UCharacterProperty
-
public final class UCharacterProperty extends java.lang.Object
Internal class used for Unicode character property database.
This classes store binary data read from uprops.icu. It does not have the capability to parse the data into more high-level information. It only returns bytes of information when required.
Due to the form most commonly used for retrieval, array of char is used to store the binary data.
UCharacterPropertyDB also contains information on accessing indexes to significant points in the binary data.
Responsibility for molding the binary data into more meaning form lies on UCharacter.
- Since:
- release 2.1, february 1st 2002
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description private class
UCharacterProperty.BiDiIntProperty
private class
UCharacterProperty.BinaryProperty
private class
UCharacterProperty.CaseBinaryProperty
private class
UCharacterProperty.CombiningClassIntProperty
private class
UCharacterProperty.EmojiBinaryProperty
private class
UCharacterProperty.IntProperty
private static class
UCharacterProperty.IsAcceptable
private static class
UCharacterProperty.LayoutProps
private class
UCharacterProperty.MathCompatBinaryProperty
private class
UCharacterProperty.MCMBinaryProperty
private class
UCharacterProperty.NormInertBinaryProperty
private class
UCharacterProperty.NormQuickCheckIntProperty
-
Field Summary
Fields Modifier and Type Field Description private static int
AGE_SHIFT_
Age value shiftprivate static int
ALPHABETIC_PROPERTY_
private static int
ASCII_HEX_DIGIT_PROPERTY_
(package private) UCharacterProperty.BinaryProperty[]
binProps
private static int
CGJ
private static int
CR
private static int
DASH_PROPERTY_
private static java.lang.String
DATA_FILE_NAME_
Default name of the datafileprivate static int
DATA_FORMAT
private static int
DECOMPOSITION_TYPE_MASK_
Integer properties mask for decomposition type.private static int
DEFAULT_IGNORABLE_CODE_POINT_PROPERTY_
private static int
DEL
private static int
DEPRECATED_PROPERTY_
private static int
DIACRITIC_PROPERTY_
private static int
EAST_ASIAN_MASK_
private static int
EAST_ASIAN_SHIFT_
private static int
EXTENDER_PROPERTY_
private static int
FIGURESP
private static int
GC_CC_MASK
private static int
GC_CN_MASK
private static int
GC_CS_MASK
private static int
GC_Z_MASK
Mask constant for multiple UCharCategory bits (Z Separators).private static int
GC_ZL_MASK
private static int
GC_ZP_MASK
private static int
GC_ZS_MASK
private static int
GCB_MASK
private static int
GCB_SHIFT
private static int[]
gcbToHst
private static int
GRAPHEME_BASE_PROPERTY_
private static int
GRAPHEME_EXTEND_PROPERTY_
private static int
GRAPHEME_LINK_PROPERTY_
private static int
HAIRSP
private static int
HEX_DIGIT_PROPERTY_
private static int
HYPHEN_PROPERTY_
private static int[]
ID_COMPAT_MATH_CONTINUE
Ranges (start/limit pairs) of ID_Compat_Math_Continue (only), from UCD PropList.txt.private static int[]
ID_COMPAT_MATH_START
ID_Compat_Math_Start characters, from UCD PropList.txt.private static int
ID_CONTINUE_PROPERTY_
private static int
ID_START_PROPERTY_
private static int
ID_TYPE_ALLOWED_MIN
private static int
ID_TYPE_BIT
private static int
ID_TYPE_DEFAULT_IGNORABLE
private static int
ID_TYPE_DEPRECATED
private static int
ID_TYPE_EXCLUSION
private static int
ID_TYPE_FORBIDDEN
private static int
ID_TYPE_INCLUSION
private static int
ID_TYPE_LIMITED_USE
private static int
ID_TYPE_NOT_CHARACTER
private static int
ID_TYPE_NOT_NFKC
private static int
ID_TYPE_NOT_XID
private static int
ID_TYPE_OBSOLETE
private static int
ID_TYPE_RECOMMENDED
private static int
ID_TYPE_SHIFT
private static int
ID_TYPE_TECHNICAL
private static int
ID_TYPE_UNCOMMON_USE
private static int
IDEOGRAPHIC_PROPERTY_
private static int
IDS_BINARY_OPERATOR_PROPERTY_
private static int
IDS_TRINARY_OPERATOR_PROPERTY_
private static int[]
idTypeToEncoded
Maps UIdentifierType to encoded bits.private static int
INCB_MASK
private static int
INCB_SHIFT
private static int
INHSWAP
static UCharacterProperty
INSTANCE
(package private) UCharacterProperty.IntProperty[]
intProps
static char
LATIN_CAPITAL_LETTER_I_WITH_DOT_ABOVE_
Latin capital letter i with dot abovestatic char
LATIN_SMALL_LETTER_DOTLESS_I_
Latin small letter i with dot abovestatic char
LATIN_SMALL_LETTER_I_
Latin lowercase iprivate static int
LB_MASK
private static int
LB_SHIFT
private static int
LOGICAL_ORDER_EXCEPTION_PROPERTY_
(package private) int
m_additionalColumnsCount_
Number of additional columns(package private) Trie2_16
m_additionalTrie_
Extra property trie(package private) int[]
m_additionalVectors_
Extra property vectors, 1st column for age and second for binary properties.(package private) CodePointTrie
m_blockTrie_
(package private) int
m_maxBlockScriptValue_
Maximum values for block, bits used as in vector word 0(package private) int
m_maxJTGValue_
Maximum values for script, bits used as in vector word 0(package private) int
m_maxValuesOther_
maximum values for other code valueschar[]
m_scriptExtensions_
Script_Extensions dataTrie2_16
m_trie_
Trie dataVersionInfo
m_unicodeVersion_
Unicode versionprivate static int
MATH_PROPERTY_
private static int
MAX_BLOCK
static int
MAX_SCRIPT
private static int[]
MODIFIER_COMBINING_MARK
Ranges (start/limit pairs) of Modifier_Combining_mark (only), from UCD PropList.txt.(package private) static int
MY_MASK
private static int
NBSP
private static int
NL
private static int
NNBSP
private static int
NOMDIG
private static int
NONCHARACTER_CODE_POINT_PROPERTY_
private static int
NTV_BASE60_START_
Sexagesimal numbers: ((ntv>>2)-0xbf) * 60^((ntv&3)+1) = (1..9)*(60^1..60^4)private static int
NTV_DECIMAL_START_
Decimal digits: nv=0..9private static int
NTV_DIGIT_START_
Other digits: nv=0..9private static int
NTV_FRACTION_START_
Fractions: ((ntv>>4)-12) / ((ntv&0xf)+1) = -1..17 / 1..16private static int
NTV_FRACTION20_START_
Fraction-20 values: frac20 = ntv-0x324 = 0..0x17 -> 1|3|5|7 / 20|40|80|160|320|640 numerator: num = 2*(frac20&3)+1 denominator: den = 20<<(frac20>>2)private static int
NTV_FRACTION32_START_
Fraction-32 values: frac32 = ntv-0x34c = 0..15 -> 1|3|5|7 / 32|64|128|256 numerator: num = 2*(frac32&3)+1 denominator: den = 32<<(frac32>>2)private static int
NTV_LARGE_START_
Large integers: ((ntv>>5)-14) * 10^((ntv&0x1f)+2) = (1..9)*(10^2..10^33) (only one significant decimal digit)private static int
NTV_NONE_
No numeric value.private static int
NTV_NUMERIC_START_
Small integers: nv=0..154private static int
NTV_RESERVED_START_
No numeric value (yet).private static int
NUMERIC_TYPE_VALUE_SHIFT_
Numeric types and values in the main properties words.private static int
PATTERN_SYNTAX
private static int
PATTERN_WHITE_SPACE
private static int
PREPENDED_CONCATENATION_MARK
private static int
QUOTATION_MARK_PROPERTY_
private static int
RADICAL_PROPERTY_
private static int
RLM
private static int
S_TERM_PROPERTY_
private static int
SB_MASK
private static int
SB_SHIFT
static int
SCRIPT_X_MASK
Script_Extensions: mask includes Scriptstatic int
SCRIPT_X_WITH_COMMON
static int
SCRIPT_X_WITH_INHERITED
static int
SCRIPT_X_WITH_OTHER
static int
SRC_BIDI
From ubidi_props.c/ubidi.icustatic int
SRC_BLOCK
static int
SRC_CASE
From ucase.c/ucase.icustatic int
SRC_CASE_AND_NORM
From ucase.c/ucase.icu as well as unorm.cpp/unorm.icustatic int
SRC_CHAR
From uchar.c/uprops.icu main triestatic int
SRC_CHAR_AND_PROPSVEC
From uchar.c/uprops.icu main trie as well as properties vectors triestatic int
SRC_COUNT
One more than the highest UPropertySource (SRC_) constant.static int
SRC_EMOJI
static int
SRC_ID_COMPAT_MATH
static int
SRC_IDSU
static int
SRC_INPC
static int
SRC_INSC
static int
SRC_MCM
static int
SRC_NAMES
From unames.c/unames.icustatic int
SRC_NFC
From normalizer2impl.cpp/nfc.nrmstatic int
SRC_NFC_CANON_ITER
From normalizer2impl.cpp/nfc.nrm canonical iterator datastatic int
SRC_NFKC
From normalizer2impl.cpp/nfkc.nrmstatic int
SRC_NFKC_CF
From normalizer2impl.cpp/nfkc_cf.nrmstatic int
SRC_NONE
No source, not a supported property.static int
SRC_PROPSVEC
From uchar.c/uprops.icu properties vectors triestatic int
SRC_VO
private static int
TAB
private static int
TERMINAL_PUNCTUATION_PROPERTY_
static int
TYPE_MASK
Character type maskprivate static int
U_a
private static int
U_A
private static int
U_f
private static int
U_F
private static int
U_FW_a
private static int
U_FW_A
private static int
U_FW_f
private static int
U_FW_F
private static int
U_FW_z
private static int
U_FW_Z
private static int
U_z
private static int
U_Z
private static int
UNIFIED_IDEOGRAPH_PROPERTY_
private static int
VARIATION_SELECTOR_PROPERTY_
private static int
WB_MASK
private static int
WB_SHIFT
private static int
WHITE_SPACE_PROPERTY_
Additional properties used in internal trie dataprivate static int
WJ
private static int
XID_CONTINUE_PROPERTY_
private static int
XID_START_PROPERTY_
private static int
ZWNBSP
-
Constructor Summary
Constructors Modifier Constructor Description private
UCharacterProperty()
Constructor
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description UnicodeSet
addPropertyStarts(UnicodeSet set)
int
digit(int c)
int
getAdditional(int codepoint, int column)
Gets the unicode additional properties.VersionInfo
getAge(int codepoint)
Get the "age" of the code point.static int
getEuropeanDigit(int ch)
Returns the digit values of characters like 'A' - 'Z', normal, half-width and full-width.int
getIDTypes(int c, java.util.EnumSet<UCharacter.IdentifierType> types)
int
getIntPropertyMaxValue(int which)
int
getIntPropertyValue(int c, int which)
static int
getMask(int type)
Gets the type maskint
getMaxValues(int column)
Get the the maximum values for some enum/int properties.private static int
getNumericTypeValue(int props)
int
getNumericValue(int c)
int
getProperty(int ch)
Gets the main property value for code point ch.(package private) int
getSource(int which)
int
getType(int c)
double
getUnicodeNumericValue(int c)
boolean
hasBinaryProperty(int c, int which)
boolean
hasIDType(int c, int typeIndex)
boolean
hasIDType(int c, UCharacter.IdentifierType type)
private static boolean
isgraphPOSIX(int c)
Checks if c is in [^\p{space}\p{gc=Control}\p{gc=Surrogate}\p{gc=Unassigned}] with space=\p{Whitespace} and Control=Cc.(package private) static void
mathCompat_addPropertyStarts(UnicodeSet set)
private static void
maybeAddType(int value, int bit, UCharacter.IdentifierType t, java.util.EnumSet<UCharacter.IdentifierType> types)
(package private) static void
mcm_addPropertyStarts(UnicodeSet set)
private static int
ntvGetType(int ntv)
void
ublock_addPropertyStarts(UnicodeSet set)
(package private) static UnicodeSet
ulayout_addPropertyStarts(int src, UnicodeSet set)
void
upropsvec_addPropertyStarts(UnicodeSet set)
-
-
-
Field Detail
-
INSTANCE
public static final UCharacterProperty INSTANCE
-
m_trie_
public Trie2_16 m_trie_
Trie data
-
m_unicodeVersion_
public VersionInfo m_unicodeVersion_
Unicode version
-
LATIN_CAPITAL_LETTER_I_WITH_DOT_ABOVE_
public static final char LATIN_CAPITAL_LETTER_I_WITH_DOT_ABOVE_
Latin capital letter i with dot above- See Also:
- Constant Field Values
-
LATIN_SMALL_LETTER_DOTLESS_I_
public static final char LATIN_SMALL_LETTER_DOTLESS_I_
Latin small letter i with dot above- See Also:
- Constant Field Values
-
LATIN_SMALL_LETTER_I_
public static final char LATIN_SMALL_LETTER_I_
Latin lowercase i- See Also:
- Constant Field Values
-
TYPE_MASK
public static final int TYPE_MASK
Character type mask- See Also:
- Constant Field Values
-
SRC_NONE
public static final int SRC_NONE
No source, not a supported property.- See Also:
- Constant Field Values
-
SRC_CHAR
public static final int SRC_CHAR
From uchar.c/uprops.icu main trie- See Also:
- Constant Field Values
-
SRC_PROPSVEC
public static final int SRC_PROPSVEC
From uchar.c/uprops.icu properties vectors trie- See Also:
- Constant Field Values
-
SRC_NAMES
public static final int SRC_NAMES
From unames.c/unames.icu- See Also:
- Constant Field Values
-
SRC_CASE
public static final int SRC_CASE
From ucase.c/ucase.icu- See Also:
- Constant Field Values
-
SRC_BIDI
public static final int SRC_BIDI
From ubidi_props.c/ubidi.icu- See Also:
- Constant Field Values
-
SRC_CHAR_AND_PROPSVEC
public static final int SRC_CHAR_AND_PROPSVEC
From uchar.c/uprops.icu main trie as well as properties vectors trie- See Also:
- Constant Field Values
-
SRC_CASE_AND_NORM
public static final int SRC_CASE_AND_NORM
From ucase.c/ucase.icu as well as unorm.cpp/unorm.icu- See Also:
- Constant Field Values
-
SRC_NFC
public static final int SRC_NFC
From normalizer2impl.cpp/nfc.nrm- See Also:
- Constant Field Values
-
SRC_NFKC
public static final int SRC_NFKC
From normalizer2impl.cpp/nfkc.nrm- See Also:
- Constant Field Values
-
SRC_NFKC_CF
public static final int SRC_NFKC_CF
From normalizer2impl.cpp/nfkc_cf.nrm- See Also:
- Constant Field Values
-
SRC_NFC_CANON_ITER
public static final int SRC_NFC_CANON_ITER
From normalizer2impl.cpp/nfc.nrm canonical iterator data- See Also:
- Constant Field Values
-
SRC_INPC
public static final int SRC_INPC
- See Also:
- Constant Field Values
-
SRC_INSC
public static final int SRC_INSC
- See Also:
- Constant Field Values
-
SRC_VO
public static final int SRC_VO
- See Also:
- Constant Field Values
-
SRC_EMOJI
public static final int SRC_EMOJI
- See Also:
- Constant Field Values
-
SRC_IDSU
public static final int SRC_IDSU
- See Also:
- Constant Field Values
-
SRC_ID_COMPAT_MATH
public static final int SRC_ID_COMPAT_MATH
- See Also:
- Constant Field Values
-
SRC_BLOCK
public static final int SRC_BLOCK
- See Also:
- Constant Field Values
-
SRC_MCM
public static final int SRC_MCM
- See Also:
- Constant Field Values
-
SRC_COUNT
public static final int SRC_COUNT
One more than the highest UPropertySource (SRC_) constant.- See Also:
- Constant Field Values
-
MY_MASK
static final int MY_MASK
- See Also:
- Constant Field Values
-
GC_CN_MASK
private static final int GC_CN_MASK
-
GC_CC_MASK
private static final int GC_CC_MASK
-
GC_CS_MASK
private static final int GC_CS_MASK
-
GC_ZS_MASK
private static final int GC_ZS_MASK
-
GC_ZL_MASK
private static final int GC_ZL_MASK
-
GC_ZP_MASK
private static final int GC_ZP_MASK
-
GC_Z_MASK
private static final int GC_Z_MASK
Mask constant for multiple UCharCategory bits (Z Separators).
-
ID_COMPAT_MATH_CONTINUE
private static final int[] ID_COMPAT_MATH_CONTINUE
Ranges (start/limit pairs) of ID_Compat_Math_Continue (only), from UCD PropList.txt.
-
ID_COMPAT_MATH_START
private static final int[] ID_COMPAT_MATH_START
ID_Compat_Math_Start characters, from UCD PropList.txt.
-
MODIFIER_COMBINING_MARK
private static final int[] MODIFIER_COMBINING_MARK
Ranges (start/limit pairs) of Modifier_Combining_mark (only), from UCD PropList.txt.
-
binProps
UCharacterProperty.BinaryProperty[] binProps
-
gcbToHst
private static final int[] gcbToHst
-
intProps
UCharacterProperty.IntProperty[] intProps
-
m_additionalTrie_
Trie2_16 m_additionalTrie_
Extra property trie
-
m_additionalVectors_
int[] m_additionalVectors_
Extra property vectors, 1st column for age and second for binary properties.
-
m_additionalColumnsCount_
int m_additionalColumnsCount_
Number of additional columns
-
m_maxBlockScriptValue_
int m_maxBlockScriptValue_
Maximum values for block, bits used as in vector word 0
-
m_maxJTGValue_
int m_maxJTGValue_
Maximum values for script, bits used as in vector word 0
-
m_maxValuesOther_
int m_maxValuesOther_
maximum values for other code values
-
m_scriptExtensions_
public char[] m_scriptExtensions_
Script_Extensions data
-
m_blockTrie_
CodePointTrie m_blockTrie_
-
DATA_FILE_NAME_
private static final java.lang.String DATA_FILE_NAME_
Default name of the datafile- See Also:
- Constant Field Values
-
NUMERIC_TYPE_VALUE_SHIFT_
private static final int NUMERIC_TYPE_VALUE_SHIFT_
Numeric types and values in the main properties words.- See Also:
- Constant Field Values
-
NTV_NONE_
private static final int NTV_NONE_
No numeric value.- See Also:
- Constant Field Values
-
NTV_DECIMAL_START_
private static final int NTV_DECIMAL_START_
Decimal digits: nv=0..9- See Also:
- Constant Field Values
-
NTV_DIGIT_START_
private static final int NTV_DIGIT_START_
Other digits: nv=0..9- See Also:
- Constant Field Values
-
NTV_NUMERIC_START_
private static final int NTV_NUMERIC_START_
Small integers: nv=0..154- See Also:
- Constant Field Values
-
NTV_FRACTION_START_
private static final int NTV_FRACTION_START_
Fractions: ((ntv>>4)-12) / ((ntv&0xf)+1) = -1..17 / 1..16- See Also:
- Constant Field Values
-
NTV_LARGE_START_
private static final int NTV_LARGE_START_
Large integers: ((ntv>>5)-14) * 10^((ntv&0x1f)+2) = (1..9)*(10^2..10^33) (only one significant decimal digit)- See Also:
- Constant Field Values
-
NTV_BASE60_START_
private static final int NTV_BASE60_START_
Sexagesimal numbers: ((ntv>>2)-0xbf) * 60^((ntv&3)+1) = (1..9)*(60^1..60^4)- See Also:
- Constant Field Values
-
NTV_FRACTION20_START_
private static final int NTV_FRACTION20_START_
Fraction-20 values: frac20 = ntv-0x324 = 0..0x17 -> 1|3|5|7 / 20|40|80|160|320|640 numerator: num = 2*(frac20&3)+1 denominator: den = 20<<(frac20>>2)- See Also:
- Constant Field Values
-
NTV_FRACTION32_START_
private static final int NTV_FRACTION32_START_
Fraction-32 values: frac32 = ntv-0x34c = 0..15 -> 1|3|5|7 / 32|64|128|256 numerator: num = 2*(frac32&3)+1 denominator: den = 32<<(frac32>>2)- See Also:
- Constant Field Values
-
NTV_RESERVED_START_
private static final int NTV_RESERVED_START_
No numeric value (yet).- See Also:
- Constant Field Values
-
EAST_ASIAN_MASK_
private static final int EAST_ASIAN_MASK_
- See Also:
- Constant Field Values
-
EAST_ASIAN_SHIFT_
private static final int EAST_ASIAN_SHIFT_
- See Also:
- Constant Field Values
-
INCB_MASK
private static final int INCB_MASK
- See Also:
- Constant Field Values
-
INCB_SHIFT
private static final int INCB_SHIFT
- See Also:
- Constant Field Values
-
SCRIPT_X_MASK
public static final int SCRIPT_X_MASK
Script_Extensions: mask includes Script- See Also:
- Constant Field Values
-
SCRIPT_X_WITH_OTHER
public static final int SCRIPT_X_WITH_OTHER
- See Also:
- Constant Field Values
-
SCRIPT_X_WITH_INHERITED
public static final int SCRIPT_X_WITH_INHERITED
- See Also:
- Constant Field Values
-
SCRIPT_X_WITH_COMMON
public static final int SCRIPT_X_WITH_COMMON
- See Also:
- Constant Field Values
-
MAX_SCRIPT
public static final int MAX_SCRIPT
- See Also:
- Constant Field Values
-
WHITE_SPACE_PROPERTY_
private static final int WHITE_SPACE_PROPERTY_
Additional properties used in internal trie data- See Also:
- Constant Field Values
-
DASH_PROPERTY_
private static final int DASH_PROPERTY_
- See Also:
- Constant Field Values
-
HYPHEN_PROPERTY_
private static final int HYPHEN_PROPERTY_
- See Also:
- Constant Field Values
-
QUOTATION_MARK_PROPERTY_
private static final int QUOTATION_MARK_PROPERTY_
- See Also:
- Constant Field Values
-
TERMINAL_PUNCTUATION_PROPERTY_
private static final int TERMINAL_PUNCTUATION_PROPERTY_
- See Also:
- Constant Field Values
-
MATH_PROPERTY_
private static final int MATH_PROPERTY_
- See Also:
- Constant Field Values
-
HEX_DIGIT_PROPERTY_
private static final int HEX_DIGIT_PROPERTY_
- See Also:
- Constant Field Values
-
ASCII_HEX_DIGIT_PROPERTY_
private static final int ASCII_HEX_DIGIT_PROPERTY_
- See Also:
- Constant Field Values
-
ALPHABETIC_PROPERTY_
private static final int ALPHABETIC_PROPERTY_
- See Also:
- Constant Field Values
-
IDEOGRAPHIC_PROPERTY_
private static final int IDEOGRAPHIC_PROPERTY_
- See Also:
- Constant Field Values
-
DIACRITIC_PROPERTY_
private static final int DIACRITIC_PROPERTY_
- See Also:
- Constant Field Values
-
EXTENDER_PROPERTY_
private static final int EXTENDER_PROPERTY_
- See Also:
- Constant Field Values
-
NONCHARACTER_CODE_POINT_PROPERTY_
private static final int NONCHARACTER_CODE_POINT_PROPERTY_
- See Also:
- Constant Field Values
-
GRAPHEME_EXTEND_PROPERTY_
private static final int GRAPHEME_EXTEND_PROPERTY_
- See Also:
- Constant Field Values
-
GRAPHEME_LINK_PROPERTY_
private static final int GRAPHEME_LINK_PROPERTY_
- See Also:
- Constant Field Values
-
IDS_BINARY_OPERATOR_PROPERTY_
private static final int IDS_BINARY_OPERATOR_PROPERTY_
- See Also:
- Constant Field Values
-
IDS_TRINARY_OPERATOR_PROPERTY_
private static final int IDS_TRINARY_OPERATOR_PROPERTY_
- See Also:
- Constant Field Values
-
RADICAL_PROPERTY_
private static final int RADICAL_PROPERTY_
- See Also:
- Constant Field Values
-
UNIFIED_IDEOGRAPH_PROPERTY_
private static final int UNIFIED_IDEOGRAPH_PROPERTY_
- See Also:
- Constant Field Values
-
DEFAULT_IGNORABLE_CODE_POINT_PROPERTY_
private static final int DEFAULT_IGNORABLE_CODE_POINT_PROPERTY_
- See Also:
- Constant Field Values
-
DEPRECATED_PROPERTY_
private static final int DEPRECATED_PROPERTY_
- See Also:
- Constant Field Values
-
LOGICAL_ORDER_EXCEPTION_PROPERTY_
private static final int LOGICAL_ORDER_EXCEPTION_PROPERTY_
- See Also:
- Constant Field Values
-
XID_START_PROPERTY_
private static final int XID_START_PROPERTY_
- See Also:
- Constant Field Values
-
XID_CONTINUE_PROPERTY_
private static final int XID_CONTINUE_PROPERTY_
- See Also:
- Constant Field Values
-
ID_START_PROPERTY_
private static final int ID_START_PROPERTY_
- See Also:
- Constant Field Values
-
ID_CONTINUE_PROPERTY_
private static final int ID_CONTINUE_PROPERTY_
- See Also:
- Constant Field Values
-
GRAPHEME_BASE_PROPERTY_
private static final int GRAPHEME_BASE_PROPERTY_
- See Also:
- Constant Field Values
-
S_TERM_PROPERTY_
private static final int S_TERM_PROPERTY_
- See Also:
- Constant Field Values
-
VARIATION_SELECTOR_PROPERTY_
private static final int VARIATION_SELECTOR_PROPERTY_
- See Also:
- Constant Field Values
-
PATTERN_SYNTAX
private static final int PATTERN_SYNTAX
- See Also:
- Constant Field Values
-
PATTERN_WHITE_SPACE
private static final int PATTERN_WHITE_SPACE
- See Also:
- Constant Field Values
-
PREPENDED_CONCATENATION_MARK
private static final int PREPENDED_CONCATENATION_MARK
- See Also:
- Constant Field Values
-
ID_TYPE_SHIFT
private static final int ID_TYPE_SHIFT
- See Also:
- Constant Field Values
-
ID_TYPE_BIT
private static final int ID_TYPE_BIT
- See Also:
- Constant Field Values
-
ID_TYPE_EXCLUSION
private static final int ID_TYPE_EXCLUSION
- See Also:
- Constant Field Values
-
ID_TYPE_LIMITED_USE
private static final int ID_TYPE_LIMITED_USE
- See Also:
- Constant Field Values
-
ID_TYPE_UNCOMMON_USE
private static final int ID_TYPE_UNCOMMON_USE
- See Also:
- Constant Field Values
-
ID_TYPE_TECHNICAL
private static final int ID_TYPE_TECHNICAL
- See Also:
- Constant Field Values
-
ID_TYPE_OBSOLETE
private static final int ID_TYPE_OBSOLETE
- See Also:
- Constant Field Values
-
ID_TYPE_NOT_XID
private static final int ID_TYPE_NOT_XID
- See Also:
- Constant Field Values
-
ID_TYPE_NOT_CHARACTER
private static final int ID_TYPE_NOT_CHARACTER
- See Also:
- Constant Field Values
-
ID_TYPE_FORBIDDEN
private static final int ID_TYPE_FORBIDDEN
- See Also:
- Constant Field Values
-
ID_TYPE_DEPRECATED
private static final int ID_TYPE_DEPRECATED
- See Also:
- Constant Field Values
-
ID_TYPE_DEFAULT_IGNORABLE
private static final int ID_TYPE_DEFAULT_IGNORABLE
- See Also:
- Constant Field Values
-
ID_TYPE_NOT_NFKC
private static final int ID_TYPE_NOT_NFKC
- See Also:
- Constant Field Values
-
ID_TYPE_ALLOWED_MIN
private static final int ID_TYPE_ALLOWED_MIN
- See Also:
- Constant Field Values
-
ID_TYPE_INCLUSION
private static final int ID_TYPE_INCLUSION
- See Also:
- Constant Field Values
-
ID_TYPE_RECOMMENDED
private static final int ID_TYPE_RECOMMENDED
- See Also:
- Constant Field Values
-
idTypeToEncoded
private static final int[] idTypeToEncoded
Maps UIdentifierType to encoded bits. When UPROPS_ID_TYPE_BIT is set, then use "&" to test whether the value bit is set. When UPROPS_ID_TYPE_BIT is not set, then compare ("==") the array value with the data value.
-
LB_MASK
private static final int LB_MASK
- See Also:
- Constant Field Values
-
LB_SHIFT
private static final int LB_SHIFT
- See Also:
- Constant Field Values
-
SB_MASK
private static final int SB_MASK
- See Also:
- Constant Field Values
-
SB_SHIFT
private static final int SB_SHIFT
- See Also:
- Constant Field Values
-
WB_MASK
private static final int WB_MASK
- See Also:
- Constant Field Values
-
WB_SHIFT
private static final int WB_SHIFT
- See Also:
- Constant Field Values
-
GCB_MASK
private static final int GCB_MASK
- See Also:
- Constant Field Values
-
GCB_SHIFT
private static final int GCB_SHIFT
- See Also:
- Constant Field Values
-
DECOMPOSITION_TYPE_MASK_
private static final int DECOMPOSITION_TYPE_MASK_
Integer properties mask for decomposition type. Equivalent to icu4c UPROPS_DT_MASK.- See Also:
- Constant Field Values
-
AGE_SHIFT_
private static final int AGE_SHIFT_
Age value shift- See Also:
- Constant Field Values
-
MAX_BLOCK
private static final int MAX_BLOCK
- See Also:
- Constant Field Values
-
DATA_FORMAT
private static final int DATA_FORMAT
- See Also:
- Constant Field Values
-
TAB
private static final int TAB
- See Also:
- Constant Field Values
-
CR
private static final int CR
- See Also:
- Constant Field Values
-
U_A
private static final int U_A
- See Also:
- Constant Field Values
-
U_F
private static final int U_F
- See Also:
- Constant Field Values
-
U_Z
private static final int U_Z
- See Also:
- Constant Field Values
-
U_a
private static final int U_a
- See Also:
- Constant Field Values
-
U_f
private static final int U_f
- See Also:
- Constant Field Values
-
U_z
private static final int U_z
- See Also:
- Constant Field Values
-
DEL
private static final int DEL
- See Also:
- Constant Field Values
-
NL
private static final int NL
- See Also:
- Constant Field Values
-
NBSP
private static final int NBSP
- See Also:
- Constant Field Values
-
CGJ
private static final int CGJ
- See Also:
- Constant Field Values
-
FIGURESP
private static final int FIGURESP
- See Also:
- Constant Field Values
-
HAIRSP
private static final int HAIRSP
- See Also:
- Constant Field Values
-
RLM
private static final int RLM
- See Also:
- Constant Field Values
-
NNBSP
private static final int NNBSP
- See Also:
- Constant Field Values
-
WJ
private static final int WJ
- See Also:
- Constant Field Values
-
INHSWAP
private static final int INHSWAP
- See Also:
- Constant Field Values
-
NOMDIG
private static final int NOMDIG
- See Also:
- Constant Field Values
-
U_FW_A
private static final int U_FW_A
- See Also:
- Constant Field Values
-
U_FW_F
private static final int U_FW_F
- See Also:
- Constant Field Values
-
U_FW_Z
private static final int U_FW_Z
- See Also:
- Constant Field Values
-
U_FW_a
private static final int U_FW_a
- See Also:
- Constant Field Values
-
U_FW_f
private static final int U_FW_f
- See Also:
- Constant Field Values
-
U_FW_z
private static final int U_FW_z
- See Also:
- Constant Field Values
-
ZWNBSP
private static final int ZWNBSP
- See Also:
- Constant Field Values
-
-
Method Detail
-
getProperty
public final int getProperty(int ch)
Gets the main property value for code point ch.- Parameters:
ch
- code point whose property value is to be retrieved- Returns:
- property value of code point
-
getAdditional
public int getAdditional(int codepoint, int column)
Gets the unicode additional properties. Java version of C u_getUnicodeProperties().- Parameters:
codepoint
- codepoint whose additional properties is to be retrievedcolumn
- The column index.- Returns:
- unicode properties
-
getAge
public VersionInfo getAge(int codepoint)
Get the "age" of the code point.
The "age" is the Unicode version when the code point was first designated (as a non-character or for Private Use) or assigned a character.
This can be useful to avoid emitting code points to receiving processes that do not accept newer characters.
The data is from the UCD file DerivedAge.txt.
This API does not check the validity of the codepoint.
- Parameters:
codepoint
- The code point.- Returns:
- the Unicode version number
-
isgraphPOSIX
private static final boolean isgraphPOSIX(int c)
Checks if c is in [^\p{space}\p{gc=Control}\p{gc=Surrogate}\p{gc=Unassigned}] with space=\p{Whitespace} and Control=Cc. Implements UCHAR_POSIX_GRAPH.
-
hasBinaryProperty
public boolean hasBinaryProperty(int c, int which)
-
getType
public int getType(int c)
-
getIntPropertyValue
public int getIntPropertyValue(int c, int which)
-
getIntPropertyMaxValue
public int getIntPropertyMaxValue(int which)
-
getSource
final int getSource(int which)
-
getMaxValues
public int getMaxValues(int column)
Get the the maximum values for some enum/int properties.- Returns:
- maximum values for the integer properties.
-
getMask
public static final int getMask(int type)
Gets the type mask- Parameters:
type
- character type- Returns:
- mask
-
getEuropeanDigit
public static int getEuropeanDigit(int ch)
Returns the digit values of characters like 'A' - 'Z', normal, half-width and full-width. This method assumes that the other digit characters are checked by the calling method.- Parameters:
ch
- character to test- Returns:
- -1 if ch is not a character of the form 'A' - 'Z', otherwise its corresponding digit will be returned.
-
digit
public int digit(int c)
-
getNumericValue
public int getNumericValue(int c)
-
getUnicodeNumericValue
public double getUnicodeNumericValue(int c)
-
getNumericTypeValue
private static final int getNumericTypeValue(int props)
-
ntvGetType
private static final int ntvGetType(int ntv)
-
addPropertyStarts
public UnicodeSet addPropertyStarts(UnicodeSet set)
-
upropsvec_addPropertyStarts
public void upropsvec_addPropertyStarts(UnicodeSet set)
-
ulayout_addPropertyStarts
static UnicodeSet ulayout_addPropertyStarts(int src, UnicodeSet set)
-
mathCompat_addPropertyStarts
static void mathCompat_addPropertyStarts(UnicodeSet set)
-
mcm_addPropertyStarts
static void mcm_addPropertyStarts(UnicodeSet set)
-
ublock_addPropertyStarts
public void ublock_addPropertyStarts(UnicodeSet set)
-
hasIDType
public boolean hasIDType(int c, int typeIndex)
-
hasIDType
public boolean hasIDType(int c, UCharacter.IdentifierType type)
-
maybeAddType
private static void maybeAddType(int value, int bit, UCharacter.IdentifierType t, java.util.EnumSet<UCharacter.IdentifierType> types)
-
getIDTypes
public int getIDTypes(int c, java.util.EnumSet<UCharacter.IdentifierType> types)
-
-