Interface UProperty


  • public interface UProperty

    Selection constants for Unicode properties.

    These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.

    The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).

    For details about the properties see UAX #44: Unicode Character Database.

    Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.

    See Also:
    UCharacter
    • Field Detail

      • UNDEFINED

        @Deprecated
        static final int UNDEFINED
        Deprecated.
        This API is ICU internal only.
        Special value indicating undefined property.
        See Also:
        Constant Field Values
      • ALPHABETIC

        static final int ALPHABETIC

        Binary property Alphabetic.

        Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().

        Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.

        See Also:
        Constant Field Values
      • BINARY_START

        static final int BINARY_START
        First constant for binary Unicode properties.
        See Also:
        Constant Field Values
      • ASCII_HEX_DIGIT

        static final int ASCII_HEX_DIGIT
        Binary property ASCII_Hex_Digit (0-9 A-F a-f).
        See Also:
        Constant Field Values
      • BIDI_CONTROL

        static final int BIDI_CONTROL

        Binary property Bidi_Control.

        Format controls which have specific functions in the Bidi Algorithm.

        See Also:
        Constant Field Values
      • BIDI_MIRRORED

        static final int BIDI_MIRRORED

        Binary property Bidi_Mirrored.

        Characters that may change display in RTL text.

        Property for UCharacter.isMirrored().

        See Bidi Algorithm; UTR 9.

        See Also:
        Constant Field Values
      • DASH

        static final int DASH

        Binary property Dash.

        Variations of dashes.

        See Also:
        Constant Field Values
      • DEFAULT_IGNORABLE_CODE_POINT

        static final int DEFAULT_IGNORABLE_CODE_POINT

        Binary property Default_Ignorable_Code_Point (new).

        Property that indicates codepoint is ignorable in most processing.

        Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)

        See Also:
        Constant Field Values
      • DEPRECATED

        static final int DEPRECATED

        Binary property Deprecated (new).

        The usage of deprecated characters is strongly discouraged.

        See Also:
        Constant Field Values
      • DIACRITIC

        static final int DIACRITIC

        Binary property Diacritic.

        Characters that linguistically modify the meaning of another character to which they apply.

        See Also:
        Constant Field Values
      • EXTENDER

        static final int EXTENDER

        Binary property Extender.

        Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.

        See Also:
        Constant Field Values
      • FULL_COMPOSITION_EXCLUSION

        static final int FULL_COMPOSITION_EXCLUSION

        Binary property Full_Composition_Exclusion.

        CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.

        See Also:
        Constant Field Values
      • GRAPHEME_BASE

        static final int GRAPHEME_BASE

        Binary property Grapheme_Base (new).

        For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ

        See Also:
        Constant Field Values
      • GRAPHEME_EXTEND

        static final int GRAPHEME_EXTEND

        Binary property Grapheme_Extend (new).

        For programmatic determination of grapheme cluster boundaries.

        Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ

        See Also:
        Constant Field Values
      • GRAPHEME_LINK

        static final int GRAPHEME_LINK

        Binary property Grapheme_Link (new).

        For programmatic determination of grapheme cluster boundaries.

        See Also:
        Constant Field Values
      • HEX_DIGIT

        static final int HEX_DIGIT

        Binary property Hex_Digit.

        Characters commonly used for hexadecimal numbers.

        See Also:
        Constant Field Values
      • HYPHEN

        static final int HYPHEN

        Binary property Hyphen.

        Dashes used to mark connections between pieces of words, plus the Katakana middle dot.

        See Also:
        Constant Field Values
      • ID_CONTINUE

        static final int ID_CONTINUE

        Binary property ID_Continue.

        Characters that can continue an identifier.

        ID_Start+Mn+Mc+Nd+Pc

        See Also:
        Constant Field Values
      • ID_START

        static final int ID_START

        Binary property ID_Start.

        Characters that can start an identifier.

        Lu+Ll+Lt+Lm+Lo+Nl

        See Also:
        Constant Field Values
      • IDEOGRAPHIC

        static final int IDEOGRAPHIC

        Binary property Ideographic.

        CJKV ideographs.

        See Also:
        Constant Field Values
      • IDS_BINARY_OPERATOR

        static final int IDS_BINARY_OPERATOR

        Binary property IDS_Binary_Operator (new).

        For programmatic determination of Ideographic Description Sequences.

        See Also:
        Constant Field Values
      • IDS_TRINARY_OPERATOR

        static final int IDS_TRINARY_OPERATOR

        Binary property IDS_Trinary_Operator (new).

        For programmatic determination of Ideographic Description Sequences.

        See Also:
        Constant Field Values
      • JOIN_CONTROL

        static final int JOIN_CONTROL

        Binary property Join_Control.

        Format controls for cursive joining and ligation.

        See Also:
        Constant Field Values
      • LOGICAL_ORDER_EXCEPTION

        static final int LOGICAL_ORDER_EXCEPTION

        Binary property Logical_Order_Exception (new).

        Characters that do not use logical order and require special handling in most processing.

        See Also:
        Constant Field Values
      • LOWERCASE

        static final int LOWERCASE

        Binary property Lowercase.

        Same as UCharacter.isULowercase(), different from UCharacter.islower().

        Ll+Other_Lowercase

        See Also:
        Constant Field Values
      • NONCHARACTER_CODE_POINT

        static final int NONCHARACTER_CODE_POINT

        Binary property Noncharacter_Code_Point.

        Code points that are explicitly defined as illegal for the encoding of characters.

        See Also:
        Constant Field Values
      • QUOTATION_MARK

        static final int QUOTATION_MARK

        Binary property Quotation_Mark.

        See Also:
        Constant Field Values
      • RADICAL

        static final int RADICAL

        Binary property Radical (new).

        For programmatic determination of Ideographic Description Sequences.

        See Also:
        Constant Field Values
      • SOFT_DOTTED

        static final int SOFT_DOTTED

        Binary property Soft_Dotted (new).

        Characters with a "soft dot", like i or j.

        An accent placed on these characters causes the dot to disappear.

        See Also:
        Constant Field Values
      • TERMINAL_PUNCTUATION

        static final int TERMINAL_PUNCTUATION

        Binary property Terminal_Punctuation.

        Punctuation characters that generally mark the end of textual units.

        See Also:
        Constant Field Values
      • UNIFIED_IDEOGRAPH

        static final int UNIFIED_IDEOGRAPH

        Binary property Unified_Ideograph (new).

        For programmatic determination of Ideographic Description Sequences.

        See Also:
        Constant Field Values
      • UPPERCASE

        static final int UPPERCASE

        Binary property Uppercase.

        Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().

        Lu+Other_Uppercase

        See Also:
        Constant Field Values
      • WHITE_SPACE

        static final int WHITE_SPACE

        Binary property White_Space.

        Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP

        See Also:
        Constant Field Values
      • XID_CONTINUE

        static final int XID_CONTINUE

        Binary property XID_Continue.

        ID_Continue modified to allow closure under normalization forms NFKC and NFKD.

        See Also:
        Constant Field Values
      • XID_START

        static final int XID_START

        Binary property XID_Start.

        ID_Start modified to allow closure under normalization forms NFKC and NFKD.

        See Also:
        Constant Field Values
      • CASE_SENSITIVE

        static final int CASE_SENSITIVE

        Binary property Case_Sensitive.

        Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.

        See Also:
        Constant Field Values
      • S_TERM

        static final int S_TERM
        Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)
        See Also:
        Constant Field Values
      • VARIATION_SELECTOR

        static final int VARIATION_SELECTOR
        Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.
        See Also:
        Constant Field Values
      • NFD_INERT

        static final int NFD_INERT
        Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .
        See Also:
        Constant Field Values
      • NFKD_INERT

        static final int NFKD_INERT
        Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
        See Also:
        NFD_INERT, Constant Field Values
      • NFC_INERT

        static final int NFC_INERT
        Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
        See Also:
        NFD_INERT, Constant Field Values
      • NFKC_INERT

        static final int NFKC_INERT
        Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
        See Also:
        NFD_INERT, Constant Field Values
      • SEGMENT_STARTER

        static final int SEGMENT_STARTER
        Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.
        See Also:
        Constant Field Values
      • PATTERN_SYNTAX

        static final int PATTERN_SYNTAX
        Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
        See Also:
        Constant Field Values
      • PATTERN_WHITE_SPACE

        static final int PATTERN_WHITE_SPACE
        Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
        See Also:
        Constant Field Values
      • POSIX_ALNUM

        static final int POSIX_ALNUM
        Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
        See Also:
        Constant Field Values
      • POSIX_BLANK

        static final int POSIX_BLANK
        Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
        See Also:
        Constant Field Values
      • POSIX_GRAPH

        static final int POSIX_GRAPH
        Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
        See Also:
        Constant Field Values
      • POSIX_PRINT

        static final int POSIX_PRINT
        Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
        See Also:
        Constant Field Values
      • POSIX_XDIGIT

        static final int POSIX_XDIGIT
        Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
        See Also:
        Constant Field Values
      • CASED

        static final int CASED
        Binary property Cased. For Lowercase, Uppercase and Titlecase characters.
        See Also:
        Constant Field Values
      • CASE_IGNORABLE

        static final int CASE_IGNORABLE
        Binary property Case_Ignorable. Used in context-sensitive case mappings.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_LOWERCASED

        static final int CHANGES_WHEN_LOWERCASED
        Binary property Changes_When_Lowercased.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_UPPERCASED

        static final int CHANGES_WHEN_UPPERCASED
        Binary property Changes_When_Uppercased.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_TITLECASED

        static final int CHANGES_WHEN_TITLECASED
        Binary property Changes_When_Titlecased.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_CASEFOLDED

        static final int CHANGES_WHEN_CASEFOLDED
        Binary property Changes_When_Casefolded.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_CASEMAPPED

        static final int CHANGES_WHEN_CASEMAPPED
        Binary property Changes_When_Casemapped.
        See Also:
        Constant Field Values
      • CHANGES_WHEN_NFKC_CASEFOLDED

        static final int CHANGES_WHEN_NFKC_CASEFOLDED
        Binary property Changes_When_NFKC_Casefolded.
        See Also:
        Constant Field Values
      • EMOJI

        static final int EMOJI
        Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • EMOJI_PRESENTATION

        static final int EMOJI_PRESENTATION
        Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • EMOJI_MODIFIER

        static final int EMOJI_MODIFIER
        Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • EMOJI_MODIFIER_BASE

        static final int EMOJI_MODIFIER_BASE
        Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • EMOJI_COMPONENT

        static final int EMOJI_COMPONENT
        Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • REGIONAL_INDICATOR

        static final int REGIONAL_INDICATOR
        Binary property Regional_Indicator.
        See Also:
        Constant Field Values
      • PREPENDED_CONCATENATION_MARK

        static final int PREPENDED_CONCATENATION_MARK
        Binary property Prepended_Concatenation_Mark.
        See Also:
        Constant Field Values
      • EXTENDED_PICTOGRAPHIC

        static final int EXTENDED_PICTOGRAPHIC
        Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties
        See Also:
        Constant Field Values
      • BASIC_EMOJI

        static final int BASIC_EMOJI
        Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • EMOJI_KEYCAP_SEQUENCE

        static final int EMOJI_KEYCAP_SEQUENCE
        Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • RGI_EMOJI_MODIFIER_SEQUENCE

        static final int RGI_EMOJI_MODIFIER_SEQUENCE
        Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • RGI_EMOJI_FLAG_SEQUENCE

        static final int RGI_EMOJI_FLAG_SEQUENCE
        Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • RGI_EMOJI_TAG_SEQUENCE

        static final int RGI_EMOJI_TAG_SEQUENCE
        Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • RGI_EMOJI_ZWJ_SEQUENCE

        static final int RGI_EMOJI_ZWJ_SEQUENCE
        Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • RGI_EMOJI

        static final int RGI_EMOJI
        Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
        See Also:
        Constant Field Values
      • IDS_UNARY_OPERATOR

        static final int IDS_UNARY_OPERATOR
        Binary property IDS_Unary_Operator. For programmatic determination of Ideographic Description Sequences.
        See Also:
        Constant Field Values
      • ID_COMPAT_MATH_START

        static final int ID_COMPAT_MATH_START
        Binary property ID_Compat_Math_Start.

        Used in mathematical identifier profile in UAX #31.

        See Also:
        Constant Field Values
      • ID_COMPAT_MATH_CONTINUE

        static final int ID_COMPAT_MATH_CONTINUE
        Binary property ID_Compat_Math_Continue.

        Used in mathematical identifier profile in UAX #31.

        See Also:
        Constant Field Values
      • BINARY_LIMIT

        @Deprecated
        static final int BINARY_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for binary Unicode properties.
        See Also:
        Constant Field Values
      • BIDI_CLASS

        static final int BIDI_CLASS
        Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.
        See Also:
        Constant Field Values
      • INT_START

        static final int INT_START
        First constant for enumerated/integer Unicode properties.
        See Also:
        Constant Field Values
      • BLOCK

        static final int BLOCK
        Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.
        See Also:
        Constant Field Values
      • CANONICAL_COMBINING_CLASS

        static final int CANONICAL_COMBINING_CLASS
        Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.
        See Also:
        Constant Field Values
      • DECOMPOSITION_TYPE

        static final int DECOMPOSITION_TYPE
        Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.
        See Also:
        Constant Field Values
      • EAST_ASIAN_WIDTH

        static final int EAST_ASIAN_WIDTH
        Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.
        See Also:
        Constant Field Values
      • GENERAL_CATEGORY

        static final int GENERAL_CATEGORY
        Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.
        See Also:
        Constant Field Values
      • JOINING_GROUP

        static final int JOINING_GROUP
        Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.
        See Also:
        Constant Field Values
      • JOINING_TYPE

        static final int JOINING_TYPE
        Enumerated property Joining_Type. Returns UCharacter.JoiningType values.
        See Also:
        Constant Field Values
      • LINE_BREAK

        static final int LINE_BREAK
        Enumerated property Line_Break. Returns UCharacter.LineBreak values.
        See Also:
        Constant Field Values
      • NUMERIC_TYPE

        static final int NUMERIC_TYPE
        Enumerated property Numeric_Type. Returns UCharacter.NumericType values.
        See Also:
        Constant Field Values
      • SCRIPT

        static final int SCRIPT
        Enumerated property Script. Same as UScript.getScript(int), returns UScript values.
        See Also:
        Constant Field Values
      • HANGUL_SYLLABLE_TYPE

        static final int HANGUL_SYLLABLE_TYPE
        Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.
        See Also:
        Constant Field Values
      • NFD_QUICK_CHECK

        static final int NFD_QUICK_CHECK
        Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
        See Also:
        Constant Field Values
      • NFKD_QUICK_CHECK

        static final int NFKD_QUICK_CHECK
        Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
        See Also:
        Constant Field Values
      • NFC_QUICK_CHECK

        static final int NFC_QUICK_CHECK
        Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
        See Also:
        Constant Field Values
      • NFKC_QUICK_CHECK

        static final int NFKC_QUICK_CHECK
        Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
        See Also:
        Constant Field Values
      • LEAD_CANONICAL_COMBINING_CLASS

        static final int LEAD_CANONICAL_COMBINING_CLASS
        Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
        See Also:
        Constant Field Values
      • TRAIL_CANONICAL_COMBINING_CLASS

        static final int TRAIL_CANONICAL_COMBINING_CLASS
        Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
        See Also:
        Constant Field Values
      • GRAPHEME_CLUSTER_BREAK

        static final int GRAPHEME_CLUSTER_BREAK
        Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.
        See Also:
        Constant Field Values
      • SENTENCE_BREAK

        static final int SENTENCE_BREAK
        Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.
        See Also:
        Constant Field Values
      • WORD_BREAK

        static final int WORD_BREAK
        Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.
        See Also:
        Constant Field Values
      • BIDI_PAIRED_BRACKET_TYPE

        static final int BIDI_PAIRED_BRACKET_TYPE
        Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.
        See Also:
        Constant Field Values
      • INDIC_POSITIONAL_CATEGORY

        static final int INDIC_POSITIONAL_CATEGORY
        Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt
        See Also:
        Constant Field Values
      • INDIC_SYLLABIC_CATEGORY

        static final int INDIC_SYLLABIC_CATEGORY
        Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt
        See Also:
        Constant Field Values
      • VERTICAL_ORIENTATION

        static final int VERTICAL_ORIENTATION
        Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.
        See Also:
        Constant Field Values
      • IDENTIFIER_STATUS

        static final int IDENTIFIER_STATUS
        Enumerated property Identifier_Status. Used for UTS #39 General Security Profile for Identifiers (https://www.unicode.org/reports/tr39/#General_Security_Profile).
        See Also:
        Constant Field Values
      • INT_LIMIT

        @Deprecated
        static final int INT_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for enumerated/integer Unicode properties.
        See Also:
        Constant Field Values
      • GENERAL_CATEGORY_MASK

        static final int GENERAL_CATEGORY_MASK
        Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".
        See Also:
        Constant Field Values
      • MASK_START

        static final int MASK_START
        First constant for bit-mask Unicode properties.
        See Also:
        Constant Field Values
      • MASK_LIMIT

        @Deprecated
        static final int MASK_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for bit-mask Unicode properties.
        See Also:
        Constant Field Values
      • NUMERIC_VALUE

        static final int NUMERIC_VALUE
        Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).
        See Also:
        Constant Field Values
      • DOUBLE_START

        static final int DOUBLE_START
        First constant for double Unicode properties.
        See Also:
        Constant Field Values
      • DOUBLE_LIMIT

        @Deprecated
        static final int DOUBLE_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for double Unicode properties.
        See Also:
        Constant Field Values
      • AGE

        static final int AGE
        String property Age. Corresponds to UCharacter.getAge(int).
        See Also:
        Constant Field Values
      • STRING_START

        static final int STRING_START
        First constant for string Unicode properties.
        See Also:
        Constant Field Values
      • BIDI_MIRRORING_GLYPH

        static final int BIDI_MIRRORING_GLYPH
        String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).
        See Also:
        Constant Field Values
      • CASE_FOLDING

        static final int CASE_FOLDING
        String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).
        See Also:
        Constant Field Values
      • ISO_COMMENT

        @Deprecated
        static final int ISO_COMMENT
        Deprecated.
        ICU 49
        Deprecated string property ISO_Comment. Corresponds to UCharacter.getISOComment(int).
        See Also:
        Constant Field Values
      • LOWERCASE_MAPPING

        static final int LOWERCASE_MAPPING
        String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).
        See Also:
        Constant Field Values
      • NAME

        static final int NAME
        String property Name. Corresponds to UCharacter.getName(int).
        See Also:
        Constant Field Values
      • SIMPLE_CASE_FOLDING

        static final int SIMPLE_CASE_FOLDING
        String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).
        See Also:
        Constant Field Values
      • SIMPLE_LOWERCASE_MAPPING

        static final int SIMPLE_LOWERCASE_MAPPING
        String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).
        See Also:
        Constant Field Values
      • SIMPLE_TITLECASE_MAPPING

        static final int SIMPLE_TITLECASE_MAPPING
        String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).
        See Also:
        Constant Field Values
      • SIMPLE_UPPERCASE_MAPPING

        static final int SIMPLE_UPPERCASE_MAPPING
        String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).
        See Also:
        Constant Field Values
      • TITLECASE_MAPPING

        static final int TITLECASE_MAPPING
        String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).
        See Also:
        Constant Field Values
      • UNICODE_1_NAME

        @Deprecated
        static final int UNICODE_1_NAME
        Deprecated.
        ICU 49
        String property Unicode_1_Name. This property is of little practical value. Beginning with ICU 49, ICU APIs return null or an empty string for this property. Corresponds to UCharacter.getName1_0(int).
        See Also:
        Constant Field Values
      • UPPERCASE_MAPPING

        static final int UPPERCASE_MAPPING
        String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).
        See Also:
        Constant Field Values
      • BIDI_PAIRED_BRACKET

        static final int BIDI_PAIRED_BRACKET
        String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.
        See Also:
        Constant Field Values
      • STRING_LIMIT

        @Deprecated
        static final int STRING_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for string Unicode properties.
        See Also:
        Constant Field Values
      • SCRIPT_EXTENSIONS

        static final int SCRIPT_EXTENSIONS
        Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.
        See Also:
        Constant Field Values
      • OTHER_PROPERTY_START

        static final int OTHER_PROPERTY_START
        First constant for Unicode properties with unusual value types.
        See Also:
        Constant Field Values
      • OTHER_PROPERTY_LIMIT

        @Deprecated
        static final int OTHER_PROPERTY_LIMIT
        Deprecated.
        ICU 58 The numeric value may change over time, see ICU ticket #12420.
        One more than the last constant for Unicode properties with unusual value types.
        See Also:
        Constant Field Values