Class DictionaryBreakEngine

    • Constructor Detail

      • DictionaryBreakEngine

        public DictionaryBreakEngine()
        Constructor
    • Method Detail

      • handles

        public boolean handles​(int c)
        Specified by:
        handles in interface LanguageBreakEngine
        Parameters:
        c - A Unicode codepoint value
        Returns:
        true if the engine can handle this character, false otherwise
      • findBreaks

        public int findBreaks​(java.text.CharacterIterator text,
                              int startPos,
                              int endPos,
                              DictionaryBreakEngine.DequeI foundBreaks,
                              boolean isPhraseBreaking)
        Description copied from interface: LanguageBreakEngine
        Implements the actual breaking logic. Find any breaks within a run in the supplied text.
        Specified by:
        findBreaks in interface LanguageBreakEngine
        Parameters:
        text - The text to break over. The iterator is left at the end of the run of characters which the engine has handled.
        startPos - The index of the beginning of the range
        endPos - The index of the possible end of our range. It is possible, however, that the range ends earlier
        foundBreaks - A data structure to receive the break positions.
        Returns:
        the number of breaks found
      • setCharacters

        void setCharacters​(UnicodeSet set)
      • divideUpDictionaryRange

        abstract int divideUpDictionaryRange​(java.text.CharacterIterator text,
                                             int rangeStart,
                                             int rangeEnd,
                                             DictionaryBreakEngine.DequeI foundBreaks,
                                             boolean isPhraseBreaking)

        Divide up a range of known dictionary characters handled by this break engine.

        Parameters:
        text - A UText representing the text
        rangeStart - The start of the range of dictionary characters
        rangeEnd - The end of the range of dictionary characters
        foundBreaks - Output of break positions. Positions are pushed. Pre-existing contents of the output stack are unaltered.
        Returns:
        The number of breaks found