Package com.ibm.icu.impl.breakiter
Class DictionaryBreakEngine
java.lang.Object
com.ibm.icu.impl.breakiter.DictionaryBreakEngine
- All Implemented Interfaces:
LanguageBreakEngine
- Direct Known Subclasses:
BurmeseBreakEngine,CjkBreakEngine,KhmerBreakEngine,LaoBreakEngine,LSTMBreakEngine,ThaiBreakEngine
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classA deque-like structure holding raw ints.(package private) static class -
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescription(package private) abstract intdivideUpDictionaryRange(CharacterIterator text, int rangeStart, int rangeEnd, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking) Divide up a range of known dictionary characters handled by this break engine.intfindBreaks(CharacterIterator text, int startPos, int endPos, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking) Implements the actual breaking logic.booleanhandles(int c) (package private) voidsetCharacters(UnicodeSet set)
-
Field Details
-
fSet
UnicodeSet fSet
-
-
Constructor Details
-
DictionaryBreakEngine
public DictionaryBreakEngine()Constructor
-
-
Method Details
-
handles
public boolean handles(int c) - Specified by:
handlesin interfaceLanguageBreakEngine- Parameters:
c- A Unicode codepoint value- Returns:
- true if the engine can handle this character, false otherwise
-
findBreaks
public int findBreaks(CharacterIterator text, int startPos, int endPos, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking) Description copied from interface:LanguageBreakEngineImplements the actual breaking logic. Find any breaks within a run in the supplied text.- Specified by:
findBreaksin interfaceLanguageBreakEngine- Parameters:
text- The text to break over. The iterator is left at the end of the run of characters which the engine has handled.startPos- The index of the beginning of the rangeendPos- The index of the possible end of our range. It is possible, however, that the range ends earlierfoundBreaks- A data structure to receive the break positions.- Returns:
- the number of breaks found
-
setCharacters
-
divideUpDictionaryRange
abstract int divideUpDictionaryRange(CharacterIterator text, int rangeStart, int rangeEnd, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking) Divide up a range of known dictionary characters handled by this break engine.
- Parameters:
text- A UText representing the textrangeStart- The start of the range of dictionary charactersrangeEnd- The end of the range of dictionary charactersfoundBreaks- Output of break positions. Positions are pushed. Pre-existing contents of the output stack are unaltered.- Returns:
- The number of breaks found
-