Class CharsetRecog_mbcs.CharsetRecog_sjis
java.lang.Object
com.ibm.icu.text.CharsetRecognizer
com.ibm.icu.text.CharsetRecog_mbcs
com.ibm.icu.text.CharsetRecog_mbcs.CharsetRecog_sjis
- Enclosing class:
CharsetRecog_mbcs
Shift-JIS charset recognizer.
-
Nested Class Summary
Nested classes/interfaces inherited from class CharsetRecog_mbcs
CharsetRecog_mbcs.CharsetRecog_big5, CharsetRecog_mbcs.CharsetRecog_euc, CharsetRecog_mbcs.CharsetRecog_gb_18030, CharsetRecog_mbcs.CharsetRecog_sjis, CharsetRecog_mbcs.iteratedChar -
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionGet the ISO language code for this charset.(package private) StringgetName()Get the IANA name of this charset.(package private) CharsetMatchmatch(CharsetDetector det) Test the match of this charset with the input text data which is obtained via the CharsetDetector object.(package private) booleanGet the next character (however many bytes it is) from the input data Subclasses for specific charset encodings must implement this function to get characters according to the rules of their encoding scheme.Methods inherited from class CharsetRecog_mbcs
match
-
Field Details
-
commonChars
static int[] commonChars
-
-
Constructor Details
-
CharsetRecog_sjis
CharsetRecog_sjis()
-
-
Method Details
-
nextChar
Description copied from class:CharsetRecog_mbcsGet the next character (however many bytes it is) from the input data Subclasses for specific charset encodings must implement this function to get characters according to the rules of their encoding scheme. This function is not a method of class iteratedChar only because that would require a lot of extra derived classes, which is awkward.- Specified by:
nextCharin classCharsetRecog_mbcs- Parameters:
it- The iteratedChar "struct" into which the returned char is placed.det- The charset detector, which is needed to get at the input byte data being iterated over.- Returns:
- True if a character was returned, false at end of input.
-
match
Description copied from class:CharsetRecognizerTest the match of this charset with the input text data which is obtained via the CharsetDetector object.- Specified by:
matchin classCharsetRecognizer- Parameters:
det- The CharsetDetector, which contains the input text to be checked for being in this charset.- Returns:
- A CharsetMatch object containing details of match with this charset, or null if there was no match.
-
getName
String getName()Description copied from class:CharsetRecog_mbcsGet the IANA name of this charset.- Specified by:
getNamein classCharsetRecog_mbcs- Returns:
- the charset name.
-
getLanguage
Description copied from class:CharsetRecognizerGet the ISO language code for this charset.- Overrides:
getLanguagein classCharsetRecognizer- Returns:
- the language code, or
nullif the language cannot be determined.
-