Package org.jcodings.specific
Class UTF16BEEncoding
- java.lang.Object
-
- org.jcodings.Encoding
-
- org.jcodings.AbstractEncoding
-
- org.jcodings.MultiByteEncoding
-
- org.jcodings.unicode.UnicodeEncoding
-
- org.jcodings.specific.UTF16BEEncoding
-
- All Implemented Interfaces:
java.lang.Cloneable
public final class UTF16BEEncoding extends UnicodeEncoding
-
-
Field Summary
Fields Modifier and Type Field Description static UTF16BEEncoding
INSTANCE
(package private) static int[]
UTF16EncLen
-
Constructor Summary
Constructors Modifier Constructor Description protected
UTF16BEEncoding()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description int
codeToMbc(int code, byte[] bytes, int p)
Extracts code point into it's multibyte representationint
codeToMbcLength(int code)
Returns character length given a code point Oniguruma equivalent:code_to_mbclen
int[]
ctypeCodeRange(int ctype, IntHolder sbOut)
onigenc_utf16_32_get_ctype_code_rangeboolean
isNewLine(byte[] bytes, int p, int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodingsboolean
isReverseMatchAllowed(byte[] bytes, int p, int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match
private static boolean
isSurrogate(int c)
private static boolean
isSurrogateFirst(int c)
private static boolean
isSurrogateSecond(int c)
int
leftAdjustCharHead(byte[] bytes, int p, int s, int end)
Seeks the previous character head in a stream Oniguruma equivalent:left_adjust_char_head
int
length(byte[] bytes, int p, int end)
Returns character length given stream, character position and stream end returns1
for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwiseint
mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] fold)
onigenc_ascii_mbc_case_foldint
mbcToCode(byte[] bytes, int p, int end)
Returns code point for a character Oniguruma equivalent:mbc_to_code
-
Methods inherited from class org.jcodings.unicode.UnicodeEncoding
applyAllCaseFold, caseFoldCodesByString, caseMap, ctypeCodeRange, getCharsetName, isCodeCType, isInCodeRange, propertyNameToCType
-
Methods inherited from class org.jcodings.MultiByteEncoding
isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength
-
Methods inherited from class org.jcodings.AbstractEncoding
asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, isCodeCTypeInternal
-
Methods inherited from class org.jcodings.Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
-
-
-
Field Detail
-
UTF16EncLen
static final int[] UTF16EncLen
-
INSTANCE
public static final UTF16BEEncoding INSTANCE
-
-
Method Detail
-
length
public int length(byte[] bytes, int p, int end)
Description copied from class:Encoding
Returns character length given stream, character position and stream end returns1
for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwise
-
isNewLine
public boolean isNewLine(byte[] bytes, int p, int end)
Description copied from class:AbstractEncoding
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings- Overrides:
isNewLine
in classAbstractEncoding
-
mbcToCode
public int mbcToCode(byte[] bytes, int p, int end)
Description copied from class:Encoding
Returns code point for a character Oniguruma equivalent:mbc_to_code
-
codeToMbcLength
public int codeToMbcLength(int code)
Description copied from class:Encoding
Returns character length given a code point Oniguruma equivalent:code_to_mbclen
- Specified by:
codeToMbcLength
in classEncoding
-
codeToMbc
public int codeToMbc(int code, byte[] bytes, int p)
Description copied from class:Encoding
Extracts code point into it's multibyte representation
-
mbcCaseFold
public int mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] fold)
Description copied from class:AbstractEncoding
onigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFold
in classUnicodeEncoding
- Parameters:
flag
- case fold flagpp
- anIntHolder
that points at character headfold
- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
ctypeCodeRange
public int[] ctypeCodeRange(int ctype, IntHolder sbOut)
onigenc_utf16_32_get_ctype_code_range- Specified by:
ctypeCodeRange
in classEncoding
-
leftAdjustCharHead
public int leftAdjustCharHead(byte[] bytes, int p, int s, int end)
Description copied from class:Encoding
Seeks the previous character head in a stream Oniguruma equivalent:left_adjust_char_head
- Specified by:
leftAdjustCharHead
in classEncoding
- Parameters:
bytes
- byte streamp
- positions
- stopend
- end
-
isReverseMatchAllowed
public boolean isReverseMatchAllowed(byte[] bytes, int p, int end)
Description copied from class:Encoding
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match
- Specified by:
isReverseMatchAllowed
in classEncoding
-
isSurrogateFirst
private static boolean isSurrogateFirst(int c)
-
isSurrogateSecond
private static boolean isSurrogateSecond(int c)
-
isSurrogate
private static boolean isSurrogate(int c)
-
-