Package org.jcodings.specific
Class BaseEUCJPEncoding
- java.lang.Object
-
- org.jcodings.Encoding
-
- org.jcodings.AbstractEncoding
-
- org.jcodings.MultiByteEncoding
-
- org.jcodings.EucEncoding
-
- org.jcodings.specific.BaseEUCJPEncoding
-
- All Implemented Interfaces:
java.lang.Cloneable
- Direct Known Subclasses:
EUCJPEncoding
,NonStrictEUCJPEncoding
abstract class BaseEUCJPEncoding extends EucEncoding
-
-
Field Summary
Fields Modifier and Type Field Description private static int[]
CR_Cyrillic
private static int[]
CR_Greek
private static int[]
CR_Han
private static int[]
CR_Hiragana
private static int[]
CR_Katakana
private static int[]
CR_Latin
private static CaseInsensitiveBytesHash<java.lang.Integer>
CTypeNameHash
(package private) static int[]
EUCJPEncLen
private static int[][]
PropertyList
-
Constructor Summary
Constructors Modifier Constructor Description protected
BaseEUCJPEncoding(int[][] Trans)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description int
codeToMbc(int code, byte[] bytes, int p)
Extracts code point into it's multibyte representationint
codeToMbcLength(int code)
Returns character length given a code point Oniguruma equivalent:code_to_mbclen
int[]
ctypeCodeRange(int ctype, IntHolder sbOut)
Returns code range for a given character type Oniguruma equivalent:get_ctype_code_range
private static int
getLowerCase(int code)
boolean
isCodeCType(int code, int ctype)
Perform a check whether given code is of given character type (e.g.protected boolean
isLead(int c)
boolean
isReverseMatchAllowed(byte[] bytes, int p, int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match
int
mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower)
onigenc_ascii_mbc_case_foldint
mbcToCode(byte[] bytes, int p, int end)
Returns code point for a character Oniguruma equivalent:mbc_to_code
int
propertyNameToCType(byte[] bytes, int p, int end)
onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings-
Methods inherited from class org.jcodings.EucEncoding
leftAdjustCharHead
-
Methods inherited from class org.jcodings.MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength
-
Methods inherited from class org.jcodings.AbstractEncoding
applyAllCaseFold, asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, caseFoldCodesByString, isCodeCTypeInternal, isNewLine
-
Methods inherited from class org.jcodings.Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getCharsetName, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, length, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
-
-
-
Field Detail
-
CR_Hiragana
private static final int[] CR_Hiragana
-
CR_Katakana
private static final int[] CR_Katakana
-
CR_Han
private static final int[] CR_Han
-
CR_Latin
private static final int[] CR_Latin
-
CR_Greek
private static final int[] CR_Greek
-
CR_Cyrillic
private static final int[] CR_Cyrillic
-
PropertyList
private static final int[][] PropertyList
-
CTypeNameHash
private static final CaseInsensitiveBytesHash<java.lang.Integer> CTypeNameHash
-
EUCJPEncLen
static final int[] EUCJPEncLen
-
-
Method Detail
-
mbcToCode
public int mbcToCode(byte[] bytes, int p, int end)
Description copied from class:Encoding
Returns code point for a character Oniguruma equivalent:mbc_to_code
-
codeToMbcLength
public int codeToMbcLength(int code)
Description copied from class:Encoding
Returns character length given a code point Oniguruma equivalent:code_to_mbclen
- Specified by:
codeToMbcLength
in classEncoding
-
codeToMbc
public int codeToMbc(int code, byte[] bytes, int p)
Description copied from class:Encoding
Extracts code point into it's multibyte representation
-
getLowerCase
private static int getLowerCase(int code)
-
mbcCaseFold
public int mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower)
Description copied from class:AbstractEncoding
onigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFold
in classAbstractEncoding
- Parameters:
flag
- case fold flagpp
- anIntHolder
that points at character headlower
- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
isLead
protected boolean isLead(int c)
- Specified by:
isLead
in classEucEncoding
-
isReverseMatchAllowed
public boolean isReverseMatchAllowed(byte[] bytes, int p, int end)
Description copied from class:Encoding
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match
- Specified by:
isReverseMatchAllowed
in classEncoding
-
propertyNameToCType
public int propertyNameToCType(byte[] bytes, int p, int end)
Description copied from class:AbstractEncoding
onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings- Overrides:
propertyNameToCType
in classAbstractEncoding
-
isCodeCType
public boolean isCodeCType(int code, int ctype)
Description copied from class:Encoding
Perform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCType
in classEncoding
- Parameters:
code
- a code point of a characterctype
- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
public int[] ctypeCodeRange(int ctype, IntHolder sbOut)
Description copied from class:Encoding
Returns code range for a given character type Oniguruma equivalent:get_ctype_code_range
- Specified by:
ctypeCodeRange
in classEncoding
-
-