Class CharsetRecog_2022

    • Constructor Detail

      • CharsetRecog_2022

        CharsetRecog_2022()
    • Method Detail

      • match

        int match​(byte[] text,
                  int textLen,
                  byte[][] escapeSequences)
        Matching function shared among the 2022 detectors JP, CN and KR Counts up the number of legal an unrecognized escape sequences in the sample of text, and computes a score based on the total number & the proportion that fit the encoding.
        Parameters:
        text - the byte buffer containing text to analyse
        textLen - the size of the text in the byte.
        escapeSequences - the byte escape sequences to test for.
        Returns:
        match quality, in the range of 0-100.