Class CosineDistance

java.lang.Object
org.apache.commons.text.similarity.CosineDistance
All Implemented Interfaces:
EditDistance<Double>, SimilarityScore<Double>

public class CosineDistance extends Object implements EditDistance<Double>
Measures the cosine distance between two character sequences.

It utilizes the CosineSimilarity to compute the distance. Character sequences are converted into vectors through a simple tokenizer that works with a regular expression to split words in a sentence.

For further explanation about Cosine Similarity and Cosine Distance, refer to http://en.wikipedia.org/wiki/Cosine_similarity.

Since:
1.0
See Also:
  • Field Details

    • tokenizer

      private final Tokenizer<CharSequence> tokenizer
      Tokenizer used to convert the character sequence into a vector.
    • cosineSimilarity

      private final CosineSimilarity cosineSimilarity
      Cosine similarity.
  • Constructor Details

    • CosineDistance

      public CosineDistance()
  • Method Details