Class CosineDistance
java.lang.Object
org.apache.commons.text.similarity.CosineDistance
- All Implemented Interfaces:
EditDistance<Double>,SimilarityScore<Double>
Measures the cosine distance between two character sequences.
It utilizes the CosineSimilarity to compute the distance. Character sequences
are converted into vectors through a simple tokenizer that works with a regular expression
to split words in a sentence.
For further explanation about Cosine Similarity and Cosine Distance, refer to http://en.wikipedia.org/wiki/Cosine_similarity.
- Since:
- 1.0
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate final CosineSimilarityCosine similarity.private final Tokenizer<CharSequence> Tokenizer used to convert the character sequence into a vector. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionapply(CharSequence left, CharSequence right) Compares two CharSequences.
-
Field Details
-
tokenizer
Tokenizer used to convert the character sequence into a vector. -
cosineSimilarity
Cosine similarity.
-
-
Constructor Details
-
CosineDistance
public CosineDistance()
-
-
Method Details
-
apply
Description copied from interface:EditDistanceCompares two CharSequences.- Specified by:
applyin interfaceEditDistance<Double>- Specified by:
applyin interfaceSimilarityScore<Double>- Parameters:
left- the first CharSequenceright- the second CharSequence- Returns:
- The similarity score between two CharSequences
-