SimilarityCoefficient - Maple Help

Online Help

All Products    Maple    MapleSim


StringTools

  

SimilarityCoefficient

  

computes the similarity coefficient of two strings

 

Calling Sequence

Parameters

Description

Examples

Calling Sequence

SimilarityCoefficient( s, t )

SimilarityCoefficient( s, t, n )

Parameters

s

-

Maple string

t

-

Maple string

n

-

(optional) positive integer

Description

• 

The SimilarityCoefficient(s, t) command computes the similarity coefficient of two strings s and t, defined as follows.

  

Let N(S) denote the set of trigrams of any string S. Then nopsNsNtnopsNsNt is the similarity coefficient of s and t, with the convention that strings having no trigrams in common have similarity coefficient equal to infinity.

• 

An optional argument n may be specified causing the similarity coefficient to be computed for n-gram similarity instead of the default trigram similarity.

• 

In typical applications, n is taken to be either 2 or 3 (the default). Note that Maple computes this measure as an exact rational quantity, rather than a floating-point approximation. You can obtain a floating-point result by applying evalf to the result.

• 

All of the StringTools package commands treat strings as (null-terminated) sequences of 8-bit (ASCII) characters.  Thus, there is no support for multibyte character encodings, such as unicode encodings.

Examples

withStringTools:

SimilarityCoefficientCanada,Canary

3

(1)

SimilarityCoefficientKline,Cline

2

(2)

SimilarityCoefficientmathematics,mathematische

2

(3)

SimilarityCoefficientConstance,Connor,1

3

(4)

SimilarityCoefficientConstance,Connor,2

112

(5)

SimilarityCoefficientConstance,Connor,3

10

(6)

SimilarityCoefficientConstance,Connor,4

(7)

SimilarityCoefficientConstance,Connor,5

(8)

See Also

evalf

string

StringTools

StringTools[NGrams]