八个字母中取一就是 12.5%(距离通常相对短语来计算),因此采用这种方法的 CAT 工具用 87.5% 来表示这两个词之间的相似性。
One character in eight represents 12.5% (difference is usually measured against the source phrase), so a CAT tool using this method would indicate that the similarity between the two words is 87.5%.
相似性常常用 Levenshtein 距离来衡量,指的是俄国科学家 Vladimir Levenshtein 在 1965 年发明的算法。
Similarity is often measured using Levenshtein distance, which refers to the algorithm written by Russian scientist Vladimir Levenshtein in 1965.
通过矢量,开发人员可以使用各种指标(比如说曼哈顿距离、欧氏距离或余弦相似性)来计算两个项目之间的距离。
Given the vectors, one can calculate the distance between two items using measures such as the Manhattan distance, Euclidean distance, or cosine similarity.
应用推荐