public class JaroWinkler extends java.lang.Object implements Comparator
Title: Dataspace Framework
Description: An implementation of the Jaro-Winkler string similarity measure. The implementation follows the description in the paper "Evaluating String Comparator Performance for Record Linkage", by William E. Yancey, RESEARCH REPORT SERIES (Statistics #2005-05), US Bureau of the Census. http://www.census.gov/srd/papers/pdf/rrs2005-05.pdf
Copyright: Copyright (c) 2013
Company: StreamScape Technologies
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
NAME |
Constructor and Description |
---|
JaroWinkler() |
Modifier and Type | Method and Description |
---|---|
double |
compare(java.lang.String s1,
java.lang.String s2) |
boolean |
isTokenized()
Returns true if the comparator breaks string values up into
tokens when comparing.
|
static double |
similarity(java.lang.String s1,
java.lang.String s2)
Returns normalized score, with 0.0 meaning no similarity at all,
and 1.0 meaning full equality.
|
public double compare(java.lang.String s1, java.lang.String s2)
compare
in interface Comparator
public boolean isTokenized()
Comparator
isTokenized
in interface Comparator
public static double similarity(java.lang.String s1, java.lang.String s2)
Copyright © 2015-2024 StreamScape Technologies. All rights reserved.