A method of identifying substantially similar data fields may include accessing a first/second text strings, performing an approximate string-matching algorithm that generates a first score, and comparing the first score to a first threshold. The method also includes performing an exact string-matching algorithm that generates a second score and comparing the second score to a second threshold. The method further includes generating a third score by combining a weighted first score with a weighted second score and comparing the third score to a third threshold. The method also includes determining whether the first text string is substantially similar to the second text string based on the three score comparisons.
展开▼