Objective: I need to find a way to consolidate multiple home and auto insurance policies purchased by an Insured (aka, policy holder) and assign those policies to a single customer account for that Insured.
Problem: I have over 100K+ policy records. An Insured could purchase one or more policy, but each policy is currently assigned a unique customer account number, even though those policies belong to the same Insured. Furthermore, the way the information was entered into the database, I can't just group the policies by Insured's name, address, or some other value in other fields without heavy manual intervention, because the information could be entered differently in each field for the same Insured (e.g., Smith, J vs. Smith, James or 1000 E highland, Sac, CA vs. East Highland, Sacramento, CA).
Question: Does anyone know of a tool/utility (hopefully free) that could be used to interrogate the records and, through some "fuzzy" algorithm, group those 100k+ policy records into grouping associated to a given Insured?