Duplicate Elimination on Dirty Data

When maintaining a database data entries may refer to the same real-world object, and thus be 'duplicates', but not have exact matching keys in the database. This problem requires a formal notion of what is a match/duplicate which can vary by application.

Parameters

  • nn: number of records

Insufficient data to display graph

Filters

Computational Model

Randomization

Approximation

Algorithms Table

Insuffient Data to display table

Reductions Table

Insuffient Data to display table

Other relevant algorithms

Insuffient Data to display table