SQL Power Group Inc., a Toronto-based data warehousing vendor, has published the source code of Power Matchmaker, its open source data cleansing tool.
Designed to eliminate duplicates and build cross-references between sources and data tables, Power Matchmaker can be run against an entire database. It also validates and corrects address information. It builds cross reference tables for linking source system identifiers to target primary keys and merges duplicate data.
The vendor claims this is a low-cost alternative to SAS Institute’s Data Flux.
Last July, SQL Power Group published the source code to Power Architect, it’s data modelling tool.
See also BI Specialist takes data modelling tool open source