Clean data by using the SSIS DQS task

SQL Server Integration Services (SSIS) includes a DQS Transformation to correct data from a connected data source. The DQS Transformation Output column is populated with one of five status as listed below. You can use a Conditional Split Transformation to separate out the output records based on its status. You can apply different business logic to each stream or insert them to a table of file destination for manual processing.

To clean data using the SSIS DQS task follow these steps:
  • Open a new SSIS package in Visual Studio
  • Create a new Data Flow Task
  • Add a Source Component to extract the data to be cleansed
  • Add a DQS Cleansing transformation and connect it to the Source Component Output
  • Edit the DQS Cleansing transformation and set the Data Quality connection manager and Data Quality Knowledge Base to use for the cleansing activity
  • Click on the Mapping tab and map the Input Columns to the corresponding Knowledge Base Domain, then click OK
  • Add a Conditional Split transformation and connect it to the DQs Cleansing Output
  • Edit the Conditional Split transformation and add the following five outputs as shown








Add a destination component to each of the outputs from the Conditional Split transformation.

Comments

Popular Posts

Non-blocking, semi-blocking, and blocking transforms in SSIS

Implement additive, semi-additive, and non-additive measures

Implement an ETL solution that supports incremental data extraction