Clean data by using the SSIS DQS task
SQL Server Integration Services (SSIS) includes a DQS Transformation to correct data from a connected data source. The DQS Transformation Output column is populated with one of five status as listed below. You can use a Conditional Split Transformation to separate out the output records based on its status. You can apply different business logic to each stream or insert them to a table of file destination for manual processing.
To clean data using the SSIS DQS task follow these steps:
- Open a new SSIS package in Visual Studio
- Create a new Data Flow Task
- Add a Source Component to extract the data to be cleansed
- Add a DQS Cleansing transformation and connect it to the Source Component Output
- Edit the DQS Cleansing transformation and set the Data Quality connection manager and Data Quality Knowledge Base to use for the cleansing activity
- Click on the Mapping tab and map the Input Columns to the corresponding Knowledge Base Domain, then click OK
- Add a Conditional Split transformation and connect it to the DQs Cleansing Output
- Edit the Conditional Split transformation and add the following five outputs as shown
Add a destination component to each of the outputs from the Conditional Split transformation.
Comments
Post a Comment