Fuzzy Grouping Transformation


This task is mainly used to group some similar data in a row and cleaning the duplicates to maintain a standard of the table. This task requires a connection to the SQL database that the transformation algorithm requires to. 
Step 1:

Click Windows button and go to SQL Server 2008 R2 and run as a administrator.


 It shows  a bellow window.


 Step 2:
Go to a views and create a new project (Short cut of New Project is Ctrl + Shift + N).


Step 3:
Give a project name (e.g. IS) and click OK button and then shows a bellow window.


Step 4:

To drag and drop the Data Flow Task


Step 5: 

To edit the Data Flow Task then it open a Data Flow


Stem 6:

To dag and drop the OLEDB source and edit it.


Step 7:

To give a OLEDB connection and select the table from given data base and select which columns are required.



Step 8:
Add Fuzzy Grouping Task and configure as given below

Open the Fuzzy Grouping editor: In the connection Manger, create OLE DB connection string for the temp table creates for fuzzy algorithm and select the columns as AddressId ,City.




Move to Advance tab and can change the fuzzy additional column names _key_in, _key_out and _score. Also, we can change the similarity threshold for the fuzzy match.


 Step 9:
Drag and drop the “data reader destination” into designer surface then map the target table and add “data viewer” to view the report. After that execute the package.








Comments

Popular Posts

Failed to execute the package or element. Build errors were encountered

Restore of database 'DataBase_Name' failed. (Microsoft.SqlServer.Management.RelationalEngineTasks)

Cannot convert "Column" between a unicode and a non-unicode string data types in SSIS