Skip to main content

Fuzzy Grouping Transformation


This task is mainly used to group some similar data in a row and cleaning the duplicates to maintain a standard of the table. This task requires a connection to the SQL database that the transformation algorithm requires to. 
Step 1:

Click Windows button and go to SQL Server 2008 R2 and run as a administrator.


 It shows  a bellow window.


 Step 2:
Go to a views and create a new project (Short cut of New Project is Ctrl + Shift + N).


Step 3:
Give a project name (e.g. IS) and click OK button and then shows a bellow window.


Step 4:

To drag and drop the Data Flow Task


Step 5: 

To edit the Data Flow Task then it open a Data Flow


Stem 6:

To dag and drop the OLEDB source and edit it.


Step 7:

To give a OLEDB connection and select the table from given data base and select which columns are required.



Step 8:
Add Fuzzy Grouping Task and configure as given below

Open the Fuzzy Grouping editor: In the connection Manger, create OLE DB connection string for the temp table creates for fuzzy algorithm and select the columns as AddressId ,City.




Move to Advance tab and can change the fuzzy additional column names _key_in, _key_out and _score. Also, we can change the similarity threshold for the fuzzy match.


 Step 9:
Drag and drop the “data reader destination” into designer surface then map the target table and add “data viewer” to view the report. After that execute the package.








Comments

Popular posts from this blog

SSRS INTERVIEW QUESTIONS

Q: What is SSRS? Ø   SSRS or SQL Server Reporting Service is a server-based report generation software systems from Microsoft and is part of Microsoft BI. Ø   It is used for preparing and delivering interactive and variety of reports. Ø   It is administered through an web based interface. Ø   Reporting services utilizes a web service interface for supporting and developing of customized reporting applications. Ø   SSRS lets you create very rich reports (Tabular/Graphical/Interactive) from various datasources with rich data visualization (Charts, Maps, sparklines) Ø   SSRS allows are reports to be exported in various formats (Excel, PDF, word etc) Q: Explain SSRS Architecture? Reporting services architecture comprises of integrated components. It is a multi-tiered, included with application, server and data layers. This architecture is scalable and modular. A single installation can be used across multiple computers. It includes the fo...

Exception deserializing the package "The process cannot access the file because it is being used by another process."

TITLE: Microsoft Visual Studio ------------------------------ Failed to start project ------------------------------ ADDITIONAL INFORMATION: Exception deserializing the package "The process cannot access the file 'E:\SSASCube\HistoricalDataLoad\HistoricalDataLoad\bin\Development\HistoricalDataLoad.ispac' because it is being used by another process.". (Microsoft.DataTransformationServices.VsIntegration) ------------------------------ The process cannot access the file 'E:\SSASCube\HistoricalDataLoad\HistoricalDataLoad\bin\Development\HistoricalDataLoad.ispac' because it is being used by another process. (mscorlib) ------------------------------ BUTTONS: OK ------------------------------ While running SSIS package i got the error “The process cannot access the file ‘*.ispac’ because it is being used by another process”. I tried to close SSDT and run it again but, I still got the same error while compiling. Then, after searching over internet, I got...

Failed to execute the package or element. Build errors were encountered

Error: TITLE: Microsoft Visual Studio ------------------------------ Failed to execute the package or element.   Build errors were encountered. For more information, see the Output window. ------------------------------ BUTTONS: OK ------------------------------   Solution: We tried to close SSDT and run it again but, we still got the same error while running SSIS package. Then, we need to follow bellow solution: Step 1: Go to Task Manager–> Details Tab. Step 2: Locate the process “ DtsDebugHost.exe “. Kill this process. There might be multiple instances of this process. Kill all of them. Step 3: Rerun SSIS package