SlideShare a Scribd company logo
1 of 13
Download to read offline
Data Integration In
Data Mining
www.rootfacts.com
Key Points
Why Is Data Integration In Data Mining Important?
What are two major systems for data integration?
What are the Issues Of Data Integration in Data
Mining?
Why Is Data Integration In Data
Mining Important?
Data Integration is a data processing
technique that collects data from
different sources (such as data
cubes, multiple databases, and flat
files) and offers a unified view of the
data to the users.
Data integration in data mining
connects with issues such as duplicate
data, inconsistent data, old systems,
etc. Manual data integration can be
achieved through middleware and
applications.
What are two major systems for data integration?
Tight Coupling
Loose Coupling
There are primarily 2 major systems for data integration
which are as follows:
Tight Coupling
In this method, the data warehouse is
treated as an information recovery
feature. The process is known as ETL
which means Extraction,
Transformation, and Loading.
Loose Coupling
In this method, an interface is offered
that listens to a query from the user
and transforms it to the source
database and then sends the query
directly to the reference databases
and obtains a great result.
What are the Issues Of Data
Integration in Data Mining?
There are no problems during data
integration in data mining: Schema
Integration, Redundancy, Detection and
explanation of data value disputes.
Some redundancies can be caught with the help of correlation analysis.
1. Schema Integration - It integrates metadata from multiple
sources and the real-world entities are matched with the entity
identification problem.
2. Redundancy - An attribute may be duplicative or obtain
redundancy. When the attributes are inconsistent, they may appear
as duplicates in the resulting data set.
3. Detection and explanation of data value
disputes - This is the third critical issue in
data integration. Here the attribute values
collected from different sources may vary
for the exact real-world entity. An attribute
collected in a system may be registered at
a lower level of generalisation as
compared with the “same” characteristic
in another.
Contact Us
https://www.rootfacts.com/services/data-
integration-in-data-mining/
contact@rootfacts.com
Thank you!

More Related Content

More from Lily Williams

RPA service in Business.pdf
RPA service in Business.pdfRPA service in Business.pdf
RPA service in Business.pdfLily Williams
 
RPA Service in Auditing.pdf
RPA Service in Auditing.pdfRPA Service in Auditing.pdf
RPA Service in Auditing.pdfLily Williams
 
Robotic Process Automation.pdf
Robotic Process Automation.pdfRobotic Process Automation.pdf
Robotic Process Automation.pdfLily Williams
 
Automation Consultant.pdf
Automation Consultant.pdfAutomation Consultant.pdf
Automation Consultant.pdfLily Williams
 
IoT Solution In Education.pdf
IoT Solution In Education.pdfIoT Solution In Education.pdf
IoT Solution In Education.pdfLily Williams
 
IoT Solution in Healthcare.pdf
IoT Solution in Healthcare.pdfIoT Solution in Healthcare.pdf
IoT Solution in Healthcare.pdfLily Williams
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdfLily Williams
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdfLily Williams
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdfLily Williams
 
IoT Service Company.pdf
IoT Service Company.pdfIoT Service Company.pdf
IoT Service Company.pdfLily Williams
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfLily Williams
 
Big data solution in marketing.pdf
Big data solution in marketing.pdfBig data solution in marketing.pdf
Big data solution in marketing.pdfLily Williams
 
Data Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfData Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfLily Williams
 
Data Integration In DBMS.pdf
Data Integration In DBMS.pdfData Integration In DBMS.pdf
Data Integration In DBMS.pdfLily Williams
 
Data Integration Consultancy.pdf
Data Integration Consultancy.pdfData Integration Consultancy.pdf
Data Integration Consultancy.pdfLily Williams
 
Data Science in Healthcare.pdf
Data Science in Healthcare.pdfData Science in Healthcare.pdf
Data Science in Healthcare.pdfLily Williams
 
Data Science In Deep Learning.pdf
Data Science In Deep Learning.pdfData Science In Deep Learning.pdf
Data Science In Deep Learning.pdfLily Williams
 
Data Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfData Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfLily Williams
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdfLily Williams
 

More from Lily Williams (20)

RPA service in Business.pdf
RPA service in Business.pdfRPA service in Business.pdf
RPA service in Business.pdf
 
RPA Service in Auditing.pdf
RPA Service in Auditing.pdfRPA Service in Auditing.pdf
RPA Service in Auditing.pdf
 
Robotic Process Automation.pdf
Robotic Process Automation.pdfRobotic Process Automation.pdf
Robotic Process Automation.pdf
 
Automation Consultant.pdf
Automation Consultant.pdfAutomation Consultant.pdf
Automation Consultant.pdf
 
IoT Solution In Education.pdf
IoT Solution In Education.pdfIoT Solution In Education.pdf
IoT Solution In Education.pdf
 
IoT Solution in Healthcare.pdf
IoT Solution in Healthcare.pdfIoT Solution in Healthcare.pdf
IoT Solution in Healthcare.pdf
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdf
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdf
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdf
 
IoT Service Company.pdf
IoT Service Company.pdfIoT Service Company.pdf
IoT Service Company.pdf
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdf
 
Big data solution in marketing.pdf
Big data solution in marketing.pdfBig data solution in marketing.pdf
Big data solution in marketing.pdf
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
 
Data Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfData Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdf
 
Data Integration In DBMS.pdf
Data Integration In DBMS.pdfData Integration In DBMS.pdf
Data Integration In DBMS.pdf
 
Data Integration Consultancy.pdf
Data Integration Consultancy.pdfData Integration Consultancy.pdf
Data Integration Consultancy.pdf
 
Data Science in Healthcare.pdf
Data Science in Healthcare.pdfData Science in Healthcare.pdf
Data Science in Healthcare.pdf
 
Data Science In Deep Learning.pdf
Data Science In Deep Learning.pdfData Science In Deep Learning.pdf
Data Science In Deep Learning.pdf
 
Data Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfData Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdf
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdf
 

Data Integration In Data Mining.pdf

  • 1. Data Integration In Data Mining www.rootfacts.com
  • 2. Key Points Why Is Data Integration In Data Mining Important? What are two major systems for data integration? What are the Issues Of Data Integration in Data Mining?
  • 3. Why Is Data Integration In Data Mining Important? Data Integration is a data processing technique that collects data from different sources (such as data cubes, multiple databases, and flat files) and offers a unified view of the data to the users.
  • 4. Data integration in data mining connects with issues such as duplicate data, inconsistent data, old systems, etc. Manual data integration can be achieved through middleware and applications.
  • 5. What are two major systems for data integration? Tight Coupling Loose Coupling There are primarily 2 major systems for data integration which are as follows:
  • 6. Tight Coupling In this method, the data warehouse is treated as an information recovery feature. The process is known as ETL which means Extraction, Transformation, and Loading.
  • 7. Loose Coupling In this method, an interface is offered that listens to a query from the user and transforms it to the source database and then sends the query directly to the reference databases and obtains a great result.
  • 8. What are the Issues Of Data Integration in Data Mining? There are no problems during data integration in data mining: Schema Integration, Redundancy, Detection and explanation of data value disputes.
  • 9. Some redundancies can be caught with the help of correlation analysis.
  • 10. 1. Schema Integration - It integrates metadata from multiple sources and the real-world entities are matched with the entity identification problem. 2. Redundancy - An attribute may be duplicative or obtain redundancy. When the attributes are inconsistent, they may appear as duplicates in the resulting data set.
  • 11. 3. Detection and explanation of data value disputes - This is the third critical issue in data integration. Here the attribute values collected from different sources may vary for the exact real-world entity. An attribute collected in a system may be registered at a lower level of generalisation as compared with the “same” characteristic in another.