SlideShare a Scribd company logo
1 of 13
Download to read offline
Data Integration In
Data Mining
www.rootfacts.com
Key Points
Why Is Data Integration In Data Mining Important?
What are two major systems for data integration?
What are the Issues Of Data Integration in Data
Mining?
Why Is Data Integration In Data
Mining Important?
Data Integration is a data processing
technique that collects data from
different sources (such as data
cubes, multiple databases, and flat
files) and offers a unified view of the
data to the users.
Data integration in data mining
connects with issues such as duplicate
data, inconsistent data, old systems,
etc. Manual data integration can be
achieved through middleware and
applications.
What are two major systems for data integration?
Tight Coupling
Loose Coupling
There are primarily 2 major systems for data integration
which are as follows:
Tight Coupling
In this method, the data warehouse is
treated as an information recovery
feature. The process is known as ETL
which means Extraction,
Transformation, and Loading.
Loose Coupling
In this method, an interface is offered
that listens to a query from the user
and transforms it to the source
database and then sends the query
directly to the reference databases
and obtains a great result.
What are the Issues Of Data
Integration in Data Mining?
There are no problems during data
integration in data mining: Schema
Integration, Redundancy, Detection and
explanation of data value disputes.
Some redundancies can be caught with the help of correlation analysis.
1. Schema Integration - It integrates metadata from multiple
sources and the real-world entities are matched with the entity
identification problem.
2. Redundancy - An attribute may be duplicative or obtain
redundancy. When the attributes are inconsistent, they may appear
as duplicates in the resulting data set.
3. Detection and explanation of data value
disputes - This is the third critical issue in
data integration. Here the attribute values
collected from different sources may vary
for the exact real-world entity. An attribute
collected in a system may be registered at
a lower level of generalisation as
compared with the “same” characteristic
in another.
Contact Us
https://www.rootfacts.com/services/data-
integration-in-data-mining/
contact@rootfacts.com
Thank you!

More Related Content

More from SophiaKelly6

Robotic Process Automation Service.pdf
Robotic Process Automation Service.pdfRobotic Process Automation Service.pdf
Robotic Process Automation Service.pdfSophiaKelly6
 
Automation Consultant Services.pdf
Automation Consultant Services.pdfAutomation Consultant Services.pdf
Automation Consultant Services.pdfSophiaKelly6
 
IoT Solution In Education.pdf
IoT Solution In Education.pdfIoT Solution In Education.pdf
IoT Solution In Education.pdfSophiaKelly6
 
IoT Solution in Manufacturing.pptx
IoT Solution in Manufacturing.pptxIoT Solution in Manufacturing.pptx
IoT Solution in Manufacturing.pptxSophiaKelly6
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdfSophiaKelly6
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdfSophiaKelly6
 
IoT Service Company.pdf
IoT Service Company.pdfIoT Service Company.pdf
IoT Service Company.pdfSophiaKelly6
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfSophiaKelly6
 
Big data solution in marketing.pdf
Big data solution in marketing.pdfBig data solution in marketing.pdf
Big data solution in marketing.pdfSophiaKelly6
 
Data Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfData Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfSophiaKelly6
 
Data Integration In DBMS.pdf
Data Integration In DBMS.pdfData Integration In DBMS.pdf
Data Integration In DBMS.pdfSophiaKelly6
 
Data Integration Consultancy.pdf
Data Integration Consultancy.pdfData Integration Consultancy.pdf
Data Integration Consultancy.pdfSophiaKelly6
 
Data Science In Deep Learning.pdf
Data Science In Deep Learning.pdfData Science In Deep Learning.pdf
Data Science In Deep Learning.pdfSophiaKelly6
 
Data Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfData Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfSophiaKelly6
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdfSophiaKelly6
 
Data Science In Gaming.pdf
Data Science In Gaming.pdfData Science In Gaming.pdf
Data Science In Gaming.pdfSophiaKelly6
 
Data Science Services.pdf
Data Science Services.pdfData Science Services.pdf
Data Science Services.pdfSophiaKelly6
 

More from SophiaKelly6 (18)

Robotic Process Automation Service.pdf
Robotic Process Automation Service.pdfRobotic Process Automation Service.pdf
Robotic Process Automation Service.pdf
 
Automation Consultant Services.pdf
Automation Consultant Services.pdfAutomation Consultant Services.pdf
Automation Consultant Services.pdf
 
IoT Solution In Education.pdf
IoT Solution In Education.pdfIoT Solution In Education.pdf
IoT Solution In Education.pdf
 
IoT Solution in Manufacturing.pptx
IoT Solution in Manufacturing.pptxIoT Solution in Manufacturing.pptx
IoT Solution in Manufacturing.pptx
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdf
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdf
 
IoT Service Company.pdf
IoT Service Company.pdfIoT Service Company.pdf
IoT Service Company.pdf
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdf
 
Big data solution in marketing.pdf
Big data solution in marketing.pdfBig data solution in marketing.pdf
Big data solution in marketing.pdf
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
 
Data Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdfData Integration In Data Warehouse.pdf
Data Integration In Data Warehouse.pdf
 
Data Integration In DBMS.pdf
Data Integration In DBMS.pdfData Integration In DBMS.pdf
Data Integration In DBMS.pdf
 
Data Integration Consultancy.pdf
Data Integration Consultancy.pdfData Integration Consultancy.pdf
Data Integration Consultancy.pdf
 
Data Science In Deep Learning.pdf
Data Science In Deep Learning.pdfData Science In Deep Learning.pdf
Data Science In Deep Learning.pdf
 
Data Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdfData Science In Pharmaceutical.pdf
Data Science In Pharmaceutical.pdf
 
Data Science In Manufacturing.pdf
Data Science In Manufacturing.pdfData Science In Manufacturing.pdf
Data Science In Manufacturing.pdf
 
Data Science In Gaming.pdf
Data Science In Gaming.pdfData Science In Gaming.pdf
Data Science In Gaming.pdf
 
Data Science Services.pdf
Data Science Services.pdfData Science Services.pdf
Data Science Services.pdf
 

Data Integration In Data Mining.pdf

  • 1. Data Integration In Data Mining www.rootfacts.com
  • 2. Key Points Why Is Data Integration In Data Mining Important? What are two major systems for data integration? What are the Issues Of Data Integration in Data Mining?
  • 3. Why Is Data Integration In Data Mining Important? Data Integration is a data processing technique that collects data from different sources (such as data cubes, multiple databases, and flat files) and offers a unified view of the data to the users.
  • 4. Data integration in data mining connects with issues such as duplicate data, inconsistent data, old systems, etc. Manual data integration can be achieved through middleware and applications.
  • 5. What are two major systems for data integration? Tight Coupling Loose Coupling There are primarily 2 major systems for data integration which are as follows:
  • 6. Tight Coupling In this method, the data warehouse is treated as an information recovery feature. The process is known as ETL which means Extraction, Transformation, and Loading.
  • 7. Loose Coupling In this method, an interface is offered that listens to a query from the user and transforms it to the source database and then sends the query directly to the reference databases and obtains a great result.
  • 8. What are the Issues Of Data Integration in Data Mining? There are no problems during data integration in data mining: Schema Integration, Redundancy, Detection and explanation of data value disputes.
  • 9. Some redundancies can be caught with the help of correlation analysis.
  • 10. 1. Schema Integration - It integrates metadata from multiple sources and the real-world entities are matched with the entity identification problem. 2. Redundancy - An attribute may be duplicative or obtain redundancy. When the attributes are inconsistent, they may appear as duplicates in the resulting data set.
  • 11. 3. Detection and explanation of data value disputes - This is the third critical issue in data integration. Here the attribute values collected from different sources may vary for the exact real-world entity. An attribute collected in a system may be registered at a lower level of generalisation as compared with the “same” characteristic in another.