SlideShare a Scribd company logo
1 of 13
Download to read offline
Data Integration In
Data Mining
www.rootfacts.com
Key Points
Why Is Data Integration In Data Mining Important?
What are two major systems for data integration?
What are the Issues Of Data Integration in Data
Mining?
Why Is Data Integration In Data
Mining Important?
Data Integration is a data processing
technique that collects data from
different sources (such as data
cubes, multiple databases, and flat
files) and offers a unified view of the
data to the users.
Data integration in data mining
connects with issues such as duplicate
data, inconsistent data, old systems,
etc. Manual data integration can be
achieved through middleware and
applications.
What are two major systems for data integration?
Tight Coupling
Loose Coupling
There are primarily 2 major systems for data integration
which are as follows:
Tight Coupling
In this method, the data warehouse is
treated as an information recovery
feature. The process is known as ETL
which means Extraction,
Transformation, and Loading.
Loose Coupling
In this method, an interface is offered
that listens to a query from the user
and transforms it to the source
database and then sends the query
directly to the reference databases
and obtains a great result.
What are the Issues Of Data
Integration in Data Mining?
There are no problems during data
integration in data mining: Schema
Integration, Redundancy, Detection and
explanation of data value disputes.
Some redundancies can be caught with the help of correlation analysis.
1. Schema Integration - It integrates metadata from multiple
sources and the real-world entities are matched with the entity
identification problem.
2. Redundancy - An attribute may be duplicative or obtain
redundancy. When the attributes are inconsistent, they may appear
as duplicates in the resulting data set.
3. Detection and explanation of data value
disputes - This is the third critical issue in
data integration. Here the attribute values
collected from different sources may vary
for the exact real-world entity. An attribute
collected in a system may be registered at
a lower level of generalisation as
compared with the “same” characteristic
in another.
Contact Us
https://www.rootfacts.com/services/data-
integration-in-data-mining/
contact@rootfacts.com
Thank you!

More Related Content

Similar to Data Integration In Data Mining.pdf

Machine learning topics machine learning algorithm into three main parts.
Machine learning topics  machine learning algorithm into three main parts.Machine learning topics  machine learning algorithm into three main parts.
Machine learning topics machine learning algorithm into three main parts.DurgaDeviP2
 
Database Systems - introduction
Database Systems - introductionDatabase Systems - introduction
Database Systems - introductionJananath Banuka
 
Lecture 1&2(rdbms-ii)
Lecture 1&2(rdbms-ii)Lecture 1&2(rdbms-ii)
Lecture 1&2(rdbms-ii)Ravinder Kamboj
 
Types Of Database For Flat File Database
Types Of Database For Flat File DatabaseTypes Of Database For Flat File Database
Types Of Database For Flat File DatabaseChristina Valadez
 
Role of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRole of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRamakant Soni
 
Week 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data ModelsWeek 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data Modelsoudesign
 
TID Chapter 10 Introduction To Database
TID Chapter 10 Introduction To DatabaseTID Chapter 10 Introduction To Database
TID Chapter 10 Introduction To DatabaseWanBK Leo
 
CHAPTER-4_RELATIONAL-DATABASE.pptx
CHAPTER-4_RELATIONAL-DATABASE.pptxCHAPTER-4_RELATIONAL-DATABASE.pptx
CHAPTER-4_RELATIONAL-DATABASE.pptxRiaBago
 
Data Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information SystemsData Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information Systemsijceronline
 
Database systems assignment 1
Database systems   assignment 1Database systems   assignment 1
Database systems assignment 1Nelson Kimathi
 
Asif nosql
Asif nosqlAsif nosql
Asif nosqlAsif Ali
 

Similar to Data Integration In Data Mining.pdf (20)

Machine learning topics machine learning algorithm into three main parts.
Machine learning topics  machine learning algorithm into three main parts.Machine learning topics  machine learning algorithm into three main parts.
Machine learning topics machine learning algorithm into three main parts.
 
Database Systems - introduction
Database Systems - introductionDatabase Systems - introduction
Database Systems - introduction
 
Lecture 1&2(rdbms-ii)
Lecture 1&2(rdbms-ii)Lecture 1&2(rdbms-ii)
Lecture 1&2(rdbms-ii)
 
Types Of Database For Flat File Database
Types Of Database For Flat File DatabaseTypes Of Database For Flat File Database
Types Of Database For Flat File Database
 
Role of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRole of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data Warehouse
 
Week 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data ModelsWeek 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data Models
 
TID Chapter 10 Introduction To Database
TID Chapter 10 Introduction To DatabaseTID Chapter 10 Introduction To Database
TID Chapter 10 Introduction To Database
 
CHAPTER-4_RELATIONAL-DATABASE.pptx
CHAPTER-4_RELATIONAL-DATABASE.pptxCHAPTER-4_RELATIONAL-DATABASE.pptx
CHAPTER-4_RELATIONAL-DATABASE.pptx
 
Data Preparation.pptx
Data Preparation.pptxData Preparation.pptx
Data Preparation.pptx
 
Data Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information SystemsData Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information Systems
 
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
Database systems Handbook 4th  dbms by Muhammad Sharif.pdfDatabase systems Handbook 4th  dbms by Muhammad Sharif.pdf
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
 
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
Database systems Handbook 4th  dbms by Muhammad Sharif.pdfDatabase systems Handbook 4th  dbms by Muhammad Sharif.pdf
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
 
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
Database systems Handbook 4th  dbms by Muhammad Sharif.pdfDatabase systems Handbook 4th  dbms by Muhammad Sharif.pdf
Database systems Handbook 4th dbms by Muhammad Sharif.pdf
 
database ppt(2)
database ppt(2)database ppt(2)
database ppt(2)
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
 
Database systems assignment 1
Database systems   assignment 1Database systems   assignment 1
Database systems assignment 1
 
GROPSIKS.pptx
GROPSIKS.pptxGROPSIKS.pptx
GROPSIKS.pptx
 
Database management systems
Database management systemsDatabase management systems
Database management systems
 
Asif nosql
Asif nosqlAsif nosql
Asif nosql
 
Data cleaning
Data cleaningData cleaning
Data cleaning
 

More from Maria Mathe

Big data solution in healthcare.pdf
Big data solution in healthcare.pdfBig data solution in healthcare.pdf
Big data solution in healthcare.pdfMaria Mathe
 
Big Data Solution in Finance.pdf
Big Data Solution in Finance.pdfBig Data Solution in Finance.pdf
Big Data Solution in Finance.pdfMaria Mathe
 
RPA service in Healthcare.pdf
RPA service in Healthcare.pdfRPA service in Healthcare.pdf
RPA service in Healthcare.pdfMaria Mathe
 
RPA service in Accounting.pdf
RPA service in Accounting.pdfRPA service in Accounting.pdf
RPA service in Accounting.pdfMaria Mathe
 
RPA Service in Airline.pdf
RPA Service in Airline.pdfRPA Service in Airline.pdf
RPA Service in Airline.pdfMaria Mathe
 
RPA service in Business.pdf
RPA service in Business.pdfRPA service in Business.pdf
RPA service in Business.pdfMaria Mathe
 
RPA Service in Auditing.pdf
RPA Service in Auditing.pdfRPA Service in Auditing.pdf
RPA Service in Auditing.pdfMaria Mathe
 
Data Visualization Service.pdf
Data Visualization Service.pdfData Visualization Service.pdf
Data Visualization Service.pdfMaria Mathe
 
Data Mining Company.pdf
Data Mining Company.pdfData Mining Company.pdf
Data Mining Company.pdfMaria Mathe
 
Robotic Process Automation Service in Banking.pdf
Robotic Process Automation Service in Banking.pdfRobotic Process Automation Service in Banking.pdf
Robotic Process Automation Service in Banking.pdfMaria Mathe
 
Robotic Process Automation.pdf
Robotic Process Automation.pdfRobotic Process Automation.pdf
Robotic Process Automation.pdfMaria Mathe
 
Automation Consultant.pdf
Automation Consultant.pdfAutomation Consultant.pdf
Automation Consultant.pdfMaria Mathe
 
IoT Solution in Agriculture.pdf
IoT Solution in Agriculture.pdfIoT Solution in Agriculture.pdf
IoT Solution in Agriculture.pdfMaria Mathe
 
IoT In Education.pptx
IoT In Education.pptxIoT In Education.pptx
IoT In Education.pptxMaria Mathe
 
IoT In Healthcare.pdf
IoT In Healthcare.pdfIoT In Healthcare.pdf
IoT In Healthcare.pdfMaria Mathe
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdfMaria Mathe
 
IoT Solution in Manufacturing.pdf
IoT Solution in Manufacturing.pdfIoT Solution in Manufacturing.pdf
IoT Solution in Manufacturing.pdfMaria Mathe
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdfMaria Mathe
 
IoT Service company.pdf
IoT Service company.pdfIoT Service company.pdf
IoT Service company.pdfMaria Mathe
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfMaria Mathe
 

More from Maria Mathe (20)

Big data solution in healthcare.pdf
Big data solution in healthcare.pdfBig data solution in healthcare.pdf
Big data solution in healthcare.pdf
 
Big Data Solution in Finance.pdf
Big Data Solution in Finance.pdfBig Data Solution in Finance.pdf
Big Data Solution in Finance.pdf
 
RPA service in Healthcare.pdf
RPA service in Healthcare.pdfRPA service in Healthcare.pdf
RPA service in Healthcare.pdf
 
RPA service in Accounting.pdf
RPA service in Accounting.pdfRPA service in Accounting.pdf
RPA service in Accounting.pdf
 
RPA Service in Airline.pdf
RPA Service in Airline.pdfRPA Service in Airline.pdf
RPA Service in Airline.pdf
 
RPA service in Business.pdf
RPA service in Business.pdfRPA service in Business.pdf
RPA service in Business.pdf
 
RPA Service in Auditing.pdf
RPA Service in Auditing.pdfRPA Service in Auditing.pdf
RPA Service in Auditing.pdf
 
Data Visualization Service.pdf
Data Visualization Service.pdfData Visualization Service.pdf
Data Visualization Service.pdf
 
Data Mining Company.pdf
Data Mining Company.pdfData Mining Company.pdf
Data Mining Company.pdf
 
Robotic Process Automation Service in Banking.pdf
Robotic Process Automation Service in Banking.pdfRobotic Process Automation Service in Banking.pdf
Robotic Process Automation Service in Banking.pdf
 
Robotic Process Automation.pdf
Robotic Process Automation.pdfRobotic Process Automation.pdf
Robotic Process Automation.pdf
 
Automation Consultant.pdf
Automation Consultant.pdfAutomation Consultant.pdf
Automation Consultant.pdf
 
IoT Solution in Agriculture.pdf
IoT Solution in Agriculture.pdfIoT Solution in Agriculture.pdf
IoT Solution in Agriculture.pdf
 
IoT In Education.pptx
IoT In Education.pptxIoT In Education.pptx
IoT In Education.pptx
 
IoT In Healthcare.pdf
IoT In Healthcare.pdfIoT In Healthcare.pdf
IoT In Healthcare.pdf
 
IoT in Supply Chain.pdf
IoT in Supply Chain.pdfIoT in Supply Chain.pdf
IoT in Supply Chain.pdf
 
IoT Solution in Manufacturing.pdf
IoT Solution in Manufacturing.pdfIoT Solution in Manufacturing.pdf
IoT Solution in Manufacturing.pdf
 
IoT Service in AWS.pdf
IoT Service in AWS.pdfIoT Service in AWS.pdf
IoT Service in AWS.pdf
 
IoT Service company.pdf
IoT Service company.pdfIoT Service company.pdf
IoT Service company.pdf
 
Big Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdfBig Data Solution in Agriculture.pdf
Big Data Solution in Agriculture.pdf
 

Recently uploaded

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 

Recently uploaded (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 

Data Integration In Data Mining.pdf

  • 1. Data Integration In Data Mining www.rootfacts.com
  • 2. Key Points Why Is Data Integration In Data Mining Important? What are two major systems for data integration? What are the Issues Of Data Integration in Data Mining?
  • 3. Why Is Data Integration In Data Mining Important? Data Integration is a data processing technique that collects data from different sources (such as data cubes, multiple databases, and flat files) and offers a unified view of the data to the users.
  • 4. Data integration in data mining connects with issues such as duplicate data, inconsistent data, old systems, etc. Manual data integration can be achieved through middleware and applications.
  • 5. What are two major systems for data integration? Tight Coupling Loose Coupling There are primarily 2 major systems for data integration which are as follows:
  • 6. Tight Coupling In this method, the data warehouse is treated as an information recovery feature. The process is known as ETL which means Extraction, Transformation, and Loading.
  • 7. Loose Coupling In this method, an interface is offered that listens to a query from the user and transforms it to the source database and then sends the query directly to the reference databases and obtains a great result.
  • 8. What are the Issues Of Data Integration in Data Mining? There are no problems during data integration in data mining: Schema Integration, Redundancy, Detection and explanation of data value disputes.
  • 9. Some redundancies can be caught with the help of correlation analysis.
  • 10. 1. Schema Integration - It integrates metadata from multiple sources and the real-world entities are matched with the entity identification problem. 2. Redundancy - An attribute may be duplicative or obtain redundancy. When the attributes are inconsistent, they may appear as duplicates in the resulting data set.
  • 11. 3. Detection and explanation of data value disputes - This is the third critical issue in data integration. Here the attribute values collected from different sources may vary for the exact real-world entity. An attribute collected in a system may be registered at a lower level of generalisation as compared with the “same” characteristic in another.