SlideShare a Scribd company logo
1 of 6
Download to read offline
A Comprehensive Study on Data Mining Process with
Distribution
Kethavarapu Uma Pavan Kumar1
umapawan.dwh@gmail.com--9014998455
Muppaneni Satyanvesh2
anvesh.chowdaryit123@gmail.com
1
M.Tech CSE Nalanda institute of Engineering and technology
Sattenapalli, Guntur (ANDHRA PRADESH)
2
B.Tech IT EVM College of Engineering and technology
Narsaraopet, Guntur (ANDHRA PRADESH)
ABSTRACT
In this paper we are going to present the concept of distributed
data mining and the advantages of combined architecture which involves the
concept of distribution as a backbone and the process of mining. To
demonstrate the Distributed Data Mining concept we are presenting
architecture with various levels of users, data, and networks. In this we also
have had the usage of DMQL (Distributed Mining Query Language) and
converter mechanism. After getting the required data we mentioned the
security layer in the architecture so as to oppose the malicious software or data.
The entire architecture depends on the usage of cloud computing which
involves the migration of the different kinds of network data mining process
KEYWORDS:-
Distribution, Mining, DMQL
(Distributed Mining Query Language),
Cloud Computing, Converter.
DISTRIBUTION INTRODUCTION:
The term distribution
playing a vital role while handling the
common data between varieties of
users. With distribution it is possible to
integrate different locations of the data
and also possible to broad cast the
common data to multiple flavors of the
users. Nowadays the distribution is
mostly used along with general
networks, databases, operating systems,
datawarehouse and in datamining.
In case of networks the
individual networks are just sending and
receiving the information. But if we
adopt the property of distribution for
this network that will be converted as
distributed systems. A distributed
system allows the user to process the
applications under a single system
image. Because of this it is possible to
interact with any of the system in the
communication network and the user is
having the feel of common usage of the
systems event though he is interacting
with different systems.
In case of databases a
single system is taking the responsibility
of all the systems those are connected to
it, such system is known as server. But
the problem with this type of centralized
databases is lack of reliability and
availability of the data. If the property
of the distribution is integrated with
normal databases then it possible to
overcome the lack of reliability and
availability of the data. The distributed
databases now playing an important role
in the management of the data such as
transactions, concurrency control and
query optimization.
The generic operating
system may be integrated with the
property of distribution so as to get
benefits of versatile process
management, memory management and
maintaining communication with the
help of distributed algorithms.
Data warehouse is a large
collection of data repository which
provides the uniform data to the user by
integrating various formats of the data
from different sources such as XML,
ERP, FLAT files (XL worksheets,
COBAL files, documents). The
distribution of datawarehouse yields
more benefits to the users who are
scattered geographically as a result it is
possible to get the required data by
different users located at different
continents.
DATA MINING:
Mining is a process of
searching for the required data from
larger databases. This is also known as
knowledge discovery in databases.
Because of this mining only it is
possible to get the most interestingness
patterns form the repositories. The
repository may be a database or else
may be datawarehouse. Getting only the
required data by avoiding the
unnecessary data is typical aspect in
case of searching for knowledge. The
mining process allows the user to
minimize the complexity the search
process in such a way that by providing
number of algorithms. Accessing the
required data in the fastest manner is the
most striking advantage of mining
process. Searching for the required data
in databases and datawarehouse is done
by online analytical processing (OLAP).
The same thing may be done through
datamining also.
The usage of OLAP
requires the functional knowledge to the
user. For example the company CEO
may want to access the previous season
sales then the CEO must know about the
season information and as well as the
products that which he wants to get the
sales information. OLAP having some
limitations while generating reports
with respect to the user requirements.
The limitations is, the user need to have
the idea about the context of the query.
This limitation can be solved by the
usage of datamining process.
COMBINING MINING PROCESS
WITH DISTRIBUTION:-
The distribution
concentrates on sharing the data from
various contents and it is possible to
replicate the same data to multiple
systems located at various client places
so the process of remote login, remote
accessing and remote computation are
done through this distribution. The
backbone of distribution is basically
LAN (Local Area Network) and it may
ranges from LAN to cloud computing.
The mining process is meant for
grabbing the most related data with
respect to the user given query. The
basic mining process involves a limited
environment such as a single server
with single or multiple databases or data
warehouses. If we implement the
mining process in case of distribution it
will give the distributed search patterns
and those patterns are more valuable
and most useful when compared with
normal mining process.
In the reference architecture the
data is gathered from various sources
such as databases, data warehouses,
FLAT files and ERP’s and that data is
avoid to distributed network
environment. The user may vary from
normal end user to MD’s, CEO’s of the
company. The user initially send his
request to the DMQL (Data Mining
Query Language) interface which is
very much similar to SQL (Structured
Query Language). So a converter is
required so as to serve the purpose of
different kinds of users. The converter
just transforms the user given query
according to the requirements of the
mining process and after that it will
search for all the available sources so as
to get the most interestingness pattern.
Cloud is representation of different
topologies of the network and it will
facilitate the integration these many
kinds of network so as to filter the
interestingness pattern from the
available source. The architecture
involving the security mechanism so as
to oppose the malicious program, code,
software’s or data into the system by the
means of antispyware and other
mechanism.
COMPLEXITIES:-
• Mixing up of mining process with
distribution is some what difficult
process.
• Gathering the required data from
unlimited source is also a tough task.
• Conversion of various formats is
also complex.
• Integrating different kinds of data
and presenting that data into the user
requested format is also not that
much of easy.
REQUIREMENTS
• Cloud computing
• Data bases or data warehouses or
Flat files or ERP’s
• Converters
• DMQL interface
• Users of various levels
•
TYPES OF MINING PROCESS
The process of mining
supports various formats of the
search process which are involving
text mining, web mining, web
content mining web usage mining,
spatial mining, multimedia data
mining depending on the
requirement it is possible to use the
corresponding mining process.
ADDED BENEFITS IN MINING
PROCESS IN CASE OF
DISTRIBUTION
The main benefit of mixing up the
distribution with respect to mining process
is the availability and reliability of the data
as a single system image
REAL TIME USAGE OF DATA
MINING WITH DISTRIBUTION
In general mining process such
as search engines like Google and other
browsing techniques while using
internet and other public networks
generally the mining process involves
the distribution by default, so almost all
general net based search process follows
the distribution mechanism so as to
access the required pattern from bulk
source.
PROS AND CONS
PROS
• The main purpose of Distributed
data mining is grabbing the
interestingness patterns from
variety of sources which is not
possible in normal mining
process
• Serving various levels of users is
possible through this distributed
data mining.
• Working with various kinds of
data formats is also possible
CONS
• Its architecture it is not possible
to locate where the exact data is.
• Unnecessary data is crept into
the user requested query.
• Sometimes it may not possible
to the converter to transform the
source data into user required
format.
CONCLUSION:-
Finally we conclude that the
discussion regarding with distributed
data mining provides the user to get the
interestingness patterns from the both
sources and it also provides handling of
various formats of the data and as well
integration of those data into a common
format. We also conclude that the
architecture basically provides the
required information by combining with
various sources and to process this there
may have some complexities and other
problem for getting the exact data.
REFERNCES
1. Data Mining concepts and
principles paulraj ponnaiah
2. Data warehousing and mining by
Alex berson
3. Data Warehousing techniques by
Michel han and kamber
4. www.altavista.com
5. Gathered from user groups and
blogs

More Related Content

Similar to A Comprehensive Study On Data Mining Process With Distribution

A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...
A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...
A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...1crore projects
 
A Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving EnvironmentA Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving EnvironmentSheila Sinclair
 
11700220085_DDBMS.pptx
11700220085_DDBMS.pptx11700220085_DDBMS.pptx
11700220085_DDBMS.pptxSouvikRoy8783
 
Implementation of Agent Based Dynamic Distributed Service
Implementation of Agent Based Dynamic Distributed ServiceImplementation of Agent Based Dynamic Distributed Service
Implementation of Agent Based Dynamic Distributed ServiceCSCJournals
 
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...birdsking
 
DISTRIBUTED SYSTEM.docx
DISTRIBUTED SYSTEM.docxDISTRIBUTED SYSTEM.docx
DISTRIBUTED SYSTEM.docxvinaypandey170
 
Assisting Migration and Evolution of Relational Legacy Databases
Assisting Migration and Evolution of Relational Legacy DatabasesAssisting Migration and Evolution of Relational Legacy Databases
Assisting Migration and Evolution of Relational Legacy DatabasesGihan Wikramanayake
 
Distributed database management system
Distributed database management systemDistributed database management system
Distributed database management systemVinay D. Patel
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsEditor IJCATR
 
Ijcatr04071003
Ijcatr04071003Ijcatr04071003
Ijcatr04071003Editor IJCATR
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsEditor IJCATR
 
Scalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storagesScalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storagesVenkat Projects
 
data-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfdata-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfssuser18927d
 
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...IRJET Journal
 
Iaetsd enhancement of performance and security in bigdata processing
Iaetsd enhancement of performance and security in bigdata processingIaetsd enhancement of performance and security in bigdata processing
Iaetsd enhancement of performance and security in bigdata processingIaetsd Iaetsd
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...Alexander Decker
 
Fragmentation of Data in Large-Scale System For Ideal Performance and Security
Fragmentation of Data in Large-Scale System For Ideal Performance and SecurityFragmentation of Data in Large-Scale System For Ideal Performance and Security
Fragmentation of Data in Large-Scale System For Ideal Performance and SecurityEditor IJCATR
 

Similar to A Comprehensive Study On Data Mining Process With Distribution (20)

A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...
A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...
A Secure and Dynamic Multi-keyword Ranked Search Scheme over Encrypted Cloud ...
 
A Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving EnvironmentA Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving Environment
 
11700220085_DDBMS.pptx
11700220085_DDBMS.pptx11700220085_DDBMS.pptx
11700220085_DDBMS.pptx
 
Implementation of Agent Based Dynamic Distributed Service
Implementation of Agent Based Dynamic Distributed ServiceImplementation of Agent Based Dynamic Distributed Service
Implementation of Agent Based Dynamic Distributed Service
 
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
 
DISTRIBUTED SYSTEM.docx
DISTRIBUTED SYSTEM.docxDISTRIBUTED SYSTEM.docx
DISTRIBUTED SYSTEM.docx
 
50620130101004
5062013010100450620130101004
50620130101004
 
Grid Presentation
Grid PresentationGrid Presentation
Grid Presentation
 
Assisting Migration and Evolution of Relational Legacy Databases
Assisting Migration and Evolution of Relational Legacy DatabasesAssisting Migration and Evolution of Relational Legacy Databases
Assisting Migration and Evolution of Relational Legacy Databases
 
Distributed database management system
Distributed database management systemDistributed database management system
Distributed database management system
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid Systems
 
Ijcatr04071003
Ijcatr04071003Ijcatr04071003
Ijcatr04071003
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid Systems
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Scalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storagesScalable and adaptive data replica placement for geo distributed cloud storages
Scalable and adaptive data replica placement for geo distributed cloud storages
 
data-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfdata-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdf
 
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
 
Iaetsd enhancement of performance and security in bigdata processing
Iaetsd enhancement of performance and security in bigdata processingIaetsd enhancement of performance and security in bigdata processing
Iaetsd enhancement of performance and security in bigdata processing
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...
 
Fragmentation of Data in Large-Scale System For Ideal Performance and Security
Fragmentation of Data in Large-Scale System For Ideal Performance and SecurityFragmentation of Data in Large-Scale System For Ideal Performance and Security
Fragmentation of Data in Large-Scale System For Ideal Performance and Security
 

More from Lori Mitchell

Expository Essay Reflection Paper In. Online assignment writing service.
Expository Essay Reflection Paper In. Online assignment writing service.Expository Essay Reflection Paper In. Online assignment writing service.
Expository Essay Reflection Paper In. Online assignment writing service.Lori Mitchell
 
Handwriting Without Tears Paper WITH Picture Han
Handwriting Without Tears Paper WITH Picture HanHandwriting Without Tears Paper WITH Picture Han
Handwriting Without Tears Paper WITH Picture HanLori Mitchell
 
My Mother Childhood Essay. Essay On My Mot
My Mother Childhood Essay. Essay On My MotMy Mother Childhood Essay. Essay On My Mot
My Mother Childhood Essay. Essay On My MotLori Mitchell
 
Stephen King Quote If You Want To Be A Writer, You Must Do Two Things
Stephen King Quote If You Want To Be A Writer, You Must Do Two ThingsStephen King Quote If You Want To Be A Writer, You Must Do Two Things
Stephen King Quote If You Want To Be A Writer, You Must Do Two ThingsLori Mitchell
 
500 Word Essay - Example, Length And Writing Tips At
500 Word Essay - Example, Length And Writing Tips At500 Word Essay - Example, Length And Writing Tips At
500 Word Essay - Example, Length And Writing Tips AtLori Mitchell
 
Pin On Educational Purposes. Online assignment writing service.
Pin On Educational Purposes. Online assignment writing service.Pin On Educational Purposes. Online assignment writing service.
Pin On Educational Purposes. Online assignment writing service.Lori Mitchell
 
English Essay Form 1 - JerryldOneal. Online assignment writing service.
English Essay Form 1 - JerryldOneal. Online assignment writing service.English Essay Form 1 - JerryldOneal. Online assignment writing service.
English Essay Form 1 - JerryldOneal. Online assignment writing service.Lori Mitchell
 
Nurse Practitioner Personal Statement Sample That Can
Nurse Practitioner Personal Statement Sample That CanNurse Practitioner Personal Statement Sample That Can
Nurse Practitioner Personal Statement Sample That CanLori Mitchell
 
As An English Teacher, Writin. Online assignment writing service.
As An English Teacher, Writin. Online assignment writing service.As An English Teacher, Writin. Online assignment writing service.
As An English Teacher, Writin. Online assignment writing service.Lori Mitchell
 
How To Write An Good Literature Essays Online
How To Write An Good Literature Essays OnlineHow To Write An Good Literature Essays Online
How To Write An Good Literature Essays OnlineLori Mitchell
 
EDEXCEL Imaginative Writing Questions Teachin
EDEXCEL Imaginative Writing Questions TeachinEDEXCEL Imaginative Writing Questions Teachin
EDEXCEL Imaginative Writing Questions TeachinLori Mitchell
 
College Essay Essay In High School. Online assignment writing service.
College Essay Essay In High School. Online assignment writing service.College Essay Essay In High School. Online assignment writing service.
College Essay Essay In High School. Online assignment writing service.Lori Mitchell
 
Cheap Essay Writing Services - Avail Best Essay Help AuthorST
Cheap Essay Writing Services - Avail Best Essay Help AuthorSTCheap Essay Writing Services - Avail Best Essay Help AuthorST
Cheap Essay Writing Services - Avail Best Essay Help AuthorSTLori Mitchell
 
Scientific Research Paper Format Template - Writinght
Scientific Research Paper Format Template - WritinghtScientific Research Paper Format Template - Writinght
Scientific Research Paper Format Template - WritinghtLori Mitchell
 
Tips For Writing A Research Paper Research4Life
Tips For Writing A Research Paper Research4LifeTips For Writing A Research Paper Research4Life
Tips For Writing A Research Paper Research4LifeLori Mitchell
 
Printable Birthday Stationary Pics BirthdayStationary
Printable Birthday Stationary Pics BirthdayStationaryPrintable Birthday Stationary Pics BirthdayStationary
Printable Birthday Stationary Pics BirthdayStationaryLori Mitchell
 
Samples Of Persuasive Essays For High School St
Samples Of Persuasive Essays For High School StSamples Of Persuasive Essays For High School St
Samples Of Persuasive Essays For High School StLori Mitchell
 
The Elements Of Writing A Song Songwriting Basics,
The Elements Of Writing A Song Songwriting Basics,The Elements Of Writing A Song Songwriting Basics,
The Elements Of Writing A Song Songwriting Basics,Lori Mitchell
 
If They Give You Lined Paper, Write Sideways. By Daniel Quinn
If They Give You Lined Paper, Write Sideways. By Daniel QuinnIf They Give You Lined Paper, Write Sideways. By Daniel Quinn
If They Give You Lined Paper, Write Sideways. By Daniel QuinnLori Mitchell
 
Robot Writing Paper - 3 Styles By Pink Posy Paperi
Robot Writing Paper - 3 Styles By Pink Posy PaperiRobot Writing Paper - 3 Styles By Pink Posy Paperi
Robot Writing Paper - 3 Styles By Pink Posy PaperiLori Mitchell
 

More from Lori Mitchell (20)

Expository Essay Reflection Paper In. Online assignment writing service.
Expository Essay Reflection Paper In. Online assignment writing service.Expository Essay Reflection Paper In. Online assignment writing service.
Expository Essay Reflection Paper In. Online assignment writing service.
 
Handwriting Without Tears Paper WITH Picture Han
Handwriting Without Tears Paper WITH Picture HanHandwriting Without Tears Paper WITH Picture Han
Handwriting Without Tears Paper WITH Picture Han
 
My Mother Childhood Essay. Essay On My Mot
My Mother Childhood Essay. Essay On My MotMy Mother Childhood Essay. Essay On My Mot
My Mother Childhood Essay. Essay On My Mot
 
Stephen King Quote If You Want To Be A Writer, You Must Do Two Things
Stephen King Quote If You Want To Be A Writer, You Must Do Two ThingsStephen King Quote If You Want To Be A Writer, You Must Do Two Things
Stephen King Quote If You Want To Be A Writer, You Must Do Two Things
 
500 Word Essay - Example, Length And Writing Tips At
500 Word Essay - Example, Length And Writing Tips At500 Word Essay - Example, Length And Writing Tips At
500 Word Essay - Example, Length And Writing Tips At
 
Pin On Educational Purposes. Online assignment writing service.
Pin On Educational Purposes. Online assignment writing service.Pin On Educational Purposes. Online assignment writing service.
Pin On Educational Purposes. Online assignment writing service.
 
English Essay Form 1 - JerryldOneal. Online assignment writing service.
English Essay Form 1 - JerryldOneal. Online assignment writing service.English Essay Form 1 - JerryldOneal. Online assignment writing service.
English Essay Form 1 - JerryldOneal. Online assignment writing service.
 
Nurse Practitioner Personal Statement Sample That Can
Nurse Practitioner Personal Statement Sample That CanNurse Practitioner Personal Statement Sample That Can
Nurse Practitioner Personal Statement Sample That Can
 
As An English Teacher, Writin. Online assignment writing service.
As An English Teacher, Writin. Online assignment writing service.As An English Teacher, Writin. Online assignment writing service.
As An English Teacher, Writin. Online assignment writing service.
 
How To Write An Good Literature Essays Online
How To Write An Good Literature Essays OnlineHow To Write An Good Literature Essays Online
How To Write An Good Literature Essays Online
 
EDEXCEL Imaginative Writing Questions Teachin
EDEXCEL Imaginative Writing Questions TeachinEDEXCEL Imaginative Writing Questions Teachin
EDEXCEL Imaginative Writing Questions Teachin
 
College Essay Essay In High School. Online assignment writing service.
College Essay Essay In High School. Online assignment writing service.College Essay Essay In High School. Online assignment writing service.
College Essay Essay In High School. Online assignment writing service.
 
Cheap Essay Writing Services - Avail Best Essay Help AuthorST
Cheap Essay Writing Services - Avail Best Essay Help AuthorSTCheap Essay Writing Services - Avail Best Essay Help AuthorST
Cheap Essay Writing Services - Avail Best Essay Help AuthorST
 
Scientific Research Paper Format Template - Writinght
Scientific Research Paper Format Template - WritinghtScientific Research Paper Format Template - Writinght
Scientific Research Paper Format Template - Writinght
 
Tips For Writing A Research Paper Research4Life
Tips For Writing A Research Paper Research4LifeTips For Writing A Research Paper Research4Life
Tips For Writing A Research Paper Research4Life
 
Printable Birthday Stationary Pics BirthdayStationary
Printable Birthday Stationary Pics BirthdayStationaryPrintable Birthday Stationary Pics BirthdayStationary
Printable Birthday Stationary Pics BirthdayStationary
 
Samples Of Persuasive Essays For High School St
Samples Of Persuasive Essays For High School StSamples Of Persuasive Essays For High School St
Samples Of Persuasive Essays For High School St
 
The Elements Of Writing A Song Songwriting Basics,
The Elements Of Writing A Song Songwriting Basics,The Elements Of Writing A Song Songwriting Basics,
The Elements Of Writing A Song Songwriting Basics,
 
If They Give You Lined Paper, Write Sideways. By Daniel Quinn
If They Give You Lined Paper, Write Sideways. By Daniel QuinnIf They Give You Lined Paper, Write Sideways. By Daniel Quinn
If They Give You Lined Paper, Write Sideways. By Daniel Quinn
 
Robot Writing Paper - 3 Styles By Pink Posy Paperi
Robot Writing Paper - 3 Styles By Pink Posy PaperiRobot Writing Paper - 3 Styles By Pink Posy Paperi
Robot Writing Paper - 3 Styles By Pink Posy Paperi
 

Recently uploaded

Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Recently uploaded (20)

Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 

A Comprehensive Study On Data Mining Process With Distribution

  • 1. A Comprehensive Study on Data Mining Process with Distribution Kethavarapu Uma Pavan Kumar1 umapawan.dwh@gmail.com--9014998455 Muppaneni Satyanvesh2 anvesh.chowdaryit123@gmail.com 1 M.Tech CSE Nalanda institute of Engineering and technology Sattenapalli, Guntur (ANDHRA PRADESH) 2 B.Tech IT EVM College of Engineering and technology Narsaraopet, Guntur (ANDHRA PRADESH) ABSTRACT In this paper we are going to present the concept of distributed data mining and the advantages of combined architecture which involves the concept of distribution as a backbone and the process of mining. To demonstrate the Distributed Data Mining concept we are presenting architecture with various levels of users, data, and networks. In this we also have had the usage of DMQL (Distributed Mining Query Language) and converter mechanism. After getting the required data we mentioned the security layer in the architecture so as to oppose the malicious software or data. The entire architecture depends on the usage of cloud computing which involves the migration of the different kinds of network data mining process
  • 2. KEYWORDS:- Distribution, Mining, DMQL (Distributed Mining Query Language), Cloud Computing, Converter. DISTRIBUTION INTRODUCTION: The term distribution playing a vital role while handling the common data between varieties of users. With distribution it is possible to integrate different locations of the data and also possible to broad cast the common data to multiple flavors of the users. Nowadays the distribution is mostly used along with general networks, databases, operating systems, datawarehouse and in datamining. In case of networks the individual networks are just sending and receiving the information. But if we adopt the property of distribution for this network that will be converted as distributed systems. A distributed system allows the user to process the applications under a single system image. Because of this it is possible to interact with any of the system in the communication network and the user is having the feel of common usage of the systems event though he is interacting with different systems. In case of databases a single system is taking the responsibility of all the systems those are connected to it, such system is known as server. But the problem with this type of centralized databases is lack of reliability and availability of the data. If the property of the distribution is integrated with normal databases then it possible to overcome the lack of reliability and availability of the data. The distributed databases now playing an important role in the management of the data such as transactions, concurrency control and query optimization. The generic operating system may be integrated with the property of distribution so as to get benefits of versatile process management, memory management and maintaining communication with the help of distributed algorithms. Data warehouse is a large collection of data repository which provides the uniform data to the user by integrating various formats of the data from different sources such as XML, ERP, FLAT files (XL worksheets, COBAL files, documents). The distribution of datawarehouse yields more benefits to the users who are scattered geographically as a result it is possible to get the required data by different users located at different continents.
  • 3. DATA MINING: Mining is a process of searching for the required data from larger databases. This is also known as knowledge discovery in databases. Because of this mining only it is possible to get the most interestingness patterns form the repositories. The repository may be a database or else may be datawarehouse. Getting only the required data by avoiding the unnecessary data is typical aspect in case of searching for knowledge. The mining process allows the user to minimize the complexity the search process in such a way that by providing number of algorithms. Accessing the required data in the fastest manner is the most striking advantage of mining process. Searching for the required data in databases and datawarehouse is done by online analytical processing (OLAP). The same thing may be done through datamining also. The usage of OLAP requires the functional knowledge to the user. For example the company CEO may want to access the previous season sales then the CEO must know about the season information and as well as the products that which he wants to get the sales information. OLAP having some limitations while generating reports with respect to the user requirements. The limitations is, the user need to have the idea about the context of the query. This limitation can be solved by the usage of datamining process. COMBINING MINING PROCESS WITH DISTRIBUTION:- The distribution concentrates on sharing the data from various contents and it is possible to replicate the same data to multiple systems located at various client places so the process of remote login, remote accessing and remote computation are done through this distribution. The backbone of distribution is basically LAN (Local Area Network) and it may ranges from LAN to cloud computing. The mining process is meant for grabbing the most related data with respect to the user given query. The basic mining process involves a limited environment such as a single server with single or multiple databases or data warehouses. If we implement the mining process in case of distribution it will give the distributed search patterns and those patterns are more valuable and most useful when compared with normal mining process.
  • 4.
  • 5. In the reference architecture the data is gathered from various sources such as databases, data warehouses, FLAT files and ERP’s and that data is avoid to distributed network environment. The user may vary from normal end user to MD’s, CEO’s of the company. The user initially send his request to the DMQL (Data Mining Query Language) interface which is very much similar to SQL (Structured Query Language). So a converter is required so as to serve the purpose of different kinds of users. The converter just transforms the user given query according to the requirements of the mining process and after that it will search for all the available sources so as to get the most interestingness pattern. Cloud is representation of different topologies of the network and it will facilitate the integration these many kinds of network so as to filter the interestingness pattern from the available source. The architecture involving the security mechanism so as to oppose the malicious program, code, software’s or data into the system by the means of antispyware and other mechanism. COMPLEXITIES:- • Mixing up of mining process with distribution is some what difficult process. • Gathering the required data from unlimited source is also a tough task. • Conversion of various formats is also complex. • Integrating different kinds of data and presenting that data into the user requested format is also not that much of easy. REQUIREMENTS • Cloud computing • Data bases or data warehouses or Flat files or ERP’s • Converters • DMQL interface • Users of various levels • TYPES OF MINING PROCESS The process of mining supports various formats of the search process which are involving text mining, web mining, web content mining web usage mining, spatial mining, multimedia data mining depending on the requirement it is possible to use the corresponding mining process. ADDED BENEFITS IN MINING PROCESS IN CASE OF DISTRIBUTION The main benefit of mixing up the distribution with respect to mining process is the availability and reliability of the data as a single system image REAL TIME USAGE OF DATA MINING WITH DISTRIBUTION In general mining process such as search engines like Google and other browsing techniques while using internet and other public networks generally the mining process involves the distribution by default, so almost all
  • 6. general net based search process follows the distribution mechanism so as to access the required pattern from bulk source. PROS AND CONS PROS • The main purpose of Distributed data mining is grabbing the interestingness patterns from variety of sources which is not possible in normal mining process • Serving various levels of users is possible through this distributed data mining. • Working with various kinds of data formats is also possible CONS • Its architecture it is not possible to locate where the exact data is. • Unnecessary data is crept into the user requested query. • Sometimes it may not possible to the converter to transform the source data into user required format. CONCLUSION:- Finally we conclude that the discussion regarding with distributed data mining provides the user to get the interestingness patterns from the both sources and it also provides handling of various formats of the data and as well integration of those data into a common format. We also conclude that the architecture basically provides the required information by combining with various sources and to process this there may have some complexities and other problem for getting the exact data. REFERNCES 1. Data Mining concepts and principles paulraj ponnaiah 2. Data warehousing and mining by Alex berson 3. Data Warehousing techniques by Michel han and kamber 4. www.altavista.com 5. Gathered from user groups and blogs