SlideShare a Scribd company logo
1 of 5
Download to read offline
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 3, Ver. IV (May-Jun. 2016), PP 13-17
www.iosrjournals.org
DOI: 10.9790/0661-1803041317 www.iosrjournals.org 13 | Page
Migrate and Map: A Framework to Access Data from Mysql,
Mongodb or Hbase Using Mysql Queries
Piyush Nikam1
, Tushar Patil2
, Gayatri Hungund3
, Ankit Pagar4
,
Aditi Talegaonkar5
, Ms. Soudamini Pawar6
1,2,3,4,5,6
(Department Of Computer Engineering, D. Y. P. C. O. E., Savitribai Phule Pune University, India)
Abstract: Due to ever-increasing amount of data, scalability factor of the databases becomes a major
constraint. Moreover, traditional relational databases fix the user’s perspective to view data in tabular format.
Parallel databases emerged to be one of the solutions to tackle the scalability constraint which was later
replaced by more scalable and efficient solution called NoSQL databases. By using the advantages of NoSQL
we proposed a framework to migrate data from MySQL to NoSQL databases that is HBase and MongoDB. The
framework will be used to map the MySQL query to HBase and MongoDB query format and fetch the data
faster by introducing a decision maker that selects the database from which data can be fetched at a relatively
faster rate, thus elevating the system performance.
Keywords: HBase, NoSQL, MapReduce, MongoDB, MySQL.
I. Introduction
In the early years accessing Relational Database Management System (RDBMS) for data manipulation
was the most popular way as it followed the traditional Atomicity – Consistency – Isolation – Durability (ACID)
properties and provides support for storing substantial amount of data. Another advantage of having RDBMS is
referential integrity which is a property of data which, when satisfied, requires every value of one attribute
(column) of a relation (table) to exist as a value of another attribute in a different table. Also, advantage of using
traditional RDBMS is having fine grain locking mechanism. But considering the scenario where the data is
changing dynamically, data manipulation and handling becomes a difficult task when it comes to usage of
RDBMS. NoSQL on the other hand, is a more efficient and scalable solution as compared to RDBMS. NoSQL
basically provides an advantage of being schema less for processing of structured and unstructured data. It also
provides MapReduce facility and robust operational tools. Considering all these advantages of using NoSQL
this paper proposes a hybrid framework that migrates all the data stored in relational databases such as MySQL
into a corresponding NoSQL database format. It also uses popular NoSQL databases such as MongoDB and
HBase. The framework will firstly migrate the data stored in MySQL database to MongoDB and HBase.
Further, the MySQL queries entered by user are mapped to MongoDB and HBase formatand datais retrieved
from the MySQL or NoSQL database selected by the decision maker.
The Technologies Used Are Highlighted As Follows:
1.1. Mongodb:
MongoDB is one of the newest databases introduced by 10gen. It is open source and document type
database in which the data is stored in the form of key and value pairs. MongoDB is a schema less database and
it provides support to advanced features such as MapReduce and aggregation tools. The biggest advantage of
using MongoDB is that it can store files of any size without complicating memory stack.
1.2. Hbase:
HBase is a NoSQL database that provides real time access to big data and efficient big data analytics. It
is flexible as it can combine a wide variety of different structures and schemas. HBase is popularly used due to
the speed of accessing the data. HBase is used for real time data analysis as it is user friendly and also most
importantly, HBase is fault tolerant.
1.3. Javafx:
JavaFX is a set of media packages that is used to create rich client application acrossvarious platforms.
It is a library which is written as Java API. JavaFX applicationcan use APIs from any Java libraries and can use
these Java libraries to access nativesystem capabilities. The JavaFX APIs are fully integrated features of Java SE
RuntimeEnvironment and Java Development Kit (JDK). It is cross-platform compatible.FXML or JavaFX
Scene Builder are used to interactively design graphical user interface.Existing Swing applications can be
Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql..
DOI: 10.9790/0661-1803041317 www.iosrjournals.org 14 | Page
updated with JavaFX features. JavaFXprovides all major UI controls that are required to develop rich
applications.
II. Need To Switch From SQL To Nosql
Scalability of NoSQL is higher than that of relational database systems. Large amount of data is stored
by horizontal scaling. NoSQL databases can handle data more efficiently as compared to relational database
system. This is done by storing the data in distributed manner. NoSQL databases do not have fixed structure to
store data. As there is no structure, different types of data can be stored. Caching facility is provided by NoSQL
databases to increase the retrieval speed and efficiency of the system.
III. General mapping techniques
IV. The entered MySQL query is converted to parse tree and then this tree is mapped to NoSQL interface.
V. Entered MySQL query is checked by query graph model. S-H and H-H rules are applied by query rewriter
to query generated model. Directed acyclic graph is obtained from this query based model with the help of
Query compiler. Plan evaluator is used to select optimum Directed Acyclic Graph (DAG) from obtained
DAGs.
VI. Using object relational mapping a data mapping module can be created which maps MySQL queries to
NoSQL queries.
IV. Related work
Wu Chung Chung et. al. [1] have proposed a framework Jackhare that uses Druid API for
implementation of scanner, parser and code generator. JDBC is used for connecting to databases. After entering
the query, WHERE clause in the query is verified first and then, the query is checked for names of the tables and
rows for which the WHERE clause is used. Finally, the data fields are sent as output in the SELECT statement.
Hadoop Map Reduce jobs are used, in which, output of the Map phase is Key-value pairs and Reducer combines
the output according to keys.
Julian Rith et. al. [2] have proposed a middleware implemented using c# and is based on .NET
framework for query mapping from MySQL to NoSQL in which the queries are divided into two categories
which are Data Definition Language(DDL) statements and CRUD operations. The query given as input is
converted into intermediate parse tree representation and further, this parse tree is mapped to NoSQL format.
ANTLR is used for building parse tree.
Xu et. al. [3] proposed a tool which increases probability of optimization of mapping rules using query
rewriting to equivalent HiveQL queries. It uses cost estimation to select best method from the different HiveQL
query versions which is based on data between mapper and reducer in network. Hive optimizer looks for
chances to merge jobs after which QMapper estimator estimates cost of MapReduce DAGs. Further, it selects
the method with minimum cost by measuring intermediate data between Maps and Reduces.
Rupali Arora et. al. [4] have proposed an algorithm for migrating data from RDBMS to MongoDB by
establishing the connection among the databases after which the database is selected by the user from existing
relational databases and corresponding collection is created in MongoDB. The table having its embedded
document created in MongoDB is chosen from the selected list of tables. Pentoho tool is used to load data into
MongoDB.
Leonardo Rocha et.al. [5] have proposed a NoSQL layer containing a mapping module to which user
enters a query as input in MySQL format to retrieve data from NoSQL databases. Mapping is done according to
types of statements. Each statement has separate java code which maps MySQL query to MongoDB. The results
are stored and converted to string format. The mapped query is executed on MongoDB and result is forwarded
to application.
V. Comparative study
Table1: Comparative study between MySQL, MongoDB and HBase
Sr. No. Factor MySQL MongoDB HBase
1. Schema Structured Dynamic Dynamic
2. Scalability Vertical Horizontal Horizontal
3. Transactions Complex Simple Simple
4. Data Locality No Yes Yes
5. MapReduce Support No Yes Yes
6. Replication No Master-Slave replication Scalable replication
VI. Architecture
To utilize the advantage of horizontal scaling also known as sharding, and processing BigData, the
Migrate and Map framework consists of a front end GUI and back end to migrate data stored in existing MySQL
Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql..
DOI: 10.9790/0661-1803041317 www.iosrjournals.org 15 | Page
database to MongoDB and HBase and a decision maker. The user enters a query in MySQL format and this
query is provided as input to decision maker which is programmed to predict the database on which the query
can be executed more efficiently which in turn will increase the overall efficiency of the system. Once the
database is selected by decision maker, the query is mapped to corresponding database query format and the
data is retrieved from the specific NoSQL database.
a.Gui
The GUI is designed using JavaFX which is a set of graphics packages that provides various classes
which in turn provide various methods written in native Java for creation of rich looking graphical user
interfaces. The WebKitHTML technology provided by JavaFX can be used for embedding web pages within
JavaFX application. A designer can code in FXML or use JavaFX Scene Builder to interactively design the
GUI. FXML is an XML based directive markup language for constructing a JavaFX application user interface.
Figure1: architecture diagram – mapping module
Figure 2: architecture diagram – migrate module
b. Jdbc
Java database connectivity is achieved using JDBC driver which is open source application
programming interface and is used to establish connectivity amongst databases to perform data manipulation
operations. JDBC allows the existence of multiple implementations which can be used by same application.
JDBC is used in the proposed framework to establish connectivity between Java and MySQL. It is also used to
fetch the table and database metadata which is required for converting tables from MySQL format to
intermediate form which is csv format.
c. Mongomigrate
The MongoMigrate module is programmed to auto-detect all the databases in MySQL and
corresponding tables that are to be migrated to MongoDB. Database metadata is extracted to make an ArrayList
Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql..
DOI: 10.9790/0661-1803041317 www.iosrjournals.org 16 | Page
of available databases which are selected one at a time. Tables are enlisted and added to ArrayList of tables. The
selected table is converted to csv file which is then migrated to MongoDB one table at a time. One table in
MySQL corresponds to one collection in MongoDB. The implementation uses multithreading for concurrent
execution of making csv files and migration of data to MongoDB.
Figure 3: migration from MySQL to MongoDB
d. Mongo Map
In migrate module (MongoDB) we have migrated data from MySQL to MongoDB. The tables
migrated from MySQL are stored as a collection in MongoDB. In this module query (Select) fired in MySQL
format is tokenized. As there can be many parameters to select query, “,” is used as a delimiter. The query
mapper takes these tokens as a input and retrieves the data from MongoDB.
Exaple1: select * from student;
As the mapper will encounter a “*”, it means that there is nothing to tokenize and hence it will display the data
stored entire collection.
Ecample2: select name, rollno from student;
In this query when the mapper will encounter a “,” query gets tokenized. Thus only name and rollno. will be
retrieved.
Example3: select name, rollno from student where rollno>2;
It will display name and rollno whose rollno is greater than 2. Here not only “,” is checked but also operators
from “where clause” is checked. When the operator “=”, “>” ,”<”, “>=”, “<=” is found it tokenized the where
clause and gives the result accordingly.
e. Hbasemigrate
The HBase migrate module migrates data from MySQL to HBase. This module is also programmed to
auto detect and migrate all the available MySQL databases and corresponding MySQL tables. To import data
into HBase, the data is first converted into csv format and then imported to HBase.
Figure 4: migration from MySQL to HBase
f. Hbasemap
This module maps the query from MySQL to HBase query format. The query is first tokenized then
according to different tokens the query is mapped to corresponding HBase query format and fired on HBase to
retrieve the data from HBase
VII. Observations
Table 2: Query mapping time
Sr. No. Query Type Mapping time (ms)
1. select * from <table name> 13
2. select <column name(s)> from <table name> 16
3. select * from <table name> where <field = value> 50
Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql..
DOI: 10.9790/0661-1803041317 www.iosrjournals.org 17 | Page
4. select <column name(s)> from <table name> where <field = value> 21
5. select * from <table name> where <field > value> 45
6. select <column name(s)> from <table name> where <field > value> 38
7. select * from <table name> where <field < value> 24
8. select <column name(s)> from <table name> where <field < value> 37
9. select * form <table name> where <field >= value> 39
10. select <column name(s)> from <table name> where <field >= value> 41
11. select * from <table name> where <field <= value> 28
12. select <column name(s)> from <table name> where <field <= value> 35
13. select * from <table name> where <field1 operator value1 and field2
operator value2>
21
14. select <column name(s)> from <table name> where <field1operator
value1 and field2 operator value2>
32
15. select * from <table name> where <field1 operator value1 or field2
operator value2>
27
16. select <column name(s)> from <table name> where <field1 operator
value1 or field2 operator value2>
27
17. select avg(column) from <table name> 1
18. select avg(column name) from <table name> group by <column name> 1
19. select max(column) from <table name> 1
20. select max(column name) from <table name> group by <column name> 1
21. select min(column) from <table name> 1
22. select min(column name) from <table name> group by <column name> 1
23. select count(column) from <table name> 2
The Table 2 denotes the time taken by the proposed framework to map the MySQL queries to
MongoDB and HBase query format.
The time required for mapping is in milliseconds which denotes that in comparison to executing
NoSQL queries, MySQL queries can be mapped to NoSQL query format and fired on NoSQL databases. The
proposed framework is advantageous as the query execution time is more by just fraction of milliseconds which
encourages the use of NoSQL databases and on the other hand, reduces the learning curve of user.
Another advantage of using the proposed framework is that the query mapping time remains constant
irrespective of size of data.
VIII. Conclusion
The paper proposes a hybrid architecture in which the data is migrated from traditional relational
database that is MySQL, to MongoDB and HBase databases to address the scalability issue in existing relational
database systems. The framework processes and serves the queries fired by the user in MySQL format to
retrieve required data from MySQL or NoSQL databases. The data retrieval is done according to the decision
making mechanism handled by the decision maker.
References
[1]. Chung, W. C., Lin, H. P., Chen, S. C., Jiang, M. F., & Chung, Y. C. (2014). JackHare: a framework for SQL to NoSQL translation
using MapReduce. Automated Software Engineering, 21(4), 489-508;
[2]. Rith, J., Lehmayr, P. S., & Meyer-Wegener, K. (2014, March). Speaking in toungues: SQL access to NoSQL systems. In
Proceedings of the 29th Annual ACM Symposium on Applied Computing (pp. 855-857). ACM.
[3]. Xu, Y., & Hu, S. (2013, May). Qmapper: a tool for sql optimization on hive using query rewriting. In Proceedings of the 22nd
international conference on World Wide Web companion (pp. 211-212). International World Wide Web Conferences Steering
Committee.
[4]. Arora, R., & Aggarwal, R. R. (2013). An Algorithm for Transformation of Data from MySQL to NoSQL (MongoDB). International
Journal of Advanced Studies in Computer Science and Engineering (IJASCSE), 2(1).
[5]. Rocha, L., Vale, F., Cirilo, E., Barbosa, D., & Mourao, F. (2015). A Framework for migrating Relational Datasets to NoSQL.
Procedia Computer Science, 51, 2593-2602.

More Related Content

What's hot

Survey of Parallel Data Processing in Context with MapReduce
Survey of Parallel Data Processing in Context with MapReduce Survey of Parallel Data Processing in Context with MapReduce
Survey of Parallel Data Processing in Context with MapReduce cscpconf
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelEditor IJCATR
 
An efficient data mining framework on hadoop using java persistence api
An efficient data mining framework on hadoop using java persistence apiAn efficient data mining framework on hadoop using java persistence api
An efficient data mining framework on hadoop using java persistence apiJoão Gabriel Lima
 
Performance Comparison of HBase and Cassandra
Performance Comparison of HBase and CassandraPerformance Comparison of HBase and Cassandra
Performance Comparison of HBase and CassandraYashIyengar
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopIOSR Journals
 
Sparkr sigmod
Sparkr sigmodSparkr sigmod
Sparkr sigmodwaqasm86
 
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONMAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONijdms
 
Hadoop mapreduce and yarn frame work- unit5
Hadoop mapreduce and yarn frame work-  unit5Hadoop mapreduce and yarn frame work-  unit5
Hadoop mapreduce and yarn frame work- unit5RojaT4
 
Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases IJECEIAES
 
Enhancing Big Data Analysis by using Map-reduce Technique
Enhancing Big Data Analysis by using Map-reduce TechniqueEnhancing Big Data Analysis by using Map-reduce Technique
Enhancing Big Data Analysis by using Map-reduce TechniquejournalBEEI
 
Big data technology unit 3
Big data technology unit 3Big data technology unit 3
Big data technology unit 3RojaT4
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technologypeertechzpublication
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architecturesRaji Gogulapati
 

What's hot (17)

Survey of Parallel Data Processing in Context with MapReduce
Survey of Parallel Data Processing in Context with MapReduce Survey of Parallel Data Processing in Context with MapReduce
Survey of Parallel Data Processing in Context with MapReduce
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
 
An efficient data mining framework on hadoop using java persistence api
An efficient data mining framework on hadoop using java persistence apiAn efficient data mining framework on hadoop using java persistence api
An efficient data mining framework on hadoop using java persistence api
 
Performance Comparison of HBase and Cassandra
Performance Comparison of HBase and CassandraPerformance Comparison of HBase and Cassandra
Performance Comparison of HBase and Cassandra
 
banian
banianbanian
banian
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
 
Sparkr sigmod
Sparkr sigmodSparkr sigmod
Sparkr sigmod
 
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONMAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
 
Hadoop mapreduce and yarn frame work- unit5
Hadoop mapreduce and yarn frame work-  unit5Hadoop mapreduce and yarn frame work-  unit5
Hadoop mapreduce and yarn frame work- unit5
 
Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases
 
Enhancing Big Data Analysis by using Map-reduce Technique
Enhancing Big Data Analysis by using Map-reduce TechniqueEnhancing Big Data Analysis by using Map-reduce Technique
Enhancing Big Data Analysis by using Map-reduce Technique
 
GridGain & Hadoop: Differences & Synergies
GridGain & Hadoop: Differences & SynergiesGridGain & Hadoop: Differences & Synergies
GridGain & Hadoop: Differences & Synergies
 
Big data technology unit 3
Big data technology unit 3Big data technology unit 3
Big data technology unit 3
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technology
 
the rising no sql technology
the rising no sql technologythe rising no sql technology
the rising no sql technology
 
Couch db
Couch dbCouch db
Couch db
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architectures
 

Viewers also liked

Research Presentation (MSc. Business Informatics) Short Version v
Research Presentation (MSc. Business Informatics) Short Version vResearch Presentation (MSc. Business Informatics) Short Version v
Research Presentation (MSc. Business Informatics) Short Version vRobert Bradbury
 
Madcom analyzes the need for broadband in eastern pa
Madcom analyzes the need for broadband in eastern paMadcom analyzes the need for broadband in eastern pa
Madcom analyzes the need for broadband in eastern paRich Frank
 
PNAS-2016-Gielen-1606927113
PNAS-2016-Gielen-1606927113PNAS-2016-Gielen-1606927113
PNAS-2016-Gielen-1606927113Raphaelle Hours
 
Annual report draft-v2
Annual report draft-v2Annual report draft-v2
Annual report draft-v2Anil Krishna
 
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02Marioly Ch
 
Performance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical LayerPerformance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical LayerIOSR Journals
 
LordJeshuainheritancemay2016
LordJeshuainheritancemay2016LordJeshuainheritancemay2016
LordJeshuainheritancemay2016Lord Jesus Christ
 
Urus APIP (Angka Pengenal Importir Produsen)
Urus APIP (Angka Pengenal Importir Produsen) Urus APIP (Angka Pengenal Importir Produsen)
Urus APIP (Angka Pengenal Importir Produsen) legalservice
 

Viewers also liked (20)

E1803062731
E1803062731E1803062731
E1803062731
 
Research Presentation (MSc. Business Informatics) Short Version v
Research Presentation (MSc. Business Informatics) Short Version vResearch Presentation (MSc. Business Informatics) Short Version v
Research Presentation (MSc. Business Informatics) Short Version v
 
Como registrarse
Como registrarseComo registrarse
Como registrarse
 
Madcom analyzes the need for broadband in eastern pa
Madcom analyzes the need for broadband in eastern paMadcom analyzes the need for broadband in eastern pa
Madcom analyzes the need for broadband in eastern pa
 
A010430107
A010430107A010430107
A010430107
 
PNAS-2016-Gielen-1606927113
PNAS-2016-Gielen-1606927113PNAS-2016-Gielen-1606927113
PNAS-2016-Gielen-1606927113
 
Annual report draft-v2
Annual report draft-v2Annual report draft-v2
Annual report draft-v2
 
F0813137
F0813137F0813137
F0813137
 
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02
Psicologadelmexicanoeneltrabajo 150301191810-conversion-gate02
 
G017234450
G017234450G017234450
G017234450
 
B010241114
B010241114B010241114
B010241114
 
G010234652
G010234652G010234652
G010234652
 
Performance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical LayerPerformance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical Layer
 
D010312230
D010312230D010312230
D010312230
 
LordJeshuainheritancemay2016
LordJeshuainheritancemay2016LordJeshuainheritancemay2016
LordJeshuainheritancemay2016
 
Urus APIP (Angka Pengenal Importir Produsen)
Urus APIP (Angka Pengenal Importir Produsen) Urus APIP (Angka Pengenal Importir Produsen)
Urus APIP (Angka Pengenal Importir Produsen)
 
E1303021829
E1303021829E1303021829
E1303021829
 
O0181397100
O0181397100O0181397100
O0181397100
 
D012531923
D012531923D012531923
D012531923
 
F011113741
F011113741F011113741
F011113741
 

Similar to C1803041317

Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiOllieShoresna
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbAlexander Decker
 
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Darshan Gorasiya
 
An Algorithm to synchronize the local database with cloud Database
An Algorithm to synchronize the local database with cloud DatabaseAn Algorithm to synchronize the local database with cloud Database
An Algorithm to synchronize the local database with cloud DatabaseAM Publications
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013Facundo Farias
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...ijcsity
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...ijcsity
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...ijcsity
 
Bridging the gap between the semantic web and big data: answering SPARQL que...
Bridging the gap between the semantic web and big data:  answering SPARQL que...Bridging the gap between the semantic web and big data:  answering SPARQL que...
Bridging the gap between the semantic web and big data: answering SPARQL que...IJECEIAES
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111NavNeet KuMar
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperScott Gray
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy snehal parikh
 
Vol 10 No 1 - February 2014
Vol 10 No 1 - February 2014Vol 10 No 1 - February 2014
Vol 10 No 1 - February 2014ijcsbi
 
Beginner's guide to Mongodb and NoSQL
Beginner's guide to Mongodb and NoSQL  Beginner's guide to Mongodb and NoSQL
Beginner's guide to Mongodb and NoSQL Maulin Shah
 
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISON
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISONMAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISON
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISONijcsit
 
Design architecture based on web
Design architecture based on webDesign architecture based on web
Design architecture based on webcsandit
 

Similar to C1803041317 (20)

Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wi
 
A survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo dbA survey on data mining and analysis in hadoop and mongo db
A survey on data mining and analysis in hadoop and mongo db
 
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
 
An Algorithm to synchronize the local database with cloud Database
An Algorithm to synchronize the local database with cloud DatabaseAn Algorithm to synchronize the local database with cloud Database
An Algorithm to synchronize the local database with cloud Database
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
 
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
MONGODB VS MYSQL: A COMPARATIVE STUDY OF PERFORMANCE IN SUPER MARKET MANAGEME...
 
Bridging the gap between the semantic web and big data: answering SPARQL que...
Bridging the gap between the semantic web and big data:  answering SPARQL que...Bridging the gap between the semantic web and big data:  answering SPARQL que...
Bridging the gap between the semantic web and big data: answering SPARQL que...
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
 
Vol 10 No 1 - February 2014
Vol 10 No 1 - February 2014Vol 10 No 1 - February 2014
Vol 10 No 1 - February 2014
 
Beginner's guide to Mongodb and NoSQL
Beginner's guide to Mongodb and NoSQL  Beginner's guide to Mongodb and NoSQL
Beginner's guide to Mongodb and NoSQL
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISON
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISONMAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISON
MAP-REDUCE IMPLEMENTATIONS: SURVEY AND PERFORMANCE COMPARISON
 
Design architecture based on web
Design architecture based on webDesign architecture based on web
Design architecture based on web
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

C1803041317

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 3, Ver. IV (May-Jun. 2016), PP 13-17 www.iosrjournals.org DOI: 10.9790/0661-1803041317 www.iosrjournals.org 13 | Page Migrate and Map: A Framework to Access Data from Mysql, Mongodb or Hbase Using Mysql Queries Piyush Nikam1 , Tushar Patil2 , Gayatri Hungund3 , Ankit Pagar4 , Aditi Talegaonkar5 , Ms. Soudamini Pawar6 1,2,3,4,5,6 (Department Of Computer Engineering, D. Y. P. C. O. E., Savitribai Phule Pune University, India) Abstract: Due to ever-increasing amount of data, scalability factor of the databases becomes a major constraint. Moreover, traditional relational databases fix the user’s perspective to view data in tabular format. Parallel databases emerged to be one of the solutions to tackle the scalability constraint which was later replaced by more scalable and efficient solution called NoSQL databases. By using the advantages of NoSQL we proposed a framework to migrate data from MySQL to NoSQL databases that is HBase and MongoDB. The framework will be used to map the MySQL query to HBase and MongoDB query format and fetch the data faster by introducing a decision maker that selects the database from which data can be fetched at a relatively faster rate, thus elevating the system performance. Keywords: HBase, NoSQL, MapReduce, MongoDB, MySQL. I. Introduction In the early years accessing Relational Database Management System (RDBMS) for data manipulation was the most popular way as it followed the traditional Atomicity – Consistency – Isolation – Durability (ACID) properties and provides support for storing substantial amount of data. Another advantage of having RDBMS is referential integrity which is a property of data which, when satisfied, requires every value of one attribute (column) of a relation (table) to exist as a value of another attribute in a different table. Also, advantage of using traditional RDBMS is having fine grain locking mechanism. But considering the scenario where the data is changing dynamically, data manipulation and handling becomes a difficult task when it comes to usage of RDBMS. NoSQL on the other hand, is a more efficient and scalable solution as compared to RDBMS. NoSQL basically provides an advantage of being schema less for processing of structured and unstructured data. It also provides MapReduce facility and robust operational tools. Considering all these advantages of using NoSQL this paper proposes a hybrid framework that migrates all the data stored in relational databases such as MySQL into a corresponding NoSQL database format. It also uses popular NoSQL databases such as MongoDB and HBase. The framework will firstly migrate the data stored in MySQL database to MongoDB and HBase. Further, the MySQL queries entered by user are mapped to MongoDB and HBase formatand datais retrieved from the MySQL or NoSQL database selected by the decision maker. The Technologies Used Are Highlighted As Follows: 1.1. Mongodb: MongoDB is one of the newest databases introduced by 10gen. It is open source and document type database in which the data is stored in the form of key and value pairs. MongoDB is a schema less database and it provides support to advanced features such as MapReduce and aggregation tools. The biggest advantage of using MongoDB is that it can store files of any size without complicating memory stack. 1.2. Hbase: HBase is a NoSQL database that provides real time access to big data and efficient big data analytics. It is flexible as it can combine a wide variety of different structures and schemas. HBase is popularly used due to the speed of accessing the data. HBase is used for real time data analysis as it is user friendly and also most importantly, HBase is fault tolerant. 1.3. Javafx: JavaFX is a set of media packages that is used to create rich client application acrossvarious platforms. It is a library which is written as Java API. JavaFX applicationcan use APIs from any Java libraries and can use these Java libraries to access nativesystem capabilities. The JavaFX APIs are fully integrated features of Java SE RuntimeEnvironment and Java Development Kit (JDK). It is cross-platform compatible.FXML or JavaFX Scene Builder are used to interactively design graphical user interface.Existing Swing applications can be
  • 2. Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql.. DOI: 10.9790/0661-1803041317 www.iosrjournals.org 14 | Page updated with JavaFX features. JavaFXprovides all major UI controls that are required to develop rich applications. II. Need To Switch From SQL To Nosql Scalability of NoSQL is higher than that of relational database systems. Large amount of data is stored by horizontal scaling. NoSQL databases can handle data more efficiently as compared to relational database system. This is done by storing the data in distributed manner. NoSQL databases do not have fixed structure to store data. As there is no structure, different types of data can be stored. Caching facility is provided by NoSQL databases to increase the retrieval speed and efficiency of the system. III. General mapping techniques IV. The entered MySQL query is converted to parse tree and then this tree is mapped to NoSQL interface. V. Entered MySQL query is checked by query graph model. S-H and H-H rules are applied by query rewriter to query generated model. Directed acyclic graph is obtained from this query based model with the help of Query compiler. Plan evaluator is used to select optimum Directed Acyclic Graph (DAG) from obtained DAGs. VI. Using object relational mapping a data mapping module can be created which maps MySQL queries to NoSQL queries. IV. Related work Wu Chung Chung et. al. [1] have proposed a framework Jackhare that uses Druid API for implementation of scanner, parser and code generator. JDBC is used for connecting to databases. After entering the query, WHERE clause in the query is verified first and then, the query is checked for names of the tables and rows for which the WHERE clause is used. Finally, the data fields are sent as output in the SELECT statement. Hadoop Map Reduce jobs are used, in which, output of the Map phase is Key-value pairs and Reducer combines the output according to keys. Julian Rith et. al. [2] have proposed a middleware implemented using c# and is based on .NET framework for query mapping from MySQL to NoSQL in which the queries are divided into two categories which are Data Definition Language(DDL) statements and CRUD operations. The query given as input is converted into intermediate parse tree representation and further, this parse tree is mapped to NoSQL format. ANTLR is used for building parse tree. Xu et. al. [3] proposed a tool which increases probability of optimization of mapping rules using query rewriting to equivalent HiveQL queries. It uses cost estimation to select best method from the different HiveQL query versions which is based on data between mapper and reducer in network. Hive optimizer looks for chances to merge jobs after which QMapper estimator estimates cost of MapReduce DAGs. Further, it selects the method with minimum cost by measuring intermediate data between Maps and Reduces. Rupali Arora et. al. [4] have proposed an algorithm for migrating data from RDBMS to MongoDB by establishing the connection among the databases after which the database is selected by the user from existing relational databases and corresponding collection is created in MongoDB. The table having its embedded document created in MongoDB is chosen from the selected list of tables. Pentoho tool is used to load data into MongoDB. Leonardo Rocha et.al. [5] have proposed a NoSQL layer containing a mapping module to which user enters a query as input in MySQL format to retrieve data from NoSQL databases. Mapping is done according to types of statements. Each statement has separate java code which maps MySQL query to MongoDB. The results are stored and converted to string format. The mapped query is executed on MongoDB and result is forwarded to application. V. Comparative study Table1: Comparative study between MySQL, MongoDB and HBase Sr. No. Factor MySQL MongoDB HBase 1. Schema Structured Dynamic Dynamic 2. Scalability Vertical Horizontal Horizontal 3. Transactions Complex Simple Simple 4. Data Locality No Yes Yes 5. MapReduce Support No Yes Yes 6. Replication No Master-Slave replication Scalable replication VI. Architecture To utilize the advantage of horizontal scaling also known as sharding, and processing BigData, the Migrate and Map framework consists of a front end GUI and back end to migrate data stored in existing MySQL
  • 3. Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql.. DOI: 10.9790/0661-1803041317 www.iosrjournals.org 15 | Page database to MongoDB and HBase and a decision maker. The user enters a query in MySQL format and this query is provided as input to decision maker which is programmed to predict the database on which the query can be executed more efficiently which in turn will increase the overall efficiency of the system. Once the database is selected by decision maker, the query is mapped to corresponding database query format and the data is retrieved from the specific NoSQL database. a.Gui The GUI is designed using JavaFX which is a set of graphics packages that provides various classes which in turn provide various methods written in native Java for creation of rich looking graphical user interfaces. The WebKitHTML technology provided by JavaFX can be used for embedding web pages within JavaFX application. A designer can code in FXML or use JavaFX Scene Builder to interactively design the GUI. FXML is an XML based directive markup language for constructing a JavaFX application user interface. Figure1: architecture diagram – mapping module Figure 2: architecture diagram – migrate module b. Jdbc Java database connectivity is achieved using JDBC driver which is open source application programming interface and is used to establish connectivity amongst databases to perform data manipulation operations. JDBC allows the existence of multiple implementations which can be used by same application. JDBC is used in the proposed framework to establish connectivity between Java and MySQL. It is also used to fetch the table and database metadata which is required for converting tables from MySQL format to intermediate form which is csv format. c. Mongomigrate The MongoMigrate module is programmed to auto-detect all the databases in MySQL and corresponding tables that are to be migrated to MongoDB. Database metadata is extracted to make an ArrayList
  • 4. Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql.. DOI: 10.9790/0661-1803041317 www.iosrjournals.org 16 | Page of available databases which are selected one at a time. Tables are enlisted and added to ArrayList of tables. The selected table is converted to csv file which is then migrated to MongoDB one table at a time. One table in MySQL corresponds to one collection in MongoDB. The implementation uses multithreading for concurrent execution of making csv files and migration of data to MongoDB. Figure 3: migration from MySQL to MongoDB d. Mongo Map In migrate module (MongoDB) we have migrated data from MySQL to MongoDB. The tables migrated from MySQL are stored as a collection in MongoDB. In this module query (Select) fired in MySQL format is tokenized. As there can be many parameters to select query, “,” is used as a delimiter. The query mapper takes these tokens as a input and retrieves the data from MongoDB. Exaple1: select * from student; As the mapper will encounter a “*”, it means that there is nothing to tokenize and hence it will display the data stored entire collection. Ecample2: select name, rollno from student; In this query when the mapper will encounter a “,” query gets tokenized. Thus only name and rollno. will be retrieved. Example3: select name, rollno from student where rollno>2; It will display name and rollno whose rollno is greater than 2. Here not only “,” is checked but also operators from “where clause” is checked. When the operator “=”, “>” ,”<”, “>=”, “<=” is found it tokenized the where clause and gives the result accordingly. e. Hbasemigrate The HBase migrate module migrates data from MySQL to HBase. This module is also programmed to auto detect and migrate all the available MySQL databases and corresponding MySQL tables. To import data into HBase, the data is first converted into csv format and then imported to HBase. Figure 4: migration from MySQL to HBase f. Hbasemap This module maps the query from MySQL to HBase query format. The query is first tokenized then according to different tokens the query is mapped to corresponding HBase query format and fired on HBase to retrieve the data from HBase VII. Observations Table 2: Query mapping time Sr. No. Query Type Mapping time (ms) 1. select * from <table name> 13 2. select <column name(s)> from <table name> 16 3. select * from <table name> where <field = value> 50
  • 5. Migrate And Map: A Framework To Access Data From Mysql, Mongodb Or Hbase Using Mysql.. DOI: 10.9790/0661-1803041317 www.iosrjournals.org 17 | Page 4. select <column name(s)> from <table name> where <field = value> 21 5. select * from <table name> where <field > value> 45 6. select <column name(s)> from <table name> where <field > value> 38 7. select * from <table name> where <field < value> 24 8. select <column name(s)> from <table name> where <field < value> 37 9. select * form <table name> where <field >= value> 39 10. select <column name(s)> from <table name> where <field >= value> 41 11. select * from <table name> where <field <= value> 28 12. select <column name(s)> from <table name> where <field <= value> 35 13. select * from <table name> where <field1 operator value1 and field2 operator value2> 21 14. select <column name(s)> from <table name> where <field1operator value1 and field2 operator value2> 32 15. select * from <table name> where <field1 operator value1 or field2 operator value2> 27 16. select <column name(s)> from <table name> where <field1 operator value1 or field2 operator value2> 27 17. select avg(column) from <table name> 1 18. select avg(column name) from <table name> group by <column name> 1 19. select max(column) from <table name> 1 20. select max(column name) from <table name> group by <column name> 1 21. select min(column) from <table name> 1 22. select min(column name) from <table name> group by <column name> 1 23. select count(column) from <table name> 2 The Table 2 denotes the time taken by the proposed framework to map the MySQL queries to MongoDB and HBase query format. The time required for mapping is in milliseconds which denotes that in comparison to executing NoSQL queries, MySQL queries can be mapped to NoSQL query format and fired on NoSQL databases. The proposed framework is advantageous as the query execution time is more by just fraction of milliseconds which encourages the use of NoSQL databases and on the other hand, reduces the learning curve of user. Another advantage of using the proposed framework is that the query mapping time remains constant irrespective of size of data. VIII. Conclusion The paper proposes a hybrid architecture in which the data is migrated from traditional relational database that is MySQL, to MongoDB and HBase databases to address the scalability issue in existing relational database systems. The framework processes and serves the queries fired by the user in MySQL format to retrieve required data from MySQL or NoSQL databases. The data retrieval is done according to the decision making mechanism handled by the decision maker. References [1]. Chung, W. C., Lin, H. P., Chen, S. C., Jiang, M. F., & Chung, Y. C. (2014). JackHare: a framework for SQL to NoSQL translation using MapReduce. Automated Software Engineering, 21(4), 489-508; [2]. Rith, J., Lehmayr, P. S., & Meyer-Wegener, K. (2014, March). Speaking in toungues: SQL access to NoSQL systems. In Proceedings of the 29th Annual ACM Symposium on Applied Computing (pp. 855-857). ACM. [3]. Xu, Y., & Hu, S. (2013, May). Qmapper: a tool for sql optimization on hive using query rewriting. In Proceedings of the 22nd international conference on World Wide Web companion (pp. 211-212). International World Wide Web Conferences Steering Committee. [4]. Arora, R., & Aggarwal, R. R. (2013). An Algorithm for Transformation of Data from MySQL to NoSQL (MongoDB). International Journal of Advanced Studies in Computer Science and Engineering (IJASCSE), 2(1). [5]. Rocha, L., Vale, F., Cirilo, E., Barbosa, D., & Mourao, F. (2015). A Framework for migrating Relational Datasets to NoSQL. Procedia Computer Science, 51, 2593-2602.