SlideShare a Scribd company logo
1 of 3
Download to read offline
Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44
www.ijera.com DOI: 10.9790/9622-0704014244 42 | P a g e
Big Data and Big Data Management (BDM) with current
Technologies –Review
Atul S, Desale Girish B, and Patil Swati P
Department of Computer Science & IT, JET’s Z. B. Patil College, Dhule, Maharashtra, INDIA
Department of Computer Science, S.S.V.P.S’s Science College, Dhule, Maharashtra, INDIA
ABSTRACT
The emerging phenomenon called ―Big Data‖ is pushing numerous changes in businesses and several other
organizations, Domains, Fields, areas etc. Many of them are struggling just to manage the massive data sets. Big
data management is about two things - ―Big data‖ and ―Data Management‖ and these terms work together to
achieve business and technology goals as well. In previous few years data generation have tremendously
enhanced due to digitization of data. Day by day new computer tools and technologies for transmission of data
among several computers through Internet is been increasing. It‗s relevance and importance in the context of
applicability, usefulness for decision making, performance improvement etc in all areas have emerged very fast
to be relevant in today‗s era. Big data management also has numerous challenges and common complexities
include low organizational maturity relative to big data, weak business support, and the need to learn new
technology approaches. This paper will discuss the impacts of Big Data and issues related to data management
using current technologies.
Keywords: Big Data, Big Data Management (BDM),e-Bussiness, Electronic Data Interchange(EDI), e-
Governence, e-Commerce, Hadoop, ETL.
I. INTRODUCTION
We are aware of term e-Business,
Electronic Data Interchange(EDI), e-Governence, e-
Commerce (Further we will call it electronic
era).This is all about to automate maximum
organizational processes with the use of computer
software or applications and Internet. That is why
several companies and organizations had enforced
automated computer applications. Today, it is
assumed that an organization of any size or
complexity have several computer applications for
the sake of efficiency and competitiveness. A
consequence after this electronic era is that many
organizations of any size, domain, area and various
companies now have massive volumes of data to
manage and it is increasing day by day with extreme
velocity and variety on account of 3Vs concept
regarding Big Data i.e. Volume, Variety and
Velocity. Although organizations have the skills for
structured data (which is what comes out of most
operational applications) today‗s unprecedented data
volume and speed of generation make big data
management a challenge. [1]
Big Data generating resources:
Big Data can be simply defined by
explaining the 3V‗s – volume, velocity and variety
which are the driving dimensions of Big Data
concept. Gartner analyst Doug Laney introduced the
famous 3 V‗s concept in his 2001 Meta group
publication, ―3D data management: Controlling
Fig.1: Schematic representation of the 3V‘s of Big
Data
1. Volume: This essentially concerns the large
quantities of data that is generated continuously.
Initially storing such data was difficult because of
high storage costs. This data can be easily
distinguishes between structured data, unstructured
data and semi-structured data. [6]
2. Velocity: In prehistoric times, data was processed
in batches. However was possible only when the
incoming data rate is slower than the batch
processing rate. Now the speed of data generation
and transmission is extremely high. For example
Facebook [7] it generates 2.7 billion like actions/day
and 300 million photos amongst others roughly
amounting to 2.5 million pieces of content in each
RESEARCH ARTICLE OPEN ACCESS
Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44
www.ijera.com DOI: 10.9790/9622-0704014244 43 | P a g e
day while Google Now processes over 1.2 trillion
searches per year worldwide. [8].
3. Variety: Today data is loosing the structure and
now formats are documents, databases, excel tables,
pictures, videos, and audios in hundreds of formats.
Structure cannot be imposed like before for data
analysis. It can be of any type- structures, semi-
structured or unstructured.
Today‗s data sources may include [2]:
 Traditional enterprise data from operational
systems.
 Student test data.
 Social media data.
 Institution marketing data.
 Financial business forecast data.
 Web site browsing pattern data.
 Campus sensor data.
 Data gathered from mobile devices.
Fig.2: Data generation on Internet
―Big data comes from many sources, in many
formats. Some industries have large, valuable stores
of unstructured data, typically in the form of human
language text. For example, the claims process in
insurance generates many textual descriptions of
accidents and other losses, plus the related people,
locations, and events. Most insurance companies
process this unstructured big data using technologies
for natural language processing (NLP), often in the
form of text analytics. The output from NLP may
feed into older applications for risk and fraud
analytics or actuarial calculations, which benefit
from the larger data sample provided via NLP‖ [1].
―Sensors are coming online in great numbers as a
significant source for big data. For example, robots
have been in use for years in manufacturing, but now
they have additional sensors so they can perform
quality assurance as well as assembly. For decades,
mechanical gauges have been common in many
industries (such as chemicals and utilities), but now
the gauges are replaced by digital sensors to provide
real-time monitoring and analysis. GPS and RFID
signals now emanate from mobile devices and
assets—ranging from smart phones to trucks to
shipping pallets—so all these can be tracked and
controlled precisely‖[1].
Big Data:
 Includes several varieties of content formats
such as text, audios, videos, photographs,
images, pdfs, ppts etc.
 Can have several electronic transactions such as
sharing of files through e-mails, sharing through
social media, file transfers etc.
 Has various levels of engagements while using
social media (share, comment, like, tweet,
retweet tag etc.)
 Have scalable communication for people
through one-one, one-many, many-many
communications through various online forums
and portals of several experts or people of
similar or different community on number of
issues.
 Generation is device independent it can be
generated through different mobiles and smart
phones, palmtops, PCs etc.
 Is also generated through real time transactions
such as whether data inputs through various
electronic devices or sensors from various
geographic locations, satellites which is in huge
amount of digital inputs generating per second
for statistical analysis
BDM Technologies in use:
As data sets are in various formats and data
generation through electronic means are generating
in large volumes, velocity and variety per second, it
has become difficult for current DBMS & RDBMS
to store, manipulate and extract required information
from it in real time. Big Data Management Current
technologies the most commonly used framework is
Hadoop. Hadoop is the combination of several other
components like Hadoop Distribution File Systems
(HDFS), Pig, Hive and HBase etc. ―Apache Hadoop
is an open-source software framework for storage
and large-scale processing of data-sets on clusters of
commodity hardware. There are mainly five building
blocks inside this runtime environment (from bottom
to top)‖. [9]
Fig.3: Hadoop Architecture [9]
Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44
www.ijera.com DOI: 10.9790/9622-0704014244 44 | P a g e
Limitations for Current Technologies:
The rise of big data has come with the
limitations with the management issues. Even five
years ago, a company could leverage a DBMS such
as Oracle for a data warehouse. However, Oracle
was built in a time when databases rarely exceeded a
few gigabytes in size. Along with other legacy
DBMSs, it cannot perform at the scale now
required[10]. While Hadoop is very new and
unknown to large number of people, it is been
something of a mystery to the business world. Today
it is highly recognised by the technology world, but
most people still are unaware or confused that what
it is actually used for. Basically some technicians
think that it was developed to facilitate data analytics
and certain forms of batch-oriented distributed data
processing and ETL (Extract, Transform, Load)
operations. Where Hadoop has major limitations in
its analytic functionality, and database like qualities
of Hadoop are not a replacement for a true analytic
platform and fundamentals of Hadoop was not
designed to facilitate highly interactive
analytics[10]. As HDFS was implemented to speed
the processing of various web documents, and to
apply the Map Reduce framework to this processing
where there was no need of schema as well as it was
built to operate on clusters of arbitrary size so there
was no need of appropriate storage scheme. And
also there was no optimizer to look into data flow
and it‗s nature automatically and no check points for
data recovery. This implies that the output extracted
from Hadoop cluster have no guarantee to be perfect
and 100% [10]. And the lacking of an expertise in
handling the Hadoop Framework for BDM precisely
is the biggest issue today.
II. CONCLUSIONS
In this paper, we have tried to present the
significance of Big data, limitations of current
technologies in analyzing Big data. Where the major
issues in processing the Big data are quality of
output due to open source components in analytical
and management processes, lacking of expertise
required in operating the current Hadoop framework,
compatibility of the current technology and security.
REFERENCES
[1]. TDWI Research – TDWI Best Practices
Report By Philip Russom Fourth Quarter
2013.
[2]. Big Data and Social Media to Improve the
Quality of Higher Education - Dr. Savita
Kumari (Sheoran)Assistant Professor, Dept.
of CS&A Indira Gandhi University Meerpur,
Rewari (Haryana) INDIA - 122502 IJCSMC,
Vol. 5, Issue. 3, March 2016, pg.179 – 185.
[3]. http://www.3rootsstudios.com/is-big-data-
under-utilized.
[4]. http://www.exist.com/wp-
content/uploads/2014/10/3Vsbigdata.png.
[5]. http://www.forbes.com/sites/gartnergroup/201
3/03/27/gartners-big-data-definition-consists-
of-three-parts-not-to-be-confused-with-three-
vs Big Data – Concepts, Applications,
Challenges and Future Scope
[6]. Samiddha Mukherjee1, Ravi Shaw2
Information Technology, Institute of
Engineering & Management, Kolkata, India.
International Journal of Advanced Research
in Computer and Communication Engineering
Vol. 5, Issue 2, February 2016 Copyright to
IJARCCE DOI
10.17148/IJARCCE.2016.5215
ISSN(Online)2278-1021. ISSN(Print) 2319-
5940.
[7]. http://www.internetlivestats.com/twitter-
statistics/
[8]. http://www.internetlivestats.com/google-
search-statistics/
[9]. http://ercoppa.github.io/HadoopInternals/Had
oopArchitectureOverview.html
[10]. .https://www.em360tech.com/wp-content/
files_mf/1360922634PARACCEL1.pdf

More Related Content

What's hot

IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET Journal
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduceRenuSuren
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET Journal
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Thingspateelhs
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data AnalyticsTUSHAR GARG
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond Rajesh Kumar
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research reportJULIO GONZALEZ SANZ
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop IntroductionJayant Mukherjee
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)SiamAhmed16
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsKaniska Mandal
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview pptVIKAS KATARE
 

What's hot (20)

IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
 
Big data
Big dataBig data
Big data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big data
Big dataBig data
Big data
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduce
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big Data
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Big Data
Big DataBig Data
Big Data
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
R180305120123
R180305120123R180305120123
R180305120123
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data Analytics
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 

Similar to Big Data and Big Data Management (BDM) with current Technologies –Review

Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformIRJET Journal
 
High level view of cloud security
High level view of cloud securityHigh level view of cloud security
High level view of cloud securitycsandit
 
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONSHIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONScscpconf
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringIRJET Journal
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A reviewShilpa Soi
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAudrey Britton
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...IJSRD
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...IJSRD
 
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Love Arora
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilindEMC
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET Journal
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGIJNSA Journal
 
Big data seminor
Big data seminorBig data seminor
Big data seminorberasrujana
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 

Similar to Big Data and Big Data Management (BDM) with current Technologies –Review (20)

Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
High level view of cloud security
High level view of cloud securityHigh level view of cloud security
High level view of cloud security
 
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONSHIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Big data
Big dataBig data
Big data
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Big Data
Big DataBig Data
Big Data
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articles
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
 

Recently uploaded

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 

Big Data and Big Data Management (BDM) with current Technologies –Review

  • 1. Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44 www.ijera.com DOI: 10.9790/9622-0704014244 42 | P a g e Big Data and Big Data Management (BDM) with current Technologies –Review Atul S, Desale Girish B, and Patil Swati P Department of Computer Science & IT, JET’s Z. B. Patil College, Dhule, Maharashtra, INDIA Department of Computer Science, S.S.V.P.S’s Science College, Dhule, Maharashtra, INDIA ABSTRACT The emerging phenomenon called ―Big Data‖ is pushing numerous changes in businesses and several other organizations, Domains, Fields, areas etc. Many of them are struggling just to manage the massive data sets. Big data management is about two things - ―Big data‖ and ―Data Management‖ and these terms work together to achieve business and technology goals as well. In previous few years data generation have tremendously enhanced due to digitization of data. Day by day new computer tools and technologies for transmission of data among several computers through Internet is been increasing. It‗s relevance and importance in the context of applicability, usefulness for decision making, performance improvement etc in all areas have emerged very fast to be relevant in today‗s era. Big data management also has numerous challenges and common complexities include low organizational maturity relative to big data, weak business support, and the need to learn new technology approaches. This paper will discuss the impacts of Big Data and issues related to data management using current technologies. Keywords: Big Data, Big Data Management (BDM),e-Bussiness, Electronic Data Interchange(EDI), e- Governence, e-Commerce, Hadoop, ETL. I. INTRODUCTION We are aware of term e-Business, Electronic Data Interchange(EDI), e-Governence, e- Commerce (Further we will call it electronic era).This is all about to automate maximum organizational processes with the use of computer software or applications and Internet. That is why several companies and organizations had enforced automated computer applications. Today, it is assumed that an organization of any size or complexity have several computer applications for the sake of efficiency and competitiveness. A consequence after this electronic era is that many organizations of any size, domain, area and various companies now have massive volumes of data to manage and it is increasing day by day with extreme velocity and variety on account of 3Vs concept regarding Big Data i.e. Volume, Variety and Velocity. Although organizations have the skills for structured data (which is what comes out of most operational applications) today‗s unprecedented data volume and speed of generation make big data management a challenge. [1] Big Data generating resources: Big Data can be simply defined by explaining the 3V‗s – volume, velocity and variety which are the driving dimensions of Big Data concept. Gartner analyst Doug Laney introduced the famous 3 V‗s concept in his 2001 Meta group publication, ―3D data management: Controlling Fig.1: Schematic representation of the 3V‘s of Big Data 1. Volume: This essentially concerns the large quantities of data that is generated continuously. Initially storing such data was difficult because of high storage costs. This data can be easily distinguishes between structured data, unstructured data and semi-structured data. [6] 2. Velocity: In prehistoric times, data was processed in batches. However was possible only when the incoming data rate is slower than the batch processing rate. Now the speed of data generation and transmission is extremely high. For example Facebook [7] it generates 2.7 billion like actions/day and 300 million photos amongst others roughly amounting to 2.5 million pieces of content in each RESEARCH ARTICLE OPEN ACCESS
  • 2. Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44 www.ijera.com DOI: 10.9790/9622-0704014244 43 | P a g e day while Google Now processes over 1.2 trillion searches per year worldwide. [8]. 3. Variety: Today data is loosing the structure and now formats are documents, databases, excel tables, pictures, videos, and audios in hundreds of formats. Structure cannot be imposed like before for data analysis. It can be of any type- structures, semi- structured or unstructured. Today‗s data sources may include [2]:  Traditional enterprise data from operational systems.  Student test data.  Social media data.  Institution marketing data.  Financial business forecast data.  Web site browsing pattern data.  Campus sensor data.  Data gathered from mobile devices. Fig.2: Data generation on Internet ―Big data comes from many sources, in many formats. Some industries have large, valuable stores of unstructured data, typically in the form of human language text. For example, the claims process in insurance generates many textual descriptions of accidents and other losses, plus the related people, locations, and events. Most insurance companies process this unstructured big data using technologies for natural language processing (NLP), often in the form of text analytics. The output from NLP may feed into older applications for risk and fraud analytics or actuarial calculations, which benefit from the larger data sample provided via NLP‖ [1]. ―Sensors are coming online in great numbers as a significant source for big data. For example, robots have been in use for years in manufacturing, but now they have additional sensors so they can perform quality assurance as well as assembly. For decades, mechanical gauges have been common in many industries (such as chemicals and utilities), but now the gauges are replaced by digital sensors to provide real-time monitoring and analysis. GPS and RFID signals now emanate from mobile devices and assets—ranging from smart phones to trucks to shipping pallets—so all these can be tracked and controlled precisely‖[1]. Big Data:  Includes several varieties of content formats such as text, audios, videos, photographs, images, pdfs, ppts etc.  Can have several electronic transactions such as sharing of files through e-mails, sharing through social media, file transfers etc.  Has various levels of engagements while using social media (share, comment, like, tweet, retweet tag etc.)  Have scalable communication for people through one-one, one-many, many-many communications through various online forums and portals of several experts or people of similar or different community on number of issues.  Generation is device independent it can be generated through different mobiles and smart phones, palmtops, PCs etc.  Is also generated through real time transactions such as whether data inputs through various electronic devices or sensors from various geographic locations, satellites which is in huge amount of digital inputs generating per second for statistical analysis BDM Technologies in use: As data sets are in various formats and data generation through electronic means are generating in large volumes, velocity and variety per second, it has become difficult for current DBMS & RDBMS to store, manipulate and extract required information from it in real time. Big Data Management Current technologies the most commonly used framework is Hadoop. Hadoop is the combination of several other components like Hadoop Distribution File Systems (HDFS), Pig, Hive and HBase etc. ―Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. There are mainly five building blocks inside this runtime environment (from bottom to top)‖. [9] Fig.3: Hadoop Architecture [9]
  • 3. Atul S et al. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 7, Issue 4, ( Part -1) April 2017, pp.42-44 www.ijera.com DOI: 10.9790/9622-0704014244 44 | P a g e Limitations for Current Technologies: The rise of big data has come with the limitations with the management issues. Even five years ago, a company could leverage a DBMS such as Oracle for a data warehouse. However, Oracle was built in a time when databases rarely exceeded a few gigabytes in size. Along with other legacy DBMSs, it cannot perform at the scale now required[10]. While Hadoop is very new and unknown to large number of people, it is been something of a mystery to the business world. Today it is highly recognised by the technology world, but most people still are unaware or confused that what it is actually used for. Basically some technicians think that it was developed to facilitate data analytics and certain forms of batch-oriented distributed data processing and ETL (Extract, Transform, Load) operations. Where Hadoop has major limitations in its analytic functionality, and database like qualities of Hadoop are not a replacement for a true analytic platform and fundamentals of Hadoop was not designed to facilitate highly interactive analytics[10]. As HDFS was implemented to speed the processing of various web documents, and to apply the Map Reduce framework to this processing where there was no need of schema as well as it was built to operate on clusters of arbitrary size so there was no need of appropriate storage scheme. And also there was no optimizer to look into data flow and it‗s nature automatically and no check points for data recovery. This implies that the output extracted from Hadoop cluster have no guarantee to be perfect and 100% [10]. And the lacking of an expertise in handling the Hadoop Framework for BDM precisely is the biggest issue today. II. CONCLUSIONS In this paper, we have tried to present the significance of Big data, limitations of current technologies in analyzing Big data. Where the major issues in processing the Big data are quality of output due to open source components in analytical and management processes, lacking of expertise required in operating the current Hadoop framework, compatibility of the current technology and security. REFERENCES [1]. TDWI Research – TDWI Best Practices Report By Philip Russom Fourth Quarter 2013. [2]. Big Data and Social Media to Improve the Quality of Higher Education - Dr. Savita Kumari (Sheoran)Assistant Professor, Dept. of CS&A Indira Gandhi University Meerpur, Rewari (Haryana) INDIA - 122502 IJCSMC, Vol. 5, Issue. 3, March 2016, pg.179 – 185. [3]. http://www.3rootsstudios.com/is-big-data- under-utilized. [4]. http://www.exist.com/wp- content/uploads/2014/10/3Vsbigdata.png. [5]. http://www.forbes.com/sites/gartnergroup/201 3/03/27/gartners-big-data-definition-consists- of-three-parts-not-to-be-confused-with-three- vs Big Data – Concepts, Applications, Challenges and Future Scope [6]. Samiddha Mukherjee1, Ravi Shaw2 Information Technology, Institute of Engineering & Management, Kolkata, India. International Journal of Advanced Research in Computer and Communication Engineering Vol. 5, Issue 2, February 2016 Copyright to IJARCCE DOI 10.17148/IJARCCE.2016.5215 ISSN(Online)2278-1021. ISSN(Print) 2319- 5940. [7]. http://www.internetlivestats.com/twitter- statistics/ [8]. http://www.internetlivestats.com/google- search-statistics/ [9]. http://ercoppa.github.io/HadoopInternals/Had oopArchitectureOverview.html [10]. .https://www.em360tech.com/wp-content/ files_mf/1360922634PARACCEL1.pdf