SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5094
Analysis of Big Data Technology and its Challenges
Dr Ajay Pratap
Assistant Professor, AIIT, Amity University Uttar Pradesh, Lucknow, UP, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - In recent decades, the internet application and
communication have seen a lot of growth and reputation in
the Information Technology sector. These internet based
applications and related communication are generating the
large size and different variety of difficult multifaceted
structure data regularly which is known as big data. This is
the era of massive automatic data collection, systematically
obtaining various measurements without knowing relevancy
of data. For instance, E-commerce transactions may include
the data of online buying, selling, investing and exploration
based activities which has very complex structure and high in
dimensional as well. The traditional data storage and
management techniques are not adequate to store and
analyses these high volume data. In this survey paper we are
discussing the big data architecture for data analysis. Paper
also provides the overview of the big data challenges and
various technologies to handle the Big Data.
Key Words: Data Analysis, Big Data, Architecture, High
Volume Data, Unstructured Data, Semi-Structured Data
1. INTRODUCTION
In today’s era people and system are using the web based
application and it causes generation of large size of data in
an exponential manner. Exabyte(EB)andPetabytes(PB) are
the measuring units for the size of data. This amount of data
growth is due to advancements in the fields of
communication, digital sensors, computations, and storage
of data for business analytics. The Big Data term had been
devised by a researcher, Roger Magoulas. In last few
decades, data has been grown in massive order in various
fields and moreover multiple types of data are also
emerging. As per the statists of International Data
Corporation (IDC), the overall data volume created in the
world was 1.8ZB for year 2011. This volume is approx. nine
times larger within next five years. Data or Information will
be the fuel for 21st century as all possible domains such as
health care, marketing, disease control & prevention, smart
city and business intelligence applications are going to
generate and use a large amount of data and information.
If we compare big data concept with other traditional
datasets and its processes, big data includes semistructured
and unstructured data. Big data technologyanditsanalytical
processes are used to provide the descriptionaboutmassive
datasets generated from various real time systems. Big data
is also useful for getting details about new prospects for
determining new values, for in-depth understanding of the
hidden values, and also some real time situations such as
organizing and manipulating big datasets exponentially.
The volume of data and information is growing rapidly and
also some challenging issues. Big data visualization process
is another vital problem in big data analytics. Improvement
in the field of Information Technology has generated more
and more data on daily basis that causes the problems of
gathering andintegratingdata fromdistributeddata sources.
Some other upcoming technologiessuchascloudcomputing,
Internet of Things (IoT) and Data Centers also promote the
growth of data. Cloud computing technology provides the
standard for storing and retrievingthedata fromthe bigdata
assets. IoT technology is based on use of sensors to collect
and transmit the data to be stored and processed in the
cloud storage.
2. BIG DATA ANALYTICS
Knowledge Discovery in Databases (KDD) refers to the
broad process of finding knowledge in data, and emphasizes
the "high-level" application of particular data mining
methods. Fundamentally, data processing is seen as the
collecting, processing, and management of data for
producing new information for end users [8]. Big Data
analysis is done in four steps which are Acquisition,
Assembly, Analyze and Action which are known as 4 A’s of
big data analysis.
Fig-1: Steps of Data Analysis
2.1 Acquisition Process
In Big Dataarchitecture,theacquisitioncomponenthasto
obtain high speed datafromvariousdatasourceswhichdeals
with variousaccesscontrolprotocolsalso.Insomeconditions
of generation of dataareimportant,andsometimescapturing
the metadata and storing them with the corresponding data
is also important forfurther analysis. Itiswhereafiltercould
be recognized to store only data which could be helpful or
underdone data with a lesser degree of uncertainty [9].
2.2 Assembly Process
This is the second step of analysis where the data of
structured or semi-structured category have to cleaned,
brought into the computable mode. After this they are
Acquisition Assembly
Analyze Action
Data
Source
Apply
Privacy
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5095
integrated and stored at desired location. This is a sort of
extraction,transformationandloading(ETL)processdoneon
data. At this point the architecture has to deal with various
data formats and must be able to parse them and extract the
actual information like named entities, relation between
them, etc [9]. Complete cleaning of data in big data
environment is not entirely guaranteed.
2.3 Analysis Process
This is the point where we have run queries, perform
modeling, and building new algorithms for new cases.
Analyzing of data is donebyminingtechniquewhichrequires
integrated, cleaned, trustworthy data.
2.4 Action Process
In this step we interpret the results obtained in Analysis
step to tale some valuable decisions for the organization or
enterprise. At the same time it is also very important to
understand and verify outputs obtained at user’s end.
2.5 Privacy
Privacy of big data and results obtained by analysis
process are very important issueandnotputtingmucheffort
on privacy may cause many serious problems at theanalysis
of data and sometimes at the creation of data also. To
overcome with these problems the big data architecture
must have following properties:
 Big data infrastructure be linear scalable
 Must be able to handle high throughput multi
formatted data
 Auto recoverable
 Fault tolerant
 Higher degree of parallelism and distributed data
processing
3. BIG DATA ANALYTICS INFRASTRUCTURE
Fig-2 depicts the overall layer based infrastructure of big
data analytics. Various layers present in the big data
analytics are as follows:
 Data Layers
 Analytics Layer
 Integration Layer
 Decision Layer
 Data Governance Layer
Fig-2: Big Data Infrastructure for Business Analytics
3.1 Data Layer
Thislayerconsistsofstructureddata,semi-structuredand
unstructured based data. Unstructured data is stored using
NoSQL databases such as MongoDB and Cassandra. Example
of unstructured data and semi-structured data are data
streamed from the web world, social media domain, IoT
sensors and other operational systems. We can use software
tools such as Hive, HBase, Storm and Spark in this layer.
3.2 Analytics Layer
In this layer, we can implement the dynamic data
analytics and then deploy the real time values. This layer has
building a model developing environment and it keeps on
modifying the local data in regular interval. Layer is
responsible for improving the performance of the analytical
engine.
3.3 Integration Layer
End user applications and analytical engine are being
integrated in this layer. This layer includes rules engine and
an API that is being used for dynamic data analytics.
3.4 Decision Layer
This layer includes interface applications for end user
such as mobile app or desktop applications for interactive
Decision
Layer
Business
Intelligence
Web /Mobile
based
Application
Application
Software
Data
Layer
Other Data
Sources
Big Data
Application
Software
Data
WarehouseIntegration
Layer
Revo DeployR Web Server
Business
Rules
Engine
Analytics
Layer
Local Data Mart
Intelligence
Revolution R Enterprise
Development Enviren
Revolution R Enterprise
Production Environment
Data Governance Layer
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5096
web applications and business intelligence software. In this
layer, people interact with the system to perform some
decision based task.
3.4 Data Governance Layer
Data governance is a collection of processes, policies,
roles, and standards that ensure the effective and efficient
use of information which enablestheorganizationtoachieve
its goals. This layer includes all key data governance
functions, such as business semantics glossary, reference
data accelerator and data stewardship manager.
All five layer described aboveareassociated withdifferent
sets of end users in real time and enables a crucial phase of
real time data analytics implementation.
4. BIG DATATECHNOLOGIESFORDATAANALYTICS
Big data management includes the organization and
manipulation of huge volumes of structured data, semi-
structured data and unstructured data. Main goal of big data
management is to take care of three major factors which are
quality of high level data, availability of data for business
intelligence and big data analytics applications.
Various tools are available for big data management from
data acquisition, data storage to data visualization. In this
section we will describe major big data management tools
related to data analysis.
4.1 Hadoop
This is an open source platform for handling big data
and its analytics. Hadoop is user friendly and provides
flexible environment to work with different data sources.
Tasks such as gathering various sources of data or accessing
of data from a database in order to run process-oriented
machine learning process can be done using this platform.
This tool also provides differenttypesofapplicationssuchas
location based data from weather, traffic sensors and social
media data.
4.2 Map Reduce
This environment permits larger jobs implementation
scalability against group of server. Map Reduce
implementation has two major tasks:
(a) The Map Task: It converts input dataset is into a
different set of value pairs.
(b) The Reduce Task: It combines several outputs of the
Map task to form reduced tuples.
4.3 PIG
It is an analytical tool that attempts to make the Hadoop
closer to the developers and business users. PIG permits the
query execution over data that is storedona Hadoopinstead
of a SQL.
4.4 WibiData
This tool was developed for the enterprises to
personalize their customer experiences. It is combination of
web analytics with Hadoop. It works as a top layer over the
HBase and allows the websites to explore better processes
with their user data. It allows real time responses to user,
such as recommendations and data related to personalized
content, and decisions.
4.5 Hive
Hive permits predictable business applications to run
SQL queries against a Hadoop cluster. It provides SQL-like
bridge that was developed by Facebook and,thenithasbeen
made open source. Hive is a high level perception of the
Hadoop and it allows all to make queries against data stored
in a Hadoop storage medium easily.
4.6 Rapidminer
It provides an integrated platform for various things
such as machine learning, data/text mining and other data
analysis task such as predictive analytics and business
analytics. Rapidminer is used for both business and
commercial applications as well as for education, research,
training, application development, and rapid prototyping. It
supports all steps of the data mining process which includes
dataset preparation, validation, results visualization and
optimization.
5. BIG DATA CHALLENGES
There are many crucial challenges need to be focused
while handling of Big Data and its analytical process [6].
Some of those challenges are being discussed:
5.1 Storage Related:
As we know that the size of secondary storage device in
computing devices is in the range of Terabytes (TB). The
amount of data produced via internet is also of very large
amount and it is measured in terms of Exabyte (EB) So the
traditional relational databasemanagementsoftwaresuchas
Oracle and MySQL are not able to store or process such kind
of Big Data. To give the solution for such issue, databases
uses NoSQL based databases such as Cassandra and
MongoDB which can handle unstructured and semi-
structured data.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5097
5.2 Data Life Cycle Management
This process decides the selection of data for storing
purpose which is used for the analytical processes.Thereare
many challenges and one of them is that the existing storage
system is not able to support such huge amount of data.
Therefore, an effective model is needed that makes the life
cycle management system better.
5.3 Data representation
Main goal of data representation is to make data more
important for data analytics and user analysis. Many
datasets have definite levels of heterogeneity in terms of
their structure, semantics, type, organization, granularity
and accessibility. Improper data representation technique
may reduce the value of the data originality and even
disturbs effective data analysis process [23]. Henceeffective
data representation is highly required for easy analysis
process.
5.4 Redundancy Reduction and Data Compression
Redundancy is one of the major problems in the field of
database system and analysis. Redundancy reduction and
data compression are operational methodfordecreasing the
cost of the system by reducing the data redundancyanddata
compression.
5.5 Analysis
Big data is generated from various types of online
activities and transactions which vary in structure and the
volume. Data Analysis is very difficult in case of high volume
data. To handle this situation, a special scalable architecture
is used to process the data in a distributed environment.
Data are distributed into fragmentsandhandledina number
of computers systems in the network and then processed
data is combined manner.
5.6 Reporting
Displaying the statistical data in the form of values is
known as Reporting. In case of high volume of data
traditional reporting tools and methods becomechallenging
to understand and implement. In these situations the
statistical reports must be represented in such a manner so
that can be easily understood.
5.7 Data Confidentiality
Data confidentiality is another big issue for big data
because the service providers and ownersofthedata are not
able to maintain and analyze such huge datasets in effective
manner. In this situation they depend on professionals or
sometimes third-party tools for the analysis of datasets,
which may cause potential safety risks. So, data
confidentiality is important issue for the researchers.
6. CONCLUSION
During the composition of this survey paper, various
concepts of big data including the concepts of big data
analytics, big data analytics techniques, data visualization
and big data analysis algorithm have been studied. The
literature survey also gives the overview of the possible
research opportunities in big data environment. Some of
them are as follows:
(i) Applying scheduling methods for handling big data
computation in cloud based platform
(ii) Issues related to data privacy and data security in big
data environment.
(iii) Reduction of data redundancyanddata compressionin
big data environment.
(iv) Handling independent and identically distributed
variables in big data
REFERENCES
[1] H.V. Jagadish, D. Agarwal, P.Bernstein, Challengesand
Opportunities in Big Data, The Community Research
Association, 2015.
[2] K. Krishnan, Data warehousing in the age of big data,
in: The Morgan Kaufmann Series on Business
Intelligence, Elsevier Science, 2013.
[3] Vallabh Dhoot, Shubham Gawande, Pooja Kanawade
and Akanksha Lekhwani, Efficient Dimensionality
Reduction for Big Data Using Clustering Technique,
Imperial Journal of Interdisciplinary Research (IJIR),
Vol-2, Issue-5, 2016, ISSN: 2454-1362 3.
[4] Gantz J, Reinsel D, Extracting value from chaos.IDC
iView, 2011, pp 1–12
[5] Cheikh Kacfah Emani, Nadine Cullot, Christophe
Nicolle, Understandable Big Data: A survey, Mobile
New Applications 2014, 171-209.
[6] Chen H, Chiang RHL, Storey VC. Business intelligence
and analytics: from big data to big impact. MIS
Quart.2012; 36(4):1165–88.
[7] K. Krishnan, Data warehousing in the age of big data,
in the Morgan Kaufmano series on Business
Intelligence, Elsevier Science, 2013.
[8] Kitchin R. The real-time city? Big data and smart
urbanism. Geo J. 2014, 79(1), pp: 1–14.
[9] Katrina Sin and Loganathan Muthu,Applicationsofbig
data in education data mining and learninganalytics –
A literature Review,ICTACTJournal onsoftcomputing
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5098
special issue on soft computing models for big data,
July 2015, Vol:05, Iss: 04, pp: 1035-1049
[10] Mayer-Schonberger V, Cukier K, Big data:a revolution
that will transform how we live, work, and think.
Boston: Houghton Mifflin Harcourt; 2013.
[11] Cheikh Kacfah Emani, Nadine Cullot, Christophe
Nicolle, UnderstandableBigData:ASurvey,Computer
Science Review, 2015, Vol: 17, pp: 71-80
[12] K. Davis, D. Patterson, “Ethics of Big Data: Balancing
Risk and innovation”, O’Reilly Media, 2012.
[13] Mike Barlow, Real-Time Big Data Analytics: Emerging
Architecture, ISBN: 978-1-449-36421-2, 2013
[14] Shuhui Jiang, Xueming Qian, Tao Mei, Yun Fu,
Personalized Travel Sequence recommendation on
Multisource Big Social Media,2016,IEEETransactions
on Big Data,Vol.2, Issue:1 2.
[15] http://www.techrepublic.com/blog/big-data-
analytics/10- emerging-technologies-for-big-data

More Related Content

What's hot

Data Warehouse: A Primer
Data Warehouse: A PrimerData Warehouse: A Primer
Data Warehouse: A Primer
IJRTEMJOURNAL
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET Journal
 
6. ijece guideforauthors 2012_2 eidt sat
6. ijece guideforauthors 2012_2 eidt sat6. ijece guideforauthors 2012_2 eidt sat
6. ijece guideforauthors 2012_2 eidt sat
IAESIJEECS
 
Data Quality - Standards and Application to Open Data
Data Quality - Standards and Application to Open DataData Quality - Standards and Application to Open Data
Data Quality - Standards and Application to Open Data
Marco Torchiano
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
IRJET Journal
 
A Study on Big Data Privacy Protection Models using Data Masking Methods
A Study on Big Data Privacy Protection Models using Data Masking Methods A Study on Big Data Privacy Protection Models using Data Masking Methods
A Study on Big Data Privacy Protection Models using Data Masking Methods
IJECEIAES
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
Sana Alvi
 
Introduction to visualizing Big Data
Introduction to visualizing Big DataIntroduction to visualizing Big Data
Introduction to visualizing Big Data
Dawit Nida
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
Gregg Barrett
 
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
ijaia
 
A frame work for data warehouse for nigerian police force
A frame work for data warehouse for nigerian police forceA frame work for data warehouse for nigerian police force
A frame work for data warehouse for nigerian police force
Alexander Decker
 
IRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare ApplicationsIRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare Applications
IRJET Journal
 
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Piyush Malik
 
An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.
ijceronline
 
Electronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approachElectronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approach
IAEME Publication
 
Big Data Analytics: Recent Achievements and New Challenges
Big Data Analytics: Recent Achievements and New ChallengesBig Data Analytics: Recent Achievements and New Challenges
Big Data Analytics: Recent Achievements and New Challenges
Editor IJCATR
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and Challenges
Gregg Barrett
 
Healthcare intel it 443835 443835
Healthcare intel it 443835 443835Healthcare intel it 443835 443835
Healthcare intel it 443835 443835Liberteks
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
ijmnct
 
The effect of technology-organization-environment on adoption decision of bi...
The effect of technology-organization-environment on  adoption decision of bi...The effect of technology-organization-environment on  adoption decision of bi...
The effect of technology-organization-environment on adoption decision of bi...
IJECEIAES
 

What's hot (20)

Data Warehouse: A Primer
Data Warehouse: A PrimerData Warehouse: A Primer
Data Warehouse: A Primer
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
6. ijece guideforauthors 2012_2 eidt sat
6. ijece guideforauthors 2012_2 eidt sat6. ijece guideforauthors 2012_2 eidt sat
6. ijece guideforauthors 2012_2 eidt sat
 
Data Quality - Standards and Application to Open Data
Data Quality - Standards and Application to Open DataData Quality - Standards and Application to Open Data
Data Quality - Standards and Application to Open Data
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
A Study on Big Data Privacy Protection Models using Data Masking Methods
A Study on Big Data Privacy Protection Models using Data Masking Methods A Study on Big Data Privacy Protection Models using Data Masking Methods
A Study on Big Data Privacy Protection Models using Data Masking Methods
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Introduction to visualizing Big Data
Introduction to visualizing Big DataIntroduction to visualizing Big Data
Introduction to visualizing Big Data
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
 
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
 
A frame work for data warehouse for nigerian police force
A frame work for data warehouse for nigerian police forceA frame work for data warehouse for nigerian police force
A frame work for data warehouse for nigerian police force
 
IRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare ApplicationsIRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare Applications
 
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...
 
An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.
 
Electronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approachElectronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approach
 
Big Data Analytics: Recent Achievements and New Challenges
Big Data Analytics: Recent Achievements and New ChallengesBig Data Analytics: Recent Achievements and New Challenges
Big Data Analytics: Recent Achievements and New Challenges
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and Challenges
 
Healthcare intel it 443835 443835
Healthcare intel it 443835 443835Healthcare intel it 443835 443835
Healthcare intel it 443835 443835
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
 
The effect of technology-organization-environment on adoption decision of bi...
The effect of technology-organization-environment on  adoption decision of bi...The effect of technology-organization-environment on  adoption decision of bi...
The effect of technology-organization-environment on adoption decision of bi...
 

Similar to IRJET- Analysis of Big Data Technology and its Challenges

CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSISCASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
IRJET Journal
 
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
IRJET Journal
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
IRJET Journal
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate Environment
IRJET Journal
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
IRJET Journal
 
An Overview of Data Lake
An Overview of Data LakeAn Overview of Data Lake
An Overview of Data Lake
IRJET Journal
 
Comparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big dataComparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big data
IRJET Journal
 
IRJET- A Study on Data Mining in Software
IRJET- A Study on Data Mining in SoftwareIRJET- A Study on Data Mining in Software
IRJET- A Study on Data Mining in Software
IRJET Journal
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
IRJET Journal
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
jadhavpravin920
 
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET Journal
 
Big Data: Review, Classification and Analysis Survey
Big Data: Review, Classification and Analysis  SurveyBig Data: Review, Classification and Analysis  Survey
Big Data: Review, Classification and Analysis Survey
AM Publications,India
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
hartrobert670
 
Data Science: A Revolution of Data
Data Science: A Revolution of DataData Science: A Revolution of Data
Data Science: A Revolution of Data
IRJET Journal
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Mr.Sameer Kumar Das
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET Journal
 
Encroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data TechnologyEncroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data Technology
MangaiK4
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big Data
IRJET Journal
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide App
IRJET Journal
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
Mark Albala
 

Similar to IRJET- Analysis of Big Data Technology and its Challenges (20)

CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSISCASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
CASE STUDY ON METHODS AND TOOLS FOR THE BIG DATA ANALYSIS
 
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate Environment
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
An Overview of Data Lake
An Overview of Data LakeAn Overview of Data Lake
An Overview of Data Lake
 
Comparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big dataComparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big data
 
IRJET- A Study on Data Mining in Software
IRJET- A Study on Data Mining in SoftwareIRJET- A Study on Data Mining in Software
IRJET- A Study on Data Mining in Software
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
 
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
IRJET-Unraveling the Data Structures of Big data, the HDFS Architecture and I...
 
Big Data: Review, Classification and Analysis Survey
Big Data: Review, Classification and Analysis  SurveyBig Data: Review, Classification and Analysis  Survey
Big Data: Review, Classification and Analysis Survey
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
 
Data Science: A Revolution of Data
Data Science: A Revolution of DataData Science: A Revolution of Data
Data Science: A Revolution of Data
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Encroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data TechnologyEncroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data Technology
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big Data
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide App
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Courier management system project report.pdf
Courier management system project report.pdfCourier management system project report.pdf
Courier management system project report.pdf
Kamal Acharya
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
ShahidSultan24
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
Kamal Acharya
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
abh.arya
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
Kamal Acharya
 

Recently uploaded (20)

CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Courier management system project report.pdf
Courier management system project report.pdfCourier management system project report.pdf
Courier management system project report.pdf
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
 

IRJET- Analysis of Big Data Technology and its Challenges

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5094 Analysis of Big Data Technology and its Challenges Dr Ajay Pratap Assistant Professor, AIIT, Amity University Uttar Pradesh, Lucknow, UP, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - In recent decades, the internet application and communication have seen a lot of growth and reputation in the Information Technology sector. These internet based applications and related communication are generating the large size and different variety of difficult multifaceted structure data regularly which is known as big data. This is the era of massive automatic data collection, systematically obtaining various measurements without knowing relevancy of data. For instance, E-commerce transactions may include the data of online buying, selling, investing and exploration based activities which has very complex structure and high in dimensional as well. The traditional data storage and management techniques are not adequate to store and analyses these high volume data. In this survey paper we are discussing the big data architecture for data analysis. Paper also provides the overview of the big data challenges and various technologies to handle the Big Data. Key Words: Data Analysis, Big Data, Architecture, High Volume Data, Unstructured Data, Semi-Structured Data 1. INTRODUCTION In today’s era people and system are using the web based application and it causes generation of large size of data in an exponential manner. Exabyte(EB)andPetabytes(PB) are the measuring units for the size of data. This amount of data growth is due to advancements in the fields of communication, digital sensors, computations, and storage of data for business analytics. The Big Data term had been devised by a researcher, Roger Magoulas. In last few decades, data has been grown in massive order in various fields and moreover multiple types of data are also emerging. As per the statists of International Data Corporation (IDC), the overall data volume created in the world was 1.8ZB for year 2011. This volume is approx. nine times larger within next five years. Data or Information will be the fuel for 21st century as all possible domains such as health care, marketing, disease control & prevention, smart city and business intelligence applications are going to generate and use a large amount of data and information. If we compare big data concept with other traditional datasets and its processes, big data includes semistructured and unstructured data. Big data technologyanditsanalytical processes are used to provide the descriptionaboutmassive datasets generated from various real time systems. Big data is also useful for getting details about new prospects for determining new values, for in-depth understanding of the hidden values, and also some real time situations such as organizing and manipulating big datasets exponentially. The volume of data and information is growing rapidly and also some challenging issues. Big data visualization process is another vital problem in big data analytics. Improvement in the field of Information Technology has generated more and more data on daily basis that causes the problems of gathering andintegratingdata fromdistributeddata sources. Some other upcoming technologiessuchascloudcomputing, Internet of Things (IoT) and Data Centers also promote the growth of data. Cloud computing technology provides the standard for storing and retrievingthedata fromthe bigdata assets. IoT technology is based on use of sensors to collect and transmit the data to be stored and processed in the cloud storage. 2. BIG DATA ANALYTICS Knowledge Discovery in Databases (KDD) refers to the broad process of finding knowledge in data, and emphasizes the "high-level" application of particular data mining methods. Fundamentally, data processing is seen as the collecting, processing, and management of data for producing new information for end users [8]. Big Data analysis is done in four steps which are Acquisition, Assembly, Analyze and Action which are known as 4 A’s of big data analysis. Fig-1: Steps of Data Analysis 2.1 Acquisition Process In Big Dataarchitecture,theacquisitioncomponenthasto obtain high speed datafromvariousdatasourceswhichdeals with variousaccesscontrolprotocolsalso.Insomeconditions of generation of dataareimportant,andsometimescapturing the metadata and storing them with the corresponding data is also important forfurther analysis. Itiswhereafiltercould be recognized to store only data which could be helpful or underdone data with a lesser degree of uncertainty [9]. 2.2 Assembly Process This is the second step of analysis where the data of structured or semi-structured category have to cleaned, brought into the computable mode. After this they are Acquisition Assembly Analyze Action Data Source Apply Privacy
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5095 integrated and stored at desired location. This is a sort of extraction,transformationandloading(ETL)processdoneon data. At this point the architecture has to deal with various data formats and must be able to parse them and extract the actual information like named entities, relation between them, etc [9]. Complete cleaning of data in big data environment is not entirely guaranteed. 2.3 Analysis Process This is the point where we have run queries, perform modeling, and building new algorithms for new cases. Analyzing of data is donebyminingtechniquewhichrequires integrated, cleaned, trustworthy data. 2.4 Action Process In this step we interpret the results obtained in Analysis step to tale some valuable decisions for the organization or enterprise. At the same time it is also very important to understand and verify outputs obtained at user’s end. 2.5 Privacy Privacy of big data and results obtained by analysis process are very important issueandnotputtingmucheffort on privacy may cause many serious problems at theanalysis of data and sometimes at the creation of data also. To overcome with these problems the big data architecture must have following properties:  Big data infrastructure be linear scalable  Must be able to handle high throughput multi formatted data  Auto recoverable  Fault tolerant  Higher degree of parallelism and distributed data processing 3. BIG DATA ANALYTICS INFRASTRUCTURE Fig-2 depicts the overall layer based infrastructure of big data analytics. Various layers present in the big data analytics are as follows:  Data Layers  Analytics Layer  Integration Layer  Decision Layer  Data Governance Layer Fig-2: Big Data Infrastructure for Business Analytics 3.1 Data Layer Thislayerconsistsofstructureddata,semi-structuredand unstructured based data. Unstructured data is stored using NoSQL databases such as MongoDB and Cassandra. Example of unstructured data and semi-structured data are data streamed from the web world, social media domain, IoT sensors and other operational systems. We can use software tools such as Hive, HBase, Storm and Spark in this layer. 3.2 Analytics Layer In this layer, we can implement the dynamic data analytics and then deploy the real time values. This layer has building a model developing environment and it keeps on modifying the local data in regular interval. Layer is responsible for improving the performance of the analytical engine. 3.3 Integration Layer End user applications and analytical engine are being integrated in this layer. This layer includes rules engine and an API that is being used for dynamic data analytics. 3.4 Decision Layer This layer includes interface applications for end user such as mobile app or desktop applications for interactive Decision Layer Business Intelligence Web /Mobile based Application Application Software Data Layer Other Data Sources Big Data Application Software Data WarehouseIntegration Layer Revo DeployR Web Server Business Rules Engine Analytics Layer Local Data Mart Intelligence Revolution R Enterprise Development Enviren Revolution R Enterprise Production Environment Data Governance Layer
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5096 web applications and business intelligence software. In this layer, people interact with the system to perform some decision based task. 3.4 Data Governance Layer Data governance is a collection of processes, policies, roles, and standards that ensure the effective and efficient use of information which enablestheorganizationtoachieve its goals. This layer includes all key data governance functions, such as business semantics glossary, reference data accelerator and data stewardship manager. All five layer described aboveareassociated withdifferent sets of end users in real time and enables a crucial phase of real time data analytics implementation. 4. BIG DATATECHNOLOGIESFORDATAANALYTICS Big data management includes the organization and manipulation of huge volumes of structured data, semi- structured data and unstructured data. Main goal of big data management is to take care of three major factors which are quality of high level data, availability of data for business intelligence and big data analytics applications. Various tools are available for big data management from data acquisition, data storage to data visualization. In this section we will describe major big data management tools related to data analysis. 4.1 Hadoop This is an open source platform for handling big data and its analytics. Hadoop is user friendly and provides flexible environment to work with different data sources. Tasks such as gathering various sources of data or accessing of data from a database in order to run process-oriented machine learning process can be done using this platform. This tool also provides differenttypesofapplicationssuchas location based data from weather, traffic sensors and social media data. 4.2 Map Reduce This environment permits larger jobs implementation scalability against group of server. Map Reduce implementation has two major tasks: (a) The Map Task: It converts input dataset is into a different set of value pairs. (b) The Reduce Task: It combines several outputs of the Map task to form reduced tuples. 4.3 PIG It is an analytical tool that attempts to make the Hadoop closer to the developers and business users. PIG permits the query execution over data that is storedona Hadoopinstead of a SQL. 4.4 WibiData This tool was developed for the enterprises to personalize their customer experiences. It is combination of web analytics with Hadoop. It works as a top layer over the HBase and allows the websites to explore better processes with their user data. It allows real time responses to user, such as recommendations and data related to personalized content, and decisions. 4.5 Hive Hive permits predictable business applications to run SQL queries against a Hadoop cluster. It provides SQL-like bridge that was developed by Facebook and,thenithasbeen made open source. Hive is a high level perception of the Hadoop and it allows all to make queries against data stored in a Hadoop storage medium easily. 4.6 Rapidminer It provides an integrated platform for various things such as machine learning, data/text mining and other data analysis task such as predictive analytics and business analytics. Rapidminer is used for both business and commercial applications as well as for education, research, training, application development, and rapid prototyping. It supports all steps of the data mining process which includes dataset preparation, validation, results visualization and optimization. 5. BIG DATA CHALLENGES There are many crucial challenges need to be focused while handling of Big Data and its analytical process [6]. Some of those challenges are being discussed: 5.1 Storage Related: As we know that the size of secondary storage device in computing devices is in the range of Terabytes (TB). The amount of data produced via internet is also of very large amount and it is measured in terms of Exabyte (EB) So the traditional relational databasemanagementsoftwaresuchas Oracle and MySQL are not able to store or process such kind of Big Data. To give the solution for such issue, databases uses NoSQL based databases such as Cassandra and MongoDB which can handle unstructured and semi- structured data.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5097 5.2 Data Life Cycle Management This process decides the selection of data for storing purpose which is used for the analytical processes.Thereare many challenges and one of them is that the existing storage system is not able to support such huge amount of data. Therefore, an effective model is needed that makes the life cycle management system better. 5.3 Data representation Main goal of data representation is to make data more important for data analytics and user analysis. Many datasets have definite levels of heterogeneity in terms of their structure, semantics, type, organization, granularity and accessibility. Improper data representation technique may reduce the value of the data originality and even disturbs effective data analysis process [23]. Henceeffective data representation is highly required for easy analysis process. 5.4 Redundancy Reduction and Data Compression Redundancy is one of the major problems in the field of database system and analysis. Redundancy reduction and data compression are operational methodfordecreasing the cost of the system by reducing the data redundancyanddata compression. 5.5 Analysis Big data is generated from various types of online activities and transactions which vary in structure and the volume. Data Analysis is very difficult in case of high volume data. To handle this situation, a special scalable architecture is used to process the data in a distributed environment. Data are distributed into fragmentsandhandledina number of computers systems in the network and then processed data is combined manner. 5.6 Reporting Displaying the statistical data in the form of values is known as Reporting. In case of high volume of data traditional reporting tools and methods becomechallenging to understand and implement. In these situations the statistical reports must be represented in such a manner so that can be easily understood. 5.7 Data Confidentiality Data confidentiality is another big issue for big data because the service providers and ownersofthedata are not able to maintain and analyze such huge datasets in effective manner. In this situation they depend on professionals or sometimes third-party tools for the analysis of datasets, which may cause potential safety risks. So, data confidentiality is important issue for the researchers. 6. CONCLUSION During the composition of this survey paper, various concepts of big data including the concepts of big data analytics, big data analytics techniques, data visualization and big data analysis algorithm have been studied. The literature survey also gives the overview of the possible research opportunities in big data environment. Some of them are as follows: (i) Applying scheduling methods for handling big data computation in cloud based platform (ii) Issues related to data privacy and data security in big data environment. (iii) Reduction of data redundancyanddata compressionin big data environment. (iv) Handling independent and identically distributed variables in big data REFERENCES [1] H.V. Jagadish, D. Agarwal, P.Bernstein, Challengesand Opportunities in Big Data, The Community Research Association, 2015. [2] K. Krishnan, Data warehousing in the age of big data, in: The Morgan Kaufmann Series on Business Intelligence, Elsevier Science, 2013. [3] Vallabh Dhoot, Shubham Gawande, Pooja Kanawade and Akanksha Lekhwani, Efficient Dimensionality Reduction for Big Data Using Clustering Technique, Imperial Journal of Interdisciplinary Research (IJIR), Vol-2, Issue-5, 2016, ISSN: 2454-1362 3. [4] Gantz J, Reinsel D, Extracting value from chaos.IDC iView, 2011, pp 1–12 [5] Cheikh Kacfah Emani, Nadine Cullot, Christophe Nicolle, Understandable Big Data: A survey, Mobile New Applications 2014, 171-209. [6] Chen H, Chiang RHL, Storey VC. Business intelligence and analytics: from big data to big impact. MIS Quart.2012; 36(4):1165–88. [7] K. Krishnan, Data warehousing in the age of big data, in the Morgan Kaufmano series on Business Intelligence, Elsevier Science, 2013. [8] Kitchin R. The real-time city? Big data and smart urbanism. Geo J. 2014, 79(1), pp: 1–14. [9] Katrina Sin and Loganathan Muthu,Applicationsofbig data in education data mining and learninganalytics – A literature Review,ICTACTJournal onsoftcomputing
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5098 special issue on soft computing models for big data, July 2015, Vol:05, Iss: 04, pp: 1035-1049 [10] Mayer-Schonberger V, Cukier K, Big data:a revolution that will transform how we live, work, and think. Boston: Houghton Mifflin Harcourt; 2013. [11] Cheikh Kacfah Emani, Nadine Cullot, Christophe Nicolle, UnderstandableBigData:ASurvey,Computer Science Review, 2015, Vol: 17, pp: 71-80 [12] K. Davis, D. Patterson, “Ethics of Big Data: Balancing Risk and innovation”, O’Reilly Media, 2012. [13] Mike Barlow, Real-Time Big Data Analytics: Emerging Architecture, ISBN: 978-1-449-36421-2, 2013 [14] Shuhui Jiang, Xueming Qian, Tao Mei, Yun Fu, Personalized Travel Sequence recommendation on Multisource Big Social Media,2016,IEEETransactions on Big Data,Vol.2, Issue:1 2. [15] http://www.techrepublic.com/blog/big-data- analytics/10- emerging-technologies-for-big-data