SlideShare a Scribd company logo
1 of 43
Dr Bassam Hammo.
Presented by: Isra Zaitoun 8131797.
and Diala Alwedyan 8131807.
• What Is Big Data?
• Big Data recourse
• Big Data Important
• Big Data challenge
• Big Data Character
• Big Data Architecture
• Big Data Technology
• What is cloud computing?
• Clouding Model
• What is fundamental Service model
• what ClAaas (Cloud Analysis as a service)?
• what is data analytic?
• what’s scientific workflow systems means?
2
Big data refers to data sets
whose size is beyond the
ability of typical database
software tools to capture,
store, manage, and analyze.
McKinsey Global Institute (MGI) definition
3 2 min
4 1 min
 Soares classifies typical sources for ‘big data’
into 5 categories:
1. Web and social media
2. Machine-to machine
3. Big transaction data
4. Biometrics
5. Human-generated data
5 2 min
6 1 min
-Government
In 2012, the Obama administration announced the Big
Data Research and Development Initiative
-Private Sector
- Walmart handles more than 1 million customer
transactions every hour,
Science
- Large Synoptic Survey Telescope will generate
140 Terabyte of data every 5 days.
7 1 min
Volume
• Data
quantity
Velocity
• Data
Speed
Variety
• Data
Types
Veracity
• Messiness
8 2 min
Heterogeneity and
Incompleteness
Scale
Timeliness
9 2 min
10
 HADOOP
 Map resource
11 1 min
12
1 min
Cloud computing involves computing
over a network, where a program or
application may run on many
connected computers at the same
time.
It specifically refers to a computing
hardware machine or group of
computing hardware machines
commonly referred as a server
connected through a
communication.
13 1 min
Public Model
Private Model
Hybrid Model
14 1 min
A cloud is called a "public cloud" when the
services are rendered over a network that is
open for public use.
15 1 min
Private cloud is cloud infrastructure operated
solely for a single organization, whether
managed internally or by a third-party and
hosted internally or externally
16 1 min
Hybrid cloud is a composition of two or more
clouds (private, community or public) that
remain distinct entities but are bound
together, offering the benefits of multiple
deployment models
17 1 min
18 2 min
1- Scalable hardware.
2- platform required with Specific software tools.
3- Workflow management system to facilitate the
execution of a big data analytics.
19 2 min
 If you want to discover knowledge and make a
decision, you should make data analysis In
different data and applications.
20 2 min
1. Big data analytics is challenge depends on hardware
and software resources.
2. Lack analytic software to different applications.
3. It’s hard to estimate the resources to analysis or
workflow.
4. Management of data in the cloud.
5. Execute analytic workflows.
What is the solution?
The paper propose: Cloud Analysis
As a service which provide platform
in the cloud.
21 3 min
 The paper shows:
Taxonomy for analytic workflow management
systems
to show important features in systems.
22 2 min
 Data analytics provides decision support in
Financial by enabling complex computations to
generate knowledge, insights, and experimental
proofs.
 The complexity computation will done through data
analysis.
What is
Data analytics?
23 2 min
 Recently the data that needs to be analyzed is
Become big, therefore researcher design analytic
system to solve problem of big data.
 The analytic requires several revisions before they
can be used as final jobs.
Data analytics
24 2 min
What is the Applications for Big Data
Analytics?
FinanceTraffic Control
25 2 min
 Scientific workflow systems: is tool for many
application, enabling the execution of
complex analysis on distributed resources.
What is scientific workflow system????
Is analytic workflow management System
exist today, which execute a series of
computational or data manipulation steps or a
workflow, in a scientific application.
26 3 min
 Analytic Workflow:
Consist of sequence of data and integration tasks
or exploratory analytic jobs.
such as: definition and execution of machine
learning models.
 A task in a workflow can also be another workflow.
 A workflow management system is necessary for
efficient definition and execution of analytic
workflows. 27 3 min
 Software tools for analytic jobs will different
depending on:
1- the data type.
2- size.
3- domain.
4-business goals.
 The analytic tools to the data sources is important
especially for big data, to avoid data transfer time
and network cost. 28 2 min
 There are systems different depending on:
1- In their focus on data domain.
2- Workflow types.
3- Representations.
4- Execution.
5- Support for collaboration and visualization.
29
1- Taverna and XBaya support distributed execution of
workflows using 3rd party web and data services.
2- to analysis of large data, grid technology is used for
distributed mapping of workflows by WINGS, DAGMan and
Kepler.
3 min
 The most successful workflow are Taverna, XBaya.
 The arrows encodes the data dependences from one activity in the
workflow to another.
 Different systems use different approaches to encapsulating an
activity.
 In Taverna and Xbaya these are web services.
30 2 min
• As illustrated in Figure, an activity have multiple
inputs and produce multiple outputs and the job
workflow engine is to create an activity as soon as its
inputs are all available.
• Directed acyclic graph representation of a workflow.
Activit
y
31 2 min
32
 XBaya control structures for conditionals
and map reduce style iteration layered
on top of a data flow graph.
1 min
 Taxonomies: is classification of the analytic
workflow systems to identify the features of CLAaaS.
Workflow System
There are seven features of the taxonomy are:
2- Security
1- Structure
5- Visualization
3- Workflow Design
4- Representation
7- Execution
6- Collaboration
Workflow systems: are task
based, each task represents a
data processing or analysis by
software or workflow.
33 3 min
- The information flow can be:
1- Control
2- Data or
3- Hybrid of the two.
 Simple workflows have: sequential, parallel or type
selection structures.
 Complex workflows Include: sub-workflows or
iterative tasks.
Structure
Content
Control
Flow
Component
structure
Layout
Data Hybrid
Complexsimple
Parallel Sequential Selection
Loop Workflow
Service
Application
Data
Access
control
34 2 min
 For sensitive data, the encryption is used to analysis
encrypt data.
 The existing systems only have some sort of access
control.
 Before analysis sensitive data, Anonymization will
done.
 Data anonymization is converts clear text data into a non
readable text.
Security
Data Access control
35 3 min
 Many existing workflow systems such as WINGS,
Kepler, Triana, Vistrails and Xbaya provide graphical
workflow tools.
 Graphical workflows are convert to other
representations for storage and execution.
 Taverna provides a hierarchical workflow view where
 high level view can be expanded to see component
details.
Workflow
design
Method Assistance Verification
Graphical
Description Hybrid
36 3 min
 Users specify constraints in Resource Description
Framework (RDF) in WINGS, making it a hybrid
design method.
 The constraints allow verification of the
workflows.
37 2 min
Workflows can be represented graphically using:
1- Object-based Modeling Language (UML).
2- Graph-based DAX (Extensible Markup Language representation
of Directed Acyclic Graph).
3- Event-based BPMN (Business Process Model and Notation),
which are easier to construct for small workflows using GUI tools.
 High level scripting languages such as Ruby and Python are
used to automatically generate the low level complex
structures in workflows. 38 3 min
39 4 min
 Effective visualization can add big value to analytics
and can vary based on the resulting data types.
 Image data should have a good resolution unlike
charts or lists.
 By using Visualization models, the user can
predefined or defined intelligently by the system
based on the data types as: in Kepler and VisTrails.
40 3 min
 Collaboration is very important for correct
interpretation of the results and specification of
effective visualization models.
 The commercial analytic software provide
collaboration functionality, the open source workflow
systems have little or no support for online
collaboration.
 Results, data and workflows are shared and published
online, for example, in MyExperiment and BioMart.41 2 min
 Interdependent tasks where the output from one serves
as the input for another have to be executed sequentially.
 Independent tasks can take advantage of distributed
parallel execution to: maximize resource utilization and
minimize the execution time.
 Workflow execution using cloud resources is currently
being explored due to the scalability required for big
data. 42
 Markus Maier,”Towards a Big Data Reference,”1-Architecture
 Master thesis, 13th October 2013.
 Nrusimham Ammu, 2 Mohd Irfanuddin,” Big Data Challenges”, nternational Journal of
Advanced Trends in Computer Science and Engineering, Vol.2 , No.1, Pages : 613 - 615 (2013)
 Special Issue of ICACSE 2013 - Held on 7-8 January, 2013 in Lords Institute of Engineering and
Technology, Hyderabad
 https://www.youtube.com/watch?v=D4ZQxBPtyHg&hd=1
 http://en.wikipedia.org/wiki/Cloud_telephony
 http://en.wikipedia.org/wiki/Big_data
 http://d2i.indiana.edu/sites/default/files/1-s2.0-s1877050912001755-main.pdf
 https://www.youtube.com/watch?v=Ft6yz0SObIA
 https://www.youtube.com/watch?v=arVoQxjIxUU
 PROGRAMMING E-SCIENCE GATEWAYS – ResearchGate
 http://computer.howstuffworks.com/cloud-computing/cloud-computing.htm
 http://www.slideshare.net/leelashine/hadoop-29386155?qid=cb3ea67b-c33e-4e2b-
8f79-fdca7b6a1b63&v=default&b=&from_search=5
 http://en.wikipedia.org/wiki/Weka_(machine_learning)
 http://en.wikipedia.org/wiki/Data_mining
 http://en.wikipedia.org/wiki/Taxonomy
 http://siliconangle.com/blog/2013/04/03/big-data-traffic-jam-smarter-lights-
happy-drivers/
43

More Related Content

What's hot

A Comparative Study: Taxonomy of High Performance Computing (HPC)
A Comparative Study: Taxonomy of High Performance Computing (HPC) A Comparative Study: Taxonomy of High Performance Computing (HPC)
A Comparative Study: Taxonomy of High Performance Computing (HPC) IJECEIAES
 
Ieee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementIeee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementtsysglobalsolutions
 
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...IRJET Journal
 
Ensuring Distributed Accountability in the Cloud
Ensuring Distributed Accountability in the CloudEnsuring Distributed Accountability in the Cloud
Ensuring Distributed Accountability in the CloudSuraj Mehta
 
Centralized Data Verification Scheme for Encrypted Cloud Data Services
Centralized Data Verification Scheme for Encrypted Cloud Data ServicesCentralized Data Verification Scheme for Encrypted Cloud Data Services
Centralized Data Verification Scheme for Encrypted Cloud Data ServicesEditor IJMTER
 
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...INFOGAIN PUBLICATION
 
Distributed computing file
Distributed computing fileDistributed computing file
Distributed computing fileberasrujana
 
A Novel privacy preserving public auditing for shared data in cloud
A Novel privacy preserving public auditing for shared data in cloudA Novel privacy preserving public auditing for shared data in cloud
A Novel privacy preserving public auditing for shared data in cloudJAVVAJI VENKATA RAO
 
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTING
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTINGDISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTING
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTINGijgca
 
Using BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingUsing BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingIJCSIS Research Publications
 
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...ijsrd.com
 
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSDATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSijdms
 
Bt9002 grid computing 1
Bt9002 grid computing 1Bt9002 grid computing 1
Bt9002 grid computing 1Techglyphs
 
Applications of SOA and Web Services in Grid Computing
Applications of SOA and Web Services in Grid ComputingApplications of SOA and Web Services in Grid Computing
Applications of SOA and Web Services in Grid Computingyht4ever
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...ijcsit
 
Data migration system in heterogeneous database
Data migration system in heterogeneous databaseData migration system in heterogeneous database
Data migration system in heterogeneous databaseeSAT Publishing House
 

What's hot (20)

A Comparative Study: Taxonomy of High Performance Computing (HPC)
A Comparative Study: Taxonomy of High Performance Computing (HPC) A Comparative Study: Taxonomy of High Performance Computing (HPC)
A Comparative Study: Taxonomy of High Performance Computing (HPC)
 
Ieee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementIeee transactions on 2018 network and service management
Ieee transactions on 2018 network and service management
 
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...
IRJET- Providing In-Database Analytic Functionalities to Mysql : A Proposed S...
 
Ensuring Distributed Accountability in the Cloud
Ensuring Distributed Accountability in the CloudEnsuring Distributed Accountability in the Cloud
Ensuring Distributed Accountability in the Cloud
 
Centralized Data Verification Scheme for Encrypted Cloud Data Services
Centralized Data Verification Scheme for Encrypted Cloud Data ServicesCentralized Data Verification Scheme for Encrypted Cloud Data Services
Centralized Data Verification Scheme for Encrypted Cloud Data Services
 
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
 
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...
Privacy Preserving Public Auditing and Data Integrity for Secure Cloud Storag...
 
Distributed computing file
Distributed computing fileDistributed computing file
Distributed computing file
 
A Novel privacy preserving public auditing for shared data in cloud
A Novel privacy preserving public auditing for shared data in cloudA Novel privacy preserving public auditing for shared data in cloud
A Novel privacy preserving public auditing for shared data in cloud
 
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTING
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTINGDISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTING
DISTRIBUTED AND BIG DATA STORAGE MANAGEMENT IN GRID COMPUTING
 
50620130101004
5062013010100450620130101004
50620130101004
 
Using BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingUsing BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined Networking
 
IoTReport
IoTReportIoTReport
IoTReport
 
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...
Adaptive Delegation Authority Enhancement to Hasbe for Efficient Access Contr...
 
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONSDATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
DATABASE SYSTEMS PERFORMANCE EVALUATION FOR IOT APPLICATIONS
 
Cloud Storage and Security
Cloud Storage and SecurityCloud Storage and Security
Cloud Storage and Security
 
Bt9002 grid computing 1
Bt9002 grid computing 1Bt9002 grid computing 1
Bt9002 grid computing 1
 
Applications of SOA and Web Services in Grid Computing
Applications of SOA and Web Services in Grid ComputingApplications of SOA and Web Services in Grid Computing
Applications of SOA and Web Services in Grid Computing
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
 
Data migration system in heterogeneous database
Data migration system in heterogeneous databaseData migration system in heterogeneous database
Data migration system in heterogeneous database
 

Similar to Big data ppt diala

Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computingsudha kar
 
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsSherinMariamReji05
 
How do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfHow do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfSoumodeep Nanee Kundu
 
A Comprehensive Study On Data Mining Process With Distribution
A Comprehensive Study On Data Mining Process With DistributionA Comprehensive Study On Data Mining Process With Distribution
A Comprehensive Study On Data Mining Process With DistributionLori Mitchell
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentHostedbyConfluent
 
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...IRJET Journal
 
Grid and cluster_computing_chapter1
Grid and cluster_computing_chapter1Grid and cluster_computing_chapter1
Grid and cluster_computing_chapter1Bharath Kumar
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfKarishma Chaudhary
 
Analyzing Systems Using Data Flow Diagrams
Analyzing Systems Using Data Flow DiagramsAnalyzing Systems Using Data Flow Diagrams
Analyzing Systems Using Data Flow DiagramsChristina Valadez
 
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...IRJET Journal
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...icwe2015
 
HOST AND NETWORK SECURITY by ThesisScientist.com
HOST AND NETWORK SECURITY by ThesisScientist.comHOST AND NETWORK SECURITY by ThesisScientist.com
HOST AND NETWORK SECURITY by ThesisScientist.comProf Ansari
 
A New Way Of Distributed Or Cloud Computing
A New Way Of Distributed Or Cloud ComputingA New Way Of Distributed Or Cloud Computing
A New Way Of Distributed Or Cloud ComputingAshley Lovato
 
IRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using QlikIRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using QlikIRJET Journal
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...theijes
 
5. the grid implementing production grid
5. the grid implementing production grid5. the grid implementing production grid
5. the grid implementing production gridDr Sandeep Kumar Poonia
 
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...IRJET Journal
 

Similar to Big data ppt diala (20)

Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
 
How do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfHow do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdf
 
A Comprehensive Study On Data Mining Process With Distribution
A Comprehensive Study On Data Mining Process With DistributionA Comprehensive Study On Data Mining Process With Distribution
A Comprehensive Study On Data Mining Process With Distribution
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
 
FR.pptx
FR.pptxFR.pptx
FR.pptx
 
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...
IRJET-Implementation of Threshold based Cryptographic Technique over Cloud Co...
 
Grid and cluster_computing_chapter1
Grid and cluster_computing_chapter1Grid and cluster_computing_chapter1
Grid and cluster_computing_chapter1
 
Report_Internships
Report_InternshipsReport_Internships
Report_Internships
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdf
 
Grid computing 12
Grid computing 12Grid computing 12
Grid computing 12
 
Analyzing Systems Using Data Flow Diagrams
Analyzing Systems Using Data Flow DiagramsAnalyzing Systems Using Data Flow Diagrams
Analyzing Systems Using Data Flow Diagrams
 
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
Ranking Efficient Attribute Based Keyword Searching Over Encrypted Data Along...
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
 
HOST AND NETWORK SECURITY by ThesisScientist.com
HOST AND NETWORK SECURITY by ThesisScientist.comHOST AND NETWORK SECURITY by ThesisScientist.com
HOST AND NETWORK SECURITY by ThesisScientist.com
 
A New Way Of Distributed Or Cloud Computing
A New Way Of Distributed Or Cloud ComputingA New Way Of Distributed Or Cloud Computing
A New Way Of Distributed Or Cloud Computing
 
IRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using QlikIRJET- Data Analytics & Visualization using Qlik
IRJET- Data Analytics & Visualization using Qlik
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
5. the grid implementing production grid
5. the grid implementing production grid5. the grid implementing production grid
5. the grid implementing production grid
 
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...
Implementation and Review Paper of Secure and Dynamic Multi Keyword Search in...
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 

Big data ppt diala

  • 1. Dr Bassam Hammo. Presented by: Isra Zaitoun 8131797. and Diala Alwedyan 8131807.
  • 2. • What Is Big Data? • Big Data recourse • Big Data Important • Big Data challenge • Big Data Character • Big Data Architecture • Big Data Technology • What is cloud computing? • Clouding Model • What is fundamental Service model • what ClAaas (Cloud Analysis as a service)? • what is data analytic? • what’s scientific workflow systems means? 2
  • 3. Big data refers to data sets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze. McKinsey Global Institute (MGI) definition 3 2 min
  • 5.  Soares classifies typical sources for ‘big data’ into 5 categories: 1. Web and social media 2. Machine-to machine 3. Big transaction data 4. Biometrics 5. Human-generated data 5 2 min
  • 7. -Government In 2012, the Obama administration announced the Big Data Research and Development Initiative -Private Sector - Walmart handles more than 1 million customer transactions every hour, Science - Large Synoptic Survey Telescope will generate 140 Terabyte of data every 5 days. 7 1 min
  • 8. Volume • Data quantity Velocity • Data Speed Variety • Data Types Veracity • Messiness 8 2 min
  • 10. 10
  • 11.  HADOOP  Map resource 11 1 min
  • 13. Cloud computing involves computing over a network, where a program or application may run on many connected computers at the same time. It specifically refers to a computing hardware machine or group of computing hardware machines commonly referred as a server connected through a communication. 13 1 min
  • 15. A cloud is called a "public cloud" when the services are rendered over a network that is open for public use. 15 1 min
  • 16. Private cloud is cloud infrastructure operated solely for a single organization, whether managed internally or by a third-party and hosted internally or externally 16 1 min
  • 17. Hybrid cloud is a composition of two or more clouds (private, community or public) that remain distinct entities but are bound together, offering the benefits of multiple deployment models 17 1 min
  • 19. 1- Scalable hardware. 2- platform required with Specific software tools. 3- Workflow management system to facilitate the execution of a big data analytics. 19 2 min
  • 20.  If you want to discover knowledge and make a decision, you should make data analysis In different data and applications. 20 2 min
  • 21. 1. Big data analytics is challenge depends on hardware and software resources. 2. Lack analytic software to different applications. 3. It’s hard to estimate the resources to analysis or workflow. 4. Management of data in the cloud. 5. Execute analytic workflows. What is the solution? The paper propose: Cloud Analysis As a service which provide platform in the cloud. 21 3 min
  • 22.  The paper shows: Taxonomy for analytic workflow management systems to show important features in systems. 22 2 min
  • 23.  Data analytics provides decision support in Financial by enabling complex computations to generate knowledge, insights, and experimental proofs.  The complexity computation will done through data analysis. What is Data analytics? 23 2 min
  • 24.  Recently the data that needs to be analyzed is Become big, therefore researcher design analytic system to solve problem of big data.  The analytic requires several revisions before they can be used as final jobs. Data analytics 24 2 min
  • 25. What is the Applications for Big Data Analytics? FinanceTraffic Control 25 2 min
  • 26.  Scientific workflow systems: is tool for many application, enabling the execution of complex analysis on distributed resources. What is scientific workflow system???? Is analytic workflow management System exist today, which execute a series of computational or data manipulation steps or a workflow, in a scientific application. 26 3 min
  • 27.  Analytic Workflow: Consist of sequence of data and integration tasks or exploratory analytic jobs. such as: definition and execution of machine learning models.  A task in a workflow can also be another workflow.  A workflow management system is necessary for efficient definition and execution of analytic workflows. 27 3 min
  • 28.  Software tools for analytic jobs will different depending on: 1- the data type. 2- size. 3- domain. 4-business goals.  The analytic tools to the data sources is important especially for big data, to avoid data transfer time and network cost. 28 2 min
  • 29.  There are systems different depending on: 1- In their focus on data domain. 2- Workflow types. 3- Representations. 4- Execution. 5- Support for collaboration and visualization. 29 1- Taverna and XBaya support distributed execution of workflows using 3rd party web and data services. 2- to analysis of large data, grid technology is used for distributed mapping of workflows by WINGS, DAGMan and Kepler. 3 min
  • 30.  The most successful workflow are Taverna, XBaya.  The arrows encodes the data dependences from one activity in the workflow to another.  Different systems use different approaches to encapsulating an activity.  In Taverna and Xbaya these are web services. 30 2 min
  • 31. • As illustrated in Figure, an activity have multiple inputs and produce multiple outputs and the job workflow engine is to create an activity as soon as its inputs are all available. • Directed acyclic graph representation of a workflow. Activit y 31 2 min
  • 32. 32  XBaya control structures for conditionals and map reduce style iteration layered on top of a data flow graph. 1 min
  • 33.  Taxonomies: is classification of the analytic workflow systems to identify the features of CLAaaS. Workflow System There are seven features of the taxonomy are: 2- Security 1- Structure 5- Visualization 3- Workflow Design 4- Representation 7- Execution 6- Collaboration Workflow systems: are task based, each task represents a data processing or analysis by software or workflow. 33 3 min
  • 34. - The information flow can be: 1- Control 2- Data or 3- Hybrid of the two.  Simple workflows have: sequential, parallel or type selection structures.  Complex workflows Include: sub-workflows or iterative tasks. Structure Content Control Flow Component structure Layout Data Hybrid Complexsimple Parallel Sequential Selection Loop Workflow Service Application Data Access control 34 2 min
  • 35.  For sensitive data, the encryption is used to analysis encrypt data.  The existing systems only have some sort of access control.  Before analysis sensitive data, Anonymization will done.  Data anonymization is converts clear text data into a non readable text. Security Data Access control 35 3 min
  • 36.  Many existing workflow systems such as WINGS, Kepler, Triana, Vistrails and Xbaya provide graphical workflow tools.  Graphical workflows are convert to other representations for storage and execution.  Taverna provides a hierarchical workflow view where  high level view can be expanded to see component details. Workflow design Method Assistance Verification Graphical Description Hybrid 36 3 min
  • 37.  Users specify constraints in Resource Description Framework (RDF) in WINGS, making it a hybrid design method.  The constraints allow verification of the workflows. 37 2 min
  • 38. Workflows can be represented graphically using: 1- Object-based Modeling Language (UML). 2- Graph-based DAX (Extensible Markup Language representation of Directed Acyclic Graph). 3- Event-based BPMN (Business Process Model and Notation), which are easier to construct for small workflows using GUI tools.  High level scripting languages such as Ruby and Python are used to automatically generate the low level complex structures in workflows. 38 3 min
  • 40.  Effective visualization can add big value to analytics and can vary based on the resulting data types.  Image data should have a good resolution unlike charts or lists.  By using Visualization models, the user can predefined or defined intelligently by the system based on the data types as: in Kepler and VisTrails. 40 3 min
  • 41.  Collaboration is very important for correct interpretation of the results and specification of effective visualization models.  The commercial analytic software provide collaboration functionality, the open source workflow systems have little or no support for online collaboration.  Results, data and workflows are shared and published online, for example, in MyExperiment and BioMart.41 2 min
  • 42.  Interdependent tasks where the output from one serves as the input for another have to be executed sequentially.  Independent tasks can take advantage of distributed parallel execution to: maximize resource utilization and minimize the execution time.  Workflow execution using cloud resources is currently being explored due to the scalability required for big data. 42
  • 43.  Markus Maier,”Towards a Big Data Reference,”1-Architecture  Master thesis, 13th October 2013.  Nrusimham Ammu, 2 Mohd Irfanuddin,” Big Data Challenges”, nternational Journal of Advanced Trends in Computer Science and Engineering, Vol.2 , No.1, Pages : 613 - 615 (2013)  Special Issue of ICACSE 2013 - Held on 7-8 January, 2013 in Lords Institute of Engineering and Technology, Hyderabad  https://www.youtube.com/watch?v=D4ZQxBPtyHg&hd=1  http://en.wikipedia.org/wiki/Cloud_telephony  http://en.wikipedia.org/wiki/Big_data  http://d2i.indiana.edu/sites/default/files/1-s2.0-s1877050912001755-main.pdf  https://www.youtube.com/watch?v=Ft6yz0SObIA  https://www.youtube.com/watch?v=arVoQxjIxUU  PROGRAMMING E-SCIENCE GATEWAYS – ResearchGate  http://computer.howstuffworks.com/cloud-computing/cloud-computing.htm  http://www.slideshare.net/leelashine/hadoop-29386155?qid=cb3ea67b-c33e-4e2b- 8f79-fdca7b6a1b63&v=default&b=&from_search=5  http://en.wikipedia.org/wiki/Weka_(machine_learning)  http://en.wikipedia.org/wiki/Data_mining  http://en.wikipedia.org/wiki/Taxonomy  http://siliconangle.com/blog/2013/04/03/big-data-traffic-jam-smarter-lights- happy-drivers/ 43

Editor's Notes

  1. Explain well. Quote practical examples