SlideShare a Scribd company logo
1 of 14
Download to read offline
BigData
Aisha Siddiqa
aasiddiqa@gmail.com
C4MCCR,
Faculty of Computer Science and Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
What is Big Data
• Data is Data, what is “Big” ???
• A Big thing in the field of computing which generates values
from large data sets that cannot be analyzed with traditional
computing techniques
• Storage
• Processing
• Visualization
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
What is new
What is new traditional data BIG DATA
Data Type Employee records,
bank records
Web search, data mining, scientific and
medical databases
Data
Accumulation
Staff Users, Machines
Processing Centralized Parallel
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Explosion of Big Data (I)
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Explosion of Big Data (II)
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Features of Big Data
Volume
Velocity
Variety
Veracity
Variability
Value
Complexity
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Real Statistics of Big Data (I)
Facebook:
• Collecting about 600 petabytes
of data per day
• An average user creates 90
pieces of content each month
• More than 500 million active
users
Twitter:
• 9,401 tweets per second
• 1 billion tweets in less than 2
days
• 50 million users from the past
year
0
250
500
750
1000
1250
1500
2008 2009 2010 2011 2012 2013 2014 2015
Facebook Users
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Real Statistics of Big Data (II)
• 49,252 Google searches
per second
• 187 million new users per
month
• 300 hours videos
uploaded per minute
• Over 1 billion users
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Real Statistics of Big Data (V)
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Future of Big Data
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Big Data is for Smart Organizations
• Every single bit is valuable
• Only smart organizations realize to keep and process
Big Data for better decision making, for survival in
competing:
– Customers
– Products
– Services
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Big Data for R&D
• Data is beyond structured, relational databases
• New opportunities for data management in hardware,
storage, networking and computing are needed:
– Virtualization
– Cloud
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Bid Data Management
• Functional Requirements
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.
Big Data Architecture
By: Aisha Siddiqa
C4MCCR,
Faculty of Computer Science and
Information Technology,
University of Malaya,
Kuala Lumpur, Malaysia.

More Related Content

Similar to Introduction to big data

Similar to Introduction to big data (20)

The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
Rise of the Data Democracy
Rise of the Data DemocracyRise of the Data Democracy
Rise of the Data Democracy
 
Big Data Challenges
Big Data ChallengesBig Data Challenges
Big Data Challenges
 
Big data
Big dataBig data
Big data
 
Data foundations building success, at city scale – Imperial College London
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College London
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Data Science Course Chennai
Data Science Course ChennaiData Science Course Chennai
Data Science Course Chennai
 
Where does Data Democracy begin? [Segment-Synapse, 2019]
Where does Data Democracy begin? [Segment-Synapse, 2019]Where does Data Democracy begin? [Segment-Synapse, 2019]
Where does Data Democracy begin? [Segment-Synapse, 2019]
 
Health IT Summit in Denver 2014 - Opening Keynote "Leading Transformation at ...
Health IT Summit in Denver 2014 - Opening Keynote "Leading Transformation at ...Health IT Summit in Denver 2014 - Opening Keynote "Leading Transformation at ...
Health IT Summit in Denver 2014 - Opening Keynote "Leading Transformation at ...
 
Data Science Course In Hyderabad
Data Science Course In HyderabadData Science Course In Hyderabad
Data Science Course In Hyderabad
 
Intelligent Automation 2019
Intelligent Automation 2019Intelligent Automation 2019
Intelligent Automation 2019
 
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
 
Data Scientists
 Data Scientists Data Scientists
Data Scientists
 
The Future of Work & The Impact of IT 2019
The Future of Work & The Impact of IT 2019The Future of Work & The Impact of IT 2019
The Future of Work & The Impact of IT 2019
 
Approaches to developing staff and students' digital capability
Approaches to developing staff and students' digital capabilityApproaches to developing staff and students' digital capability
Approaches to developing staff and students' digital capability
 
Data Governance in a big data era
Data Governance in a big data eraData Governance in a big data era
Data Governance in a big data era
 
Introduction to Information management
Introduction to Information management Introduction to Information management
Introduction to Information management
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 
Data Science Course in Pune
Data Science Course in PuneData Science Course in Pune
Data Science Course in Pune
 
Jay Ferro
Jay FerroJay Ferro
Jay Ferro
 

Recently uploaded

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Introduction to big data

  • 1. BigData Aisha Siddiqa aasiddiqa@gmail.com C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 2. What is Big Data • Data is Data, what is “Big” ??? • A Big thing in the field of computing which generates values from large data sets that cannot be analyzed with traditional computing techniques • Storage • Processing • Visualization By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 3. What is new What is new traditional data BIG DATA Data Type Employee records, bank records Web search, data mining, scientific and medical databases Data Accumulation Staff Users, Machines Processing Centralized Parallel By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 4. Explosion of Big Data (I) By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 5. Explosion of Big Data (II) By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 6. Features of Big Data Volume Velocity Variety Veracity Variability Value Complexity By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 7. Real Statistics of Big Data (I) Facebook: • Collecting about 600 petabytes of data per day • An average user creates 90 pieces of content each month • More than 500 million active users Twitter: • 9,401 tweets per second • 1 billion tweets in less than 2 days • 50 million users from the past year 0 250 500 750 1000 1250 1500 2008 2009 2010 2011 2012 2013 2014 2015 Facebook Users By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 8. Real Statistics of Big Data (II) • 49,252 Google searches per second • 187 million new users per month • 300 hours videos uploaded per minute • Over 1 billion users By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 9. Real Statistics of Big Data (V) By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 10. Future of Big Data By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 11. Big Data is for Smart Organizations • Every single bit is valuable • Only smart organizations realize to keep and process Big Data for better decision making, for survival in competing: – Customers – Products – Services By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 12. Big Data for R&D • Data is beyond structured, relational databases • New opportunities for data management in hardware, storage, networking and computing are needed: – Virtualization – Cloud By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 13. Bid Data Management • Functional Requirements By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.
  • 14. Big Data Architecture By: Aisha Siddiqa C4MCCR, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.