SlideShare a Scribd company logo
1 of 20
CPS 216: Advanced Database
Systems (Data-intensive
Computing Systems)
Shivnath Babu
A Brief History
Relational database
management systems
Time
1975-
1985
1985-
1995
1995-
2005
2005-
2010
2020
Let us first see what a
relational database
system is
Data Management
Data
Query
Query Query
User/Application
DataBase Management System (DBMS)
Example: At a Company
ID Name DeptID Salary โ€ฆ
10 Nemo 12 120K โ€ฆ
20 Dory 156 79K โ€ฆ
40 Gill 89 76K โ€ฆ
52 Ray 34 85K โ€ฆ
โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ€ฆ
ID Name โ€ฆ
12 IT โ€ฆ
34 Accounts โ€ฆ
89 HR โ€ฆ
156 Marketing โ€ฆ
โ€ฆ โ€ฆ โ€ฆ
Employee Department
Query 1: Is there an employee named โ€œNemoโ€?
Query 2: What is โ€œNemoโ€™sโ€ salary?
Query 3: How many departments are there in the company?
Query 4: What is the name of โ€œNemoโ€™sโ€ department?
Query 5: How many employees are there in the
โ€œAccountsโ€ department?
DataBase Management System (DBMS)
High-level
Query Q
DBMS
Data
Answer
Translates Q into
best execution plan
for current conditions,
runs plan
Example: Store that Sells Cars
Make Model OwnerID
Honda Accord 12
Toyota Camry 34
Mini Cooper 89
Honda Accord 156
โ€ฆ โ€ฆ โ€ฆ
ID Name Age
12 Nemo 22
34 Ray 42
89 Gill 36
156 Dory 21
โ€ฆ โ€ฆ โ€ฆ
Cars Owners
Filter (Make = Honda and
Model = Accord)
Join (Cars.OwnerID = Owners.ID)
Make Model OwnerID ID Name Age
Honda Accord 12 12 Nemo 22
Honda Accord 156 156 Dory 21
Owners of
Honda Accords
who are <=
23 years old
Filter (Age <= 23)
DataBase Management System (DBMS)
High-level
Query Q
DBMS
Data
Answer
Translates Q into
best execution plan
for current conditions,
runs plan
Keeps data safe
and correct
despite failures,
concurrent
updates, online
processing, etc.
A Brief History
Relational database
management systems
Time
1975-
1985
1985-
1995
1995-
2005
2005-
2010
2020
Semi-structured and
unstructured data (Web)
Hardware developments
Developments in
system software
Changes in
data sizes
Assumptions and
requirements changed
over time
Big Data: How much data?
๏‚ข Google processes 20 PB a day (2008)
๏‚ข Wayback Machine has 3 PB + 100 TB/month (3/2009)
๏‚ข eBay has 6.5 PB of user data + 50 TB/day (5/2009)
๏‚ข Facebook has 36 PB of user data + 80-90 TB/day (6/2010)
๏‚ข CERNโ€™s LHC: 15 PB a year (any day now)
๏‚ข LSST: 6-10 PB a year (~2015)
640K ought to be
enough for anybody.
From http://www.umiacs.umd.edu/~jimmylin/
From: http://www.cs.duke.edu/smdb10/
NEW REALITIES
TB disks < $100
Everything is data
Rise of data-driven culture
Very publicly espoused
by Google, Wired, etc.
Sloan Digital Sky Survey,
Terraserver, etc.
The quest for knowledge used to
begin with grand theories.
Now it begins with massive
amounts of data.
Welcome to the Petabyte Age.
From: http://db.cs.berkeley.edu/jmh/
THE NEW PRACTITIONERS
Hal Varian, UC Berkeley, Chief Economist @ Google
โ€œLooking for a career where your
services will be in high demand?
โ€ฆ Provide a scarce, complementary
service to something that is getting
ubiquitous and cheap.
So whatโ€™s ubiquitous and cheap?
Data.
And what is complementary to data?
Analysis.
the sexy job in
the next ten
years will be
statisticians
From: http://db.cs.berkeley.edu/jmh/
THE NEW PRACTITIONERS
Aggressively Datavorous
Statistically savvy
Diverse in training, tools
From: http://db.cs.berkeley.edu/jmh/
FOX AUDIENCE
NETWORK
โ€ข Greenplum parallel DB
โ€ข 42 Sun X4500s (โ€œThumperโ€) each
with:
โ€ข 48 500GB drives
โ€ข 16GB RAM
โ€ข 2 dual-core Opterons
โ€ข Big and growing
โ€ข 200 TB data (mirrored)
โ€ข Fact table of 1.5 trillion rows
โ€ข Growing 5TB per day
โ€ข 4-7 Billion rows per day
โ€ข Also extensive use of R
and Hadoop
As reported by FAN, Feb, 2009
From: http://db.cs.berkeley.edu/jmh/
Yahoo! runs a 4000
node Hadoop cluster
(probably the largest).
Overall, there are
38,000 nodes running
Hadoop at Yahoo!
A SCENARIO FROM FAN
Open-ended question about
statistical densities
(distributions)
How many female WWF
fans under the age of 30
visited the Toyota
community over the last 4
days and saw a Class A ad?
How are these people
similar to those that
visited Nissan?
From: http://db.cs.berkeley.edu/jmh/
MULTILINGUAL
DEVELOPMENT
SQL or MapReduce
Sequential code in a
variety of languages
Perl
Python
Java
R
Mix and Match!
From: http://db.cs.berkeley.edu/jmh/
From: http://outsideinnovation.blogs.com/pseybold/2009/03/-sun-will-shine-in-blue-cloud.html
Teaching/Learning Methodology
Relational database
management systems
Time
1975-
1985
1985-
1995
1995-
2005
2005-
2010
2020
Semi-structured and
unstructured data (Web)
Hardware developments
Developments in
system software
Changes in
data sizes
Assumptions and
requirements changed
over time
Course Outline
โ€ข Principles of query processing (30%)
โ€“ Indexes
โ€“ Query execution plans and operators
โ€“ Query optimization
โ€ข Data storage (10%)
โ€“ Databases Vs. filesystems (Google/Hadoop Distributed FileSystem)
โ€“ Flash memory and Solid State Drives
โ€ข Scalable data processing (35%)
โ€“ Parallel query plans and operators
โ€“ Systems based on MapReduce
โ€“ Scalable key-value stores
โ€ข Concurrency control and recovery (15%)
โ€“ Consistency models for data (ACID, BASE, Serializability)
โ€“ Write-ahead logging
โ€ข Information retrieval and Data mining (10%)
โ€“ Web search (Google PageRank, inverted indexes)
โ€“ Association rules and clustering
Course Logistics
โ€ข Web: http://www.cs.duke.edu/courses/fall10/cps216
โ€ข TA: Gang Luo
โ€ข References:
โ€“ Hadoop: The Definitive Guide, by Tom White
โ€“ Database Systems: The Complete Book, by H. Garcia-
Molina, J. D. Ullman, and J. Widom
โ€ข Grading:
โ€“ Project 35% (Hopefully, on Amazon Cloud!)
โ€“ Homework Assignments 15%
โ€“ Midterm 25%
โ€“ Final 25%

More Related Content

Similar to 01_intro.pfwekfmwEKFHswfuhSWFYUGsduyfgSWY6FEWUFpt

HadoopWorkshopJuly2014
HadoopWorkshopJuly2014HadoopWorkshopJuly2014
HadoopWorkshopJuly2014Dieter De Witte
ย 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
ย 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big DataLewis Crawford
ย 
On Big Data
On Big DataOn Big Data
On Big Dataarttan2001
ย 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its TrendsDr.K.Sreenivas Rao
ย 
Petascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsPetascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsHeiko Joerg Schick
ย 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008Blogtalk 2008
ย 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web ScienceJames Hendler
ย 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataPaco Nathan
ย 
Big Data for One Big Family
Big Data for One Big FamilyBig Data for One Big Family
Big Data for One Big FamilyMatt Asay
ย 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
ย 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalIIIT Allahabad
ย 
History and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningHistory and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningJongwook Woo
ย 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
ย 
Hadoop journey
Hadoop journey Hadoop journey
Hadoop journey Jack Son
ย 
Introduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionIntroduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionJongwook Woo
ย 
Data Lake,beyond the Data Warehouse
Data Lake,beyond the Data WarehouseData Lake,beyond the Data Warehouse
Data Lake,beyond the Data WarehouseData Science Thailand
ย 
(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web PagesMichael Nelson
ย 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) SkillsOscar Corcho
ย 

Similar to 01_intro.pfwekfmwEKFHswfuhSWFYUGsduyfgSWY6FEWUFpt (20)

HadoopWorkshopJuly2014
HadoopWorkshopJuly2014HadoopWorkshopJuly2014
HadoopWorkshopJuly2014
ย 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
ย 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big Data
ย 
On Big Data
On Big DataOn Big Data
On Big Data
ย 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
ย 
Database Essay
Database EssayDatabase Essay
Database Essay
ย 
Petascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsPetascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big Analytics
ย 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
ย 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web Science
ย 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About Data
ย 
Big Data for One Big Family
Big Data for One Big FamilyBig Data for One Big Family
Big Data for One Big Family
ย 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
ย 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar Semwal
ย 
History and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep LearningHistory and Trend of Big Data and Deep Learning
History and Trend of Big Data and Deep Learning
ย 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
ย 
Hadoop journey
Hadoop journey Hadoop journey
Hadoop journey
ย 
Introduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and PredictionIntroduction to Big Data and AI for Business Analytics and Prediction
Introduction to Big Data and AI for Business Analytics and Prediction
ย 
Data Lake,beyond the Data Warehouse
Data Lake,beyond the Data WarehouseData Lake,beyond the Data Warehouse
Data Lake,beyond the Data Warehouse
ย 
(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages
ย 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) Skills
ย 

More from WrushabhShirsat3

unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDppt
unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDpptunit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDppt
unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDpptWrushabhShirsat3
ย 
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...WrushabhShirsat3
ย 
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiuf
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiufchapter7.pptfuifbsdiufbsiudfiudfiufeiufiuf
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiufWrushabhShirsat3
ย 
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiue
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiuem150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiue
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiueWrushabhShirsat3
ย 
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfyt
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfytxjtrutdctrd5454drxxresersestryugyufy6rythgfytfyt
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfytWrushabhShirsat3
ย 
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybb
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybbjhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybb
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybbWrushabhShirsat3
ย 
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvh
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvhdcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvh
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvhWrushabhShirsat3
ย 
asdabuydvduyawdyuadauysdasuydyudayudayudaw
asdabuydvduyawdyuadauysdasuydyudayudayudawasdabuydvduyawdyuadauysdasuydyudayudayudaw
asdabuydvduyawdyuadauysdasuydyudayudayudawWrushabhShirsat3
ย 
lecture1-intro.ppt
lecture1-intro.pptlecture1-intro.ppt
lecture1-intro.pptWrushabhShirsat3
ย 
asabydqwfdytwdv.ppt
asabydqwfdytwdv.pptasabydqwfdytwdv.ppt
asabydqwfdytwdv.pptWrushabhShirsat3
ย 
Ch01_Introduction_to_SPM.pdf
Ch01_Introduction_to_SPM.pdfCh01_Introduction_to_SPM.pdf
Ch01_Introduction_to_SPM.pdfWrushabhShirsat3
ย 
chapter1-conventionalsoftwaremanagement1 (1).ppt
chapter1-conventionalsoftwaremanagement1 (1).pptchapter1-conventionalsoftwaremanagement1 (1).ppt
chapter1-conventionalsoftwaremanagement1 (1).pptWrushabhShirsat3
ย 

More from WrushabhShirsat3 (20)

unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDppt
unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDpptunit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDppt
unit-iipart-1.WDQWDQWDQWDQWDQWDQWDQWDQWDQWDppt
ย 
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...
chapter1-convehisudhiusdiudiudsiusdiuddsdshdibsdiubdsjxkjxjntionalsoftwareman...
ย 
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiuf
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiufchapter7.pptfuifbsdiufbsiudfiudfiufeiufiuf
chapter7.pptfuifbsdiufbsiudfiudfiufeiufiuf
ย 
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiue
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiuem150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiue
m150c1.pptfefgefuygfiuffwefnefjufbwefbweiufwiuefiuefhiuefhwiuefhwuiefhuwiefhwiue
ย 
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfyt
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfytxjtrutdctrd5454drxxresersestryugyufy6rythgfytfyt
xjtrutdctrd5454drxxresersestryugyufy6rythgfytfyt
ย 
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybb
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybbjhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybb
jhbuhbhujnhyubhbuybuybuybbuhyybuybuybuybybyubyubybybb
ย 
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvh
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvhdcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvh
dcvdhusdbsduvb0sdyvbsdyvbsdvysdvysdbvsydvdbvbyubdvbdvhvhvhvh
ย 
asdabuydvduyawdyuadauysdasuydyudayudayudaw
asdabuydvduyawdyuadauysdasuydyudayudayudawasdabuydvduyawdyuadauysdasuydyudayudayudaw
asdabuydvduyawdyuadauysdasuydyudayudayudaw
ย 
lecture1-intro.ppt
lecture1-intro.pptlecture1-intro.ppt
lecture1-intro.ppt
ย 
IntroT.ppt
IntroT.pptIntroT.ppt
IntroT.ppt
ย 
scan.pptx
scan.pptxscan.pptx
scan.pptx
ย 
2.pptx
2.pptx2.pptx
2.pptx
ย 
papp01.pptx
papp01.pptxpapp01.pptx
papp01.pptx
ย 
80410172053.pdf
80410172053.pdf80410172053.pdf
80410172053.pdf
ย 
asabydqwfdytwdv.ppt
asabydqwfdytwdv.pptasabydqwfdytwdv.ppt
asabydqwfdytwdv.ppt
ย 
Ch01_Introduction_to_SPM.pdf
Ch01_Introduction_to_SPM.pdfCh01_Introduction_to_SPM.pdf
Ch01_Introduction_to_SPM.pdf
ย 
chapter1.ppt
chapter1.pptchapter1.ppt
chapter1.ppt
ย 
chapter1-conventionalsoftwaremanagement1 (1).ppt
chapter1-conventionalsoftwaremanagement1 (1).pptchapter1-conventionalsoftwaremanagement1 (1).ppt
chapter1-conventionalsoftwaremanagement1 (1).ppt
ย 
Intro.ppt
Intro.pptIntro.ppt
Intro.ppt
ย 
ABCD.ppt
ABCD.pptABCD.ppt
ABCD.ppt
ย 

Recently uploaded

VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...
VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...
VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...SUHANI PANDEY
ย 
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING  SYSTEMS- IGBC, GRIHA, LEED--.pptxRATING  SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptxJIT KUMAR GUPTA
ย 
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
ย 
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...SUHANI PANDEY
ย 
Cheap Call Girls in Dubai %(+971524965298 )# Dubai Call Girl Service By Rus...
Cheap Call Girls  in Dubai %(+971524965298 )#  Dubai Call Girl Service By Rus...Cheap Call Girls  in Dubai %(+971524965298 )#  Dubai Call Girl Service By Rus...
Cheap Call Girls in Dubai %(+971524965298 )# Dubai Call Girl Service By Rus...Escorts Call Girls
ย 
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
ย 
RA 7942:vThe Philippine Mining Act of 1995
RA 7942:vThe Philippine Mining Act of 1995RA 7942:vThe Philippine Mining Act of 1995
RA 7942:vThe Philippine Mining Act of 1995garthraymundo123
ย 
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptx
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptxHertwich_EnvironmentalImpacts_BuildingsGRO.pptx
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptxEdgar Hertwich
ย 
Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...Call Girls in Nagpur High Profile
ย 
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
ย 
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...SUHANI PANDEY
ย 
DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024itadmin50
ย 
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
ย 
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...SUHANI PANDEY
ย 
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756dollysharma2066
ย 
Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.cojitesh
ย 

Recently uploaded (20)

VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...
VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...
VIP Model Call Girls Charholi Budruk ( Pune ) Call ON 8005736733 Starting Fro...
ย 
Climate Change
Climate ChangeClimate Change
Climate Change
ย 
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING  SYSTEMS- IGBC, GRIHA, LEED--.pptxRATING  SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptx
ย 
Deforestation
DeforestationDeforestation
Deforestation
ย 
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Valsad 7001035870 Whatsapp Number, 24/07 Booking
ย 
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
ย 
Cheap Call Girls in Dubai %(+971524965298 )# Dubai Call Girl Service By Rus...
Cheap Call Girls  in Dubai %(+971524965298 )#  Dubai Call Girl Service By Rus...Cheap Call Girls  in Dubai %(+971524965298 )#  Dubai Call Girl Service By Rus...
Cheap Call Girls in Dubai %(+971524965298 )# Dubai Call Girl Service By Rus...
ย 
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
ย 
RA 7942:vThe Philippine Mining Act of 1995
RA 7942:vThe Philippine Mining Act of 1995RA 7942:vThe Philippine Mining Act of 1995
RA 7942:vThe Philippine Mining Act of 1995
ย 
(Anamika) VIP Call Girls Jammu Call Now 8617697112 Jammu Escorts 24x7
(Anamika) VIP Call Girls Jammu Call Now 8617697112 Jammu Escorts 24x7(Anamika) VIP Call Girls Jammu Call Now 8617697112 Jammu Escorts 24x7
(Anamika) VIP Call Girls Jammu Call Now 8617697112 Jammu Escorts 24x7
ย 
(INDIRA) Call Girl Katra Call Now 8617697112 Katra Escorts 24x7
(INDIRA) Call Girl Katra Call Now 8617697112 Katra Escorts 24x7(INDIRA) Call Girl Katra Call Now 8617697112 Katra Escorts 24x7
(INDIRA) Call Girl Katra Call Now 8617697112 Katra Escorts 24x7
ย 
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptx
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptxHertwich_EnvironmentalImpacts_BuildingsGRO.pptx
Hertwich_EnvironmentalImpacts_BuildingsGRO.pptx
ย 
Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...
ย 
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
ย 
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
ย 
DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024
ย 
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
ย 
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...
VVIP Pune Call Girls Wagholi WhatSapp Number 8005736733 With Elite Staff And ...
ย 
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756
GENUINE Babe,Call Girls IN Chhatarpur Delhi | +91-8377877756
ย 
Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.
ย 

01_intro.pfwekfmwEKFHswfuhSWFYUGsduyfgSWY6FEWUFpt

  • 1. CPS 216: Advanced Database Systems (Data-intensive Computing Systems) Shivnath Babu
  • 2. A Brief History Relational database management systems Time 1975- 1985 1985- 1995 1995- 2005 2005- 2010 2020 Let us first see what a relational database system is
  • 4. Example: At a Company ID Name DeptID Salary โ€ฆ 10 Nemo 12 120K โ€ฆ 20 Dory 156 79K โ€ฆ 40 Gill 89 76K โ€ฆ 52 Ray 34 85K โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ€ฆ ID Name โ€ฆ 12 IT โ€ฆ 34 Accounts โ€ฆ 89 HR โ€ฆ 156 Marketing โ€ฆ โ€ฆ โ€ฆ โ€ฆ Employee Department Query 1: Is there an employee named โ€œNemoโ€? Query 2: What is โ€œNemoโ€™sโ€ salary? Query 3: How many departments are there in the company? Query 4: What is the name of โ€œNemoโ€™sโ€ department? Query 5: How many employees are there in the โ€œAccountsโ€ department?
  • 5. DataBase Management System (DBMS) High-level Query Q DBMS Data Answer Translates Q into best execution plan for current conditions, runs plan
  • 6. Example: Store that Sells Cars Make Model OwnerID Honda Accord 12 Toyota Camry 34 Mini Cooper 89 Honda Accord 156 โ€ฆ โ€ฆ โ€ฆ ID Name Age 12 Nemo 22 34 Ray 42 89 Gill 36 156 Dory 21 โ€ฆ โ€ฆ โ€ฆ Cars Owners Filter (Make = Honda and Model = Accord) Join (Cars.OwnerID = Owners.ID) Make Model OwnerID ID Name Age Honda Accord 12 12 Nemo 22 Honda Accord 156 156 Dory 21 Owners of Honda Accords who are <= 23 years old Filter (Age <= 23)
  • 7. DataBase Management System (DBMS) High-level Query Q DBMS Data Answer Translates Q into best execution plan for current conditions, runs plan Keeps data safe and correct despite failures, concurrent updates, online processing, etc.
  • 8. A Brief History Relational database management systems Time 1975- 1985 1985- 1995 1995- 2005 2005- 2010 2020 Semi-structured and unstructured data (Web) Hardware developments Developments in system software Changes in data sizes Assumptions and requirements changed over time
  • 9. Big Data: How much data? ๏‚ข Google processes 20 PB a day (2008) ๏‚ข Wayback Machine has 3 PB + 100 TB/month (3/2009) ๏‚ข eBay has 6.5 PB of user data + 50 TB/day (5/2009) ๏‚ข Facebook has 36 PB of user data + 80-90 TB/day (6/2010) ๏‚ข CERNโ€™s LHC: 15 PB a year (any day now) ๏‚ข LSST: 6-10 PB a year (~2015) 640K ought to be enough for anybody. From http://www.umiacs.umd.edu/~jimmylin/
  • 11. NEW REALITIES TB disks < $100 Everything is data Rise of data-driven culture Very publicly espoused by Google, Wired, etc. Sloan Digital Sky Survey, Terraserver, etc. The quest for knowledge used to begin with grand theories. Now it begins with massive amounts of data. Welcome to the Petabyte Age. From: http://db.cs.berkeley.edu/jmh/
  • 12. THE NEW PRACTITIONERS Hal Varian, UC Berkeley, Chief Economist @ Google โ€œLooking for a career where your services will be in high demand? โ€ฆ Provide a scarce, complementary service to something that is getting ubiquitous and cheap. So whatโ€™s ubiquitous and cheap? Data. And what is complementary to data? Analysis. the sexy job in the next ten years will be statisticians From: http://db.cs.berkeley.edu/jmh/
  • 13. THE NEW PRACTITIONERS Aggressively Datavorous Statistically savvy Diverse in training, tools From: http://db.cs.berkeley.edu/jmh/
  • 14. FOX AUDIENCE NETWORK โ€ข Greenplum parallel DB โ€ข 42 Sun X4500s (โ€œThumperโ€) each with: โ€ข 48 500GB drives โ€ข 16GB RAM โ€ข 2 dual-core Opterons โ€ข Big and growing โ€ข 200 TB data (mirrored) โ€ข Fact table of 1.5 trillion rows โ€ข Growing 5TB per day โ€ข 4-7 Billion rows per day โ€ข Also extensive use of R and Hadoop As reported by FAN, Feb, 2009 From: http://db.cs.berkeley.edu/jmh/ Yahoo! runs a 4000 node Hadoop cluster (probably the largest). Overall, there are 38,000 nodes running Hadoop at Yahoo!
  • 15. A SCENARIO FROM FAN Open-ended question about statistical densities (distributions) How many female WWF fans under the age of 30 visited the Toyota community over the last 4 days and saw a Class A ad? How are these people similar to those that visited Nissan? From: http://db.cs.berkeley.edu/jmh/
  • 16. MULTILINGUAL DEVELOPMENT SQL or MapReduce Sequential code in a variety of languages Perl Python Java R Mix and Match! From: http://db.cs.berkeley.edu/jmh/
  • 18. Teaching/Learning Methodology Relational database management systems Time 1975- 1985 1985- 1995 1995- 2005 2005- 2010 2020 Semi-structured and unstructured data (Web) Hardware developments Developments in system software Changes in data sizes Assumptions and requirements changed over time
  • 19. Course Outline โ€ข Principles of query processing (30%) โ€“ Indexes โ€“ Query execution plans and operators โ€“ Query optimization โ€ข Data storage (10%) โ€“ Databases Vs. filesystems (Google/Hadoop Distributed FileSystem) โ€“ Flash memory and Solid State Drives โ€ข Scalable data processing (35%) โ€“ Parallel query plans and operators โ€“ Systems based on MapReduce โ€“ Scalable key-value stores โ€ข Concurrency control and recovery (15%) โ€“ Consistency models for data (ACID, BASE, Serializability) โ€“ Write-ahead logging โ€ข Information retrieval and Data mining (10%) โ€“ Web search (Google PageRank, inverted indexes) โ€“ Association rules and clustering
  • 20. Course Logistics โ€ข Web: http://www.cs.duke.edu/courses/fall10/cps216 โ€ข TA: Gang Luo โ€ข References: โ€“ Hadoop: The Definitive Guide, by Tom White โ€“ Database Systems: The Complete Book, by H. Garcia- Molina, J. D. Ullman, and J. Widom โ€ข Grading: โ€“ Project 35% (Hopefully, on Amazon Cloud!) โ€“ Homework Assignments 15% โ€“ Midterm 25% โ€“ Final 25%