SlideShare a Scribd company logo
Big Data Meets Computer
Science
Jim Hendler
Tetherless World Professor of Computer, Web and
Data Sciences
Director, Rensselaer Institute for Data Exploration and
Applications
@jahendler
The Rensselaer “IDEA” (idea.rpi.edu)The Rensselaer “IDEA” (idea.rpi.edu)
The Rensselaer IDEA 3
… Across Applications (corresponding to Challenges Identified in the
Rensselaer Plan 2024)
Healthcare
Analytics
Business
Systems
Built and Natural
Environments
Virtual and
Augmented Reality
Cyber-
Resiliency
Policy, Ethics and
Open Government
Materials
Informatics
Data-driven
Physical/Life
Sciences
The Rensselaer IDEA 4
Developing a Comprehensive “Data Science” Research Agenda
P. Fox and J. Hendler, The Science of Data Science, Big Data, 2(2), in press
The Rensselaer IDEA
Graduate Projects in IDEA
• IDEA and CCI (HPC): technologies to enable
Rensselaer researchers to work with data at larger
scales and in new ways
• Population-scale cognitive computing models for
“human intensive” agent-based simulations
• IDEA and EMPAC (Performing arts center): provide
next generation data exploration tools
• Multi-person data visualization tools for big-data
applications
• IDEA and Watson: New direction in Cognitive
Computation
• How do we go from Question/Answering to Open Web
Data exploration?
• IDEA and CBIS (Ctr for Biotechnology &
Interdisciplinary Studies): Data-driven Informatics
• Can we couple semantics and big data to find new medical
uses for already approved drugs?
The Rensselaer IDEA
External Projects and partnerships
Emergency Room Care
Language and Agents
Largescale Healthcare Analytics
In Discussion Jumpstart (Proposal underway)
Built and Natural Biome data-driven
science and engineering
Cognitive Computing Collaborative
Research Initiative
Campus Data
Infrastructure
Metadata
• Title
• Author
• Author Email
• Licence
• Subject
• Keyword
• Data Type
Dataset
CDF
RPI Object Deposit RPI Research Network
RPI-ID Request RPI-ID Request
Share
Knowledge
Join
Network
Allocate a universal accessible RPI-ID
Register Metadata
Upload Any Data
RPI Research Object
Registration and Deposit
RPI Research Collaboration
and Community Network
Requires going Beyond
the Database
Discovery
Integrate
Visualize
Explain
Thinking outside the Database box
Strata talk, 2013 - https://www.youtube.com/watch?v=Cob5oltMGMc
At new scales (and in
new ways)
Fox and Hendler, Changing the Equation on Scientific Visualization,
Science, 2/11 - http://www.sciencemag.org/content/331/6018/705.short)
A Whole New World
• But what about undergraduate
education
– where do we train the students who can
take on projects needing
• statistics and analytics
• informatics
• data science challenges
• machine learning
• unstructured data
• cognitive computation
• …
Computer Science
Education?
• Programming is a necessary skill
– not sufficient
• and we mostly teach it wrong…
– (For my heresies about teaching programming, see
“Let’s Help Computer Science Students Crack the
Code, 3/13 http://chronicle.com/article/Lets-Help-Computer-Science/137649/ )
• The computing environment of today is nothing like
the computing environment of the 70s,
– but the curriculum hasn’t changed much since I was in
school – but the fundamentals are NOT all the same
– data-oriented computations involve graphs, memory
intensive algorithms, machine learning, …
Deploying these ideas at
RPI
• Innovation in the interdisciplinary Information
Technology Program
– Renamed Information Technology and Web
Science, 2011
• for more on Web Science, see
– Berners-Lee et al., Creating a Science of the World Wide Web,
Science, 2006,
https://www.sciencemag.org/content/313/5788/769.summary;
– Hendler et. al, Web Science: An interdisciplinary Approach to
Understanding the Web, CACM, 7/2008,
http://cacm.acm.org/magazines/2008/7/5366-web-science/fulltext
IT and Web Science
• First IT academic program in U.S.
• First web science degree program in
U.S.; First undergraduate web science
degree anywhere
• BS in ITWS (20 concentrations) and MS
in IT (10 concentrations)
• PhD in Multi-Disciplinary Sciences
• http://itws.rpi.edu
– I was Director 2008-2012
– Now directed by Peter Fox (whose slides I stole
for this section)
 
 
 
Technical Track Courses
 
 
 Concentrations
Computer Engineering
Track
1) ECSE-2610 Computer Components and Operations
2) ENGR-2350 Embedded Control
3) ECSE-2660 Computer Architecture, Networking and 
Operating Systems
Civil Engineering
Computer Hardware
Computer Networking (hardware focus)
Mechanical/Aeronautical  Eng.
Computer Science Track 1) CSCI-2200 Foundations of Computer Science
2) CSCI-2300 Introduction to Algorithms
3) CSCI-2500 Computer Organization
Cognitive Science
Computer Networking (software focus)
Information Security
Machine and Computational Learning
Information Systems Track 1) CSCI-2200 Foundation of Computer Science
2) CSCI-2500 Computer Organization
3) Four credits from the following:
• CSCI-2220 Programming in Java (2 credits)
• CSCI-2961 Program in Python (2 credits)
• CSCI-2300 Introduction to Algorithms (4 credits)
• ITWS-49XX Web Systems Development II (4 credits)
Arts
Communication
Economics
Entrepreneurship
Finance
Management Information 
  Systems
Medicine
Pre-law
Psychology
STS
Web Science Track 1) CSCI-2200 Foundations of Computer Science
2) CSCI-2500 Computer Organization
3) One of the following:
• CSCI-49XX Web Systems Development II
• Web/Data Course approved by ITWS Curriculum 
Committee
Data Science
Science Informatics 
Web Technologies
 
CHANGES TO THE MASTER’S IN
INFORMATION TECHNOLOGY
PROGRAM
• In Spring 2013 the MS in IT core curriculum was revised
to include Data Analytics.
• Networking core classes were replaced with Data
Analytics core classes: Data Science, Database Mining,
X-informatics, and Data Analytics (a new class offered in
Spring 2014).
• The MS in IT program also added two new
concentrations: Data Science and Analytics and
Information Dominance.
• The Information Dominance concentration was
developed for a new Navy program that will be educating
a select group of 5-10 naval officers a year with the skills
needed for military cyberspace operations. Two officers
started in Fall 2013 and three began in Spring 2014.
IT Core Area Course Number Course Title
Term(s)
Offered
Database Systems CSCI-4380 Database Systems Fall/Spring
Data Analytics ITWS-6350 Data Science Fall
Software Design and
Engineering
CSCI-4440 Software Design and Documentation Fall
ITWS-6400 X-Informatics Spring
Management of
Technology*
ITWS-6300
Business Issues for Engineers and Scientists
(Professional Track Only)
Fall/Spring
Human Computer
Interaction
COMM-6420 Foundations of HCI Usability Fall
COMM-696X Human Media Interaction Spring
MS in IT Required Core Courses
* For the research track, replace ITWS-6300 Business Issues for Engineers and Scientists with one of the two semester courses ITWS-
6980 Master’s Project or ITWS-6990 Master’s Thesis.
Advanced Core options for students who have previously completed a Core Course
IT Core Area Course Number Course Title
Term(s)
Offered
Database Systems
CSCI-6390 Database Mining Fall
ITWS-6350 Data Science Fall
ITWS-696X Semantic E-Science Fall
Data Analytics
CSCI-6390 Database Mining Fall
ITWS-6400 X-Informatics Spring
ITWX-696X Data Analytics Spring
Software Design
CSCI-6500 Distributed Computing Over the Internet Fall
ECSE-6780 Software Engineering II Fall
ITWS-696X Semantic E-Science Fall
Management of
Technology
MGMT-6080 Networks, Innovation and Value Creation Fall
MGMT-6140 Information Systems for Management Spring
Human Computer
Interaction
COMM-6620 Information Architecture Spring
COMM-6770 User-Centered Design Fall
COMM-696X Interactive Media Design Summer
Concentration Course Number Course Name Term(s)
Offered
Data
Science and
Analytics
Data and Information analytics extends analysis (descriptive and
predictive models to obtain knowledge from data) by using
insight from analyses to recommend action or to guide and
communicate decision-making. Thus, analytics is not so much
concerned with individual analyses or analysis steps, but with an
entire methodology. Key topics include: advanced statistical
computing theory, multivariate analysis, and application of
computer science courses such as data mining and machine
learning and change detection by uncovering unexpected
patterns in data.
Select two or three of the following courses:
ITWS-6350 Data Science Fall
ITWS-6400 X-Informatics Spring
ITWS-696X Data Analytics Spring
ITWS-696X Semantic E-Science Fall
ITWX-696X
Advanced Semantic
Technologies*
Spring
If only two of the above were chosen, select one more of
the following courses:
COMM-6620 Information Architecture Spring
CSCI-4020 Computer Algorithms Spring
CSCI-4150 Introduction to AI Fall
CSCI-6390 Database Mining Fall
CSCI-4220 or CSCI-
6220
Network Programming
or Parallel Algorithm
Design
Spring
ISYE-4220
Optimization Algorithms
and Applications
Fall
ISYE-6180
Knowledge Discovery
with Data Mining
Spring
MGMT-696X
Technology Foundations
for Business Analytics
Fall
MGMT-696X
Predictive Analytics
Using Social Media
Spring
Concentration Course Number Course Name Term(s)
Offered
Information
Dominance
The Information Dominance concentration prepares students for
careers designing, building, and managing secure information
systems and networks. The concentration includes advanced
study in encryption and network security, formal models and
policies for access control in databases and application systems,
secure coding techniques, and other related information
assurance topics. The combination of coursework provides
comprehensive coverage of issues and solutions for utilizing
high assurance systems for tactical decision-making. It
prepares students for careers ranging from secure information
systems analyst, to information security engineer, to field
information manager and chief information officer. It is also
appropriate for all IT professionals who want to enhance their
knowledge of how to use pervasive information in situational
awareness, operations scenarios, and decision-making.
Select two or three of the following courses:
ISYE-6180
Knowledge Discovery with Data
Mining
Spring
CSCI-6960
Cryptography and Network
Security I
Fall
ITWS-4370 Information System Security Spring
CSCI-4650 Networking Laboratory I
Fall/Spri
ng
MGMT-7760 Risk Management Fall
ISYE-4310
Ethics of Modeling for Industrial
Systems Engineering
Fall
If only two of the above were chosen, select one more of the
following courses:
CSCI-6390 Database Mining Fall
CSCI-6968
Cryptography and Network
Security II
Spring
CSCI-4660 Networking Laboratory II
Fall/Spri
ng
ECSE-6860
Evaluation Methods for Decision
Making
Fall
ISYE-6500
Information and Decision
Technologies for Industrial and
Service Systems
Fall/Spri
ng
CSCI-496X
Computational Analysis of
Social Processes
Fall
Two New MS in IT Concentrations
Also at RPI
• Data Science Research Center and Data Science
Education Center (dsrc.rpi.edu, 2009)
• http://www.rpi.edu/about/inside/issue/v4n17/datacente
r.html
– Over 45: research faculty, post-docs, grad students, staff,
undergraduates…
• Data is one of the Rensselaer Plan’s five thrusts
• Other key faculty
– Fran Berman (Center for Digital Society and RDA)
– Bulent Yener (DSRC Director)
– Peter Fox(ITWS Director)
More RPI Curriculua
• Environmental Science with Geoinformatics
concentration
• Bio, geo, chem, astro, materials - informatics
• GIS for Science
• Visualization (new summer program)
• Multi-disciplinary science program - PhD in
Data and Web Science
• DATUM: Data in Undergraduate Math! (Bennett)
• Missing – intermediate statistics
• Graphs – significant potential here – must teach!
5-6 years in…
• Science and interdisciplinary from the start!
– Not a question of: do we train scientists to be
technical/data people, or do we train technical
people to learn the science
– It’s a skill/ course level approach that is needed
• We teach methodology and principles over
technology
• Data science must be a skill, and natural like
using instruments, writing/using codes
• Team/ collaboration aspects are key
• Foundations and theory must be taught
– for data, as well as programming
Summary

More Related Content

What's hot

Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
Sivashankar Ganapathy
 
Big Data - Insights & Challenges
Big Data - Insights & ChallengesBig Data - Insights & Challenges
Big Data - Insights & Challenges
Rupen Momaya
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social media
Supriya Radhakrishna
 
The Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient WorldThe Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient World
PYA, P.C.
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
Melinda Thielbar
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
Ahmed Banafa
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
Kathirvel Ayyaswamy
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
The Marketing Distillery
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on Businesses
CRISIL Limited
 
Big data 101
Big data 101Big data 101
Big data 101
Lars Marius Garshol
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
Prashant Sharma
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
Dharmesh Tank
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
Poonam Kshirsagar
 
Big data ppt
Big data pptBig data ppt
Big data ppt
AKASH SIHAG
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Napier University
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
Richard Vidgen
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
eXascale Infolab
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
big data analytics in mobile cellular network
big data analytics in mobile cellular networkbig data analytics in mobile cellular network
big data analytics in mobile cellular network
shubham patil
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
Indu Khemchandani
 

What's hot (20)

Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Big Data - Insights & Challenges
Big Data - Insights & ChallengesBig Data - Insights & Challenges
Big Data - Insights & Challenges
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social media
 
The Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient WorldThe Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient World
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on Businesses
 
Big data 101
Big data 101Big data 101
Big data 101
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
big data analytics in mobile cellular network
big data analytics in mobile cellular networkbig data analytics in mobile cellular network
big data analytics in mobile cellular network
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 

Viewers also liked

Dai Big Data al Cognitive Computing - Pietro Leo
Dai Big Data al Cognitive Computing - Pietro LeoDai Big Data al Cognitive Computing - Pietro Leo
Dai Big Data al Cognitive Computing - Pietro Leo
Apulian ICT Living Labs
 
New Frontiers in IA: Design in the Era of Cognitive Computing
New Frontiers in IA: Design in the Era of Cognitive ComputingNew Frontiers in IA: Design in the Era of Cognitive Computing
New Frontiers in IA: Design in the Era of Cognitive Computing
Paul King
 
Computer Science Imperative
Computer Science ImperativeComputer Science Imperative
Computer Science Imperative
Hal Speed
 
Basic data structure and data operation
Basic data structure and data operationBasic data structure and data operation
Basic data structure and data operation
Mohsin Siddique
 
Point Placement Algorithms: An Experimental Study
Point Placement Algorithms: An Experimental StudyPoint Placement Algorithms: An Experimental Study
Point Placement Algorithms: An Experimental Study
CSCJournals
 
M.S. Thesis Defense
M.S. Thesis DefenseM.S. Thesis Defense
M.S. Thesis Defense
pbecker1987
 
Watson and Open Source Tools
Watson and Open Source ToolsWatson and Open Source Tools
Watson and Open Source Tools
Boulder Java User's Group
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
IBM Sverige
 
Basics of computer science
Basics of computer scienceBasics of computer science
Basics of computer science
Paul Schmidt
 
An introduction to Computer Technology
An introduction to Computer TechnologyAn introduction to Computer Technology
An introduction to Computer Technology
Steven Heath
 
Computer instructions
Computer instructionsComputer instructions
Computer instructions
Anuj Modi
 
Computer Science & Information Systems
Computer Science & Information SystemsComputer Science & Information Systems
Computer Science & Information Systems
Luis Borges Gouveia
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
DATAVERSITY
 
2.0 Introduction to Computer Science and Programming
2.0 Introduction to Computer Science and Programming2.0 Introduction to Computer Science and Programming
2.0 Introduction to Computer Science and Programming
Abdelrahman Hosny
 
Amuse UX 2015: Y.Vetrov — Platform Thinking
Amuse UX 2015: Y.Vetrov — Platform ThinkingAmuse UX 2015: Y.Vetrov — Platform Thinking
Amuse UX 2015: Y.Vetrov — Platform Thinking
Yury Vetrov
 
Interaction designers vs algorithms
Interaction designers vs algorithmsInteraction designers vs algorithms
Interaction designers vs algorithms
cxpartners
 
History of Computers
History of ComputersHistory of Computers
History of Computers
mshihab
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics Presentation
Ian Balina
 
simplification of boolean algebra
simplification of boolean algebrasimplification of boolean algebra
simplification of boolean algebra
mayannpolisticoLNU
 
Introduction to Computers
Introduction to ComputersIntroduction to Computers
Introduction to Computers
Samudin Kassan
 

Viewers also liked (20)

Dai Big Data al Cognitive Computing - Pietro Leo
Dai Big Data al Cognitive Computing - Pietro LeoDai Big Data al Cognitive Computing - Pietro Leo
Dai Big Data al Cognitive Computing - Pietro Leo
 
New Frontiers in IA: Design in the Era of Cognitive Computing
New Frontiers in IA: Design in the Era of Cognitive ComputingNew Frontiers in IA: Design in the Era of Cognitive Computing
New Frontiers in IA: Design in the Era of Cognitive Computing
 
Computer Science Imperative
Computer Science ImperativeComputer Science Imperative
Computer Science Imperative
 
Basic data structure and data operation
Basic data structure and data operationBasic data structure and data operation
Basic data structure and data operation
 
Point Placement Algorithms: An Experimental Study
Point Placement Algorithms: An Experimental StudyPoint Placement Algorithms: An Experimental Study
Point Placement Algorithms: An Experimental Study
 
M.S. Thesis Defense
M.S. Thesis DefenseM.S. Thesis Defense
M.S. Thesis Defense
 
Watson and Open Source Tools
Watson and Open Source ToolsWatson and Open Source Tools
Watson and Open Source Tools
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
 
Basics of computer science
Basics of computer scienceBasics of computer science
Basics of computer science
 
An introduction to Computer Technology
An introduction to Computer TechnologyAn introduction to Computer Technology
An introduction to Computer Technology
 
Computer instructions
Computer instructionsComputer instructions
Computer instructions
 
Computer Science & Information Systems
Computer Science & Information SystemsComputer Science & Information Systems
Computer Science & Information Systems
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
 
2.0 Introduction to Computer Science and Programming
2.0 Introduction to Computer Science and Programming2.0 Introduction to Computer Science and Programming
2.0 Introduction to Computer Science and Programming
 
Amuse UX 2015: Y.Vetrov — Platform Thinking
Amuse UX 2015: Y.Vetrov — Platform ThinkingAmuse UX 2015: Y.Vetrov — Platform Thinking
Amuse UX 2015: Y.Vetrov — Platform Thinking
 
Interaction designers vs algorithms
Interaction designers vs algorithmsInteraction designers vs algorithms
Interaction designers vs algorithms
 
History of Computers
History of ComputersHistory of Computers
History of Computers
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics Presentation
 
simplification of boolean algebra
simplification of boolean algebrasimplification of boolean algebra
simplification of boolean algebra
 
Introduction to Computers
Introduction to ComputersIntroduction to Computers
Introduction to Computers
 

Similar to Big Data and Computer Science Education

Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
Geoffrey Fox
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Research Data Alliance
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Dr. Sunil Kr. Pandey
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
University of Sindh
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
Arnab Majumdar
 
Application statistics in software engineering
Application statistics in software engineeringApplication statistics in software engineering
Application statistics in software engineering
md emran
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
ArmyTrilidiaDevegaSK
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
SatyajitPatil42
 
A Deep Dissertion Of Data Science Related Issues And Its Applications
A Deep Dissertion Of Data Science  Related Issues And Its ApplicationsA Deep Dissertion Of Data Science  Related Issues And Its Applications
A Deep Dissertion Of Data Science Related Issues And Its Applications
Tracy Hill
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
Philip Bourne
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
Steven Miller
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
AbderrahmanABID2
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in Europe
Steven Miller
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET Journal
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET Journal
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
How to crack down big data?
How to crack down big data? How to crack down big data?
How to crack down big data?
Ta-Wei (David) Huang
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
Elvis Muyanja
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
Titash Mandal
 
Cse 8th sem syllabus
Cse 8th sem syllabusCse 8th sem syllabus
Cse 8th sem syllabus
Akshatha Nair
 

Similar to Big Data and Computer Science Education (20)

Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
Application statistics in software engineering
Application statistics in software engineeringApplication statistics in software engineering
Application statistics in software engineering
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
 
A Deep Dissertion Of Data Science Related Issues And Its Applications
A Deep Dissertion Of Data Science  Related Issues And Its ApplicationsA Deep Dissertion Of Data Science  Related Issues And Its Applications
A Deep Dissertion Of Data Science Related Issues And Its Applications
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in Europe
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
How to crack down big data?
How to crack down big data? How to crack down big data?
How to crack down big data?
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Cse 8th sem syllabus
Cse 8th sem syllabusCse 8th sem syllabus
Cse 8th sem syllabus
 

More from James Hendler

Knowing what AI Systems Don't know and Why it matters
Knowing what AI  Systems Don't know and Why it mattersKnowing what AI  Systems Don't know and Why it matters
Knowing what AI Systems Don't know and Why it matters
James Hendler
 
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
James Hendler
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
James Hendler
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
James Hendler
 
Knowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/InteroperabilityKnowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/Interoperability
James Hendler
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
James Hendler
 
Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs
James Hendler
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
James Hendler
 
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
James Hendler
 
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
James Hendler
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep Learning
James Hendler
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AI
James Hendler
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
James Hendler
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
James Hendler
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
James Hendler
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
James Hendler
 
Wither OWL
Wither OWLWither OWL
Wither OWL
James Hendler
 
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
James Hendler
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
James Hendler
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
James Hendler
 

More from James Hendler (20)

Knowing what AI Systems Don't know and Why it matters
Knowing what AI  Systems Don't know and Why it mattersKnowing what AI  Systems Don't know and Why it matters
Knowing what AI Systems Don't know and Why it matters
 
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Knowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/InteroperabilityKnowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/Interoperability
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
 
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
 
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep Learning
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AI
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 
Wither OWL
Wither OWLWither OWL
Wither OWL
 
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 

Recently uploaded

Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
BrainSell Technologies
 
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdfWhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
ArgaBisma
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
Priyanka Aash
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
aslasdfmkhan4750
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
ChristopherTHyatt
 
Best Practices for Effectively Running dbt in Airflow.pdf
Best Practices for Effectively Running dbt in Airflow.pdfBest Practices for Effectively Running dbt in Airflow.pdf
Best Practices for Effectively Running dbt in Airflow.pdf
Tatiana Al-Chueyr
 
CiscoIconsLibrary cours de réseau VLAN.ppt
CiscoIconsLibrary cours de réseau VLAN.pptCiscoIconsLibrary cours de réseau VLAN.ppt
CiscoIconsLibrary cours de réseau VLAN.ppt
moinahousna
 
Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...
chetankumar9855
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
Adam Dunkels
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
Opencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of MünsterOpencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of Münster
Matthias Neugebauer
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
Neo4j
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
KAMAL CHOUDHARY
 
July Patch Tuesday
July Patch TuesdayJuly Patch Tuesday
July Patch Tuesday
Ivanti
 
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-InTrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
TrustArc
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
ishalveerrandhawa1
 
Introduction-to-the-IAM-Platform-Implementation-Plan.pptx
Introduction-to-the-IAM-Platform-Implementation-Plan.pptxIntroduction-to-the-IAM-Platform-Implementation-Plan.pptx
Introduction-to-the-IAM-Platform-Implementation-Plan.pptx
313mohammedarshad
 
The Role of IoT in Australian Mobile App Development - PDF Guide
The Role of IoT in Australian Mobile App Development - PDF GuideThe Role of IoT in Australian Mobile App Development - PDF Guide
The Role of IoT in Australian Mobile App Development - PDF Guide
Shiv Technolabs
 
find out more about the role of autonomous vehicles in facing global challenges
find out more about the role of autonomous vehicles in facing global challengesfind out more about the role of autonomous vehicles in facing global challenges
find out more about the role of autonomous vehicles in facing global challenges
huseindihon
 

Recently uploaded (20)

Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
 
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdfWhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
WhatsApp Image 2024-03-27 at 08.19.52_bfd93109.pdf
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
 
Best Practices for Effectively Running dbt in Airflow.pdf
Best Practices for Effectively Running dbt in Airflow.pdfBest Practices for Effectively Running dbt in Airflow.pdf
Best Practices for Effectively Running dbt in Airflow.pdf
 
CiscoIconsLibrary cours de réseau VLAN.ppt
CiscoIconsLibrary cours de réseau VLAN.pptCiscoIconsLibrary cours de réseau VLAN.ppt
CiscoIconsLibrary cours de réseau VLAN.ppt
 
Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
Opencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of MünsterOpencast Summit 2024 — Opencast @ University of Münster
Opencast Summit 2024 — Opencast @ University of Münster
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
 
July Patch Tuesday
July Patch TuesdayJuly Patch Tuesday
July Patch Tuesday
 
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-InTrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
 
Introduction-to-the-IAM-Platform-Implementation-Plan.pptx
Introduction-to-the-IAM-Platform-Implementation-Plan.pptxIntroduction-to-the-IAM-Platform-Implementation-Plan.pptx
Introduction-to-the-IAM-Platform-Implementation-Plan.pptx
 
The Role of IoT in Australian Mobile App Development - PDF Guide
The Role of IoT in Australian Mobile App Development - PDF GuideThe Role of IoT in Australian Mobile App Development - PDF Guide
The Role of IoT in Australian Mobile App Development - PDF Guide
 
find out more about the role of autonomous vehicles in facing global challenges
find out more about the role of autonomous vehicles in facing global challengesfind out more about the role of autonomous vehicles in facing global challenges
find out more about the role of autonomous vehicles in facing global challenges
 

Big Data and Computer Science Education

  • 1. Big Data Meets Computer Science Jim Hendler Tetherless World Professor of Computer, Web and Data Sciences Director, Rensselaer Institute for Data Exploration and Applications @jahendler
  • 2. The Rensselaer “IDEA” (idea.rpi.edu)The Rensselaer “IDEA” (idea.rpi.edu)
  • 3. The Rensselaer IDEA 3 … Across Applications (corresponding to Challenges Identified in the Rensselaer Plan 2024) Healthcare Analytics Business Systems Built and Natural Environments Virtual and Augmented Reality Cyber- Resiliency Policy, Ethics and Open Government Materials Informatics Data-driven Physical/Life Sciences
  • 4. The Rensselaer IDEA 4 Developing a Comprehensive “Data Science” Research Agenda P. Fox and J. Hendler, The Science of Data Science, Big Data, 2(2), in press
  • 5. The Rensselaer IDEA Graduate Projects in IDEA • IDEA and CCI (HPC): technologies to enable Rensselaer researchers to work with data at larger scales and in new ways • Population-scale cognitive computing models for “human intensive” agent-based simulations • IDEA and EMPAC (Performing arts center): provide next generation data exploration tools • Multi-person data visualization tools for big-data applications • IDEA and Watson: New direction in Cognitive Computation • How do we go from Question/Answering to Open Web Data exploration? • IDEA and CBIS (Ctr for Biotechnology & Interdisciplinary Studies): Data-driven Informatics • Can we couple semantics and big data to find new medical uses for already approved drugs?
  • 6. The Rensselaer IDEA External Projects and partnerships Emergency Room Care Language and Agents Largescale Healthcare Analytics In Discussion Jumpstart (Proposal underway) Built and Natural Biome data-driven science and engineering Cognitive Computing Collaborative Research Initiative
  • 7. Campus Data Infrastructure Metadata • Title • Author • Author Email • Licence • Subject • Keyword • Data Type Dataset CDF RPI Object Deposit RPI Research Network RPI-ID Request RPI-ID Request Share Knowledge Join Network Allocate a universal accessible RPI-ID Register Metadata Upload Any Data RPI Research Object Registration and Deposit RPI Research Collaboration and Community Network
  • 8. Requires going Beyond the Database Discovery Integrate Visualize Explain Thinking outside the Database box Strata talk, 2013 - https://www.youtube.com/watch?v=Cob5oltMGMc
  • 9. At new scales (and in new ways) Fox and Hendler, Changing the Equation on Scientific Visualization, Science, 2/11 - http://www.sciencemag.org/content/331/6018/705.short)
  • 10. A Whole New World • But what about undergraduate education – where do we train the students who can take on projects needing • statistics and analytics • informatics • data science challenges • machine learning • unstructured data • cognitive computation • …
  • 11. Computer Science Education? • Programming is a necessary skill – not sufficient • and we mostly teach it wrong… – (For my heresies about teaching programming, see “Let’s Help Computer Science Students Crack the Code, 3/13 http://chronicle.com/article/Lets-Help-Computer-Science/137649/ ) • The computing environment of today is nothing like the computing environment of the 70s, – but the curriculum hasn’t changed much since I was in school – but the fundamentals are NOT all the same – data-oriented computations involve graphs, memory intensive algorithms, machine learning, …
  • 12. Deploying these ideas at RPI • Innovation in the interdisciplinary Information Technology Program – Renamed Information Technology and Web Science, 2011 • for more on Web Science, see – Berners-Lee et al., Creating a Science of the World Wide Web, Science, 2006, https://www.sciencemag.org/content/313/5788/769.summary; – Hendler et. al, Web Science: An interdisciplinary Approach to Understanding the Web, CACM, 7/2008, http://cacm.acm.org/magazines/2008/7/5366-web-science/fulltext
  • 13. IT and Web Science • First IT academic program in U.S. • First web science degree program in U.S.; First undergraduate web science degree anywhere • BS in ITWS (20 concentrations) and MS in IT (10 concentrations) • PhD in Multi-Disciplinary Sciences • http://itws.rpi.edu – I was Director 2008-2012 – Now directed by Peter Fox (whose slides I stole for this section)
  • 14.       Technical Track Courses      Concentrations Computer Engineering Track 1) ECSE-2610 Computer Components and Operations 2) ENGR-2350 Embedded Control 3) ECSE-2660 Computer Architecture, Networking and  Operating Systems Civil Engineering Computer Hardware Computer Networking (hardware focus) Mechanical/Aeronautical  Eng. Computer Science Track 1) CSCI-2200 Foundations of Computer Science 2) CSCI-2300 Introduction to Algorithms 3) CSCI-2500 Computer Organization Cognitive Science Computer Networking (software focus) Information Security Machine and Computational Learning Information Systems Track 1) CSCI-2200 Foundation of Computer Science 2) CSCI-2500 Computer Organization 3) Four credits from the following: • CSCI-2220 Programming in Java (2 credits) • CSCI-2961 Program in Python (2 credits) • CSCI-2300 Introduction to Algorithms (4 credits) • ITWS-49XX Web Systems Development II (4 credits) Arts Communication Economics Entrepreneurship Finance Management Information    Systems Medicine Pre-law Psychology STS Web Science Track 1) CSCI-2200 Foundations of Computer Science 2) CSCI-2500 Computer Organization 3) One of the following: • CSCI-49XX Web Systems Development II • Web/Data Course approved by ITWS Curriculum  Committee Data Science Science Informatics  Web Technologies  
  • 15. CHANGES TO THE MASTER’S IN INFORMATION TECHNOLOGY PROGRAM • In Spring 2013 the MS in IT core curriculum was revised to include Data Analytics. • Networking core classes were replaced with Data Analytics core classes: Data Science, Database Mining, X-informatics, and Data Analytics (a new class offered in Spring 2014). • The MS in IT program also added two new concentrations: Data Science and Analytics and Information Dominance. • The Information Dominance concentration was developed for a new Navy program that will be educating a select group of 5-10 naval officers a year with the skills needed for military cyberspace operations. Two officers started in Fall 2013 and three began in Spring 2014.
  • 16. IT Core Area Course Number Course Title Term(s) Offered Database Systems CSCI-4380 Database Systems Fall/Spring Data Analytics ITWS-6350 Data Science Fall Software Design and Engineering CSCI-4440 Software Design and Documentation Fall ITWS-6400 X-Informatics Spring Management of Technology* ITWS-6300 Business Issues for Engineers and Scientists (Professional Track Only) Fall/Spring Human Computer Interaction COMM-6420 Foundations of HCI Usability Fall COMM-696X Human Media Interaction Spring MS in IT Required Core Courses * For the research track, replace ITWS-6300 Business Issues for Engineers and Scientists with one of the two semester courses ITWS- 6980 Master’s Project or ITWS-6990 Master’s Thesis. Advanced Core options for students who have previously completed a Core Course IT Core Area Course Number Course Title Term(s) Offered Database Systems CSCI-6390 Database Mining Fall ITWS-6350 Data Science Fall ITWS-696X Semantic E-Science Fall Data Analytics CSCI-6390 Database Mining Fall ITWS-6400 X-Informatics Spring ITWX-696X Data Analytics Spring Software Design CSCI-6500 Distributed Computing Over the Internet Fall ECSE-6780 Software Engineering II Fall ITWS-696X Semantic E-Science Fall Management of Technology MGMT-6080 Networks, Innovation and Value Creation Fall MGMT-6140 Information Systems for Management Spring Human Computer Interaction COMM-6620 Information Architecture Spring COMM-6770 User-Centered Design Fall COMM-696X Interactive Media Design Summer
  • 17. Concentration Course Number Course Name Term(s) Offered Data Science and Analytics Data and Information analytics extends analysis (descriptive and predictive models to obtain knowledge from data) by using insight from analyses to recommend action or to guide and communicate decision-making. Thus, analytics is not so much concerned with individual analyses or analysis steps, but with an entire methodology. Key topics include: advanced statistical computing theory, multivariate analysis, and application of computer science courses such as data mining and machine learning and change detection by uncovering unexpected patterns in data. Select two or three of the following courses: ITWS-6350 Data Science Fall ITWS-6400 X-Informatics Spring ITWS-696X Data Analytics Spring ITWS-696X Semantic E-Science Fall ITWX-696X Advanced Semantic Technologies* Spring If only two of the above were chosen, select one more of the following courses: COMM-6620 Information Architecture Spring CSCI-4020 Computer Algorithms Spring CSCI-4150 Introduction to AI Fall CSCI-6390 Database Mining Fall CSCI-4220 or CSCI- 6220 Network Programming or Parallel Algorithm Design Spring ISYE-4220 Optimization Algorithms and Applications Fall ISYE-6180 Knowledge Discovery with Data Mining Spring MGMT-696X Technology Foundations for Business Analytics Fall MGMT-696X Predictive Analytics Using Social Media Spring Concentration Course Number Course Name Term(s) Offered Information Dominance The Information Dominance concentration prepares students for careers designing, building, and managing secure information systems and networks. The concentration includes advanced study in encryption and network security, formal models and policies for access control in databases and application systems, secure coding techniques, and other related information assurance topics. The combination of coursework provides comprehensive coverage of issues and solutions for utilizing high assurance systems for tactical decision-making. It prepares students for careers ranging from secure information systems analyst, to information security engineer, to field information manager and chief information officer. It is also appropriate for all IT professionals who want to enhance their knowledge of how to use pervasive information in situational awareness, operations scenarios, and decision-making. Select two or three of the following courses: ISYE-6180 Knowledge Discovery with Data Mining Spring CSCI-6960 Cryptography and Network Security I Fall ITWS-4370 Information System Security Spring CSCI-4650 Networking Laboratory I Fall/Spri ng MGMT-7760 Risk Management Fall ISYE-4310 Ethics of Modeling for Industrial Systems Engineering Fall If only two of the above were chosen, select one more of the following courses: CSCI-6390 Database Mining Fall CSCI-6968 Cryptography and Network Security II Spring CSCI-4660 Networking Laboratory II Fall/Spri ng ECSE-6860 Evaluation Methods for Decision Making Fall ISYE-6500 Information and Decision Technologies for Industrial and Service Systems Fall/Spri ng CSCI-496X Computational Analysis of Social Processes Fall Two New MS in IT Concentrations
  • 18. Also at RPI • Data Science Research Center and Data Science Education Center (dsrc.rpi.edu, 2009) • http://www.rpi.edu/about/inside/issue/v4n17/datacente r.html – Over 45: research faculty, post-docs, grad students, staff, undergraduates… • Data is one of the Rensselaer Plan’s five thrusts • Other key faculty – Fran Berman (Center for Digital Society and RDA) – Bulent Yener (DSRC Director) – Peter Fox(ITWS Director)
  • 19. More RPI Curriculua • Environmental Science with Geoinformatics concentration • Bio, geo, chem, astro, materials - informatics • GIS for Science • Visualization (new summer program) • Multi-disciplinary science program - PhD in Data and Web Science • DATUM: Data in Undergraduate Math! (Bennett) • Missing – intermediate statistics • Graphs – significant potential here – must teach!
  • 20. 5-6 years in… • Science and interdisciplinary from the start! – Not a question of: do we train scientists to be technical/data people, or do we train technical people to learn the science – It’s a skill/ course level approach that is needed • We teach methodology and principles over technology • Data science must be a skill, and natural like using instruments, writing/using codes • Team/ collaboration aspects are key • Foundations and theory must be taught – for data, as well as programming

Editor's Notes

  1. * ** ***