This document provides an introduction to data science. It defines data science as a multi-disciplinary field that uses scientific methods and processes to extract knowledge and insights from structured and unstructured data. The document discusses the importance and impact of data science on organizations and society. It also outlines common applications of data science and the roles and skills required for a career in data science.
Outline
Digital Project Planning
What is the goal of your Digital Scholarship project?
We will discuss Digital Humanities projects as Digital Scholarship Project
Learn what the components or layers of a Digital Humanities project are.
How do you find data to use to answer research questions?
Understand descriptive metadata and the rationale for its use
Digital Pedagogy
If you are involving students how does that affect your planning plan?
How do you incorporate Digital Pedagogy into a Digital Project?
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystMartha Russell
Across the globe, new word-of-mouth messaging methods are emerging. Many of these involve new technologies. The strategic use of media has become a game changer for both local and global businesses. Traditional media platforms are outpaced by the speed of flash movements as they unfold. Technical discoveries outpace the scientific journals available to announce them. Journalists, entertainers, academics, scientists, and citizens are experimenting with new tools and platforms for content creation, consumption and curation.
When the news about Tahir Square, or Occupy Wall Street or, more recently the Brazilian protests, hit the headlines of newspapers and magazines, they were already outdated. Documentaries were equally incapable of tracking and fully describing these movements. Traditional narratives – and the technologies used to tell them - fall short of accurately portraying the ideas and behaviors that are emerging through new modes of communication. Information travels so fast, that news is no longer "new". Ubiquitous media disintermediates traditional business ecosystems. And every company must take on roles of a media company.
The world of digital content is experiencing an explosion of innovation in both creation and consumption of media. It may well have been consumer applications that ignited the transformation, but business, enterprise and government interests have joined the party. Across the entire innovation ecosystem of media, new technologies and new uses of it by people are creating a sea change in the way people participate and in the responses they expect, Streaming coverage, both amateur and professional – both business and community, is powered by cutting edge technology in combinations of smartphones, 4G, drone cameras and, even, Google Glass can report on events and movements, products and services. The new role of the Chief Digital Officer has emerged in many organizations - to help management bridge the changing roles usually played by Chief Information Officers, Chief Marketing Officers, and Chief Technology Officers.
Labs affiliated with mediaX at Stanford University study how people and information technology interact. We invite discovery collaborations on the future of content for business, education, and entertainment.
PAARL's 1st Marina G. Dayrit Lecture Series held at UP's Melchor Hall, 5F, Proctor & Gamble Audiovisual Hall, College of Engineering, on 3 March 2017, with Albert Anthony D. Gavino of Smart Communications Inc. as resource speaker on the topic "Using Big Data to Enhance Library Services"
Outline
Digital Project Planning
What is the goal of your Digital Scholarship project?
We will discuss Digital Humanities projects as Digital Scholarship Project
Learn what the components or layers of a Digital Humanities project are.
How do you find data to use to answer research questions?
Understand descriptive metadata and the rationale for its use
Digital Pedagogy
If you are involving students how does that affect your planning plan?
How do you incorporate Digital Pedagogy into a Digital Project?
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystMartha Russell
Across the globe, new word-of-mouth messaging methods are emerging. Many of these involve new technologies. The strategic use of media has become a game changer for both local and global businesses. Traditional media platforms are outpaced by the speed of flash movements as they unfold. Technical discoveries outpace the scientific journals available to announce them. Journalists, entertainers, academics, scientists, and citizens are experimenting with new tools and platforms for content creation, consumption and curation.
When the news about Tahir Square, or Occupy Wall Street or, more recently the Brazilian protests, hit the headlines of newspapers and magazines, they were already outdated. Documentaries were equally incapable of tracking and fully describing these movements. Traditional narratives – and the technologies used to tell them - fall short of accurately portraying the ideas and behaviors that are emerging through new modes of communication. Information travels so fast, that news is no longer "new". Ubiquitous media disintermediates traditional business ecosystems. And every company must take on roles of a media company.
The world of digital content is experiencing an explosion of innovation in both creation and consumption of media. It may well have been consumer applications that ignited the transformation, but business, enterprise and government interests have joined the party. Across the entire innovation ecosystem of media, new technologies and new uses of it by people are creating a sea change in the way people participate and in the responses they expect, Streaming coverage, both amateur and professional – both business and community, is powered by cutting edge technology in combinations of smartphones, 4G, drone cameras and, even, Google Glass can report on events and movements, products and services. The new role of the Chief Digital Officer has emerged in many organizations - to help management bridge the changing roles usually played by Chief Information Officers, Chief Marketing Officers, and Chief Technology Officers.
Labs affiliated with mediaX at Stanford University study how people and information technology interact. We invite discovery collaborations on the future of content for business, education, and entertainment.
PAARL's 1st Marina G. Dayrit Lecture Series held at UP's Melchor Hall, 5F, Proctor & Gamble Audiovisual Hall, College of Engineering, on 3 March 2017, with Albert Anthony D. Gavino of Smart Communications Inc. as resource speaker on the topic "Using Big Data to Enhance Library Services"
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
Slides from a workshop delivered for the University of Edinburgh Digital Scholarship programme, on 18th October 2017. For further information on the programme see: http://www.digital.cahss.ed.ac.uk/ or #DigScholEd. If you are interested in hosting a similar workshop, or adapting these slides please contact me: nicola.osborne@ed.ac.uk.
an introductory course for Librarians on using Big Data and Data Science applications on the field of Library Science. The course is a 2 hour course module for basic fundamentals of applying DS work.
Defining Data Science
• What Does a Data Science Professional Do?
• Data Science in Business
• Use Cases for Data Science
• Installation of R and R studio
Booz Allen Hamilton created the Field Guide to Data Science to help organizations and missions understand how to make use of data as a resource. The Second Edition of the Field Guide, updated with new features and content, delivers our latest insights in a fast-changing field. http://bit.ly/1O78U42
The field-guide-to-data-science 2015 (second edition) By Booz | Allen | HamiltonArysha Channa
Foreword: Data science touches aspects of our lives on a daily basis. When we visit the doctor, drive our cars, get on an airplane, or shop for services, Data science is changing the way we interact with and explore our world.
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
This is my presentation on the Topic "Data Science - An emerging Stream of Science with its Spreading Reach & Impact". I have compiled and collected different statistics and data from different sources. This may be useful for students and those who might be interested in this field of Study.
Linked Data Love: research representation, discovery, and assessment
#ALAAC15
The explosion of linked data platforms and data stores over the last five years has been profound – both in terms of quantity of data as well as its potential impact. Research information systems such as VIVO (www.vivoweb.org) play a significant role in enabling this work. VIVO is an open source, Semantic Web-based application that provides an integrated, searchable view of the scholarly activities of an organization. The uniform semantic structure of VIVO-ISF data enables a new class of tools to advance science. This presentation will provide a brief introduction and update to VIVO and present ways that this semantically-rich data can enable visualizations, reporting and assessment, next-generation collaboration and team building, and enhanced multi-site search. Libraries are uniquely positioned to facilitate the open representation of research information and its subsequent use to spur collaboration, discovery, and assessment. The talk will conclude with a description of ways librarians are engaged in this work – including visioning, metadata and ontology creation, policy creation, data curation and management, technical, and engagement activities.
Kristi Holmes, PhD
Director, Galter Health Sciences Library
Director of Evaluation, NUCATS
Associate Professor, Preventive Medicine-Health and Biomedical Informatics
Northwestern University Feinberg School of Medicine
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
Slides from a workshop delivered for the University of Edinburgh Digital Scholarship programme, on 18th October 2017. For further information on the programme see: http://www.digital.cahss.ed.ac.uk/ or #DigScholEd. If you are interested in hosting a similar workshop, or adapting these slides please contact me: nicola.osborne@ed.ac.uk.
an introductory course for Librarians on using Big Data and Data Science applications on the field of Library Science. The course is a 2 hour course module for basic fundamentals of applying DS work.
Defining Data Science
• What Does a Data Science Professional Do?
• Data Science in Business
• Use Cases for Data Science
• Installation of R and R studio
Booz Allen Hamilton created the Field Guide to Data Science to help organizations and missions understand how to make use of data as a resource. The Second Edition of the Field Guide, updated with new features and content, delivers our latest insights in a fast-changing field. http://bit.ly/1O78U42
The field-guide-to-data-science 2015 (second edition) By Booz | Allen | HamiltonArysha Channa
Foreword: Data science touches aspects of our lives on a daily basis. When we visit the doctor, drive our cars, get on an airplane, or shop for services, Data science is changing the way we interact with and explore our world.
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
This is my presentation on the Topic "Data Science - An emerging Stream of Science with its Spreading Reach & Impact". I have compiled and collected different statistics and data from different sources. This may be useful for students and those who might be interested in this field of Study.
Linked Data Love: research representation, discovery, and assessment
#ALAAC15
The explosion of linked data platforms and data stores over the last five years has been profound – both in terms of quantity of data as well as its potential impact. Research information systems such as VIVO (www.vivoweb.org) play a significant role in enabling this work. VIVO is an open source, Semantic Web-based application that provides an integrated, searchable view of the scholarly activities of an organization. The uniform semantic structure of VIVO-ISF data enables a new class of tools to advance science. This presentation will provide a brief introduction and update to VIVO and present ways that this semantically-rich data can enable visualizations, reporting and assessment, next-generation collaboration and team building, and enhanced multi-site search. Libraries are uniquely positioned to facilitate the open representation of research information and its subsequent use to spur collaboration, discovery, and assessment. The talk will conclude with a description of ways librarians are engaged in this work – including visioning, metadata and ontology creation, policy creation, data curation and management, technical, and engagement activities.
Kristi Holmes, PhD
Director, Galter Health Sciences Library
Director of Evaluation, NUCATS
Associate Professor, Preventive Medicine-Health and Biomedical Informatics
Northwestern University Feinberg School of Medicine
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
2. LEARNING OBJECTIVES
• Apprehend the field of Data Science impact and
importance in the society
• Reflect on its applications, importance and advantages
3. CONTENTS
• Why should study Data Science?
• How Does Data Science Impact Organizations?
• Application and Competitive Advantage of Data
Science in Organization
• Importance of Data Science to Society
• Road to Become a Data Scientist
4. WHY WE ARE TALKING ABOUT
DATA SCIENCE?
Source: https://bit.ly/31HBHuQ
5. WHAT IS DATA SCIENCE?
• “Data Science is a new term. But in the same sense as
Columbus was discovered NEW Continent 1000 years
ago.
”
- Hector Garcia-Molina
Professor in the Departments
of Computer Science and
Electrical Engineering of
Stanford University
6. WHAT IS DATA SCIENCE?
• a multi-disciplinary field
that uses scientific
methods, processes,
algorithms and systems to
extract knowledge and
insights from structured
and unstructured data.
Source: https://bit.ly/30dekJB
7. WHAT IS DATA SCIENCE?
• a "concept to unify statistics, data analysis, machine
learning and their related methods" in order to
"understand and analyze actual phenomena" with data.
• employs techniques and
theories drawn from many
fields within the context of
mathematics, statistics,
computer science, and
information science.
Source: https://bit.ly/2YTRQ3w
9. WHAT IS DATA SCIENCE?
Fourth Paradigm of Science
• Thousand of years
- Empirical
• Few hundred of years
- Theoretical
• Last fifty years
- Computational
- “Query the world”
• Last twenty years
- eScience (Data Science)
- “Download the world”
10. WHAT IS DATA SCIENCE?
Data Science and others
• Statistics
• Big Data Analytics
• Business Analytics
• Business Intelligence
• Data(base) Management
• Visualization
• Machine Learning
• Data Mining
• Artificial Intelligence
• Predictive Modelling
11. WHAT IS DATA SCIENCE?
Big Data Science T
asks
• Facebooks
• Amazon
• Google
• Linkedln
• Netflix
• Rozetka
• Microsoft
12. WHAT IS DATA SCIENCE?
Regular Data Science
• Data Analysis
• Modelling Statistics
• Engineering / Prototyping
13. WHAT IS DATA SCIENCE?
What do people look for in a data scientist?
14. WHAT IS DATA SCIENCE?
What do people look for in a data scientist?
59. IMPORTANCE OF DATA SCIENCE
1. Data science
customers in
manner.
helps brands to understand their
a much enhanced and empowered
2. It allows brands to communicate their story in such
a engaging and powerful manner.
3. Big Data is a new field that is constantly growing
and evolving.
60. IMPORTANCE OF DATA SCIENCE
4. Its findings and results can be applied to almost
any sector like travel, healthcare and education
among others.
5. Data science is accessible to almost all sectors.
62. REFERENCES
• https://slideplayer.com/slide/10398517/
• https://www.slideshare.net/ryanorban/how-to-become-a-data-scientist
• Dhar, V
. (2013). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499.
• Hayashi, Chikio (1 January 1998). "What is Data Science? Fundamental Concepts and a Heuristic Example". In
Hayashi, Chikio; Yajima, Keiji; Bock, Hans-Hermann; Ohsumi, Noboru; T
anaka, Yutaka; Baba, Yasumasa (eds.).
Data Science, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge
Organization. Springer Japan. pp. 40–51. doi:10.1007/978-4-431-65950-1_3. ISBN 9784431702085.
• Davenport, Thomas H.; Patil, DJ (October 2012), Data Scientist: The Sexiest Job of the 21st Century, Harvard
Business Review
• Jeff Leek (12 December 2013). "The key word in "Data Science" is not Data, it is Science". Simply Statistics.
• https://www.analyticsvidhya.com/blog/2015/09/applications-data-science/
• https://www.edureka.co/blog/data-science-applications/
• https://dutchdatascienceweek.nl/2018/04/05/the-impact-of-data-science-on-society/
• https://www.educba.com/data-science-and-its-growing-importance/