SlideShare a Scribd company logo
Datasets, Data Models, and Useful
Facts In Drawing conclusions
If there's a delay and you have your phone keep in mind that
Facebook has a language setting called English(Pirate)
Banging your head against a wall burns 150 calories an hour.
By Michael Bostwick
 You've had some experience with technology.
 We cant cover it all, so be sure to read the manual.
 Google is a great search engine!
“Never memorize something that you can look up.”
― Albert Einstein
Feel free to ask questions or share insight.
Assumptions
Introduction
Data Sources
Example Models
Mistakes Using Data
Some Tools And Solutions
Overview
The study of where
information comes
from, what it
represents and how it
can be turned into a
valuable resource in
the creation of
business and IT
strategies.
Data Science
 The Great Chain Of Being By Plato
 Over time we moved to Porphyrian tree
 An example is someone's family tree
History Of Human Knowledge
Described in more detail
very well in a TED Talk by
Manuel Lima
More and More things are based
on Networks
http://cpan-explorer.org/
Networks Metaphor
From a Ted Talk By Eric Berlow and Sean Gourley
(this model expands to see connections, it processed language)
TED Talk Map
More Maps At socialexplorer.com
Mapping Of Social Ties
From A Ted Talk By Dave Troy About Social Mapping
 kdnuggets.com listing(and industry site)
 caesar0301 awesome public datasets
 data.gov
 code for kansas city github wiki
 reddit.com datasets
 blog.visual.ly data sources
 Open StreetMaps Data
 opensecrets.org a watchdog group
 wiki's linked data
 wiki data a nosql listing of different relationships
 trulia local map
Data Sets
Trulia Map Of KC
 Freedom of Information Act (FOIA)
 Missouri Sunshine Law
 Many states have similar laws.
 We the people petitions
 Media and Other Public Posting Options
 instigative journalism postings
 craigslist
Getting More And New Data
A theory that seeks
to explain how,
why, and at what
rate new ideas and
technology spread
through cultures.
Diffusion Of Innovation
 Innovation
 Any idea, practice, or object that is perceived as new by an
individual or other unit of adoption could be considered an
innovation available for study.
 Adopters
 In most case adopters are individuals, but can also be
organizations
 Communication channels
 Time
 Social system
Components Of Diffusion Of
Innovations
 Knowledge
 The individual first view into to the innovation
 Persuasion
 The individual is interested in the innovation and actively seeks related information/details.
 Decision
 The individual takes the concept of the change and weighs the advantages/disadvantages of
using the innovation and decides whether to adopt or reject the innovation.
 This stage is very hard to get metrics on.
 Implementation
 They use it, how much and where is based on the situation. During this stage the individual also
determines the usefulness of the innovation and may search for further information about it.
 Confirmation
 The individual his/her decision to continue using the innovation. This stage is both intrapersonal
(may cause cognitive dissonance) and interpersonal, confirmation the group has made the right
decision.
Adoption Process For Diffusion
Of Innovation
 We heard about it on the news
 Officials began to push the use of it
 Some people couldn't get it to work, so they contact
there friends and family
 Issues made some people upset
 Now its working, and lots of people use it
 Some people still just use there work insurance
Example Change.gov
We all should remember this from science class
Empirical Research Model
 The model describes the meaning of its instances.
 A conceptual schema is a high-level description of a
business's informational needs.
 Normally described with entity relationship(E.R)
diagrams
 Common in software development
Semantic Data Model
E.R. Diagram Sample
 A very good explanation is given
on ted
 The Math is lengthy
 Cities are very hard to kill
 All companies die
 Companies and cities are like animals
 Growth is linked to population
 The growth of population is linked to
other factors
Geoffrey West Urban Scaling Model
 pace of life decree's
as you get bigger
 double the size of a
city, and get a 15%
increase of things
Growth Is Hard To Sustain
 To avoid collapse, major
innovation take place
 Cycles of innovation to avoid
collapse, you have to keep
innovating, or collapse
Cycles Of Innovation
Please Feel Free To Share Models Or
Patterns
Anti patterns are also approaches that
shouldn’t be taken!
Open Invitation To Share Models
 UML
 Universal Modeling Language
 PlantUML
 draw.io
 Different subjects have there own models!
 Algorithms were excluded
 Open MIT Courses
 Khanacademy Algorithms
Making New Models
If we have data, let’s look at data. If all we have
are opinions, let’s go with mine.
– Jim Barksdale, former Netscape CEO
Torture the data, and it will confess to anything
– Ronald Coase, Economics, Nobel Prize Laureate
Data Can Be BAD!
 Business Failures
 Systemic Failure
 Loss Of Trust
 An example is the bad data used in the
autism drug study
Possible Harms
 False Correlation
 Emotional Appeal
 Question demographics
 Look at relationships instead
 Confirmation Bias
 Posta Hac Ergo Proctor Hoc
 (after which therefore because of which)
 after something not because of it
Common Mistakes
 Show Your work so others can see the
mistakes
 Showing your work also shows what you
might have missed
 Data has limits
 Data doesn't create meaning, people do
Use Critical Thinking
 Should get its own talk
 Good Starting Tour of M.I.
 Microsoft has a very cool cloud framework
out
 Python and R, are very common tools for
using machine learning
 Machine learning is great for making
predictive calls
Machine Learning
 Some databases have this feature built in!
 QGIS
 An open source GIS viewer
 ArchGIS
 A very common GIS application, that’s licensed
 Google Earth
 KML Files
 Open Street Maps
 Can take user submit information
 It also allows downloading of all of there data
Spatial Mapping
 Relation Database Management System
 ACID (Atomicity, Consistency, Isolation, Durability)
 Commonly SQL
 Standard Query Language
 Most RDMS systems have a bulk loading tool
 Lots Of Examples
 Oracle
 MS Sql Server
 Postgress
RDMS
 These databases duplicate data
 There great for large data sets
 Big Data is normally done by this
 There great for distributed information
 They can scale to handle endless data
 Lots of Open Source Options
 MongoDB
 CouchDB
Non Relational Databases
 R
 Python
 Perl
 Bash
 Windows Powershell
 PHP And Javascript
Scripting Languages
 C#
 C++ (and C)
 Java
There are a lot of solutions on github and sourceforge,
that can be changed and expanded
Some are frameworks like eclipse frameworks
Compiled Programming
Languages
Blender
Molecular flipbook
YouTube
Khan Academy
Amazon Web Services
Googles BigQuery
Other Tools
THEEND!

More Related Content

What's hot

Wither OWL
Wither OWLWither OWL
Wither OWL
James Hendler
 
Machine Learning for Non-technical People
Machine Learning for Non-technical PeopleMachine Learning for Non-technical People
Machine Learning for Non-technical People
indico data
 
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Han Woo PARK
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Matthew Lease
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Matthew Lease
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Matthew Lease
 
Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7
CS, NcState
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
James Hendler
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine Learning
Nik Spirin
 
Deep Neural Networks for Machine Learning
Deep Neural Networks for Machine LearningDeep Neural Networks for Machine Learning
Deep Neural Networks for Machine Learning
Justin Beirold
 
Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation
Matthew Lease
 
Why Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspectiveWhy Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspective
James Hendler
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
Xavier Amatriain
 
The Other HPC: High Productivity Computing
The Other HPC: High Productivity ComputingThe Other HPC: High Productivity Computing
The Other HPC: High Productivity Computing
University of Washington
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Matthew Lease
 
Semantic Web: The Inside Story
Semantic Web: The Inside StorySemantic Web: The Inside Story
Semantic Web: The Inside Story
James Hendler
 
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Dhiana Deva
 
Los Angeles R User Group meetup - useR! 2014 Best of List
Los Angeles R User Group meetup - useR! 2014 Best of ListLos Angeles R User Group meetup - useR! 2014 Best of List
Los Angeles R User Group meetup - useR! 2014 Best of List
amuletc
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
James Hendler
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Daniel Tunkelang
 

What's hot (20)

Wither OWL
Wither OWLWither OWL
Wither OWL
 
Machine Learning for Non-technical People
Machine Learning for Non-technical PeopleMachine Learning for Non-technical People
Machine Learning for Non-technical People
 
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
 
Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine Learning
 
Deep Neural Networks for Machine Learning
Deep Neural Networks for Machine LearningDeep Neural Networks for Machine Learning
Deep Neural Networks for Machine Learning
 
Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation Designing Human-AI Partnerships to Combat Misinfomation
Designing Human-AI Partnerships to Combat Misinfomation
 
Why Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspectiveWhy Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspective
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
The Other HPC: High Productivity Computing
The Other HPC: High Productivity ComputingThe Other HPC: High Productivity Computing
The Other HPC: High Productivity Computing
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
 
Semantic Web: The Inside Story
Semantic Web: The Inside StorySemantic Web: The Inside Story
Semantic Web: The Inside Story
 
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
 
Los Angeles R User Group meetup - useR! 2014 Best of List
Los Angeles R User Group meetup - useR! 2014 Best of ListLos Angeles R User Group meetup - useR! 2014 Best of List
Los Angeles R User Group meetup - useR! 2014 Best of List
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
 

Viewers also liked

Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
advantika
 
215 braun content map (1)
215 braun content map (1)215 braun content map (1)
215 braun content map (1)
MrBraun4
 
Decreto 102 13-primera_infancia_rd
Decreto 102 13-primera_infancia_rdDecreto 102 13-primera_infancia_rd
Decreto 102 13-primera_infancia_rdPierinaCamiloC
 
Syandes Quiz 3 - Tommy
Syandes Quiz 3 - TommySyandes Quiz 3 - Tommy
Syandes Quiz 3 - Tommy
Tommy Ellema
 
Управление проектами, или как успеть в срок, не упуская мелочей
Управление проектами, или как успеть в срок, не упуская мелочейУправление проектами, или как успеть в срок, не упуская мелочей
Управление проектами, или как успеть в срок, не упуская мелочей
advantika
 
Ignite Presentation Slides
Ignite Presentation SlidesIgnite Presentation Slides
Ignite Presentation Slides
limari1212
 
Промопрезентация
ПромопрезентацияПромопрезентация
Промопрезентацияadvantika
 
Создать интернет магазин - легко! (Михаил Федотов)
Создать интернет магазин  - легко! (Михаил Федотов)Создать интернет магазин  - легко! (Михаил Федотов)
Создать интернет магазин - легко! (Михаил Федотов)advantika
 
Nathen mathews ignite slides
Nathen mathews ignite slidesNathen mathews ignite slides
Nathen mathews ignite slides
Nathen Mathews
 
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...advantika
 
Контент-маркетинг
Контент-маркетингКонтент-маркетинг
Контент-маркетинг
advantika
 
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)advantika
 
Современные технологии для интернет-магазинов
Современные технологии для интернет-магазиновСовременные технологии для интернет-магазинов
Современные технологии для интернет-магазинов
advantika
 
Контекст для интернет-магазинов (Ярослав Тхоренко)
Контекст для интернет-магазинов (Ярослав Тхоренко)Контекст для интернет-магазинов (Ярослав Тхоренко)
Контекст для интернет-магазинов (Ярослав Тхоренко)advantika
 
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
Agil Handayani
 
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
advantika
 

Viewers also liked (16)

Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...
 
215 braun content map (1)
215 braun content map (1)215 braun content map (1)
215 braun content map (1)
 
Decreto 102 13-primera_infancia_rd
Decreto 102 13-primera_infancia_rdDecreto 102 13-primera_infancia_rd
Decreto 102 13-primera_infancia_rd
 
Syandes Quiz 3 - Tommy
Syandes Quiz 3 - TommySyandes Quiz 3 - Tommy
Syandes Quiz 3 - Tommy
 
Управление проектами, или как успеть в срок, не упуская мелочей
Управление проектами, или как успеть в срок, не упуская мелочейУправление проектами, или как успеть в срок, не упуская мелочей
Управление проектами, или как успеть в срок, не упуская мелочей
 
Ignite Presentation Slides
Ignite Presentation SlidesIgnite Presentation Slides
Ignite Presentation Slides
 
Промопрезентация
ПромопрезентацияПромопрезентация
Промопрезентация
 
Создать интернет магазин - легко! (Михаил Федотов)
Создать интернет магазин  - легко! (Михаил Федотов)Создать интернет магазин  - легко! (Михаил Федотов)
Создать интернет магазин - легко! (Михаил Федотов)
 
Nathen mathews ignite slides
Nathen mathews ignite slidesNathen mathews ignite slides
Nathen mathews ignite slides
 
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...
Корпоративный портал. повышаем прозрачность и управляемость компании (Екатери...
 
Контент-маркетинг
Контент-маркетингКонтент-маркетинг
Контент-маркетинг
 
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)
Вдохновляющие истории интернет-маркетинга (Екатерина Просвиркина)
 
Современные технологии для интернет-магазинов
Современные технологии для интернет-магазиновСовременные технологии для интернет-магазинов
Современные технологии для интернет-магазинов
 
Контекст для интернет-магазинов (Ярослав Тхоренко)
Контекст для интернет-магазинов (Ярослав Тхоренко)Контекст для интернет-магазинов (Ярослав Тхоренко)
Контекст для интернет-магазинов (Ярослав Тхоренко)
 
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...
 
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...
 

Similar to Data Models And Details About Open Data

How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
How Many Dimensions of Compatibility?: Discovering What's Right for Your Users How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
Marliese Thomas
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
OSTHUS
 
Reasoning over big data
Reasoning over big dataReasoning over big data
Reasoning over big data
OSTHUS
 
Larissa
Larissa  Larissa
Larissa
Hilary Ip
 
The future of data analytics
The future of data analyticsThe future of data analytics
The future of data analytics
Edward Chenard
 
Getting into ai event slides
Getting into ai   event slidesGetting into ai   event slides
Getting into ai event slides
Matthew Miller
 
Ethics andtel
Ethics andtelEthics andtel
Ethics andtel
Laurence Cuffe
 
Overview of Data Science and AI
Overview of Data Science and AIOverview of Data Science and AI
Overview of Data Science and AI
johnstamford
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
David De Roure
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Dr. Sunil Kr. Pandey
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
Kelly Technologies
 
Becoming Datacentric
Becoming DatacentricBecoming Datacentric
Becoming Datacentric
Timothy Cook
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
The Higher Education Academy
 
Oss swot
Oss swotOss swot
Oss swot
Bill Ott
 
Text analysis-semantic-search
Text analysis-semantic-searchText analysis-semantic-search
Text analysis-semantic-search
Diana Maynard
 
Framework for understanding data science.pdf
Framework for understanding data science.pdfFramework for understanding data science.pdf
Framework for understanding data science.pdf
Michael Brodie
 
ai_and_you_slide_template.pptx
ai_and_you_slide_template.pptxai_and_you_slide_template.pptx
ai_and_you_slide_template.pptx
ganeshjilo
 
Developing cognitive applications v1
Developing cognitive applications v1Developing cognitive applications v1
Developing cognitive applications v1
Harsha Srivatsa
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
lorainedeserre
 

Similar to Data Models And Details About Open Data (20)

How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
How Many Dimensions of Compatibility?: Discovering What's Right for Your Users How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
How Many Dimensions of Compatibility?: Discovering What's Right for Your Users
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 
Reasoning over big data
Reasoning over big dataReasoning over big data
Reasoning over big data
 
Larissa
Larissa  Larissa
Larissa
 
The future of data analytics
The future of data analyticsThe future of data analytics
The future of data analytics
 
Getting into ai event slides
Getting into ai   event slidesGetting into ai   event slides
Getting into ai event slides
 
Ethics andtel
Ethics andtelEthics andtel
Ethics andtel
 
Overview of Data Science and AI
Overview of Data Science and AIOverview of Data Science and AI
Overview of Data Science and AI
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Becoming Datacentric
Becoming DatacentricBecoming Datacentric
Becoming Datacentric
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
Oss swot
Oss swotOss swot
Oss swot
 
Text analysis-semantic-search
Text analysis-semantic-searchText analysis-semantic-search
Text analysis-semantic-search
 
Framework for understanding data science.pdf
Framework for understanding data science.pdfFramework for understanding data science.pdf
Framework for understanding data science.pdf
 
ai_and_you_slide_template.pptx
ai_and_you_slide_template.pptxai_and_you_slide_template.pptx
ai_and_you_slide_template.pptx
 
Developing cognitive applications v1
Developing cognitive applications v1Developing cognitive applications v1
Developing cognitive applications v1
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docx
 

Recently uploaded

Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
Mydbops
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 

Recently uploaded (20)

Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 

Data Models And Details About Open Data

  • 1. Datasets, Data Models, and Useful Facts In Drawing conclusions If there's a delay and you have your phone keep in mind that Facebook has a language setting called English(Pirate) Banging your head against a wall burns 150 calories an hour. By Michael Bostwick
  • 2.  You've had some experience with technology.  We cant cover it all, so be sure to read the manual.  Google is a great search engine! “Never memorize something that you can look up.” ― Albert Einstein Feel free to ask questions or share insight. Assumptions
  • 3. Introduction Data Sources Example Models Mistakes Using Data Some Tools And Solutions Overview
  • 4. The study of where information comes from, what it represents and how it can be turned into a valuable resource in the creation of business and IT strategies. Data Science
  • 5.
  • 6.  The Great Chain Of Being By Plato  Over time we moved to Porphyrian tree  An example is someone's family tree History Of Human Knowledge Described in more detail very well in a TED Talk by Manuel Lima
  • 7. More and More things are based on Networks http://cpan-explorer.org/ Networks Metaphor
  • 8. From a Ted Talk By Eric Berlow and Sean Gourley (this model expands to see connections, it processed language) TED Talk Map
  • 9. More Maps At socialexplorer.com Mapping Of Social Ties From A Ted Talk By Dave Troy About Social Mapping
  • 10.  kdnuggets.com listing(and industry site)  caesar0301 awesome public datasets  data.gov  code for kansas city github wiki  reddit.com datasets  blog.visual.ly data sources  Open StreetMaps Data  opensecrets.org a watchdog group  wiki's linked data  wiki data a nosql listing of different relationships  trulia local map Data Sets
  • 12.  Freedom of Information Act (FOIA)  Missouri Sunshine Law  Many states have similar laws.  We the people petitions  Media and Other Public Posting Options  instigative journalism postings  craigslist Getting More And New Data
  • 13. A theory that seeks to explain how, why, and at what rate new ideas and technology spread through cultures. Diffusion Of Innovation
  • 14.  Innovation  Any idea, practice, or object that is perceived as new by an individual or other unit of adoption could be considered an innovation available for study.  Adopters  In most case adopters are individuals, but can also be organizations  Communication channels  Time  Social system Components Of Diffusion Of Innovations
  • 15.  Knowledge  The individual first view into to the innovation  Persuasion  The individual is interested in the innovation and actively seeks related information/details.  Decision  The individual takes the concept of the change and weighs the advantages/disadvantages of using the innovation and decides whether to adopt or reject the innovation.  This stage is very hard to get metrics on.  Implementation  They use it, how much and where is based on the situation. During this stage the individual also determines the usefulness of the innovation and may search for further information about it.  Confirmation  The individual his/her decision to continue using the innovation. This stage is both intrapersonal (may cause cognitive dissonance) and interpersonal, confirmation the group has made the right decision. Adoption Process For Diffusion Of Innovation
  • 16.  We heard about it on the news  Officials began to push the use of it  Some people couldn't get it to work, so they contact there friends and family  Issues made some people upset  Now its working, and lots of people use it  Some people still just use there work insurance Example Change.gov
  • 17. We all should remember this from science class Empirical Research Model
  • 18.  The model describes the meaning of its instances.  A conceptual schema is a high-level description of a business's informational needs.  Normally described with entity relationship(E.R) diagrams  Common in software development Semantic Data Model
  • 20.  A very good explanation is given on ted  The Math is lengthy  Cities are very hard to kill  All companies die  Companies and cities are like animals  Growth is linked to population  The growth of population is linked to other factors Geoffrey West Urban Scaling Model
  • 21.  pace of life decree's as you get bigger  double the size of a city, and get a 15% increase of things Growth Is Hard To Sustain
  • 22.  To avoid collapse, major innovation take place  Cycles of innovation to avoid collapse, you have to keep innovating, or collapse Cycles Of Innovation
  • 23. Please Feel Free To Share Models Or Patterns Anti patterns are also approaches that shouldn’t be taken! Open Invitation To Share Models
  • 24.  UML  Universal Modeling Language  PlantUML  draw.io  Different subjects have there own models!  Algorithms were excluded  Open MIT Courses  Khanacademy Algorithms Making New Models
  • 25. If we have data, let’s look at data. If all we have are opinions, let’s go with mine. – Jim Barksdale, former Netscape CEO Torture the data, and it will confess to anything – Ronald Coase, Economics, Nobel Prize Laureate Data Can Be BAD!
  • 26.  Business Failures  Systemic Failure  Loss Of Trust  An example is the bad data used in the autism drug study Possible Harms
  • 27.  False Correlation  Emotional Appeal  Question demographics  Look at relationships instead  Confirmation Bias  Posta Hac Ergo Proctor Hoc  (after which therefore because of which)  after something not because of it Common Mistakes
  • 28.  Show Your work so others can see the mistakes  Showing your work also shows what you might have missed  Data has limits  Data doesn't create meaning, people do Use Critical Thinking
  • 29.  Should get its own talk  Good Starting Tour of M.I.  Microsoft has a very cool cloud framework out  Python and R, are very common tools for using machine learning  Machine learning is great for making predictive calls Machine Learning
  • 30.  Some databases have this feature built in!  QGIS  An open source GIS viewer  ArchGIS  A very common GIS application, that’s licensed  Google Earth  KML Files  Open Street Maps  Can take user submit information  It also allows downloading of all of there data Spatial Mapping
  • 31.  Relation Database Management System  ACID (Atomicity, Consistency, Isolation, Durability)  Commonly SQL  Standard Query Language  Most RDMS systems have a bulk loading tool  Lots Of Examples  Oracle  MS Sql Server  Postgress RDMS
  • 32.  These databases duplicate data  There great for large data sets  Big Data is normally done by this  There great for distributed information  They can scale to handle endless data  Lots of Open Source Options  MongoDB  CouchDB Non Relational Databases
  • 33.  R  Python  Perl  Bash  Windows Powershell  PHP And Javascript Scripting Languages
  • 34.  C#  C++ (and C)  Java There are a lot of solutions on github and sourceforge, that can be changed and expanded Some are frameworks like eclipse frameworks Compiled Programming Languages
  • 35. Blender Molecular flipbook YouTube Khan Academy Amazon Web Services Googles BigQuery Other Tools