As mandated in the Public Health Security and Bioterrorism Preparedness and Response Act of 2002, CDC’s BioSense program was launched in 2003 with the aim of establishing an integrated system of nationwide public health surveillance for the early detection and prompt assessment of potential bioterrorism-related illness. Over the following several years, as awareness grew about the limits of syndromic and related automated surveillance systems, including BioSense, in providing early and accurate epidemic alerts, increased emphasis was placed on their use in providing timely situation awareness throughout the course of public health emergencies. In practice, a key application of these systems has been their use in tracking the course of seasonal influenza and, in 2009, the impact of the H1N1 influenza pandemic. While retaining the original purpose of BioSense of early event (or threat) detection and characterization, we believe the most efficient and effective approach to achieve the program’s long-term business case is to build on existing systems and programs. This will have additional public health benefits that can improve the nation’s health at all times, including: 1. Public health situation awareness, 2. Routine public health practice, 3. Improving health outcomes and public health; and 4. Monitoring healthcare quality
As mandated in the Public Health Security and Bioterrorism Preparedness and Response Act of 2002, CDC’s BioSense program was launched in 2003 with the aim of establishing an integrated system of nationwide public health surveillance for the early detection and prompt assessment of potential bioterrorism-related illness. Over the following several years, as awareness grew about the limits of syndromic and related automated surveillance systems, including BioSense, in providing early and accurate epidemic alerts, increased emphasis was placed on their use in providing timely situation awareness throughout the course of public health emergencies. In practice, a key application of these systems has been their use in tracking the course of seasonal influenza and, in 2009, the impact of the H1N1 influenza pandemic. While retaining the original purpose of BioSense of early event (or threat) detection and characterization, we believe the most efficient and effective approach to achieve the program’s long-term business case is to build on existing systems and programs. This will have additional public health benefits that can improve the nation’s health at all times, including: 1. Public health situation awareness, 2. Routine public health practice, 3. Improving health outcomes and public health; and 4. Monitoring healthcare quality
BioSense 2.0: Public Health Surveillance Through Collaboration. Monday Biosecurity Meeting: Crowd-Sourcing for Outbreak and Agent Identification, The American Association for the Advancement of Science (AAAS) Center for Science, Technology, and Security Policy. Presented by Taha Kass-Hout, MD, MS on November 21, 2011, Noon-1:30pm, Abelson/Haskins Room (2nd Floor, AAAS, 1200 New York Avenue, NW, Washington, DC 20005)
Presenting precisionFDA for the first time at the Precision Medicine Coalition in Washington, DC on February 24, 2016
Any views or opinions expressed here do not necessarily represent the views of the FDA, HHS, or any other entity of the United States government. Furthermore, the use of any product names, trade names, images, or commercial sources is for identification purposes only, and does not imply endorsement or government sanction by the U.S. Department of Health and Human Services.
Riff: A Social Network and Collaborative Platform for Public Health Disease S...Taha Kass-Hout, MD, MS
A hybrid (event-based and indicator-based) platform designed to streamline the collaboration between domain experts and machine learning algorithms for detection, prediction and response to health-related events (such as disease outbreaks or pandemics). The platform helps synthesize health-related event indicators from a wide variety of information sources (structured and unstructured) into a consolidated picture for analysis, maintenance of “community-wide coherence”, and collaboration processes. The platform offers features to detect anomalies, visualize clusters of potential events, predict the rate and spread of a disease outbreak and provide decision makers with tools, methodologies and processes to investigate the event.
[FOSS4G Korea 2016] GeoHash를 이용한 지형도 변화탐지와 시계열 관리BJ Jang
연차별로 구축된 지형도를 PostGIS에 넣어 ST_GeoHash()함수를 이용해 지리적인 식별키를 생성하고 이를 이용해 각 객처별 변화를 탐지해 낸다. 이렇게 탐지한 변화정보를 이용해 지형도의 변화를 시계열적으로 구축하여, 원하는 시점의 자료를 조회하고, 변화내용을 분석하는 과정을 국토지리정보원의 실사례와 함께 설명한다.
Using Qualtrics to Create Automated Online TrainingsShalin Hai-Jew
When thinking about “transformational teaching and learning,” training would not be the first thing to come to mind.
The Qualtrics® research suite offers a number of design tools and features that enable the building of automated online trainings. There are the baseline features such as the ability to integrate multimedia, apply various question designs, enable accessibility features (like alt-texting), deliver a mobile experience, reach learners across distances, and provide basic security and data integrity features.
Other features actually make this tool phenomenally powerful. One is the ability to richly customize learning sequences—by learner profile, by performance (behavior), by selection, or a mix of factors. There is a feature that enables the scoring of learner responses and the ability to set a threshold for passing. This tool has a rich data analytics capability (including a light item analysis), including online analytics and even cross-tabulation analysis. A Qualtrics® API enables the recording of online assessment scores and learner behaviors, in an automated way to faculty / staff / student information systems.
Trainings are critical for effective workplace functioning and professional development. The same features in Qualtrics® that enable the effective building of automated trainings also enable the effective building of pre-learning modules or sequences for learners who need to refresh their skills for a new course. This digital slideshow introduces the use of Qualtrics® as a customizable training and pre-learning module tool.
Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...Shalin Hai-Jew
This presentation focuses on how to understand public sentiment through a related-tags content network analysis of public Flickr photos and videos. NodeXL is used to conduct data extractions and visualizations of user-tagged Flickr contents and the resulting “noisy” folksonomies. What mental connections may be made about particular issues based on analysis of text-annotated graphs?
Data Synchronization of Epi Info™ Using a Mesh4X Adapter: Presentation at the AMIA 2009 Annual Symposium-Demonstrations: Management of Populations.
Disclaimer: Any views or opinions expressed by the speaker do not necessarily represent the views of the CDC, HHS, or any other entity of the United States government. Furthermore, the use of any product names, trade names, images, or commercial sources is for identification purposes only, and does not imply endorsement or government sanction by the U.S. Department of Health and Human Services.
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...Shalin Hai-Jew
This introduces methods for extracting and analyzing social network data from Twitter for hashtag conversations (and emergent events), event graphs, search networks, and user ego neighborhoods (using NodeXL). There will be direct demonstrations and discussions of how to analyze social network graphs. This information may be extended with human- and / or machine-based sentiment analysis.
Writing and Publishing about Applied Technologies in Tech Journals and BooksShalin Hai-Jew
This slideshow provides insights on how to write and publish about applied technologies in tech journals and books, including the following:
Getting started in tech publishing
Cost-benefit calculations
Parts to an article; parts to a chapter
Writing process
Collaborating
Publishing process
Acquiring readers (and citations)
Post-publishing
Next works
Hunting the hunter, can you tell if your phone’s being captured by a rogue cell phone tower/ IMSI catcher/ Stingray? Learn strategies to detect rogue cell phone towers and hear stories from adventures war walking Las Vegas during Defcon. Learn about IMSI catchers their capabilities, LTE to GSM downgrade attacks, and ways to protect yourself from these devices. Discover open source projects and other ways you can get involved to help make cellular technologies safer for users.
Video Link: https://www.youtube.com/watch?v=eivHO1OzF5E
LIWC-ing at Texts for Insights from Linguistic PatternsShalin Hai-Jew
Since the mid-1990s, researchers have been using the Linguistic Inquiry and Word Count (LIWC pronounced “luke”) software tool to explore various text corpora for hidden insights from linguistic patterns. The LIWC tool has evolved over the years. Simultaneously, research using computational text analysis has evolved and shed light on areas of deception, threat assessment, personality, predictive analytics, and other areas. This presentation will highlight some of the applications of LIWC in the research literature and showcase the tool on some original text sets.
Building a Digital Learning Object w/ Articulate Storyline 2Shalin Hai-Jew
The digital learning object (DLO) is still a common staple in online learning. One of the more sophisticated authoring tools to build DLOs is Articulate Storyline 2, which enables the integration of multimedia (including screen captures with Articulate Replay); the building of animations; branching, and other features. Its packaging allows a full range of SCORM and Tin Can API outputs and versioning in HTML 5. This presentation will introduce the software tool and some of its capabilities to provide a sense of where digital learning objects may be headed.
The aim of this presentation is to provide a brief overview of the SCRUM Agile Methodology, and to give organizations an idea of how SCRUM may affect the traditional development of requirements and deliverables.
Scaling GIS Data in Non-relational Data StoresMike Malone
As the amount of GIS data we need to keep track of increases, the amount of devices accessing it increases, and the amount of GIS writes increase, we’re finding that, much like real-time web applications, normal RDBMS’s are not well suited to scaling. This talk covers why GIS data is hard to scale in a normal RDBMS, what nonrelational stores exist out there, and some basic examples of how to do spatial queries within a nonrelational store.
iParanoid: an IMSI Catcher - Stingray Intrusion Detection SystemLuca Bongiorni
The goal is the research and development of Intrusion Detection System related with Cell Networks.
Mainly this App will check the status of some Cell Network variables (e.g. Cellid, LAC, A5 Encryption, etc.) subsequently update a local DB and check if the information about the cell networks around the users are valid or if there could be a risk (e.g. possible interception, possible impersonation, etc.).
BioSense 2.0: Public Health Surveillance Through Collaboration. Monday Biosecurity Meeting: Crowd-Sourcing for Outbreak and Agent Identification, The American Association for the Advancement of Science (AAAS) Center for Science, Technology, and Security Policy. Presented by Taha Kass-Hout, MD, MS on November 21, 2011, Noon-1:30pm, Abelson/Haskins Room (2nd Floor, AAAS, 1200 New York Avenue, NW, Washington, DC 20005)
Presenting precisionFDA for the first time at the Precision Medicine Coalition in Washington, DC on February 24, 2016
Any views or opinions expressed here do not necessarily represent the views of the FDA, HHS, or any other entity of the United States government. Furthermore, the use of any product names, trade names, images, or commercial sources is for identification purposes only, and does not imply endorsement or government sanction by the U.S. Department of Health and Human Services.
Riff: A Social Network and Collaborative Platform for Public Health Disease S...Taha Kass-Hout, MD, MS
A hybrid (event-based and indicator-based) platform designed to streamline the collaboration between domain experts and machine learning algorithms for detection, prediction and response to health-related events (such as disease outbreaks or pandemics). The platform helps synthesize health-related event indicators from a wide variety of information sources (structured and unstructured) into a consolidated picture for analysis, maintenance of “community-wide coherence”, and collaboration processes. The platform offers features to detect anomalies, visualize clusters of potential events, predict the rate and spread of a disease outbreak and provide decision makers with tools, methodologies and processes to investigate the event.
[FOSS4G Korea 2016] GeoHash를 이용한 지형도 변화탐지와 시계열 관리BJ Jang
연차별로 구축된 지형도를 PostGIS에 넣어 ST_GeoHash()함수를 이용해 지리적인 식별키를 생성하고 이를 이용해 각 객처별 변화를 탐지해 낸다. 이렇게 탐지한 변화정보를 이용해 지형도의 변화를 시계열적으로 구축하여, 원하는 시점의 자료를 조회하고, 변화내용을 분석하는 과정을 국토지리정보원의 실사례와 함께 설명한다.
Using Qualtrics to Create Automated Online TrainingsShalin Hai-Jew
When thinking about “transformational teaching and learning,” training would not be the first thing to come to mind.
The Qualtrics® research suite offers a number of design tools and features that enable the building of automated online trainings. There are the baseline features such as the ability to integrate multimedia, apply various question designs, enable accessibility features (like alt-texting), deliver a mobile experience, reach learners across distances, and provide basic security and data integrity features.
Other features actually make this tool phenomenally powerful. One is the ability to richly customize learning sequences—by learner profile, by performance (behavior), by selection, or a mix of factors. There is a feature that enables the scoring of learner responses and the ability to set a threshold for passing. This tool has a rich data analytics capability (including a light item analysis), including online analytics and even cross-tabulation analysis. A Qualtrics® API enables the recording of online assessment scores and learner behaviors, in an automated way to faculty / staff / student information systems.
Trainings are critical for effective workplace functioning and professional development. The same features in Qualtrics® that enable the effective building of automated trainings also enable the effective building of pre-learning modules or sequences for learners who need to refresh their skills for a new course. This digital slideshow introduces the use of Qualtrics® as a customizable training and pre-learning module tool.
Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...Shalin Hai-Jew
This presentation focuses on how to understand public sentiment through a related-tags content network analysis of public Flickr photos and videos. NodeXL is used to conduct data extractions and visualizations of user-tagged Flickr contents and the resulting “noisy” folksonomies. What mental connections may be made about particular issues based on analysis of text-annotated graphs?
Data Synchronization of Epi Info™ Using a Mesh4X Adapter: Presentation at the AMIA 2009 Annual Symposium-Demonstrations: Management of Populations.
Disclaimer: Any views or opinions expressed by the speaker do not necessarily represent the views of the CDC, HHS, or any other entity of the United States government. Furthermore, the use of any product names, trade names, images, or commercial sources is for identification purposes only, and does not imply endorsement or government sanction by the U.S. Department of Health and Human Services.
Hashtag Conversations,Eventgraphs, and User Ego Neighborhoods: Extracting So...Shalin Hai-Jew
This introduces methods for extracting and analyzing social network data from Twitter for hashtag conversations (and emergent events), event graphs, search networks, and user ego neighborhoods (using NodeXL). There will be direct demonstrations and discussions of how to analyze social network graphs. This information may be extended with human- and / or machine-based sentiment analysis.
Writing and Publishing about Applied Technologies in Tech Journals and BooksShalin Hai-Jew
This slideshow provides insights on how to write and publish about applied technologies in tech journals and books, including the following:
Getting started in tech publishing
Cost-benefit calculations
Parts to an article; parts to a chapter
Writing process
Collaborating
Publishing process
Acquiring readers (and citations)
Post-publishing
Next works
Hunting the hunter, can you tell if your phone’s being captured by a rogue cell phone tower/ IMSI catcher/ Stingray? Learn strategies to detect rogue cell phone towers and hear stories from adventures war walking Las Vegas during Defcon. Learn about IMSI catchers their capabilities, LTE to GSM downgrade attacks, and ways to protect yourself from these devices. Discover open source projects and other ways you can get involved to help make cellular technologies safer for users.
Video Link: https://www.youtube.com/watch?v=eivHO1OzF5E
LIWC-ing at Texts for Insights from Linguistic PatternsShalin Hai-Jew
Since the mid-1990s, researchers have been using the Linguistic Inquiry and Word Count (LIWC pronounced “luke”) software tool to explore various text corpora for hidden insights from linguistic patterns. The LIWC tool has evolved over the years. Simultaneously, research using computational text analysis has evolved and shed light on areas of deception, threat assessment, personality, predictive analytics, and other areas. This presentation will highlight some of the applications of LIWC in the research literature and showcase the tool on some original text sets.
Building a Digital Learning Object w/ Articulate Storyline 2Shalin Hai-Jew
The digital learning object (DLO) is still a common staple in online learning. One of the more sophisticated authoring tools to build DLOs is Articulate Storyline 2, which enables the integration of multimedia (including screen captures with Articulate Replay); the building of animations; branching, and other features. Its packaging allows a full range of SCORM and Tin Can API outputs and versioning in HTML 5. This presentation will introduce the software tool and some of its capabilities to provide a sense of where digital learning objects may be headed.
The aim of this presentation is to provide a brief overview of the SCRUM Agile Methodology, and to give organizations an idea of how SCRUM may affect the traditional development of requirements and deliverables.
Scaling GIS Data in Non-relational Data StoresMike Malone
As the amount of GIS data we need to keep track of increases, the amount of devices accessing it increases, and the amount of GIS writes increase, we’re finding that, much like real-time web applications, normal RDBMS’s are not well suited to scaling. This talk covers why GIS data is hard to scale in a normal RDBMS, what nonrelational stores exist out there, and some basic examples of how to do spatial queries within a nonrelational store.
iParanoid: an IMSI Catcher - Stingray Intrusion Detection SystemLuca Bongiorni
The goal is the research and development of Intrusion Detection System related with Cell Networks.
Mainly this App will check the status of some Cell Network variables (e.g. Cellid, LAC, A5 Encryption, etc.) subsequently update a local DB and check if the information about the cell networks around the users are valid or if there could be a risk (e.g. possible interception, possible impersonation, etc.).
Minnesota GIS/LIS The Geospatial Revolution Peter BattyPeter Batty
My keynote presentation at Minnesota GIS/LIS. Similar to my recent keynote at AGI GeoCommunity in the UK, but with some additional material (50 vs 30 minutes) and a few tweaks. Looks at trends in the geospatial industry in three areas: moving to the mainstream; a real time multimedia view of the world; and crowdsourcing.
The Geospatial Revolution - AGI GeoCommunity keynotePeter Batty
My keynote presentation given at the AGI GeoCommunity conference in Stratford upon Avon, UK, on September 23, 2009. Talking about directions in the geospatial industry.
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...i_scienceEU
Network of Excellence Internet Science Summer School. The theme of the summer school is "Internet Privacy and Identity, Trust and Reputation Mechanisms".
More information: http://www.internet-science.eu/
Introduction To Data Mining: Introduction - The evolution of database
system technology - Steps in knowledge discovery from database process
- Architecture of a data mining systems - Data mining on different kinds
of data - Different kinds of pattern - Technologies used - Applications -
Major issues in data mining - Classification of data mining systems - Data
mining task primitives - Integration of a data mining system with a
database or data warehouse system.
The Geospatial Revolution in CopenhagenPeter Batty
The keynote talk I gave at a seminar organized by the Danish National Mapping Agency in Copenhagen, in March 2010. Some common material with other "Geospatial Revolution" presentations I have given, and some new material too.
DRCOG: The Geospatial Revolution Peter BattyPeter Batty
Presentation at DRCOG in Denver on the Geospatial Revolution. Some additional material on data sharing compared to previous versions of the same talk (but a lot of common material)
NoSQL & Big Data Analytics: History, Hype, OpportunitiesVishy Poosala
Looking at NoSQL and Big Data Analytics as an evolution starting from Relational Databases, and go behind the hype. You can find more on this topic in my blog at: http://innovation-edge.blogspot.com/
Thanks to Gregory Piatetsky-Shapiro for the 2nd half of the slides.
During the rise and innovation of “big data,” the geospatial analytics landscape has grown and evolved. We are beyond just analyzing static maps. Geospatial data is streaming from devices, sensors, infrastructure systems, or social media, and our applications and use cases must dynamically scale to meet the increased demands.
Cloud can provide cost-effective storage and that ephemeral resource-burst needed for fast processing and low latency, all to monetize the immediate value of fresh geospatial data. Geospatial analytics require optimized spatial data types and algorithms to distill data to knowledge. Such processing, especially with strict latency requirements, has always been a challenge.
We propose an open source big data stack for geospatial analytics on Cloud based on Apache NiFi, Apache Spark and LocationTech GeoMesa. GeoMesa is a geospatial framework deployed in a modern big data platform that provides a scalable and low latency solution for indexing volumes of historical data and generating live views and streaming geospatial analytics.
Presentation and live demo performed at DataWorks 2018 Conference - San Jose: https://bit.ly/2xthAGD
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
What 'kind of things' does a data scientist do? What are the foundations and principles of data science? What is a Data Product? What does the data science process looks like? Learning from data: Data Modeling or Algorithmic Modeling? - talk by Carlos Somohano @ds_ldn at The Cloud and Big Data: HDInsight on Azure London 25/01/13
Information Extraction and Integration of Hard and Soft Information for D2D v...DataCards
"Information Extraction and Integration of Hard and Soft Information for D2D via
Controlled National Language,” Dr. Tien Pham, US Army Research Laboratory
1. Heatmaps & Geohash:
Integration of Multi-Source Geospatial Data
Abe Usher, CIO
abe@thehumangeo.com
703.955.1540
@abeusher
AKA Heatmaps are the Heat
INFORMATION INTO INSIGHT
2. Our Menu of Subtopics
A LITTLE HISTORY WHY HEATMAPS? GEOHASH? DATA GONE WILD
Big data requires new approaches. A new organizing construct for
Why geospatial data?
information analysis.
KITCHEN MODEL SPECIFIC EXAMPLES EASTER EGG
New ways to combine internal data Concrete take-aways. Treats for making it through another
with external new media for presentation.
maximum insight.
2
6. Geohash
Geohash is a coordinate
transformation
that facilitates combining two
variables (latitude and longitude)
into a single(text) variable
that represents a bounding-box
containing the point of interest.
6
8. A rose by any other name
“Discrete & continuous methods of kernel density
estimation”
Gaussian
Quartic
Exponential
Triangular
Uniform
Epanechnikov
8
9. A rose by any other name
“Discrete methods of kernel density estimation”
9
10. About
Big Data (Digital) Human Geography
Predictive models of social drift &
human behavior
Streaming media analytics
Micro-demographics
We’re hiring! info@thehumangeo.com
10
12. Why Heatmaps and Geohash?
Too much data
Trust in Internet data
12
13. Why Heatmaps and Geohash?
Too much data
Trust in Internet data
Heatmaps look cool
13
14. Why Heatmaps and Geohash?
Too much data
Trust in Internet data
Heatmaps look cool
Geohash helps quantify
data
14
15. Why Heatmaps and Geohash?
Visual summary
Too much data
Trust in Internet data
Heatmaps look cool
Geohash helps quantify
data
Quantitative methods
15
17. Trust and Internet Information
“Game knows
game, baby.”
Tracy Morrow aka “Ice T”
17
18. Trust and Internet Information
“If you have expert
knowledge, then
you are capable of
recognizing expert
knowledge.”
[paraphrased]
Tracy Morrow aka “Ice T”
18
19. Trust and Internet Information
Can we actually trust this Internet stuff?
19
23. Salami Slicing
Salami slicing: series of minor observations, resulting
in a larger observation that would be difficult to perform
http://en.wikipedia.org/wiki/Salami_slicing
23
24. Seven Layer GLT
1. OpenStreetMap data
2. Flickr
3. Panoramio
4. Geonames.org
5. Twitter
6. Wikimapia
7. 4Square
* Geospatial Lattice of Trusted Data
24
25. Seven Layer GLT
1. OpenStreetMap data
2. Flickr
3. Panoramio
4. Geonames.org
5. Twitter
6. Wikimapia
7. 4Square
Spatial Temporal User Finds From the Field (STUFFF)
25
26. Rule #1: Think in terms of aggregation
Twitter data
Panoramio Tourist photos
Classified data
Twitter geohash ez420 – coffee shop
Panoramio geohash ez420 – Starbucks
Classified geohash ez420 - Wifi
Trust through aggregation
26
27. Rule #1: Think in terms of aggregation
Twitter data
Twitter geohash ez420 – coffee shop
Panoramio geohash ez420 – Starbucks
Classified geohash ez420 - Wifi
Panoramio Tourist photos
Geohash creates simple string variables. Classified data
Matching strings = super easy
Matching similar coordinates = impossible
Trust through aggregation
27
28. Rule #1: Think in terms of aggregation
Twitter data
Panoramio Tourist photos
Use geohash to apply collaborative
filtering techniques to develop new Classified data
models of trust & data confidence.
Trust through aggregation
28
40. Heatmap: Recipe One
“OSM Style”
Get Python http://python.org
Get the sethoscope library
http://www.sethoscope.net/heatmap/
Get data
http://bit.ly/geotweet_sc
https://dev.twitter.com/docs/streaming-api/methods#locations
Command line:
heatmap.py -g portland.gpx -o output.png --height 800 --osm
40
42. Heatmap: Recipe One
Stitch it together in an MP4 movie!
Get the CLI app:
http://ffmpeg.org/
Command line:
heatmap.py -g portland.gpx -o output.png --height 800 –osm –a
ffmpeg -i frame-%05d.png OSM_is_awesome.mp4
42
47. Rule #3:
Beware of population effects
34
Absolute value
2,000
Normalized value =
Population estimate
72 52
16,000 25,000
22
2,000
47
48. Conclusion
1. Think in terms of aggregation
2. Selectively throw away precision
3. Beware of population effects
48
49. Contact Us
HumanGeo NY HumanGeo DC
1221 Avenue of the Americas 2500 Wilson Boulevard
Suite 4200 | New York, NY 10020 Suite 310 | Arlington, VA 22201
info@thehumangeo.com | 703.955.1540 | www.thehumangeo.com
49