Keynote at the VGI-Alive workshop at the AGILE 2018 conference in Lund (June 2018).
Keywords: Information geography; crowdsourcing; volunteered geographic information; geo-demographics; Twitter; Wikipedia
Authors:
Andrea Ballatore (Birkbeck, University of London)
Stefano De Sabbata (University of Leicester)
1. Based on an examination of 260 tools, Social360 is currently the only tool capable of adequately enriching geospatial intelligence by extracting relevant information from social media content.
2. Approximately 5% of current tools can monitor social networks in near real-time, geolocate social media information, and forecast events based on social media analysis.
3. Key individuals and organizations like the University of Missouri that could develop new tools for social media geospatial analysis in the next 3-5 years are identified.
Participatory GIS, Volunteered Geographic Information and Citizen Science - G...Muki Haklay
This document discusses participatory GIS (PPGIS), volunteered geographic information (VGI), and citizen science. It provides background on the emergence of PPGIS in the 1990s and describes how it has developed methodologies for public participation with GIS. It also discusses the rise of user-generated geographic content through VGI platforms. Finally, it introduces the concept of "extreme citizen science" where citizens participate in all aspects of the scientific process from defining problems to analyzing results.
This document summarizes a conference presentation on using crowdsourcing approaches like volunteered geographic information (VGI), citizen science (CS), and participatory mapping (PM) to engage the public in policymaking. It defines these approaches and provides examples. While governments have been reluctant to use crowdsourced data due to quality and legal concerns, the presenters argue that capitalizing on established crowdsourcing models and open data practices could help develop public policies to formally incorporate crowdsourcing into government decision-making and engagement. Recommendations are provided for overcoming barriers and assuaging government adoption of these approaches.
Experiences as a producer, consumer and observer of open dataProgCity
Peter Mooney, is an Environmental Protection Agency (EPA) funded Research Fellow at the Department of Computer Science, NUI Maynooth. He has been working with the EPA on making environmental data publicly accessibly for the last ten years.
Presentation was part of The 1st Seminar of the ERC Funded Programmable City Project based at NIRSA, NUI Maynooth, Republic of Ireland.
Volunteered Geographic Information (VGI) for Disaster ManagementEmir Hartato
This document summarizes a presentation on using volunteered geographic information (VGI) for disaster management in Jakarta, Indonesia. It discusses how VGI can help fill information needs during disasters. While VGI has been used for real-time flood mapping in Jakarta, issues around data credibility must be addressed. The presentation aims to determine an appropriate framework to improve VGI credibility for disaster management across all phases of the disaster management cycle. It uses recent floods in Jakarta as a case study.
Using Machine Learning to Analyse Radio Content in Uganda UN Global Pulse
The document describes a project that uses machine learning to analyze radio content in Uganda for development and humanitarian purposes. It details the development of a Radio Content Analysis Tool that can automatically analyze hundreds of hours of radio broadcasts daily and extract text to identify discussions on predefined topics. Several pilot studies were conducted using this tool to understand how radio data could provide insights on issues like refugee perceptions, disaster impacts, health services, and disease outbreaks. The document outlines the automated and human analysis processes used and discusses opportunities and challenges around using talk radio as a source of big data to inform development goals.
1. Based on an examination of 260 tools, Social360 is currently the only tool capable of adequately enriching geospatial intelligence by extracting relevant information from social media content.
2. Approximately 5% of current tools can monitor social networks in near real-time, geolocate social media information, and forecast events based on social media analysis.
3. Key individuals and organizations like the University of Missouri that could develop new tools for social media geospatial analysis in the next 3-5 years are identified.
Participatory GIS, Volunteered Geographic Information and Citizen Science - G...Muki Haklay
This document discusses participatory GIS (PPGIS), volunteered geographic information (VGI), and citizen science. It provides background on the emergence of PPGIS in the 1990s and describes how it has developed methodologies for public participation with GIS. It also discusses the rise of user-generated geographic content through VGI platforms. Finally, it introduces the concept of "extreme citizen science" where citizens participate in all aspects of the scientific process from defining problems to analyzing results.
This document summarizes a conference presentation on using crowdsourcing approaches like volunteered geographic information (VGI), citizen science (CS), and participatory mapping (PM) to engage the public in policymaking. It defines these approaches and provides examples. While governments have been reluctant to use crowdsourced data due to quality and legal concerns, the presenters argue that capitalizing on established crowdsourcing models and open data practices could help develop public policies to formally incorporate crowdsourcing into government decision-making and engagement. Recommendations are provided for overcoming barriers and assuaging government adoption of these approaches.
Experiences as a producer, consumer and observer of open dataProgCity
Peter Mooney, is an Environmental Protection Agency (EPA) funded Research Fellow at the Department of Computer Science, NUI Maynooth. He has been working with the EPA on making environmental data publicly accessibly for the last ten years.
Presentation was part of The 1st Seminar of the ERC Funded Programmable City Project based at NIRSA, NUI Maynooth, Republic of Ireland.
Volunteered Geographic Information (VGI) for Disaster ManagementEmir Hartato
This document summarizes a presentation on using volunteered geographic information (VGI) for disaster management in Jakarta, Indonesia. It discusses how VGI can help fill information needs during disasters. While VGI has been used for real-time flood mapping in Jakarta, issues around data credibility must be addressed. The presentation aims to determine an appropriate framework to improve VGI credibility for disaster management across all phases of the disaster management cycle. It uses recent floods in Jakarta as a case study.
Using Machine Learning to Analyse Radio Content in Uganda UN Global Pulse
The document describes a project that uses machine learning to analyze radio content in Uganda for development and humanitarian purposes. It details the development of a Radio Content Analysis Tool that can automatically analyze hundreds of hours of radio broadcasts daily and extract text to identify discussions on predefined topics. Several pilot studies were conducted using this tool to understand how radio data could provide insights on issues like refugee perceptions, disaster impacts, health services, and disease outbreaks. The document outlines the automated and human analysis processes used and discusses opportunities and challenges around using talk radio as a source of big data to inform development goals.
This document outlines Andrew Crooks' research using GIS and agent-based modeling to study various social and environmental problems. It provides examples of applications including modeling border security, disease spread, refugee camps, slum formation, and wildfire evacuations. New sources of crowdsourced data from social media and the internet are discussed as a way to supplement traditional data and obtain near real-time information to inform models. The integration of GIS and agent-based modeling is presented as a way to explore complex systems and human behaviors at fine spatial and temporal scales.
UN Global Pulse's 2016 annual report summarizes the organization's work to promote the use of big data for development and humanitarian purposes. In 2016, Global Pulse intensified efforts to leverage new data sources to support achieving the UN Sustainable Development Goals. It collaborated with UN agencies on 20 innovation projects using data from sources like social media, mobile phones, and satellite imagery. Global Pulse also worked to build an enabling environment for data innovation, strengthen partnerships, and accelerate adoption of ethical data use policies. The organization continued delivering capacity building and acting as a hub for stakeholders through its Pulse Labs in New York, Indonesia, and Uganda.
What's the status of the NSDI?
Cowen's address will provide his perspective on the current status of the National Spatial Data Infrastructure (NSDI). He will draw from his extensive experience with the National Research Council’s Mapping Science Committee, chairing the NRC study National Land Parcel Data: A Vision for the Future, a recent term as chair of the National Geospatial Advisory Committee, and his service as vice chairman of the Coalition of Geospatial Organizations (COGO) Report Card Committee on the NSDI. Through these activities he has observed and analyzed the Federal geospatial landscape for the thirty years since president Clinton issued Executive Order 12906, Coordinating Geographic Data Acquisition and Access: The National Spatial Data Infrastructure in 1994. He will comment on the changing role of various stakeholders in the collection, maintenance and sharing of geospatial data.
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...crooksAndrew
This document discusses leveraging crowdsourced data from sources like social media, photos, and reviews for agent-based modeling. It provides examples of using this data to model traffic patterns, neighborhood dynamics, disaster response, and wildfires. While this data provides new insights, there are also challenges to address regarding data collection and storage, validation across sources, and addressing biases. Overall, crowdsourced data represents new opportunities for understanding human behavior and perception of space over time through geosimulation models.
If you took a Geography course over 20 years ago, you might recall the subject involving little more than memorizing the locations of continents, countries, cities, as well as climate and cultural facts. In that time, many universities have expanded their geography programs by entering the world of Geographic Information Systems, or GIS for short. In the beginning GIS was an obscure field of specialized hardware, software, and cryptic keyboard commands that allowed a skilled professional to query data to get answers to geographic-based inquiries. Queries, such as the quantity of forested acres within an area, were the beginning of the geographicbased analysis revolution that has since unfolded. But today’s leading geography programs are teaching students more than just the where, what, who, and why of our world, but also bring to the table an interdisciplinary approach to solving today’s local, regional, national, and global problems. Many of these programs are not limited to just universities, now involving the K-12 space, tapping into young people’s minds to unleash innovative ideas in what is now an interdisciplinary field. To maintain a competitive advantage in today’s world, leading countries, companies, and research organizations are embracing these new capabilities and the talent that is available in the marketplace.
Data Innovation: Generating Climate Solutions EventUN Global Pulse
This high-level event will jump-start proceedings of the Sustainable Innovation Forum with discussions and examples of data-driven innovations to accelerate climate solutions and the Sustainable Development Agenda.
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...ESD UNU-IAS
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Response
Task Force Pendukung Data Spasial (TF-PDS)
Gempa Lombok (UGM)
2019 ProSPER.Net Young Researchers' School
5 March 2019
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISSRemetey-Fülöpp Gábor
Introduction of EO-related activities in GSDI Association highlighting some relevant actions of its regional member EUROGI and national level member HUNAGI
Presenting a new, clear approach to defining neogeography and its various elements, understanding the stakeholders in VGI and researching how volunteered information may benefit users over and above traditional cartography.
From crowdsourced geographic information to participatory citizen science - e...Muki Haklay
Slides from presentation at Leicester Geography seminar March 2014, which is based on earlier discussion in a 'thinking and doing digital mapping' workshop in June 2013 in http://blog.digitalcartography.eu/2013/03/26/june-workshop-thinking-and-doing-digital-mapping/ as part of Charting the Digital project http://digitalcartography.eu/
The presentation discusses Volunteered Geographic Information (crowdsourced information) and Citizen Science, using the philosophy of technology of Albert Borgmann.
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystMartha Russell
Across the globe, new word-of-mouth messaging methods are emerging. Many of these involve new technologies. The strategic use of media has become a game changer for both local and global businesses. Traditional media platforms are outpaced by the speed of flash movements as they unfold. Technical discoveries outpace the scientific journals available to announce them. Journalists, entertainers, academics, scientists, and citizens are experimenting with new tools and platforms for content creation, consumption and curation.
When the news about Tahir Square, or Occupy Wall Street or, more recently the Brazilian protests, hit the headlines of newspapers and magazines, they were already outdated. Documentaries were equally incapable of tracking and fully describing these movements. Traditional narratives – and the technologies used to tell them - fall short of accurately portraying the ideas and behaviors that are emerging through new modes of communication. Information travels so fast, that news is no longer "new". Ubiquitous media disintermediates traditional business ecosystems. And every company must take on roles of a media company.
The world of digital content is experiencing an explosion of innovation in both creation and consumption of media. It may well have been consumer applications that ignited the transformation, but business, enterprise and government interests have joined the party. Across the entire innovation ecosystem of media, new technologies and new uses of it by people are creating a sea change in the way people participate and in the responses they expect, Streaming coverage, both amateur and professional – both business and community, is powered by cutting edge technology in combinations of smartphones, 4G, drone cameras and, even, Google Glass can report on events and movements, products and services. The new role of the Chief Digital Officer has emerged in many organizations - to help management bridge the changing roles usually played by Chief Information Officers, Chief Marketing Officers, and Chief Technology Officers.
Labs affiliated with mediaX at Stanford University study how people and information technology interact. We invite discovery collaborations on the future of content for business, education, and entertainment.
This document summarizes a meeting of the Land Use Systems Group regarding ongoing research into the nexus between digitization and landscape research. It provides an overview of the objectives to conduct a systematic literature review on this topic, identify research gaps, and suggest future directions. A preliminary literature review identified three relevant domains: the impact of digitization on landscapes, its interlinkage with landscape perceptions, and use of digitization for citizen science in landscape research. Next steps include a more detailed literature review and publishing a review paper on the current state of knowledge by March 2019. The research aims to better understand how digitization may influence landscape interactions and perceptions over time.
Presented by Prof. Dr. Darla Munroe, The Ohio State University, USA, on 10 November 2020 at "International workshop: Enhancing wetland management and sustainable development"
Geographical information system : GIS and Social Media Imran Ghaznavi
Geographical Information System (GIS) is a system for capturing, storing, analyzing and managing data which is spatially referenced to Earth. GIS allows users to correlate geographic and attribute information to better understand relationships and patterns. Key applications of GIS include urban planning, transportation, education, government, marketing and more. The rise of social media and location-based services has led to a convergence of GIS and social media, allowing analysis of social networks and interactions tied to physical locations over time. This new development enables deeper understanding of human behavior patterns across communities.
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...Muki Haklay
These are slides from a talk at Edinburgh EEO/AGI-Scotland seminar. The talk explores how Geographic Information Science (GIScience) can contribute to citizen science, and what citizen science can contribute to GIScience.
Extreme Citizen Science: the socio-political potential of citizen scienceMuki Haklay
Slides from a talk at the International Congress for Conservation Biology / European Congress for Conservation Biology 2015 (Montpellier 2-6 August). The talk positioned citizen science within the wider context of production and use of environmental information, and emphasised the need to extend citizen science to a wider audience. It also demonstrated how technology can be used within a careful participatory process.
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
This document discusses how citizen science project organizers address issues related to participation and data quality when limited by resource constraints. It notes that while ideal information and communication technologies (ICT) could help with recruitment, retention, and data quality, free ICT options are often not sufficient due to lack of funding. The document presents three citizen science projects that have contrasting resource levels and explores how they creatively address these issues through alternative solutions like in-person outreach, paper data collection, and process optimization. It concludes by discussing implications for practice, policy, and CSCW research around leveraging complementary resources when full ICT capabilities are not possible.
This document outlines Andrew Crooks' research using GIS and agent-based modeling to study various social and environmental problems. It provides examples of applications including modeling border security, disease spread, refugee camps, slum formation, and wildfire evacuations. New sources of crowdsourced data from social media and the internet are discussed as a way to supplement traditional data and obtain near real-time information to inform models. The integration of GIS and agent-based modeling is presented as a way to explore complex systems and human behaviors at fine spatial and temporal scales.
UN Global Pulse's 2016 annual report summarizes the organization's work to promote the use of big data for development and humanitarian purposes. In 2016, Global Pulse intensified efforts to leverage new data sources to support achieving the UN Sustainable Development Goals. It collaborated with UN agencies on 20 innovation projects using data from sources like social media, mobile phones, and satellite imagery. Global Pulse also worked to build an enabling environment for data innovation, strengthen partnerships, and accelerate adoption of ethical data use policies. The organization continued delivering capacity building and acting as a hub for stakeholders through its Pulse Labs in New York, Indonesia, and Uganda.
What's the status of the NSDI?
Cowen's address will provide his perspective on the current status of the National Spatial Data Infrastructure (NSDI). He will draw from his extensive experience with the National Research Council’s Mapping Science Committee, chairing the NRC study National Land Parcel Data: A Vision for the Future, a recent term as chair of the National Geospatial Advisory Committee, and his service as vice chairman of the Coalition of Geospatial Organizations (COGO) Report Card Committee on the NSDI. Through these activities he has observed and analyzed the Federal geospatial landscape for the thirty years since president Clinton issued Executive Order 12906, Coordinating Geographic Data Acquisition and Access: The National Spatial Data Infrastructure in 1994. He will comment on the changing role of various stakeholders in the collection, maintenance and sharing of geospatial data.
Leveraging Crowdsourced data for Agent-based modeling: Opportunities, Example...crooksAndrew
This document discusses leveraging crowdsourced data from sources like social media, photos, and reviews for agent-based modeling. It provides examples of using this data to model traffic patterns, neighborhood dynamics, disaster response, and wildfires. While this data provides new insights, there are also challenges to address regarding data collection and storage, validation across sources, and addressing biases. Overall, crowdsourced data represents new opportunities for understanding human behavior and perception of space over time through geosimulation models.
If you took a Geography course over 20 years ago, you might recall the subject involving little more than memorizing the locations of continents, countries, cities, as well as climate and cultural facts. In that time, many universities have expanded their geography programs by entering the world of Geographic Information Systems, or GIS for short. In the beginning GIS was an obscure field of specialized hardware, software, and cryptic keyboard commands that allowed a skilled professional to query data to get answers to geographic-based inquiries. Queries, such as the quantity of forested acres within an area, were the beginning of the geographicbased analysis revolution that has since unfolded. But today’s leading geography programs are teaching students more than just the where, what, who, and why of our world, but also bring to the table an interdisciplinary approach to solving today’s local, regional, national, and global problems. Many of these programs are not limited to just universities, now involving the K-12 space, tapping into young people’s minds to unleash innovative ideas in what is now an interdisciplinary field. To maintain a competitive advantage in today’s world, leading countries, companies, and research organizations are embracing these new capabilities and the talent that is available in the marketplace.
Data Innovation: Generating Climate Solutions EventUN Global Pulse
This high-level event will jump-start proceedings of the Sustainable Innovation Forum with discussions and examples of data-driven innovations to accelerate climate solutions and the Sustainable Development Agenda.
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Re...ESD UNU-IAS
Lecture 8: Volunteered Geographic Information (VGI) for Disaster Emergency Response
Task Force Pendukung Data Spasial (TF-PDS)
Gempa Lombok (UGM)
2019 ProSPER.Net Young Researchers' School
5 March 2019
GSDI Liaison report on Earth Observation-related activities for the CEOS WGISSRemetey-Fülöpp Gábor
Introduction of EO-related activities in GSDI Association highlighting some relevant actions of its regional member EUROGI and national level member HUNAGI
Presenting a new, clear approach to defining neogeography and its various elements, understanding the stakeholders in VGI and researching how volunteered information may benefit users over and above traditional cartography.
From crowdsourced geographic information to participatory citizen science - e...Muki Haklay
Slides from presentation at Leicester Geography seminar March 2014, which is based on earlier discussion in a 'thinking and doing digital mapping' workshop in June 2013 in http://blog.digitalcartography.eu/2013/03/26/june-workshop-thinking-and-doing-digital-mapping/ as part of Charting the Digital project http://digitalcartography.eu/
The presentation discusses Volunteered Geographic Information (crowdsourced information) and Citizen Science, using the philosophy of technology of Albert Borgmann.
Current Disruptions in Media: Earthquakes or New Openings? Stanford as CatalystMartha Russell
Across the globe, new word-of-mouth messaging methods are emerging. Many of these involve new technologies. The strategic use of media has become a game changer for both local and global businesses. Traditional media platforms are outpaced by the speed of flash movements as they unfold. Technical discoveries outpace the scientific journals available to announce them. Journalists, entertainers, academics, scientists, and citizens are experimenting with new tools and platforms for content creation, consumption and curation.
When the news about Tahir Square, or Occupy Wall Street or, more recently the Brazilian protests, hit the headlines of newspapers and magazines, they were already outdated. Documentaries were equally incapable of tracking and fully describing these movements. Traditional narratives – and the technologies used to tell them - fall short of accurately portraying the ideas and behaviors that are emerging through new modes of communication. Information travels so fast, that news is no longer "new". Ubiquitous media disintermediates traditional business ecosystems. And every company must take on roles of a media company.
The world of digital content is experiencing an explosion of innovation in both creation and consumption of media. It may well have been consumer applications that ignited the transformation, but business, enterprise and government interests have joined the party. Across the entire innovation ecosystem of media, new technologies and new uses of it by people are creating a sea change in the way people participate and in the responses they expect, Streaming coverage, both amateur and professional – both business and community, is powered by cutting edge technology in combinations of smartphones, 4G, drone cameras and, even, Google Glass can report on events and movements, products and services. The new role of the Chief Digital Officer has emerged in many organizations - to help management bridge the changing roles usually played by Chief Information Officers, Chief Marketing Officers, and Chief Technology Officers.
Labs affiliated with mediaX at Stanford University study how people and information technology interact. We invite discovery collaborations on the future of content for business, education, and entertainment.
This document summarizes a meeting of the Land Use Systems Group regarding ongoing research into the nexus between digitization and landscape research. It provides an overview of the objectives to conduct a systematic literature review on this topic, identify research gaps, and suggest future directions. A preliminary literature review identified three relevant domains: the impact of digitization on landscapes, its interlinkage with landscape perceptions, and use of digitization for citizen science in landscape research. Next steps include a more detailed literature review and publishing a review paper on the current state of knowledge by March 2019. The research aims to better understand how digitization may influence landscape interactions and perceptions over time.
Presented by Prof. Dr. Darla Munroe, The Ohio State University, USA, on 10 November 2020 at "International workshop: Enhancing wetland management and sustainable development"
Geographical information system : GIS and Social Media Imran Ghaznavi
Geographical Information System (GIS) is a system for capturing, storing, analyzing and managing data which is spatially referenced to Earth. GIS allows users to correlate geographic and attribute information to better understand relationships and patterns. Key applications of GIS include urban planning, transportation, education, government, marketing and more. The rise of social media and location-based services has led to a convergence of GIS and social media, allowing analysis of social networks and interactions tied to physical locations over time. This new development enables deeper understanding of human behavior patterns across communities.
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...Muki Haklay
These are slides from a talk at Edinburgh EEO/AGI-Scotland seminar. The talk explores how Geographic Information Science (GIScience) can contribute to citizen science, and what citizen science can contribute to GIScience.
Extreme Citizen Science: the socio-political potential of citizen scienceMuki Haklay
Slides from a talk at the International Congress for Conservation Biology / European Congress for Conservation Biology 2015 (Montpellier 2-6 August). The talk positioned citizen science within the wider context of production and use of environmental information, and emphasised the need to extend citizen science to a wider audience. It also demonstrated how technology can be used within a careful participatory process.
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
This document discusses how citizen science project organizers address issues related to participation and data quality when limited by resource constraints. It notes that while ideal information and communication technologies (ICT) could help with recruitment, retention, and data quality, free ICT options are often not sufficient due to lack of funding. The document presents three citizen science projects that have contrasting resource levels and explores how they creatively address these issues through alternative solutions like in-person outreach, paper data collection, and process optimization. It concludes by discussing implications for practice, policy, and CSCW research around leveraging complementary resources when full ICT capabilities are not possible.
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...michellep
The document discusses CIBER's work using deep log analysis (DLA) to understand user behavior in digital environments. DLA transforms digital activity data into information on user satisfaction, impacts, and outcomes. Findings from DLA are used to inform system changes. Benefits of DLA include its ability to analyze massive amounts of direct and real-time user activity data without sampling. CIBER has used DLA in various research projects analyzing digital behavior in fields like health, publishing, and education.
Citizen Science in Open Science context: measuring & understanding impacts of...Muki Haklay
Citizen science has grown rapidly in recent decades due to societal and technological trends. It includes a wide range of activities across disciplines. While not all participants want deep engagement, citizen science can involve fully participatory research processes. It is gaining recognition from the public and policymakers. Evaluation of citizen science projects requires sensitivity, as rigid criteria may exclude some activities or newcomers. Overall, citizen science shows potential for increasing scientific literacy and knowledge while achieving important research goals.
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...Liliana Bounegru
The document discusses reimagining data journalism through the lens of data infrastructures. It provides examples from digital methods research that investigate digital platforms and data creation. These include mapping right-wing groups in Europe using web analysis, examining counter-jihadist networks on Facebook, and analyzing climate change negotiations through transcripts and indicators. Examples from journalism that engage with data infrastructures include reverse engineering Netflix's film genres, mapping misinformation spread on Twitter, examining email targeting models, and making memory politics on social media visible. The document promotes accounting for socio-technical conditions of data and investigating how data infrastructures could be composed differently.
The previous AGI Foresight Report from 2010 accurately predicted several trends in the geospatial industry over the following five years, including the acceleration of technology change, the growing dominance of cloud-based solutions, and the rise of open source software due to budget constraints in the public sector. However, some predictions like greater market inroads by Chinese companies did not fully materialize. While the major players in 2010 like Google and Esri remained dominant, the report underestimated the growth of startups in areas like open source mapping. Overall, the report served as a useful benchmark, but also shows how difficult it is to fully anticipate the pace and direction of technological change.
COST Actions: ENERGIC, Mapping and the citizen sensor.Vyron
A presentation given during the COST Session in HAICTA 2013 (Cofru, Greece) about the aims and work of two COST Actions: ENERGIC (IC1203) and Mapping and the citizen sensor (TD1202). The presentation was put together by Cristina Capineri, Giles Foody and Vyron Antoniou.
Similar to Geographies of crowdsourced information and their implications (VGI-Alive Keynote), AGILE 2018, Lund (20)
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
State of Artificial intelligence Report 2023kuntobimo2016
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its sixth year. Consider this report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Research: Technology breakthroughs and their capabilities.
Industry: Areas of commercial application for AI and its business impact.
Politics: Regulation of AI, its economic implications and the evolving geopolitics of AI.
Safety: Identifying and mitigating catastrophic risks that highly-capable future AI systems could pose to us.
Predictions: What we believe will happen in the next 12 months and a 2022 performance review to keep us honest.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
Geographies of crowdsourced information and their implications (VGI-Alive Keynote), AGILE 2018, Lund
1. 1
Geographies of
crowdsourced information
and their implications
@a_ballatore
VGI-Alive, AGILE
June 2018, Lund, Sweden
Andrea Ballatore*
Department of Geography
Birkbeck, University of London
Stefano De Sabbata
School of Geography, Geology and the Env.,
University of Leicester
aballatore.space
2. 2
Outline
1. Where are we?
2. Core research questions
3. Paradigm limitations
4. Beyond the usual suspects
5. Case studies: Search behaviour
6. [Stefano will do the rest]
4. 4
The success of
crowdsourcing
"85% of the top global brands have used
crowdsourcing in the last ten years. […] According
to Gartner, 75% of the world’s high performing
enterprises will be using crowdsourcing by 2018."
(Deloitte, 2016)
It is when new, successful technologies withdraw
into the "woodwork of everyday banality" that
their effects become real and profound.
(Vincent Mosco, 2004)
https://ideascale.com
5. 5
The success of
crowdsourcing
"85% of the top global brands have used
crowdsourcing in the last ten years. […] According
to Gartner, 75% of the world’s high performing
enterprises will be using crowdsourcing by 2018."
(Deloitte, 2016)
It is when new, successful technologies withdraw
into the "woodwork of everyday banality" that
their effects become real and profound.
(Vincent Mosco, 2004)
https://ideascale.com
8. 8
Placing the crowds
• Crowdsourced
geographic information
(CGI)
• From experimental phase
to oligopoly (e.g.,
Google, Facebook)
• From cyberoptimism to
cyberpessimism
https://www.cagle.com
9. 9
Placing the crowds
• Crowdsourced
geographic information
(CGI)
• From experimental phase
to oligopoly (e.g.,
Google, Facebook)
• From cyberoptimism to
cyberpessimism
(2016)
11. 11
Core CGI research
1. Who are the contributors and why do they engage
in spatial information production, and what
incentives work or do not work? How do they
collaborate and organise? How do we include
marginalised communities?
(Budhathoki and Haythornthwaite 2013)
2. How can we calculate the quality and fitness for
purpose of crowdsourced data in a reliable,
preferably intrinsic way? (Goodchild and Li 2012)
3. What are the limitations of such models and what
are their spatial, epistemic and cultural biases?
(Dodge and Kitchin 2013)
12. 12
Core CGI research
1. Who are the contributors and why do they engage
in spatial information production, and what
incentives work or do not work? How do they
collaborate and organise? How do we include
marginalised communities?
(Budhathoki and Haythornthwaite 2013)
2. How can we calculate the quality and fitness for
purpose of crowdsourced data in a reliable,
preferably intrinsic way? (Goodchild and Li 2012)
3. What are the limitations of such models and what
are their spatial, epistemic and cultural biases?
(Dodge and Kitchin 2013)
13. 13
CGI for other domains
Natural sciences: biology,
climate science, Earth
sensing
Social sciences: urban planning,
transportation,
public health, economics,
human geography
Humanities: digital humanities,
history, cultural analytics
15. 15
• "Ignorance of crowds"
(Carr, 2007)
• Conditions for wisdom
• Menial work, no real
innovation/creativity
• Undermining paid work
• Variable quality
Limitations of the paradigm
2004
16. 16
• "Ignorance of crowds"
(Carr, 2007)
• Conditions for wisdom
• Menial work, no real
innovation/creativity
• Undermining paid work
• Variable quality
Limitations of the paradigm
2007
17. 17
• Large volumes of information
do not imply usefulness or
fitness for purpose
• We need representative
samples, not large samples
(e.g., random sample of
1,000 > 1M non-random)
Limitations of the paradigm
18. 18
Each CGI source is a particular viewpoint
and will return a different image of the
social and natural world.
suifaijohnmak.wordpress.com
21. 21
• Without centralised planning and protocols,
data quality remains uneven (coverage!)
• Wikipedia replaced Britannica, but
OpenStreetMap is not replacing Google
Maps
• CGI cannot replace established data
collection protocols and sources, but can
provide useful ancillary data
CGI strictures
22. 22
• Without centralised planning and protocols,
data quality remains uneven (coverage!)
• Wikipedia replaced Britannica, but
OpenStreetMap is not replacing Google
Maps
• CGI cannot replace established data
collection protocols and sources, but can
provide useful ancillary data
CGI strictures
23. 23
Reinventing wheels
• Some CGI replicates work
that has been done better
by professionals
• More useful to focus on
"missing" data:
bookscrounger.com
24. 24
Open data vs CGI
Authoritative datasets
are becoming
cheaper/free
28. 28
• Google Maps
• From 5M to 50M
contributors in
2017
• 700K new places
monthly
• Gamification
29. 29
• Hundreds of millions of users, billions
of reviews
• Measurable effects on spatial and
economic behaviour
• Sentiment about points of interest,
cities, and neighbourhoods
30. 30
• Micro-economic data (e.g. price of onions
in India, new shops in Ghana)
• For profit, contributors are paid
• Applications: International Development,
Government, Global Security, and Business
33. 33
Online visibility of CGI projects
• Search engines are the key entry point
to discover new information
• Feedback loop between Wikipedia and
Google Search to attract new
contributors
• Making CGI findable and consumable
for search engines and social media
• Study on CGI on Google Search (2018)
34. 34
Online visibility of CGI projects
• Search engines are the key entry point
to discover new information
• Feedback loop between Wikipedia and
Google Search to attract new
contributors
• Making CGI findable and consumable
for search engines and social media
• Study on CGI on Google Search (2018)
38. Andrea Ballatore
Department of Geography,
Birkbeck, University of London
Stefano De Sabbata*
School of Geography, Geology and the Env.,
University of Leicester
S c h o o l o f
G e o g r a p h y ,
G e o l o g y a n d t h e
E n v i r o n m e n t
Geographies of
crowdsourced information
and their implications
AGILE 2018 – Lund, Sweden
39. Case studies
• Operationalising quality in health studies
– with J Bright, S Lee, B Ganesh, DK Humphreys
– OpenStreetMap
– impact of availability on alcohol-related harm
• Analysis of global gazetteers
– with E Acheson and RS Purves
– GeoNames and Getty TGN
– patterns of coverage
• Urban Information Geographies
– with A Ballatore
– with N Tate
– quality and representativeness of VGI
40. Operationalising
VGI in health
studies
• with J Bright, S Lee, B
Ganesh, DK Humphreys
• OpenStreetMap
• impact of availability on
alcohol-related harm
• https://www.sciencedirect.com/scie
nce/article/pii/S1353829217305804
41. Quality indicator
Values Transformation
Num. of map features per square kilometre Log transformed, scaled
Average num. of edits per map feature Log transformed, scaled
Num. of users who made at least one edit Log transformed, scaled
Num. of features edited by more than one user Log transformed, scaled
Num. of days with at least one edit Log transformed, scaled
Days since the last edit in the area Inversed, scaled
Proportion of buildings with full address -1 if 0%, 0 if no buildings
• Intrinsic quality indicator based on Barron et al. (2014)
• Calculated as a linear combination of:
42. Replicating CRESH study
The overall conclusions are essentially identical
• significant positive correlation between density of
alcohol outlets and incidence of alcohol related harms
• an effect size which is comparable (though smaller)
The correlation
between OSM-based
estimates and CRESH
is stronger where the
quality index is higher
43. Analysis of global
gazetteers
• with E Acheson and RS
Purves
• GeoNames and
Getty TGN
• patterns of coverage
• https://www.sciencedirect.com/scie
nce/article/pii/S0198971516302496
44. Comparing GeoNames and TGN
Two distinct levels of coverages
• in “high coverage” countries, TGN same order of
magnitude as counts in GeoNames
• in “low coverage” countries, GeoNames has over 60 times
as much content as TGN
45. GeoNames and TGN in Great Britain
Gazetteer Features Pop place
GeoNames 54,701 16,475
Getty TGN 24,003 16,816
Ordnance Survey 50k 248,626
Ordnance Survey OpenNames 41,490
49. Findings
• Overall confirmed (some) hypothesis
– representation similar biases as participation
– Higher qualifications strongest factor
– Wealth (house prices) strong factor in both, more so for Wikipedia
– Twitter strongly influenced by perc. of ppl. aged 30-44 (positively) and
households with de-pendent children (negatively)
• However
– models account only for about 44–55% of variability
– need for more explanatory factors, e.g., tourism-related activities
• …or can these difference be used as indicators?
– ethnic composition is not a factor in the UK
• Twitter and Wikipedia similar but distinct geographies
– only representative of themselves
50. Comparing Twitter and Wikipedia
Number of tweets
(951 obs.)
~ Number of articles
Adj. R2 = 0.490
52. Conclusions
• More interaction between research on CGI
core and domain applications
– beyond technology, towards social domains
– beyond the usual suspects (OSM, etc)
• Access to data: sampling and scraping
• Systematic work on limitations is needed
– Urban Information Geographies
• Impact of GDPR on access and use of data
53. Thank you for your attention.
Any questions?
Stefano De Sabbata
School of Geography, Geology and the Env.,
University of Leicester
Andrea Ballatore
Department of Geography,
Birkbeck, University of London