A large scale comparison of the position of countries in international collab...Zaida Chinchilla-Rodríguez
This work presents a preliminary large scale analysis of the relationship between collaboration and mobility indicators at the country level, taking into account the scientific capacities of countries.
3-D geospatial data for disaster management and developmentKeiko Ono
Japan is a high income country at an advanced stage of epidemiological transition. One of its remaining public health challenges is response to natural disasters. This presentation explores the potential of 3-D geospatial data in disaster response and management.
Social Media and Forced Displacement: Big Data Analytics and Machine Learning...UN Global Pulse
UN Global Pulse and UNHCR Innovation Service, an interdepartmental initiative of the Office of the United Nations High Commissioner for Refugees (UNHCR) used data from Twitter to monitor protection issues and the safe access to asylum of migrants and refugees in Europe. The experimental project investigated interactions among refugees, between refugees and host communities, and between refugees and service providers along the way into Europe. This paper summarises the initial findings and lessons learned, and describes the results of ten mini-studies that were developed as part of the project. It outlines the process, questions and methodology used to develop the studies, and presents preliminary observations on how aspects of the Europe Refugee Emergency are related on social media.
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Keiichiro Ono
Introduction to biological network analysis and visualization with Cytoscape (using the latest version 3.4).
This is a first half of the lecture for Applied Bioinformatics lecture at TSRI.
A large scale comparison of the position of countries in international collab...Zaida Chinchilla-Rodríguez
This work presents a preliminary large scale analysis of the relationship between collaboration and mobility indicators at the country level, taking into account the scientific capacities of countries.
3-D geospatial data for disaster management and developmentKeiko Ono
Japan is a high income country at an advanced stage of epidemiological transition. One of its remaining public health challenges is response to natural disasters. This presentation explores the potential of 3-D geospatial data in disaster response and management.
Social Media and Forced Displacement: Big Data Analytics and Machine Learning...UN Global Pulse
UN Global Pulse and UNHCR Innovation Service, an interdepartmental initiative of the Office of the United Nations High Commissioner for Refugees (UNHCR) used data from Twitter to monitor protection issues and the safe access to asylum of migrants and refugees in Europe. The experimental project investigated interactions among refugees, between refugees and host communities, and between refugees and service providers along the way into Europe. This paper summarises the initial findings and lessons learned, and describes the results of ten mini-studies that were developed as part of the project. It outlines the process, questions and methodology used to develop the studies, and presents preliminary observations on how aspects of the Europe Refugee Emergency are related on social media.
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Keiichiro Ono
Introduction to biological network analysis and visualization with Cytoscape (using the latest version 3.4).
This is a first half of the lecture for Applied Bioinformatics lecture at TSRI.
This is an analysis that investigates the relations between climate change and global justice. Our main aim is to study the consequences of climate change affect issues of Global Justice such as equality, fair allocation of resources and the evaluation of the future.
Prof. Melinda Laituri, Colorado State University | Open Data for Secondary Ci...Kathmandu Living Labs
State of the Map Asia (SotM-Asia) is the annual regional conference of OpenStreetMap (OSM) organized by OSM communities in Asia. First SotM-Asia was organized in Jakarta, Indonesia in 2015, and the second was organized in Manila, Philippines in 2016. This year’s conference, third in the series, was organized in Kathmandu, Nepal on September 23 – 24, 2017 at Park Village Resort, Budhanilkantha, Kathmandu, Nepal.
We brought nearly 200 Open Mapping enthusiasts from Asia and beyond to this year’s SotM-Asia. The event provided an opportunity to share knowledge and experience among mappers; expand their network; and generate ideas to expand map coverage and effective use of OSM data in Asian continent. We chose ‘from creation to use of OSM data’ as the theme of this year’s conference, emphasizing on the effective use of OSM data. We also brought together a government panel from four different countries in this year’s SotM-Asia. We believe this event will deepen the bond and enhance collaboration among OSM communities across Asia.
More information about the conference can be found on: http://stateofthemap.asia.
This is an analysis that investigates the relations between climate change and global justice. Our main aim is to study the consequences of climate change affect issues of Global Justice such as equality, fair allocation of resources and the evaluation of the future.
Prof. Melinda Laituri, Colorado State University | Open Data for Secondary Ci...Kathmandu Living Labs
State of the Map Asia (SotM-Asia) is the annual regional conference of OpenStreetMap (OSM) organized by OSM communities in Asia. First SotM-Asia was organized in Jakarta, Indonesia in 2015, and the second was organized in Manila, Philippines in 2016. This year’s conference, third in the series, was organized in Kathmandu, Nepal on September 23 – 24, 2017 at Park Village Resort, Budhanilkantha, Kathmandu, Nepal.
We brought nearly 200 Open Mapping enthusiasts from Asia and beyond to this year’s SotM-Asia. The event provided an opportunity to share knowledge and experience among mappers; expand their network; and generate ideas to expand map coverage and effective use of OSM data in Asian continent. We chose ‘from creation to use of OSM data’ as the theme of this year’s conference, emphasizing on the effective use of OSM data. We also brought together a government panel from four different countries in this year’s SotM-Asia. We believe this event will deepen the bond and enhance collaboration among OSM communities across Asia.
More information about the conference can be found on: http://stateofthemap.asia.
Gender Equality and Big Data. Making Gender Data Visible UN Global Pulse
This report provides background context on how big data can be used to facilitate and assess progress towards the SDGs, and focuses in particular on SDG 5 – “Achieve gender equality and empower all women and girls”. It examines successes and challenges in the use of big data to improve the lives of women and girls, and identifies concrete data innovation projects from across the development sector that have considered the gender dimension.
Sander van der Waal's (Open Knowledge Foundation) presentation at Prague Open Data Meetup #7: Linked Open Cities.
The event was organised by Otakar Motejl Fund and LOD2 project. More info: bit.ly/open-cities-meetup
The role of open data in the development of sustainable smart cities and smar...Anastasija Nikiforova
This presentation is a supplementary material for the guest lecture "The role of open data in the development of sustainable smart cities and smart society" I delivered for the Federal University of Technology – Paraná (Universidade Tecnológica Federal do Paraná (UTFPR)) (Brazil, May 2022).
Presentation to Civil Society at the University of the West Indies, Port of Spain, Trinidad on 28 February 2015 by the World Bank to civil society representatives including those from the Trinidad and Tobago Computer Society
Towards 'Resilient Cities' - Harmonisation of Spatial Planning Information as...Beniamino Murgante
Towards 'Resilient Cities' - Harmonisation of Spatial Planning Information as One Step Along the Way
Manfred Schrenk, Julia Neuschmid, Daniela Patti - Department for Urbanism, Transport, Environment and Information Society, Central European Institute of Technology, Austria
Towards 'Resilient Cities' - Harmonisation of Spatial Planning Information as...Beniamino Murgante
Towards 'Resilient Cities' - Harmonisation of Spatial Planning Information as One Step Along the Way
Manfred Schrenk, Julia Neuschmid, Daniela Patti - Department for Urbanism, Transport, Environment and Information Society, Central European Institute of Technology, Austria
When the Global Pulse initiative was launched by the UN Secretary-General in late 2009, its mission to use real-time and other non- traditional data sources in development and humanitarian action was groundbreaking. 2014 was a landmark year for embracing the importance of data analysis in achieving sustainable development. Throughout the year, the "Post-2015 data revolution" agenda was taken-up in governments, public sector and civil society organisations.
Over the past year, Pulse Labs in New York, Jakarta and Indonesia have supported the growth of a thriving community of practice, redefined the data innovation landscape and demonstrated how real-time data can play a role in supporting decision-makers and shaping public service delivery. With 25 joint data innovation projects implemented over the year, in partnership with 25 UN & Govt innovation project partners, 30 private sector collaborators and academics from 26 institutions, Global Pulse is contrbuting to a body of evidence that demonstrates how big data analysis can complement traditional approaches to development planning and monitoring.
Global Pulse's Annual Report 2014 highlights big data innovation projects carried out over the past year, and new milestones in the evolution of a "big data for development" ecosystem.
The term “Spatial Data Infrastructure” (SDI) is often used to denote the relevant base collection of technologies, policies and institutional arrangements that facilitate the availability of and access to spatial data. SDI describes the overall methodology, process, existing practice, terms, policies of Nepal.
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
6. Poster
Using R with Taiwan Government Open Data to Create a
Tool For Monitoring the City's Age-Friendly Status
As the aged population rapidly growths, making a city age-friendly is
becoming a priority goal of the government’s policy. Information such
as indexes reflecting the city’s age-friendliness would be needed for
better policy-making and should also be readily accessible to the
citizens. And the R language provides a great flexibility in dealing with
the diversity of the file formats from government. Besides, the data
visualization and web application supported by R can make the
analysis result more understandable and interactive.
According to Global Age-friendly Cities: A Guide (WHO, 2005), there
are eight aspects for a comfort of elder living (outdoor spaces,
transportation, housing, social participation, social respect, civic
participation, communication, health and community support). And
we use the Taiwan government open data to integrate indexes with
normalization and to visualize the indexes geographically. In the end,
we create a Shiny application to let the result easily be approached.
The result may show how to utilize the government data and provide a
great application turning WHO guideline into a monitor tool helping
the government practice in age-friendly policy.
Abstract
Taiwan government open data
Growing aged population in Taiwan
Age-friendly City
Ting Wei Lin1, Wen Tsai Hsu2, Zheng Wan Lin3, Yu Wen Kao4, Po Shang Yang5, Chi Tse Teng6
1.System Genome and Biology Program, National Taiwan University, Taiwan 2.Department of Financial and Computational Mathematics,Providence University, Taiwan 3.Department of Information
Management, Providence university, Taiwan 4.Department of Statistics and Informatics Science, Providence University, Taiwan 5.Department of Computer Science and
Engineering, National Chung-Hsing University, Taiwan 6.Department of Computer Science and Information Engineering, Providence university, Taiwan
The international development trend on using the public data is to let
the government information transparent and easily accessible, which
can promote the citizen participation. In Taiwan, the government have
launched the open data policy in 2012 to participate in the trends to
pursue a better government transparency. On the other hand , the
calling for the open government data from the local open community
had also accelerated the Taiwan government’s path to openness.
With the cooperation from the government and numerous open data
communities, Taiwan has been ranked No.1 in the 2015 Open Data
Index by the Open Knowledge Foundation. Now, the open taiwan
government (data.gov.tw) have stored over 17281 data sets from more
than 66 central government and 26 local government departments.
The data sets had providing the information about the government
spending, national statistics, procurement tenders, national map,
legislation, pollutant emissions, election results, company register,
government budget, water quality, weather information, which
provide enormous data and resource to utilize for citizen
participation. Overall, the ultimate goal to turn the government into
more transparency and open is to promote the citizen participation
and to continue supervisor the government’s practice.
By exploring the data stored in the Taiwan government open data, we can find
that the growing aging population is a emerging serious public issue.
According to the national statistic data from the open Taiwan government, the
average population ratio over 65 years in Taiwan is increased from 11% in 2010
to 13% in 2015, which mean the aged society. The range of the population ratio
over 65 years old range from 10% to 17% in different Taiwan districts. The
National Development Council report had showed that in the 2060, the Taiwan
may be the 2nd place high aged population. Making policy to response to the
population change will be a priority and also a great challenge.
The problem of rapidly aging population is not only Taiwan’s public problem
but the global health public issue. The WHO have done a great effort on
dealing with this global aging problem and establish a serial campaigns on
promoting the concept of buiding a age-friendly city. According to the WHO
age-friendly city report, “ An age-friendly city is an inclusive and accessible
community environment that opportunities for health, participation and
security, in order that quality of life and dignity are ensured as people age.”
There are eight aspects for a comfort of elder living (outdoor spaces,
transportation, housing, social participation, social respect, civic
participation, communication, health and community support ) and the
indicators fit in above aspects can be used to assess a age-friendly status
of city. With those concepts, we can take the advantage of the rich data
from Open Taiwan Government data to create a index to reflect the age-
friendly status of the different districts in Taiwan.
Compositions of the age-friendly index
Equity measures
✪ the income difference between families
✪ the difference between the families structure
Age-friendly environment outcomes
accessible phyiscal environment
✪ neighbourhood walkability
✪ accessiblity of public spaces and buidlings
✪ accessibility of public transportation vehicles
✪ accessibility of public transportation stops
✪ affordability of housing
Inclusive Social Environment
✪ positive social attitude towrad old people
✪ engagement in voluteer activity
✪ engagement in paid employment
✪ engagement in socio-culture activity
✪ participation in local decision-making
✪ availability of informaiton
✪ availability of health and social sevice
✪ economic security
Impact on wellbeing
✪ quality of life
Workflow
Taiwan government open data
Data Gathering
readr dplyr
1.data input
2.data normalization
xij-min(xj)/max(xj)-min(xj)
3.data transformation
1- xij for negative correlation
indicators
maptools
rgeos broom
tidyr
4.input shapefile
(Taiwan area)
5.subsetting region
6.smooth the boundary
7.manual fixed centroid
data
ggmap
8.use the Stamen map
map
integration
Project Github URL
1. WHO(2015),Measuring the age-friendliness of cities A guide to using core indicators
2. 2016 Transportation Annual Statistics, Ministry of Transportation and Communications R.O.C
3. 2013 Statistic of General Health and Welfare, Ministry of Health and Welfare
4. 2016 Monthly Ratio of House Cost and Income, Department of Land Administration, M. O. I.
5. R Core Team (2016). R: A language and environment for statistical computing. R Foundation for 3. Statistical Computing, Vienna, Austria.
6. Hadley Wickham and Romain Francois (2015). readr: Read Tabular Data. R package version 0.2.2.
7. Hadley Wickham and Romain Francois (2015). dplyr: A Grammar of Data Manipulation. R package version 0.4.3.
8. D. Kahle and H. Wickham. ggmap: Spatial Visualization with ggplot2. The R Journal, 5(1),144-161.
9.Roger Bivand and Nicholas Lewin-Koh (2016). maptools: Tools for Reading and Handling Spatial Objects. R package version 0.8-39.
10.Roger Bivand and Colin Rundel (2016). rgeos: Interface to Geometry Engine - Open Source (GEOS). R package version 0.3-19.
11.Roger Bivand, Tim Keitt and Barry Rowlingson (2016). rgdal: Bindings for the Geospatial Data Abstraction Library. R package version 1.1-10.
12.H. Wickham. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2009.
Reference
Acknowledgement
1.DSP
2.Open Culture Foundation
Quality of
Life
Equity
Physical
Environment
Social
Environment
AgeFriendlyCity:Physical Environment
AgeFriendlyCity:Social Environment
AgeFriendlyCity:Equity
AgeFriendlyCity:Quality of Life
36 inch
91.44cm
40 inch
101.6cm
10. conference
keynote
lighting talk Contribute talk
Poster talk
sponsor talk
18
Statistics Method
Performance
Kaleidoscope
Case Study
Bioinformatics
Big data
R & Other languag
Regression
5min
2 hours
15. useR!2016
Tutor
Sponsor: H2o data company
Develop: SparkR,R markdown
Online Course Tutor: Datacamp Garrett
Statistic Book Author:Max Kuhn
3 hr per workshop
RStudio Server+AWS service
Github note
Notebook
counter
16. useR!2016
Keynote
Richard Becker Donald Knuth Deborah Nolan Hadley Wickham
Literature
Programming/Tex
R S
How to teach Data science
in Berkeley
Hadley“verse”
in R world
17. useR!2016
Keynote
Richard Becker
R S
why has the S language stayed around for 40 years ?
1.
2. S
3.
4.
5. unix
AT&T
R S
1970 1991
R
Along Came & Robert Gentleman