Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of EarthquakesPRBETTER
As part of the final BETTER Hackathon, project partners prepared 4 hackathon exercises. ETHZ organised this exercise as the challenge promoter for the Geohazards thematic area. This open exercise featured the use of Binder and purposely provided cloud resources but could also be run locally through a Docker image and Docker Compose. The focus of this half-day exercise was to find a convenient way of exploitation of Co-seismic interferograms, by using developed BETTER pipelines. The idea was to produce geocoded maps combining automatically the important results to have a convenient visualisation that helps interpreting results.Participants were expected to be familiar with the Jupyter environment (Python 3) and the most common EO libraries (e.g. GDAL). The recorded part includes the introduction of the exercise in the context of the BETTER project.
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Tatiana Tarasova
This presentation is about a framework for semantically-enabled data discovery and integration across multiple Earth science disciplines. Data harmonization was based on the principles of Linked Data. Previous works define the Data Cube extensions which are relevant to certain Earth science disciplines. To provide a generic and domain independent solution, we propose an upper level vocabulary (the ENVRI vocabulary) that allows us to express domain specific information at a higher level of abstraction.
From a human viewpoint we provide an interactive Web based user interface for data discovery and integration across multiple research infrastructures (http://portal.envri.eu). The system is demonstrated on a use case of the Iceland Volcano’s eruption on April 10, 2010.
One of the most impressive mountain areas in the Alps are the Dolomites. Tourists visit the Dolomites for outdoor activities such as hiking, skiing, climbing, cycling and many others. However, unexpected events, such as weather conditions, poor experience in mountains, underestimation of the terrain conditions, bad planning, weak technical skills, among others, can increase the risk of accidents. The activities that represent better these events are ski touring, rock/ice climbing, mountaineering and hiking. In this work, we propose a localization system based on LoRa (pycom) and Bluetooth (beacon) devices to find lost people inside of the natural park Bletterbach GEOPARC. The system consists on a wearable sensor device worn in a helmet that transmits actual position to fixed stations. In case of emergency, the stations provide the last checking point to a centralized system, and authorities can start a procedure to find the victim with the aid of an unmanned aerial vehicle (UAV). The UAV transmits the exact coordinate to mountain rescue teams, and terrain conditions (imagery) to effectively plan the rescue mission. This work aims to accelerate the process of search and rescue operations that could be extended to different scenarios.
Better Hackathon 2020 ETHZ - Comparing Static And Dynamic Effects Of EarthquakesPRBETTER
As part of the final BETTER Hackathon, project partners prepared 4 hackathon exercises. ETHZ organised this exercise as the challenge promoter for the Geohazards thematic area. This open exercise featured the use of Binder and purposely provided cloud resources but could also be run locally through a Docker image and Docker Compose. The focus of this half-day exercise was to find a convenient way of exploitation of Co-seismic interferograms, by using developed BETTER pipelines. The idea was to produce geocoded maps combining automatically the important results to have a convenient visualisation that helps interpreting results.Participants were expected to be familiar with the Jupyter environment (Python 3) and the most common EO libraries (e.g. GDAL). The recorded part includes the introduction of the exercise in the context of the BETTER project.
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Tatiana Tarasova
This presentation is about a framework for semantically-enabled data discovery and integration across multiple Earth science disciplines. Data harmonization was based on the principles of Linked Data. Previous works define the Data Cube extensions which are relevant to certain Earth science disciplines. To provide a generic and domain independent solution, we propose an upper level vocabulary (the ENVRI vocabulary) that allows us to express domain specific information at a higher level of abstraction.
From a human viewpoint we provide an interactive Web based user interface for data discovery and integration across multiple research infrastructures (http://portal.envri.eu). The system is demonstrated on a use case of the Iceland Volcano’s eruption on April 10, 2010.
One of the most impressive mountain areas in the Alps are the Dolomites. Tourists visit the Dolomites for outdoor activities such as hiking, skiing, climbing, cycling and many others. However, unexpected events, such as weather conditions, poor experience in mountains, underestimation of the terrain conditions, bad planning, weak technical skills, among others, can increase the risk of accidents. The activities that represent better these events are ski touring, rock/ice climbing, mountaineering and hiking. In this work, we propose a localization system based on LoRa (pycom) and Bluetooth (beacon) devices to find lost people inside of the natural park Bletterbach GEOPARC. The system consists on a wearable sensor device worn in a helmet that transmits actual position to fixed stations. In case of emergency, the stations provide the last checking point to a centralized system, and authorities can start a procedure to find the victim with the aid of an unmanned aerial vehicle (UAV). The UAV transmits the exact coordinate to mountain rescue teams, and terrain conditions (imagery) to effectively plan the rescue mission. This work aims to accelerate the process of search and rescue operations that could be extended to different scenarios.
ExtremeEarth Data Science Pipeline for Linked Earth Observation DataExtremeEarth
Presentation in Data Week 2021. The main objective of this workshop is to bring together four pioneer H2020 projects that are at the frontier of European research and innovation and are developing Artificial Intelligence and Big Data technologies for Copernicus data. These projects are ExtremeEarth (http://earthanalytics.eu/), AI4Copernicus (https://ai4copernicus-project.eu/), DeepCube (https://deepcube-h2020.eu/) and CALLISTO (https://callisto-h2020.eu/). The first two of the projects have been funded by ICT calls while the other two have been funded by DT-SPACE calls.
El 12 de mayo de 2017 celebramos en la Fundación Ramó Areces una jornada con IS Global y Unitaid sobre enfermedades transmitidas por vectores, como la malaria, entre otras.
We present a review of the activities carried out by various members of the “Extremes and Networks” working sub-group. In particular, we discuss our initial findings on different
connectivity metrics on networks and their applicability on data on extremes, and case studies on networks of precipitation extremes for southern USA and for India. A considerable part of this work is ongoing, and some challenges and features of this research will be discussed.
ExtremeEarth Data Science Pipeline for Linked Earth Observation DataExtremeEarth
Presentation in Data Week 2021. The main objective of this workshop is to bring together four pioneer H2020 projects that are at the frontier of European research and innovation and are developing Artificial Intelligence and Big Data technologies for Copernicus data. These projects are ExtremeEarth (http://earthanalytics.eu/), AI4Copernicus (https://ai4copernicus-project.eu/), DeepCube (https://deepcube-h2020.eu/) and CALLISTO (https://callisto-h2020.eu/). The first two of the projects have been funded by ICT calls while the other two have been funded by DT-SPACE calls.
El 12 de mayo de 2017 celebramos en la Fundación Ramó Areces una jornada con IS Global y Unitaid sobre enfermedades transmitidas por vectores, como la malaria, entre otras.
We present a review of the activities carried out by various members of the “Extremes and Networks” working sub-group. In particular, we discuss our initial findings on different
connectivity metrics on networks and their applicability on data on extremes, and case studies on networks of precipitation extremes for southern USA and for India. A considerable part of this work is ongoing, and some challenges and features of this research will be discussed.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Eco-informatics: Data services for bringing together and publishing the full ...TERN Australia
The presentation provides an overview of Advanced Ecological Knowledge and Observation System and SHaRED services by the TERN Eco-informatics to publish plot-based ecological data. The presentation was part of the Workshop on Approaches to Terrestrial Ecosystem Data Management : from collection to synthesis and beyond which was held on 9th of March 2016 in University of Queensland.
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK NeISSProject
Infrastructures Supporting Inter-disciplinary Research - Exemplars from the UK . Talk given by Richard Sinnott at Urban Research Infrastructure Network Workshops, Melbourne, Brisbane, Sydney, September 2010.
Darwin Core extension for germplasm (11th December 2013)Dag Endresen
Presentation on the Darwin Core germplasm extension for the "1st International e-Conference on Germplasm Data Interoperability: Session 2", 11th December 2013 (https://sites.google.com/site/germplasminteroperability/). Publishing germplasm information on plant genetic resources and their traits using the Darwin Core standard and the germplasm extension for genebanks.
Articulo escrito por Hector Sánchez Villeda.
Hector Sánchez ha desarrollado tecnologías de la información para las ciencias biológicas por más de 20 años y actualmente es Fundador y Director de Desarrollo de IT de G2 Apps una empresa de innovación tecnológica basada en Querétaro, México.
G2 APPS se dedica a la implementación de LIMS (Laboratory Information Management Systems) utilizando un enfoque multidisciplinario que desde luego incluye un alto nivel de conocimientos en las ciencias de la vida para llevar a cabo una facil implementación.
Artículo escrito por el MC Hector Sánchez VIlleda acerca de su participación en el desarrollo, diseño e implementacion de un Sistema de Administración de la Información para Laboratorios en la Universidad de Missouri.
Hector Sánchez Villeda ha trabajado por más de 25 años en el desarrollo de TI para las ciencias biologicas y es fundador y Director de Desarrollo de IT en G2 Apps, una compañia de inovación tecnológica basada en la ciudad de Querétaro, Mexico
The role of Earth Observations in DOPA, a Digital Observatory for Protected A...Gregoire Dubois
Slides used for the talk given at the ZSL symposium on Remote Sensing for Conservation (22nd and 23rd of May 2014 at ZSL in London). This symposium has highlighted integrative approaches for an improved ecological understanding of the mechanisms shaping current changes in biodiversity patterns, while triggering new research directions in remote sensing science and the development of new remote sensing products.
Text (personal views position statement) to accompany presentation on what research infrastructures really need for data, XLDB-Europe, 8-10th June 2011, Edinburgh
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...Dag Endresen
Presentation on the Darwin Core standard for data exchange and the germplasm extension for genebanks during the 2014 workshop of the ECPGR Documentation and Information Working Group "Tailoring the Documentation of Plant Genetic Resources in Europe to the Needs of the User" (http://www.ecpgr.cgiar.org/working_groups/documentation_information/docinfo2014.html) in Prague-Ruzyně, Czech Republic, 20th May 2014.
Short URL: https://goo.gl/C5UEnU
DOI: http://doi.org/10.13140/RG.2.2.10865.28006
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 5Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
Scott Edmunds talk at the HUPO congress in Geneva, September 6th 2011 on GigaScience - a journal or a database? Lessons learned from the Genomics Tsunami.
Similar to Metadata Standards in CKAN for Biodiversity Pilot in NextGEOSS (20)
Agrihub INSPIRE Hackathon 2021: Challenge #7: Analysis, processing and standa...plan4all
This is a presentation of results of Challenge #7: Analysis, processing and standardisation of data from agriculture machinery for easier utilization by farmers of the Agrihub INSPIRE Hackathon 2021.
Calculation of agro climatic factors from global climatic dataplan4all
Authors: Pavel Hájek,
Raitis Berzins , Jiří Valeš, Martin Pitoňák , Vincent
Onckelet , Tomáš Andrš, Veronika Osmiková , Ronald
Ssembajwe , Amit Kirschenbaum , Jörg Schliesser , Michal Kepka & Karel Jedlička
Digitalization of indigenous knowledge in African agriculture for fostering f...plan4all
Authors:
Antoine Kantiza, AKANTIZA CONSULT, Burundi
Didier Muyiramye, Swedish University of Agricultural Sciences, Rwanda
Elias Cherenet Weldemariam, HARAMAYA UNIVERSITY, Ethiopia
Petr Horak, WIRELESSINFO, Czech Republic
Robert Sabimana, Frutus Fresco Ltd, Uganda
Pavel Hajek, West Bohemia University, Czech Republic
Tuula Löytty, Smart & Lean Hub Oy, Finland
Demet Osmancelebioglu, Smart & Lean Hub Oy, Finland
Karel charvat map-compositions-format-intro-presentation-by-karel (1)plan4all
Karel Charvat on behalf of Plan4all, Lesprojekt, BOSC and Asplan Viak gave a presentation about the project to create a Google Docs-like map application and map composition format.
Karel charvat map-whiteboard-collaborative-map-making-breakout-sessionplan4all
Karel Charvat on behalf of Plan4all, Lesprojekt, BOSC and Asplan Viak gave a presentation about the project to create a Google Docs-like map application and map composition format.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
Metadata Standards in CKAN for Biodiversity Pilot in NextGEOSS
1. Metadata Standards in
CKAN for Biodiversity Pilot
Andrew Skidmore a, Elnaz Neinavaz a, Roshanak Darvish a,
Sander Mucher b, Wouter Meijninger b, Stephan Hennekes b
a Dept. of Natural Resources, Faculty of Geo-Information Science and Earth Observation (ITC),
University of Twente, Enschede, the Netherlands
b Alterra, Wageningen UR, Wageningen, The Netherlands
5th March 2018
2. Pilot 6.2.1 Biodiversity
Subtask 6.2.1
Demonstrate the value of an European data hub for the creation of RS-EBVs (i.e., Remote sensing- Essential biodiversity
variables), which leads to creating a GEO hub for EBVs by linking the key policy/user network groups (GEOBON, CBD and
IPBES) with the space agencies (via CEOS).
Pilot status with respect to the integration with the NextGEOSS European Data hub & Platform
NextGEOSS data hub will be populated with existing and new RS-enabled EBVs.
2
3. EBVs proposed by:
Skidmore, A. K., & Pettorelli, N. (2015). Agree on biodiversity metrics to track from space: ecologists and
space agencies must forge a global monitoring strategy. Nature, 523 (7561), 403-406
3
Leaf area
index
Ecosystem
distribution
Inundation
Fragmentati
on
heterogenei
ty
Species
occurrence
Plant traits
Fire
occurrence
Vegetation
phenology
Land cover
Vegetation
height
Identification of available RS-EBV products
Observation
Satellite
In situ
measurement
4. ➢ Identificationof available RS-EBV productsconsidering the resolution and scales
EBVs
Species occurrence NL
Species distribution Europe
Vegetationheight NL
point density 8 per
m2 for AHN-3
Land cover
Europe
20-1000m
resolution
Global 300 m resolution
Fire occurrence Global
250-1000 m
resolution
Phenology Global
250-1000 m
resolution
Leaf area index Global
300-1000m
resolution
Net primary
productivity
Global
500-1000m
resolution
Satellite
MODIS/Terra
MODIS/A
qua
SPOT-
VGT,
PROBA-V
Most available RS-EBVs produce using MODIS and SPOT data
4
LAI
MODIS SPOT
5. Raw data Platform
Pre- processing of
the data
Applying
algorithms to
predict EBVs
Visualize output
in map
Study area in biodiversity pilot (NextGEOSS)
The Netherlands
Bavaria Forest National Park (Germany)
Sentinel-2
High
resolution
data
User
NextGEOSSEuropean Data hub and RS-EBVs
5
Atmospheric
correction
Resampling
Univariate
Multivariate
Machinelearning
approaches
6. 6
MetadataStandardSchemas and EBVs
Element Definition
Contributor An entity responsible for making contributions the resource
Coverage The spatial or temporal topic of the resource, the spatial
applicability of the resource , or the jurisdiction under which the
resource is relevant
Creator An entity primarily responsible for making the resource
Date A point or period of time associated with an event in the lifecycle
of the resource
Description An account of the resource
Format The file format, physical medium or dimensions of the resource
Identifier An unambiguous reference to the resource within a given context
Language A language of the resource
Publisher An entity responsible for making the resource available
Relation A related resource
Rights Information about rights held in and over the resource
Source A related resource from which the described resource is derived
Subject The topic of the resource
Title A name given to the resource
Type The nature or genre of the resource
Input (raw data)
Modelling
Output (in map)
Metadata
Metadata
DublinCore
GeoDCAT
ISO 19115
…