The iMarine initiative provides a data infrastructure aimed at facilitating open access, the sharing of data, collaborative analysis, processing and mining processing, as well as the dissemination of newly generated knowledge. The iMarine data infrastructure is developed to support decision making in high-level challenges that require policy decisions typical of the ecosystem approach. The iMarine offering can be articulated in six bundles. A “bundle” is a set of services and technologies grouped according to a family of related tasks for achieving a common objective. Bundles can be customized and/or enriched into flexible, purpose-built Virtual Research Environments (VRE). Virtual research environments offer flexible and secure web-based, community-centric platforms, so researchers can work together on common challenges. Each VRE in the infrastructure is tightly integrated with the underlying gCube enabling software, and can access and re-purpose data from other iMarine applications.
Presentation by Scott Taylor, President, Taylor Environmental Services, on the latest changes in environmental regulations for the asphalt industry in California. Taylor is co-chairman of the CalAPA Environmental Committee. Presentation delivered at the CalAPA Spring Asphalt Pavement Conference, held March 20-21, 2019 in Ontario, CA.
Caltrans Senior Engineer Jacquelyn Wong discusses the use of Environmental Product Declarations on Caltrans asphalt pavement projects. Presentation delivered at the CalAPA Spring Asphalt Pavement Conference, held March 20-21, 2019 in Ontario, CA.
Presentation by Scott Taylor, President, Taylor Environmental Services, on the latest changes in environmental regulations for the asphalt industry in California. Taylor is co-chairman of the CalAPA Environmental Committee. Presentation delivered at the CalAPA Spring Asphalt Pavement Conference, held March 20-21, 2019 in Ontario, CA.
Caltrans Senior Engineer Jacquelyn Wong discusses the use of Environmental Product Declarations on Caltrans asphalt pavement projects. Presentation delivered at the CalAPA Spring Asphalt Pavement Conference, held March 20-21, 2019 in Ontario, CA.
Integrating Heterogeneous and Distributed Information about Marine Species th...iMarine283644
On the 21st of November 2013, Yannis Tzitzikas, FORTH, presented the Integrating heterogeneous and distributed information about marine species through a top level ontology paper at the 7th Metadata and Semantic Research Conference in Thessaloniki, Greece.
A step into the future of iMarine: The iMarine Public-centred Partnership Bus...iMarine283644
Presentation by Marc Taconet - FAO-FI, Chief Fisheries Statistics and Information Branch (FIPS) & iMarine Board Chair, Patricio Bernal - IUCN High Seas Initiatives and Hervé Camount - Terradue, Program Manager on the sustainability plan of the iMarine initiative
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: All About iMarine iMarine283644
iMarine is empowering users in the marine community and beyond by providing a highly efficient e-Infrastructure to accelerate data discovery, exchange, and analysis, tools and platforms that facilitates scientific discovery. Funded by the European Commission's 7th Framework Programme, a number of iMarine services are already available through the iMarine Gateway supplying cross disciplinary data supporting experts in the field.
This is the device Klassmate which I have developed for classroom teaching. Which costs 70000 INR and is portable interactive whiteboard. with Microsoft Kinect sensor.
What does a Platform mean nowadays?
▪ A lever of Web and Cloud technologies
▪ A business model for value co-creation
▪ A framework to bring innovation to new or larger communities
Progetto INNO ed esempi di applicazioni nel campo della GEOMATICA - P.CauSardegna Ricerche
La presentazione del Progetto INNO a cura di Pierluigi Cau, in occasione dell'evento "Bonifiche ambientali e potenzialità delle imprese" che si è tenuto a Cagliari il 7 novembre 2014.
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)Amazon Web Services
Making earth observation data available by using Amazon S3 is accelerating scientific discovery and enabling the creation of new products. Attend and learn how the scale and performance of Amazon S3 lets earth scientists, researchers, startups, and GIS professionals gather and analyze planetary-scale data without worrying about limitations of bandwidth, storage, memory, or processing power. Learn how AWS is being used to combine satellite imagery, social data, and telemetry data to produce new products and services. Learn also how Amazon S3 provides much more than storage, and how an open geospatial data lake on Amazon S3 can be used as the basis for planetary-scale applications built with Amazon EMR, Amazon API Gateway, and AWS Lambda. As part of this talk, AWS customer Digital Globe demonstrates how they use open data stored in S3 to distribute high-resolution satellite imagery to their customers around the world.
National Archives of Australia. AVAMS Project Achievements August 2014Rose Holley
An overview of the achievements of the AVAMS project at the National Archives of Australia. The project implemented an audiovisual collection management system and an audiovisual digital preservation system using Mediaflex.
Building on iMarine for fostering Innovation, Decision making, Governance and...Blue BRIDGE
BlueBRIDGE - Building Research environments fostering Innovation, Decision making, Governance and Education - is funded under H2020 and provides data services to scientists, researchers and data managers delivering a solid foundation for informed advice to competent authorities. A complete set of web-based data and computational resources will enable them to address key challenges related to the Blue Growth long term strategy with a strong focus on sustainable growth. BlueBRIDGE services will be built on top of the iMarine infrastructure (www.i-marine.eu) in order to capitalize on the previous investments made by the European Commission and as a first step towards their sustainability after the end of the project. www.bluebridge-vres.eu | @BlueBridgeVREs
Integrating Heterogeneous and Distributed Information about Marine Species th...iMarine283644
On the 21st of November 2013, Yannis Tzitzikas, FORTH, presented the Integrating heterogeneous and distributed information about marine species through a top level ontology paper at the 7th Metadata and Semantic Research Conference in Thessaloniki, Greece.
A step into the future of iMarine: The iMarine Public-centred Partnership Bus...iMarine283644
Presentation by Marc Taconet - FAO-FI, Chief Fisheries Statistics and Information Branch (FIPS) & iMarine Board Chair, Patricio Bernal - IUCN High Seas Initiatives and Hervé Camount - Terradue, Program Manager on the sustainability plan of the iMarine initiative
Marine Knowledge Meeting, 11-12 Oct 2012, Brussels: All About iMarine iMarine283644
iMarine is empowering users in the marine community and beyond by providing a highly efficient e-Infrastructure to accelerate data discovery, exchange, and analysis, tools and platforms that facilitates scientific discovery. Funded by the European Commission's 7th Framework Programme, a number of iMarine services are already available through the iMarine Gateway supplying cross disciplinary data supporting experts in the field.
This is the device Klassmate which I have developed for classroom teaching. Which costs 70000 INR and is portable interactive whiteboard. with Microsoft Kinect sensor.
What does a Platform mean nowadays?
▪ A lever of Web and Cloud technologies
▪ A business model for value co-creation
▪ A framework to bring innovation to new or larger communities
Progetto INNO ed esempi di applicazioni nel campo della GEOMATICA - P.CauSardegna Ricerche
La presentazione del Progetto INNO a cura di Pierluigi Cau, in occasione dell'evento "Bonifiche ambientali e potenzialità delle imprese" che si è tenuto a Cagliari il 7 novembre 2014.
AWS re:Invent 2016: Earth on AWS—Next-Generation Open Data Platforms (STG203)Amazon Web Services
Making earth observation data available by using Amazon S3 is accelerating scientific discovery and enabling the creation of new products. Attend and learn how the scale and performance of Amazon S3 lets earth scientists, researchers, startups, and GIS professionals gather and analyze planetary-scale data without worrying about limitations of bandwidth, storage, memory, or processing power. Learn how AWS is being used to combine satellite imagery, social data, and telemetry data to produce new products and services. Learn also how Amazon S3 provides much more than storage, and how an open geospatial data lake on Amazon S3 can be used as the basis for planetary-scale applications built with Amazon EMR, Amazon API Gateway, and AWS Lambda. As part of this talk, AWS customer Digital Globe demonstrates how they use open data stored in S3 to distribute high-resolution satellite imagery to their customers around the world.
National Archives of Australia. AVAMS Project Achievements August 2014Rose Holley
An overview of the achievements of the AVAMS project at the National Archives of Australia. The project implemented an audiovisual collection management system and an audiovisual digital preservation system using Mediaflex.
Building on iMarine for fostering Innovation, Decision making, Governance and...Blue BRIDGE
BlueBRIDGE - Building Research environments fostering Innovation, Decision making, Governance and Education - is funded under H2020 and provides data services to scientists, researchers and data managers delivering a solid foundation for informed advice to competent authorities. A complete set of web-based data and computational resources will enable them to address key challenges related to the Blue Growth long term strategy with a strong focus on sustainable growth. BlueBRIDGE services will be built on top of the iMarine infrastructure (www.i-marine.eu) in order to capitalize on the previous investments made by the European Commission and as a first step towards their sustainability after the end of the project. www.bluebridge-vres.eu | @BlueBridgeVREs
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeCarole Goble
Workflow systems support the design, configuration and execution of repetitive, multi-step pipelines and analytics, well established in many disciplines, notably biology and chemistry, but less so in biodiversity and ecology. From an experimental perspective workflows are a means to handle the work of accessing an ecosystem of software and platforms, manage data and security, and handle errors. From a reporting perspective they are a means to accurately document methodology for reproducibility, comparison, exchange and reuse, and to trace the provenance of results for review, credit, workflow interoperability and impact analysis. Workflows operate in an evolving ecosystem and are assemblages of components in that ecosystem; their provenance trails are snapshots of intermediate and final results. Taking a lifecycle perspective, what are the challenges in workflow design and use with different stakeholders? What needs to be tackled in evolution, resilience, and preservation? And what are the “mitigate or adapt” strategies adopted by workflow systems in the face of changes in the ecosystem/environment, for example when tools are depreciated or datasets become inaccessible in the face of funding shortfalls?
Navigating the Marine Geophysical Data Life CycleVicki Ferrini
I gave this presentation at the University of New Hampshire's Center for Coastal and Ocean Mapping on April 18, 2014 describing the marine geophysical data life cycle and a variety of resources available to help investigators navigate the world of data management, as well as efforts focused on optimizing high-quality publicly available data.
A CMS based Geoportal targeted to manage information related to water resource management projects, powered with a full FOSS stack. A first application of the Geoportal is on the case study of Red Thai Binh River in Vietnam.
A FOSS approach to Integrated Water Resource Management. The case study of Re...Carolina Arias Muñoz
C.Arias, M.Brovelli, S.Corti,
M. Micotti, R. Soncini-Sessa and E. Weber
http://geomatica.como.polimi.it/workbooks/n12/FOSS4G-eu15_submission_100.pdf
http://www.slideshare.net/NRMPolimi/foss4-g2015-ariasmicotti
iMarine data e-infrastructure: Data access, harmonization, analysis, and mana...iMarine283644
On the 22 July 2014, OpenChannels.org and the EBM Tools Network, two of the premier sources of information about coastal and marine planning and management tools in the United States and internationally, hosted the iMarine webinar: iMarine Data e-Infrastructure Initiative for Fisheries Management and Conservation of Marine Living Resources.
The webinar focused on the presentation of the iMarine initiative and its powerful data e-infrastructures and services, followed by a presentation of a set of use cases related to Geospatial Analysis, Ecology, Biodiversity and Life History Traits. The presentations were given by Pasquale Pagano, CNR-ISTI and iMarine Technical Director and Gianpaolo Coro, CNR-ISTI. Watch the video of the webinar here https://www.youtube.com/watch?v=lgf30BPyBbk
In computational statistics, algorithms often have specialized implementations that address very specific problems. Every so often, these algorithms are applicable also to other problems than the original ones. Today, interest is growing towards modular and pluggable solutions that enable the repetition and validation of the experiments made by other scientists and allow the exploitation of those algorithms in other contexts. Furthermore, such procedures are requested to be remotely hosted and to “hide” the complexity of the calculations, managed by remote computational infrastructures behind the scenes. For such reasons, the usual solution of supplying modular software libraries containing implementations of algorithms is leaving the place to Web Services accessible through standard protocols and hosting such implementations. The protocols describing the computational capabilities of these Services are more and more elaborate, so that modular workflows can rely on them.
Part 1 - What is a data e-infrastructure?
Part 2 - Serving policy frameworks facing BIG challenges
Part 3 - The power of an e-Infrastructure - Synergies and efficiencies through Global collaboration communities
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
4. Application Bundles
Management and interpretation of biological and
ecological data in the environment
Complete full life-cycle data framework, from
observational data to aggregated data repositories
enriched with validation and analytical tools
Storage and interpretation of geospatial explicit
information, including WPS processing
Flexible sharing, storage, reporting, search and
retrieval, aggregation and projection facilities
iMarine Products and services delivery
A BUNDLE is
a set of
services and
technologie
s grouped
according to
a family of
related
tasks for ac
hieving a
common
objective
5. A fraction of the products and services belonging to BiolCube
PRODUCTS AND SERVICES
DEVELOPMENT PROGRESS REPORT
iMarine Products and services delivery
6. Species Data Discovery
Search for multiple species
Search across several data providers
Search for all occurrences of a set of species and their synonyms
Search occurrences for all species belonging a taxon group
iMarine Products and services delivery
7. Species Data Discovery
Search in GBIF all the occurrences about 'sarda sarda' and its synonyms found in WoRMS
• SEARCH BY SN 'sarda sarda' EXPAND WITH WoRMS IN GBIF RETURN Occurrence
Search in CoL all the Taxa about 'sarda sarda' and its synonyms found in WoRMS
• SEARCH BY SN 'sarda sarda' EXPAND WITH WoRMS IN CoL RETURN TAXON
Search all occurrences for the species commonly recognized as 'shark' in WoRMS and their
synonyms as recognized by CoL. Accept only the results with coordinate less or equals to
(15.12, 16.12).
• SEARCH BY CN 'shark' RESOLVE WITH WoRMS EXPAND WITH CoL WHERE coordinate <= 15.12, 16.12 RETURN
Occurrence
Search in OBIS all the occurrences for 'sarda sarda' and 'Carcharodon carcharias' expanded
with synonyms from WoRMS and CoL. Accept only the results with an event date between
2000 and 2005.
• SEARCH BY SN 'sarda sarda', 'Carcharodon carcharias' EXPAND WITH WoRMS, CoL IN OBIS WHERE eventDate >=
'2000' AND eventDate <= '2005' RETURN Occurrence
iMarine Products and services delivery
8. Occurrence Points
Occurrence Data from GBIF
Occurrence Data from Obis
∩
ᴜ
-
Intersection
Union
Difference
DD
Duplicates Deletion
A
B
x,y
x,y
Records
Event Date
Event Date
Modif Date
Modif Date
Similarity
Author
Species Scientific Name
Author
Species Scientific Name
iMarine Products and services delivery
9. Similarity between habitats
Habitat Representativeness Score:
1.
Measures the similarity between the environmental features of two areas
2. Assesses the quality of models and environmental features
Latimeria chalumnae
HRS=10.5
Habitat
Representativeness
Score
iMarine Products and services delivery
10. BiOnym
Raw Input String.
E.g. Gadus morua Lineus 1758
Reference
Source
(ASFIS)
Preprocessing
And
Parsing
Reference
Source
(FISHBASE)
Reference
Source
(Other in
DwC-A)
Reference
Source
(OBIS)
Taxon
Matcher 1
Taxon
Matcher 2
A flexible workflow approach to
taxon name matching
Accounts for:
• Variations in the spelling and
interpretation of taxonomic
names
• Combination of data from
different sources
• Harmonization and reconciliation
of Taxa names
Taxon
Matcher n
PostProcessing
Correct Transcriptions:
E.g. Gadus morhua (Linnaeus, 1758)
iMarine Products and services delivery
11. Trendylyzer - Scope
• Fill some knowledge gaps on marine
species
• Account for sampling biases
• Define trends for common species
We focus on the OBIS database
Is the Fulmar losing its common
species status among the
seabirds?
Herring recovered after the fish ban
Can we recognize big changes in
species presence?
Plankton regime shift
iMarine Products and services delivery
14. Trendylyzer – Observation ranks on Marine Ecoregions of the World
iMarine Products and services delivery
15. Length-Weight Relationships
Objective:
Calculate the a and b parameters for several
species.
Requirements:
Account for...
• Many studies about a single species
• Single study
• Use existing studies to inform new studies
bluewatermag.com.au
Solution:
Combine existing knowledge with new data by
means of Bayesian methods.
Approach:
Collaborative development with the
‘stakeholder’
Integration of R Scripts
Usage of Cloud computing for R Scripts
iMarine Products and services delivery
16. LWR - Performance
The porting to the D4Science Statistical Manager allowed to run the
scripts in distributed fashion
The original time of the scientist’s procedure was 20 days
After the optimization on our R development machines the time of
the sequential run was reduced to 10 days
The timing on the Statistical Manager was of 11 hours!
Time reduction of 95.4%
The script has been run periodically and currently solves LWR for
37 234 species
iMarine Products and services delivery
17. A fraction of the products and services belonging to StatsCube
PRODUCTS AND SERVICES
DEVELOPMENT PROGRESS REPORT
iMarine Products and services delivery
18. Tabular Data Manager
Complete new application for the management
of data workflow. It allows to *manage* *flow of
data* and to create report out of the
management activities.
• flow of data: dataset compliant with a template
that are generated and updated in chunks.
• manage: import, store, transform, validate,
access, analyze, visualize, and export.
iMarine Products and services delivery
19. Tabular Data Manager: Templates
• A table template defines:
– Table definition
– Columns definition
– A set of table transformations
– A set of validation procedures
• Can be applied to any dataset
• Can be modified and shared among people
iMarine Products and services delivery
20. Tabular Data Manager: Menu
Ribbon style menu
Buttons behavior depends
on current document
Alt messages on
mouseover
iMarine Products and services delivery
23. 330 Cores Currently Allocated
Infrastructure: Computing as Service
Hadoop
• MapReduce
Statistical
Manager
• Analysis/clustering/modeling
R clusters
• Windows and Linux
I-MARINE EXTENDED BOARD
23
24. A fraction of the products and services belonging to GeosCube
PRODUCTS AND SERVICES
DEVELOPMENT PROGRESS REPORT
iMarine Products and services delivery
25. Rasterization
A polygonal map is
transformed into a raster
map or into a point map
iMarine Products and services delivery
29. Environmental Enrichment: Approach
• (Oozie)workflow to optimize the processing chain:
– Extract occurrences for the Carcharodon carcharias (White
Shark) for a given time of interest
– Apply the dbscan algorithm (R implementation) to identify
geospatial clusters
– Create bounding boxes around the clusters
– Use the bounding boxes as queryables for the WCS request
– Apply BEAM Pixel Extraction (same algorithm as BioOracle
environmental enrichment service)
– Create the time series
– Visualize the time series
iMarine Products and services delivery
31. SPREAD
• Interactive investigation process for statisticians &
scientists to confront data from different domains
(e.g. Statistics vs. GIS data) and batch process of data
reallocations hypothesis
DATA IMPORT / CURATION
Estimates dataset
by EEZ – high seas
Catch dataset
by FAO area
FAO Areas
GIS DATA DISCOVERY,
SEARCHING & SHARING
Available
Target Areas
DATA SELECTION
(e.g. Filter)
Geographic intersection
FAO Areas / EEZs – Highs seas
REALLOCATION
Species
distributions
iMarine Products and services delivery
32. Legacy Processes (IRD)
• iX Catches per Species: per Ocean / Area, per
Fishing Gear type, per Month / Year, and kernel
density for biodiversity / ecological datasets
(IRD+OBIS+GBIF)
20°N
10°N
0
10°S
20°S
30°S
30°E
50°E
70°E
90°E
110°E
iMarine Products and services delivery
33. A fraction of the products and services belonging to ConnectCube
PRODUCTS AND SERVICES
DEVELOPMENT PROGRESS REPORT
iMarine Products and services delivery
34. MarineTLO
Version 3.0.0
Version 2.0.0
–
–
–
–
–
–
–
–
Species
Scientific Name of Species
FAO Species Code
IRD Species Code
WoRMS Species Code
Predators and Prey
Competitors
Biological Classification of Species
(e.g. WoRMS)
–
–
–
–
–
–
–
–
–
–
–
–
–
–
MarineTLO Version 2.0.0
Water Areas
Species connected to Water Areas
Countries
Countries connected to Water Aras
Species connected to Countries
Ecosystems
Ecosystems connected to Countries
Species connected to Ecosystems
Exclusive Economical Zones
Fishing Gears
Fishing Vessels
More species and more Predators
Common Names of Species
iMarine Products and services delivery
34
35. Requirements as Competency Queries
#Query For a scientific name of a species (e.g. Thunnus Albacares or Poromitra Crassiceps),
find/give me
Q1
the biological environments (e.g. ecosystems) in which the species has been introduced and more
general descriptive information of it (such as the country)
Q2
its common names and their complementary info (e.g. languages and countries where they are
used)
Q3
Q4
Q5
Q6
the water areas and their FAO codes in which the species is native
the countries in which the species lives
the water areas and the FAO portioning code associated with a country
the presentation w.r.t Country, Ecosystem, Water Area and Exclusive Economical Zone (of the
water area)
Q7
the projection w.r.t. Ecosystem and Competitor, providing for each competitor the identification
information (e.g. several codes provided by different organizations)
Q8
a map w.r.t. Country and Predator, providing for each predator both the identification information
and the biological classification
Q9
who discovered it, in which year, the biological classification, the identification information, the
common names - providing for each common name the language, the countries where it is used
in.
iMarine Products and services delivery
35
36. The MarineTLO-based warehouse Evolution
RDF
Triple Store
TLOMarine
FLOD2TLOm
apping
ECOSCOPE2TLO
mapping
WoRMS2TLO
mapping
DBpediaS2TLO
mapping
FB2TLO
mapping
FLOD
ECOSCOPE
WoRMS
DBpedia
Fishbase
Copy
FLOD
By FAO
Copy
ECOSCOPE
By IRD
Copy
WoRMS
(part)
Generated by SPD
&TLO wrapper
Copy
DBpedia
(part)
By DBpedia
SPARQL Endpoint
iMarine Products and services delivery
Copy
Fishbase
(part)
By Fishbase
RDMS
37. Warehouse V3
Concepts
Ecoscope
FLOD
WoRMS DBpedia Fishbase
Species
Scientific Names
Authorships
Common Names
Predators
Ecosystems
Countries
Water Areas
Vessels
Gears
EEZ
iMarine Products and services delivery
38. TLO warehouse V2 vs V3
V2 Contains information about 19,000 distinct marine species
Source
Species
Number
DBpedia
FLOD
14,291
FLOD
Common Species (size of intersections)
10,849
WoRMS
3,046
Ecoscope
731
56
768
FLOD
1124
Ecoscope
DBpedia
WoRMS
73
277
WoRMS
768
53
V3 contains information about 37,000 distinct marine species
Source
Common Species (size of intersections)
Species
Number
DBpedia
14,291
FLOD
FLOD
WoRMS
Ecoscope
Fishbase
10,849
WoRMS
1124
Ecoscope
277
FishBase
31,277
DBpedia
FLOD
3,046
731
56
9833
768
73
6141
53
1288
WoRMS
Ecoscope
iMarine Products and services delivery
53
39. A tiny fraction of the products and services belonging to BiolCube
PRODUCTS AND SERVICES CATALOGUE
AT PROJECT CONCLUSION
iMarine Products and services delivery
40. Trendylyzer – Definition of Common Species
Grey = not a common species in 1990
Trends for common
species can be indicators
of ecological changes
A formal definition of
common species is not
trivial
A definition based on
occurrences distribution
gives interesting, result
but is affected by sampling
biases
iMarine Products and services delivery
41. Trendylyzer – Definition of Common Species
We are searching for a more formal definition of C.S., which accounts
for the biases in the database …
We defined a commonness score function
The terms influencing the Commonness of a species are given a weight
using pattern recognition models
For each species:
1. Nr of observations
2. Nr of individuals per observation
3. Nr of observations per dataset
4. Nr of datasets
5. Nr of geographical cells
6. Temporal frequency of the observations
Normalizing => relative commonness.
Create score or rank by taxonomic group
We are assessing the
performances on the
indications by FishBase and
IUCN on some benchmark
species
iMarine Products and services delivery
42. Trendylyzer - Performance
A preliminary definition of CS was done using
1. Nr of observations per dataset in one year
2. Nr of datasets containing the species in one year
On a ‘trustable’ benchmark with 255 species the correctness of the
classification with respect to an expert classification was 99.21%!
The complex approximating function including also time and
geographical extent gave 80% of agreement with respect to an expert
classification on an ‘wild’ benchmark (80 species)
The results are very promising!
iMarine Products and services delivery
43. A tiny fraction of the products and services belonging to StatsCube
PRODUCTS AND SERVICES CATALOGUE
AT PROJECT CONCLUSION
iMarine Products and services delivery