d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...Jens Mittelbach
D:SWARM is a graphical web-based ETL modelling tool that serves to import data from heterogeneous sources with different formats, to map input to output schemata and design transformation workflows, to load transformed data into property graph database. It is developed in a collaborative project by SLUB Dresden (www.slub-dresden.de) and Avantgarde Labs GmbH (www.avantgarde-labs.de) features additional functionalities like exporting of data models as RDF and sharing mappings and transformation workflows.
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...Jens Mittelbach
D:SWARM is a graphical web-based ETL modelling tool that serves to import data from heterogeneous sources with different formats, to map input to output schemata and design transformation workflows, to load transformed data into property graph database. It is developed in a collaborative project by SLUB Dresden (www.slub-dresden.de) and Avantgarde Labs GmbH (www.avantgarde-labs.de) features additional functionalities like exporting of data models as RDF and sharing mappings and transformation workflows.
The presentation was given at the SOCM'16 workshop at the WWW16 conference. It corresponds to the research study titled "Observlets: Empowering Analytical Observations on Web Observatory".
Data challenge accepted - an Overview of Data Science Practices and Competenc...Alina Stoicescu
In today’s competitive research environment, the need for librarians to be knowledgeable about all things digital is growing. Data-savvy librarians are able to better assist their patrons with the resources they need for their research, as well as extract useful insights from library data.
Data science as a discipline aims to provide solutions for managing the steeply growing amount of data in the world. Due to their educational background and inquisitive approach to information and knowledge, librarians are well-positioned to use data science in their work. Yet how prepared are they to work with data science? Areas discussed within this presentation are data science competencies, data librarianship as a profession and the three roles of data librarianship.
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsJoachim Neubert
"What's new?" and "What has changed" are questions users of Knowledge Organization Systems (KOS), such as thesauri or classifications, ask when a new version is published. Much more so, when a thesaurus existing since the 1990s has been completely revised, subject area for subject area. After five intermediately published versions in as many consecutive years, STW Thesaurus for Economics has been re-launched recently in version 9.0. In total, 777 descriptors have been added; more than a thousand (of about 6,000) have been deprecated, in their vast majority merged into others. More subtle changes include modified preferred labels, or merges and splits of existing concepts. We here describe how these changes were tracked, making use of the published SKOS (Miles & Bechhofer, 2009) files of the versions, loading them into named graphs of a SPARQL endpoint and executing queries on them. An ontology supporting version and delta description and query formulation is introduced. High-level visualizations of aggregated change data and drill-downs to the actual concepts are presented. We finish with an outlook to the skos-history project, which generalizes and extends the methodology to different knowledge organization systems.
Discovery layer decisions, configurations and strategiesRay Schwartz
What are discovery layers
What brought about this topic and the five libraries chosen
How did they implement
How have they assessed
What modifications were made
Conclusions
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...Big Data Value Association
In the Internet of Everything, huge volumes of multimedia data are generated at very high rates by heterogeneous sources in various formats, such as sensors readings, process logs, structured data from RDBMS, etc. The need of the hour is setting up efficient data pipelines that can compute advanced analytics models on data and use results to customize services, predict future needs or detect anomalies. This Webinar explores the TOREADOR conversational, service-based approach to the easy design of efficient and reusable analytics pipelines to be automatically deployed on a variety of cloud-based execution platforms.
Connexity: Reinventing the Networking Experience UpdatedPCMAHQ
Session: Connexity: Reinventing the Networking Experience Updated
Presented by: Sarah Michel CSP, VP of Professional Connexity, Velvet Chainsaw Consulting
Date and time: Tuesday, June 25, 10:00am
pcma.org/educon
The presentation was given at the SOCM'16 workshop at the WWW16 conference. It corresponds to the research study titled "Observlets: Empowering Analytical Observations on Web Observatory".
Data challenge accepted - an Overview of Data Science Practices and Competenc...Alina Stoicescu
In today’s competitive research environment, the need for librarians to be knowledgeable about all things digital is growing. Data-savvy librarians are able to better assist their patrons with the resources they need for their research, as well as extract useful insights from library data.
Data science as a discipline aims to provide solutions for managing the steeply growing amount of data in the world. Due to their educational background and inquisitive approach to information and knowledge, librarians are well-positioned to use data science in their work. Yet how prepared are they to work with data science? Areas discussed within this presentation are data science competencies, data librarianship as a profession and the three roles of data librarianship.
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsJoachim Neubert
"What's new?" and "What has changed" are questions users of Knowledge Organization Systems (KOS), such as thesauri or classifications, ask when a new version is published. Much more so, when a thesaurus existing since the 1990s has been completely revised, subject area for subject area. After five intermediately published versions in as many consecutive years, STW Thesaurus for Economics has been re-launched recently in version 9.0. In total, 777 descriptors have been added; more than a thousand (of about 6,000) have been deprecated, in their vast majority merged into others. More subtle changes include modified preferred labels, or merges and splits of existing concepts. We here describe how these changes were tracked, making use of the published SKOS (Miles & Bechhofer, 2009) files of the versions, loading them into named graphs of a SPARQL endpoint and executing queries on them. An ontology supporting version and delta description and query formulation is introduced. High-level visualizations of aggregated change data and drill-downs to the actual concepts are presented. We finish with an outlook to the skos-history project, which generalizes and extends the methodology to different knowledge organization systems.
Discovery layer decisions, configurations and strategiesRay Schwartz
What are discovery layers
What brought about this topic and the five libraries chosen
How did they implement
How have they assessed
What modifications were made
Conclusions
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...Big Data Value Association
In the Internet of Everything, huge volumes of multimedia data are generated at very high rates by heterogeneous sources in various formats, such as sensors readings, process logs, structured data from RDBMS, etc. The need of the hour is setting up efficient data pipelines that can compute advanced analytics models on data and use results to customize services, predict future needs or detect anomalies. This Webinar explores the TOREADOR conversational, service-based approach to the easy design of efficient and reusable analytics pipelines to be automatically deployed on a variety of cloud-based execution platforms.
Connexity: Reinventing the Networking Experience UpdatedPCMAHQ
Session: Connexity: Reinventing the Networking Experience Updated
Presented by: Sarah Michel CSP, VP of Professional Connexity, Velvet Chainsaw Consulting
Date and time: Tuesday, June 25, 10:00am
pcma.org/educon
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...Alistair Hamilton
Presentation by Al Hamilton and Cody Johnson to Canberra Semantic Web Meetup Group on why producers of official statistics are interested in semantic web community (including Linked Open Data) and outlining experimental work by Cody Johnson on transforming selected Population Census data released by the ABS in SDMX-ML to RDF Data Cube Vocabulary format.
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)eXascale Infolab
Internet Infrastructures for Big Data
Talk given at Verisign's Distinguished Speaker Series, 2014
Prof. Philippe Cudre-Mauroux
eXascale Infolab
http://exascale.info/
BigData conference - Introduction to stream processingNicolas Fränkel
While “software is eating the world”, those who are able to best manage the huge mass of data will emerge out on the top.
The batch processing model has been faithfully serving us for decades. However, it might have reached the end of its usefulness for all but some very specific use-cases. As the pace of businesses increases, most of the time, decision makers prefer slightly wrong data sooner, than 100% accurate data later. Stream processing – or data streaming – exactly matches this usage: instead of managing the entire bulk of data, manage pieces of them as soon as they become available.
In this talk, Nicolas will define the context in which the old batch processing model was born, the reasons that are behind the new stream processing one, how they compare, what are their pros and cons, and a list of existing technologies implementing the latter with their most prominent characteristics. He’ll conclude by describing in detail one possible use-case of data streaming that is not possible with batches: display in (near) real-time all trains in Switzerland and their position on a map. He’ll go through the all the requirements and the design. Finally, using an OpenData endpoint and the Hazelcast platform, he’ll try to impress attendees with a working demo implementation of it.
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson'sinside-BigData.com
In this deck from the HPC User Forum in Tucson, Joe Lombardo from UNLV presents: HPC and Precision Medicine - A New Framework for Alzheimer's and Parkinson's.
"The University of Nevada, Las Vegas and the Cleveland Clinic Lou Ruvo Center for Brain Health have been awarded an $11 million federal grant from the National Institutes of Health and National Institute of General Medical Sciences to advance the understanding of Alzheimer's and Parkinson's diseases. In this session, we will present how UNLV's National Supercomputing Institute plays a critical role in this research by fusing brain imaging, neuropsychological and behavioral studies along with the diagnostic exome sequencing models to increase our knowledge of dementia-related and age-associated degenerative disorders."
Watch the video: https://wp.me/p3RLHQ-iws
Learn more: https://www.unlv.edu/news/release/unlv-receives-nih-grant-alzheimers-disease-research
and
http://hpcuserforum.com
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
The seminar is about Data warehousing, in here we are gonna discuss about what is data warehousing, comparison b/w database and data warehouse, different data warehouse models.about Data mart, and disadvantages of data warehousing.
Big data serving: Processing and inference at scale in real timeItai Yaffe
Jon Bratseth (VP Architect) @ Verizon Media:
The big data world has mature technologies for offline analysis and learning from data, but have lacked options for making data-driven decisions in real time.
When it is sufficient to consider a single data point model servers such as TensorFlow serving can be used but in many cases you want to consider many data points to make decisions.
This is a difficult engineering problem combining state, distributed algorithms and low latency, but solving it often makes it possible to create far superior solutions when applying machine learning.
This talk will explain why this is a hard problem, show the advantages of solving it, and introduce the open source Vespa.ai platform which is used to implement such solutions in some of the largest scale problems in the world including the world's third largest ad serving system.
From Millennium ERMS to Proquest 360 Resource ManagerRindra Ramli
An overview of the recommendation study and subsequent implementation of a new electronic resources management system ERMS in an international graduate research university in the Middle East. It described the project timeline, deliverables, challenges as well as lessons learnt.
Devclub.lv - Introduction to stream processingNicolas Fränkel
While “software is eating the world”, those who are able to best manage the huge mass of data will emerge out on the top.
The batch processing model has been faithfully serving us for decades. However, it might have reached the end of its usefulness for all but some very specific use-cases. As the pace of businesses increases, most of the time, decision-makers prefer slightly wrong data sooner, than 100% accurate data later. Stream processing – or data streaming – exactly matches this usage: instead of managing the entire bulk of data, manage pieces of them as soon as they become available.
Scanner Data
In these slides the author presents the issues and challenges related to dealing with datasets of big size such as those involved in the Scanner Data project at Istat. He illustrates IT architecture backing the testing phase of the project, currently in place, and the ideas for the production architecture. The motivations behind the design are explained as well as the solutions introduced as part of a larger scope approach to the modernization of tools and techniques used for data storage and processing in Istat, envisioning the future challenges posed by the adoption of Big Data and Data Science in NSIs.
http://www.istat.it/en/archive/168897
http://www.istat.it/it/archivio/168890
This webcast introduces AE-EHR Clinical Exchange Document (CED) capabilities as well as an approach to exchange an unsolicited CED within the ConnectR interface engine.
(Big) Data Processing for Next Generation Business Value. Presented at the Leaders Buildings Leaders Conference, held at Union College on April 3, 2015.
https://www.ucollege.edu/academics/business-and-computer-science/leaders-building-leaders
This presentation is a semi-technical overview of big data and related use-cases, the Apache Hadoop software stack, and some example data-science / analysis models.
Accelerating Delivery of Data Products - The EBSCO WayMongoDB
EBSCO Information Services (EBSCO) is the leading provider of electronic journals, magazines, eBooks, audioBooks, and online research content for libraries, including hundreds of research databases, historical archives, point-of-care medical reference, and corporate learning tools serving millions of end users at tens of thousands of institutions worldwide. The EBSCO platform is a widely used platform serving the needs of researchers at all levels in academic institutions, schools, public libraries, hospitals, medical institutions, corporations and government institutions. Data is our business, and delivering new products quickly is our competitive advantage. We build hundreds of data products and accelerating the analysis, transformation of new datasets translates to revenue and competitiveness. And since our data is so varied, using MognoDB to store data flexibly and JSON Studio to analyze this data allows us to deliver products to market faster. In this session we will describe this process that helped us expedite delivery of new datasets, and give real examples of how data is used, analyzed and processed.
ADV Slides: Trends in Streaming Analytics and Message-oriented MiddlewareDATAVERSITY
Streaming and real-time data has high business value, but that value can rapidly decay if not processed quickly. If the value of the data is not realized in a certain window of time, its value is lost and the decision or action that was needed as a result never occurs. Streaming data – whether from sensors, devices, applications, or events – needs special attention because a sudden price change, a critical threshold met, a sensor reading changing rapidly, or a blip in a log file can all be of immense value, but only if the alert is in time.
Similar to Coherance in dissemination- Msis 2007 (20)
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Coherance in dissemination- Msis 2007
1. Coherent dissemination throughout
the web
MSIS
Management of Statistical Information Systems
Geneva 8-10 May 2007
Annegrete Wulff
Statistics Denmark
awu@dst.dk
2. 2
Centralized vs decentralized
• Input: centralized and decentralized
• Centralized: loading program, global
variables, coordination of code lists,
classifications, title
• Decentralized: data updates, loading,
code lists, specific variable, contact, foot
notes, English translations
• Output: centralized
• one output system
• coordination of structure and look-and-
feel
3. 3
Dissemination principles
• Electronic over paper
• StatBank is the place for all official statistics
• StatBank is the source for all publications
• Simultaneous releases in all media 9:30:00 am
• StatBank is online available & free-of-charge for everyone
• Dissemination should address well-defined
– target groups
– types of usage
• …jet still use the same source (data and metadata)
4. 4
What is www.StatBank.dk
• 1,500 large multi dimensional tables (cubes)
• Online available & free-of-charge for everyone
• Covers all subjects
• Cross cutting metadata
• Data storage in an Oracle database
• User interface in ASP and JAVA script
• Variety of download formats
• Saved queries and data shoots
• Presentations in tables, time series, graphs, maps
• Complete English version
5. 5
Output
• Formats: Excel, PC-
AXIS, SAS, comma
separated, xml, time
series,…
• Maps
• Graphs
• Links to
documentation and
contacts
6. 6
The Public
-
www.dst.dk
SumDatabase
Cleaned micro data
Statistical registers
-StatBank
Denmark
Print
Binding
dst.d
k
Charged statistics
and analysis
Annonymos micro data
for
Researchers
Subject matter division, Dissemination, IT-Centre
Aggregation
to macro data
Publication pdf
-StatHost 4
-StatHost 3
-StatHost 2
-StatHost 1
International organisations
7. 7
Alert on updates
• RSS on latest updates
• Saved queries – accessing StatBank.dk
• Datashoot
• Excel web queries
• XML queries and web service
8. 8
XML query
• Registreret XML-bruger i Statistikbanken
• Vi giver adgang til en profil i Statistikbanken,
som kan indeholde et ubegrænset antal XML-
forespørgsler, der altid giver de nyeste tal. Der
gives ydermere adgang til en Web Service, som
kan levere resultatet som færdig HTML. Vi
garanterer at der adviseres om eventuelle
ændringer i XML-formatet. Denne løsning giver
mulighed for advisering via e-post
(Datashooting) når der er nye tal, samt for
automatisk opdatering af tabellerne i Excel.
Kunden kan selv tilføje, ændre og slette de
gemte forespørgsler efter behov.
10. 11
1,500 matrices in Danish and English
2 million retrievals
HTML table on screen
Downloads
of a file
77 % only on screen
6 % maps, 17 % graphs
23% downloads. Of these:
86 % in Excel
9 % in PC-AXIS
5% in other formats
11. 12
Global data…..global metadata
• Reference metadata (declaration of content)
– source
– quality
– accessability
– methodologies
– contacts
– release info
• Definitions of concepts ( project start 2007)
• Stored once – to be used all over