Scaling collaborative data science with Globus and JupyterIan Foster
The Globus service simplifies the utilization of large and distributed data on the Jupyter platform. Ian Foster explains how to use Globus and Jupyter to seamlessly access notebooks using existing institutional credentials, connect notebooks with data residing on disparate storage systems, and make data securely available to business partners and research collaborators.
It will contain knowledge and usefulness of colwiz tool which is used for research management tool. One can mange their research papers and also can search from using it.
Science as a Service: How On-Demand Computing can Accelerate DiscoveryIan Foster
My talk at ScienceCloud 2013 in NYC. Thanks to the organizers for the invitation to talk.
A bit of new material relative to previous talks posted, e.g., on Globus Genomics.
Scaling collaborative data science with Globus and JupyterIan Foster
The Globus service simplifies the utilization of large and distributed data on the Jupyter platform. Ian Foster explains how to use Globus and Jupyter to seamlessly access notebooks using existing institutional credentials, connect notebooks with data residing on disparate storage systems, and make data securely available to business partners and research collaborators.
It will contain knowledge and usefulness of colwiz tool which is used for research management tool. One can mange their research papers and also can search from using it.
Science as a Service: How On-Demand Computing can Accelerate DiscoveryIan Foster
My talk at ScienceCloud 2013 in NYC. Thanks to the organizers for the invitation to talk.
A bit of new material relative to previous talks posted, e.g., on Globus Genomics.
Slides accompanying a day-long introduction to AtoM and Archivematica, presented by Dan Gillean and Justin Simpson at the UK National Archives as part of an AIM25 and Higher Education Archive Programme Network Meeting, December 2, 2016.
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
A presentation on research data management tools, workflows and best practices at Imperial College London with a focus on software management. Presented at the 2017 session of the HPC Summer School (Dept. of Computing).
“Filling the digital preservation gap”an update from the Jisc Research Data ...Jenny Mitcham
Presentation given to the Hydra Preservation Interest Group by Jenny Mitcham on the Jisc Research Data Spring project "Filling the Digital Preservation Gap"
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
A presentation given by Jenny Mitcham at the Northern Collaboration Conference on 10th September 2015 at Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
https://bigscience.huggingface.co/
EN: Presentation of the BigScience project: a research initiative launched by HuggingFace and aiming to build a large language model (inspired by OpenAI and GPTx) over multiple languages and a very large processing cluster. The participants plan to investigate the dataset and the model from all angles: bias, social impact, capabilities, limitations, ethics, potential improvements, specific domain performances, carbon impact, general AI/cognitive research landscape.
FR : Présentation du projet Bigscience : un projet de recherche ouvert lancé par HuggingFace et qui a pour objectif de contruire un modèle de langue (ie un peu comme openAI et GPT-3) mais en explorant les problèmes liés au jeux de données et au modèle selon les angles des biais cognitifs, de l'impact social et environemental, des limites éthiques, des possibles gain de performance et de l'impact général de ce type d'approche lorsque le but n'est pas seulement "d'avoir un plus gros modèle".
Project update: A collaborative approach to "filling the digital preservation...Jenny Mitcham
A presentation given by Julie Allinson at the UK Archivematica group meeting on 6th November 2015 in Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
The Wellcome Trust is examining the possibility of a cloud platform for the storage and delivery of digitised artefacts. This platform is intended for the Trust's own use as well as others. A version of this presentation with embedded notes and video can be viewed on Google docs: http://bit.ly/1GRKqN4 or PowerPoint online: http://bit.ly/1CwGsrE
War stories from building the Global Patent Search Network, and why Data folks need to think more about UX and Discovery, and UX folks need to think more about Data.
This presentation was provided by Jake Zarnegar of Silverchair, during the NFAIS Forethought event "Artificial Intelligence #2 – Processes for Media Analysis and Extraction" The webinar was held on May 20, 2020.
Collaborations in the Extreme: The rise of open code development in the scie...Kelle Cruz
Video: https://www.simonsfoundation.org/event/collaborations-in-the-extreme-the-rise-of-open-code-development-in-the-scientific-community/
The internet is changing the scientific landscape by fostering international, interdisciplinary and collaborative software development. More than ever before, software is a crucial component of any scientific result. The ability to easily share code is reshaping expectations about reproducibility -- a fundamental tenet of the scientific process. In this lecture, Kelle Cruz will briefly provide the backstory of how these shifts have come about, describe some of the most impactful open source projects, and discuss efforts currently underway aimed at ensuring these community-led projects are sustainable and receive support.
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Artefactual Systems - AtoM
These slides accompanied a June 4th, 2016 presentation made by Dan Gillean of Artefactual Systems at the Association of Canadian Archivists' 2016 Conference in Montreal, QC, Canada.
This presentation aims to examine several existing or emerging computing paradigms, with specific examples, to imagine how they might inform next-generation archival systems to support digital preservation, description, and access. Topics covered include:
- Distributed Version Control and git
- P2P architectures and the BitTorrent protocol
- Linked Open Data and RDF
- Blockchain technology
The session is part of an attempt by the ACA to create interactive "working sessions" at its conferences. Accompanying notes can be found at: http://bit.ly/tech-Proche
Participants were also asked to use the Twitter hashtag of #techProche for online interaction during the session.
Sharepoint for Nonprofits: Introduction501 Commons
A Walkthrough of Real World Deployment
Now that you are up and running on Office365, are you wondering if SharePoint can help your organization better collaborate? Is email your org's primary means of sharing files? Not quite sure what SharePoint is? Wondering what it takes to build a useful SharePoint site? Hoping to decommission your local file server or Dropbox shared accounts? This presentation will walk through the business and technical steps taken at Habitat for Humanity SKC to deploy SharePoint in a mid-sized nonprofit.
Topics to be covered:
SharePoint functional overview. What is it and what can it do for me? Brief comparison with similar products.
Structuring sites and subsites.
Security considerations. Internal and external sharing. How do I control and monitor access?
Document libraries. Custom views. Using folders vs. search.
Data preservation.
Syncing files locally. Limits and tradeoffs.
Reaching nirvana – any document on any device, anywhere in a secure environment?
How to involve your team in design and deployment. How to manage a deployment.
Project overview at Habitat for Humanity. What worked? What didn't work?
Hidden costs? Training? Internet upgrade? Storage fees? Local PC upgrades?
Benefit from the experience of a recent deployment, and make a more informed decision about whether SharePoint is a good fit for your organizational needs.
About the presenter:
Kevin Phaup is an independent software consultant who has advised dozens of local non-profits in Seattle and Portland over the years primarily as a volunteer. He works closely with Social Venture Partners. He enjoys providing targeted technical and business advice, and hands-on work building successful IT solutions.
Slides accompanying a day-long introduction to AtoM and Archivematica, presented by Dan Gillean and Justin Simpson at the UK National Archives as part of an AIM25 and Higher Education Archive Programme Network Meeting, December 2, 2016.
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
A presentation on research data management tools, workflows and best practices at Imperial College London with a focus on software management. Presented at the 2017 session of the HPC Summer School (Dept. of Computing).
“Filling the digital preservation gap”an update from the Jisc Research Data ...Jenny Mitcham
Presentation given to the Hydra Preservation Interest Group by Jenny Mitcham on the Jisc Research Data Spring project "Filling the Digital Preservation Gap"
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
A presentation given by Jenny Mitcham at the Northern Collaboration Conference on 10th September 2015 at Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
https://bigscience.huggingface.co/
EN: Presentation of the BigScience project: a research initiative launched by HuggingFace and aiming to build a large language model (inspired by OpenAI and GPTx) over multiple languages and a very large processing cluster. The participants plan to investigate the dataset and the model from all angles: bias, social impact, capabilities, limitations, ethics, potential improvements, specific domain performances, carbon impact, general AI/cognitive research landscape.
FR : Présentation du projet Bigscience : un projet de recherche ouvert lancé par HuggingFace et qui a pour objectif de contruire un modèle de langue (ie un peu comme openAI et GPT-3) mais en explorant les problèmes liés au jeux de données et au modèle selon les angles des biais cognitifs, de l'impact social et environemental, des limites éthiques, des possibles gain de performance et de l'impact général de ce type d'approche lorsque le but n'est pas seulement "d'avoir un plus gros modèle".
Project update: A collaborative approach to "filling the digital preservation...Jenny Mitcham
A presentation given by Julie Allinson at the UK Archivematica group meeting on 6th November 2015 in Leeds. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
The Wellcome Trust is examining the possibility of a cloud platform for the storage and delivery of digitised artefacts. This platform is intended for the Trust's own use as well as others. A version of this presentation with embedded notes and video can be viewed on Google docs: http://bit.ly/1GRKqN4 or PowerPoint online: http://bit.ly/1CwGsrE
War stories from building the Global Patent Search Network, and why Data folks need to think more about UX and Discovery, and UX folks need to think more about Data.
This presentation was provided by Jake Zarnegar of Silverchair, during the NFAIS Forethought event "Artificial Intelligence #2 – Processes for Media Analysis and Extraction" The webinar was held on May 20, 2020.
Collaborations in the Extreme: The rise of open code development in the scie...Kelle Cruz
Video: https://www.simonsfoundation.org/event/collaborations-in-the-extreme-the-rise-of-open-code-development-in-the-scientific-community/
The internet is changing the scientific landscape by fostering international, interdisciplinary and collaborative software development. More than ever before, software is a crucial component of any scientific result. The ability to easily share code is reshaping expectations about reproducibility -- a fundamental tenet of the scientific process. In this lecture, Kelle Cruz will briefly provide the backstory of how these shifts have come about, describe some of the most impactful open source projects, and discuss efforts currently underway aimed at ensuring these community-led projects are sustainable and receive support.
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Artefactual Systems - AtoM
These slides accompanied a June 4th, 2016 presentation made by Dan Gillean of Artefactual Systems at the Association of Canadian Archivists' 2016 Conference in Montreal, QC, Canada.
This presentation aims to examine several existing or emerging computing paradigms, with specific examples, to imagine how they might inform next-generation archival systems to support digital preservation, description, and access. Topics covered include:
- Distributed Version Control and git
- P2P architectures and the BitTorrent protocol
- Linked Open Data and RDF
- Blockchain technology
The session is part of an attempt by the ACA to create interactive "working sessions" at its conferences. Accompanying notes can be found at: http://bit.ly/tech-Proche
Participants were also asked to use the Twitter hashtag of #techProche for online interaction during the session.
Sharepoint for Nonprofits: Introduction501 Commons
A Walkthrough of Real World Deployment
Now that you are up and running on Office365, are you wondering if SharePoint can help your organization better collaborate? Is email your org's primary means of sharing files? Not quite sure what SharePoint is? Wondering what it takes to build a useful SharePoint site? Hoping to decommission your local file server or Dropbox shared accounts? This presentation will walk through the business and technical steps taken at Habitat for Humanity SKC to deploy SharePoint in a mid-sized nonprofit.
Topics to be covered:
SharePoint functional overview. What is it and what can it do for me? Brief comparison with similar products.
Structuring sites and subsites.
Security considerations. Internal and external sharing. How do I control and monitor access?
Document libraries. Custom views. Using folders vs. search.
Data preservation.
Syncing files locally. Limits and tradeoffs.
Reaching nirvana – any document on any device, anywhere in a secure environment?
How to involve your team in design and deployment. How to manage a deployment.
Project overview at Habitat for Humanity. What worked? What didn't work?
Hidden costs? Training? Internet upgrade? Storage fees? Local PC upgrades?
Benefit from the experience of a recent deployment, and make a more informed decision about whether SharePoint is a good fit for your organizational needs.
About the presenter:
Kevin Phaup is an independent software consultant who has advised dozens of local non-profits in Seattle and Portland over the years primarily as a volunteer. He works closely with Social Venture Partners. He enjoys providing targeted technical and business advice, and hands-on work building successful IT solutions.
Similar to "A Toolkit for Digital Research" - CNI 2013 (20)
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
13. how much made available?
30,000,000 Gb/all*
9 Gb/PhD
3 Gb/year
* roughly
14. A cloud based research data
management system where you can:
Manage your research Make your research
outputs privately and outputs citable, sharable,
securely discoverable
20. Backed up in multiple
institutions around the world
DOIs provided by DataCite at
the California Digital Library
Adhere to ethics of academic
publishing, as per guidelines
ORCID launch partner, files to
be pushed to author profiles
All content hosted on AWS with
triple file storage, fast load times
and unbeatable uptime
21. formats accepted / visualised
Some of the formats we visualise
• csv
• f4v
• svg
• tsv
• m4a
• tif
• xls
• doc
• tiff
• xlsx
• docx
• gif
• ptt
• png
• ppt
• ods
• jpg
• pptx
• aac
• jpeg
• nex
• 3g2
• txt
• odt
• WebM
• bmp
• rtf
• mpeg4
• djvu
• pps
• 3gpp
• eps
• seq
• mov
• fa
• xml
• avi
• faa
• vaxml
• mpeg
• fasta
• tnt
• wmv
• ffn
• sxw
• flv
• frn
• zip
• mp3
• fna
• tar
• mp4
• pdf
• ...& more
22. desktop uploader
Desktop Uploader
the figshare desktop uploader allows
for quick and easy upload of your
research outputs, straight allowsyour
The figshare desktop uploader from
for quick and easy upload of your
desktop.outputs, straight from your many
research users can upload as
desktop. You can upload oncefiles atthe
files as they want at many and
once and the uploader supports
uploader supports resumable uploads
resumable uploads. This means if your
(so you’re safe if your connection
internet connection drops, you don t
need to start the drops).again.
uploads
allAll files areprivate space at private
files uploaded into your first, where
spacecan choose whether to make
you on figshare, where you can
chose whether to make them public or
manage themthem public.
privately.
24. figshare for Institutions
• Large amounts of secure private storage
space and unlimited public space.
• Detailed metrics on publicly available data.
• Ability to push research to any internal
repository.
• Subject categorisation per department.
• Collaborative spaces.
• Create your own institutional repo:
institution.figshare.com
• All data is citable, visualisable, embeddable
and trackable.
• Can be used securely by any number of
users.
35. altmetrics
Alternative metrics
Alternative to only looking at citations (complement
to, not replacement for bibliometrics)
Manifesto (by Priem, Neylon, Groth, Taborelli) at
altmetrics.org - we didn’t coin the phrase
37. right now...
“Altmetrics” generally refers to tracking online
attention (esp. social media) data to try and get
an idea of the wider impact of scholarly
research.
43. data is made available through
an API
http://api.altmetric.com/v1/doi/xxx
http://api.altmetric.com/v1/citations/1w?cited_in=linkedin
http://api.altmetric.com/v1/citations/1w?issns=1098-4275
44. the business model question ...
use by institutional repositories is free. Explorer accounts also free for
academic librarians to use however they wish.
we charge commercial publishers for more detailed data and sell site
license access to the Explorer app.