Presentation at Open Repositories 2014 on crowd sourcing of transcription of medieval calendars via IIIF Image and Presentation APIs, plus REST, Open Annotation and JSON-LD.
Olaf Janssen on the collaboration between European national libraries during ...Olaf Janssen
In this presentation Olaf Janssen - project & account manager at The European Library - talks about 18 years of cooperation between the national libraries of Europe.
He introduces national libraries in a European context and presents the history and benefits of their collaboration.
He outlines the development of The European Library (TEL) from the Gabriel project and the TELproject, and discusses how the TEL-ME-MOR project contributes to the growth of TEL.
I held this talk as an invited speaker during the NLB Conference "Celebrating Knowledge, The Power and Potential" on 15-11-2005, Singapore
Presentation on the American Archive of Public Broadcasting at the 2015 Society of American Archivists conference in Cleveland, Ohio. AAPB staff presented on the history of the project, website development, metadata, Online Reading Room, value to scholars and researchers, and digital preservation. Panelists included Karen Cariani, AAPB Director at WGBH, Casey Davis, AAPB Project Manager at WGBH, Alan Gevinson, AAPB Director at the Library of Congress, and James Snyder, Senior Systems Administrator at the Library of Congress.
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...The European Library
This paper addresses the identification of all contributors of an intellectual work, when they are recorded in bibliographic data but in unstructured form. National bibliographies are very reliable on representing the first author of a work, but frequently, secondary contributors are represented in the statements of responsibility that are transcribed by the cataloguer from the book into the bibliographic records. The identification of work contributors mentioned in statements of responsibility is a typical motivation for the application of information extraction techniques. This paper presents an approach developed for the specific application scenario of the ARROW rights infrastructure being deployed in several European countries to assist in the determination of the copyright status of works that may not be under public domain. Our approach performed reliably in most languages and bibliographic datasets of at least one million records, achieving precision and recall above 0.97 on five of the six evaluated datasets. We conclude that the approach can be reliably applied to other national bibliographies and languages.
Olaf Janssen on the collaboration between European national libraries during ...Olaf Janssen
In this presentation Olaf Janssen - project & account manager at The European Library - talks about 18 years of cooperation between the national libraries of Europe.
He introduces national libraries in a European context and presents the history and benefits of their collaboration.
He outlines the development of The European Library (TEL) from the Gabriel project and the TELproject, and discusses how the TEL-ME-MOR project contributes to the growth of TEL.
I held this talk as an invited speaker during the NLB Conference "Celebrating Knowledge, The Power and Potential" on 15-11-2005, Singapore
Presentation on the American Archive of Public Broadcasting at the 2015 Society of American Archivists conference in Cleveland, Ohio. AAPB staff presented on the history of the project, website development, metadata, Online Reading Room, value to scholars and researchers, and digital preservation. Panelists included Karen Cariani, AAPB Director at WGBH, Casey Davis, AAPB Project Manager at WGBH, Alan Gevinson, AAPB Director at the Library of Congress, and James Snyder, Senior Systems Administrator at the Library of Congress.
Word Occurrence Based Extraction of Work Contributors from Statements of Resp...The European Library
This paper addresses the identification of all contributors of an intellectual work, when they are recorded in bibliographic data but in unstructured form. National bibliographies are very reliable on representing the first author of a work, but frequently, secondary contributors are represented in the statements of responsibility that are transcribed by the cataloguer from the book into the bibliographic records. The identification of work contributors mentioned in statements of responsibility is a typical motivation for the application of information extraction techniques. This paper presents an approach developed for the specific application scenario of the ARROW rights infrastructure being deployed in several European countries to assist in the determination of the copyright status of works that may not be under public domain. Our approach performed reliably in most languages and bibliographic datasets of at least one million records, achieving precision and recall above 0.97 on five of the six evaluated datasets. We conclude that the approach can be reliably applied to other national bibliographies and languages.
2014 Census of Open Access Repositories in Germany, Austria and SwitzerlandPaul Vierkant
The proposal presents the “2014 Census of Open Access Repositories in Germany, Austria and Switzerland” (2014 Census) conducted in the cause of a seminar at Berlin School of Library and Information Science (BSLIS) at Humboldt-Universität zu Berlin. It succeeds the 2012 Census of Open Access repositories in Germany. The 2014 Census gives insights into the development of Open Access repositories and current trends in repository design. By extending the research design it also indicates unexplored aspects of Open Access repositories. With the paper the authors will present the overall 2014 Census results exclusively.
Library collections and the emerging scholarly recordlisld
A high level review of collection trends followed by a summary of recent work on the evolving scholarly record.
Presented at the OCLC Research Library Partnership meeting at the University of Melbourne, 2 December 2015.
The Abnormal Hieratic Global Portal aims to:
- Bring together published texts, i.e. transcriptions, transliterations and translations
- Teaching the study of Abnormal Hieratic with papyri
- Discuss and annotate texts
- Create a name book and dictionary to help new papyri be deciphered
By Ben Companjen, 27th June 2019
Islandora Webinar: Highlighting CUHK Chinese Digital CollectionsErin Tripp
The webinar will feature a presentation and Q&A session with Jeff Liu, Digital Services Librarian and Louisa Lam, Head, Research Support and Digital Initiatives at the CUHK Library.
The CUHK Library has curated a collection of over five million digital objects in the past 20 years. It features Chinese literature, culture, arts, politics, society and religion. Until recently, the collection was stored in a broad range of different systems, complicating the discovery of these precious digital assets.
In 2015, librarians at CUHK embarked on a project to find a permanent, single platform for digital content. Objectives of the project included enhanced discoverability, multi-language support (Chinese, Japanese & Korean) and custom development capability to modify display and viewing features that would showcase Chinese literature in its true form.
Islandora met all the functional requirements and more, including support for digital humanities projects and access to a user-driven open source software community.
The CUHK library was also attracted to the vendor services and support available through discoverygarden. We provided advice, support and custom development assistance; contributing to the launch of the digital repository every step of the way.
The repository (http://repository.lib.cuhk.edu.hk) officially launched in February 2016, making the CUHK Library digital initiatives pioneers in Hong Kong.
DanteSources is a focused digital library endowed with web services that allow visualizing information on Dante Alighieri’s primary sources in form of charts and tables. The visualized charts can be exported in various well-known formats like PDF and JPEG, but the data can be also exported in CSV format, to lend them to further analyses. DanteSources makes information about Dante’s primary sources available in digital format for the first time. Having the information about primary sources dispersed on paper books makes it difficult to
systematically overview how the cultural background of Dante evolved in time. On the other hand, the automatic visualization of data allows understanding the development of Dante’s cultural background in comparison with the different phases of his biography.
Advocating Open Access: Before, during and after HEFCENick Sheppard
Since “self-archiving” of research outputs was first mooted in the mid-1990s, initiatives towards “green” Open Access (OA) across the sector have met with generally limited success and coverage in institutional and subject repositories is generally cited at around 20-30%. However, since the Finch report in 2012 combined with OA policies from RCUK, also in 2012, and HEFCE the following year, there is little doubt that a tipping point of awareness has been reached. This session will aim to contextualise the HEFCE policy in the broader history of Open Access and present a case study of a non-research intensive University and how the repository manager has sought to liaise with academic support services in order to facilitate knowledge exchange across the University. - See more at: http://www.cilip.org.uk/events/open-access-advocacy#sthash.9YqReHt0.dpuf
Blowing Up the Book: Design Challenges for Online Cultural Heritage InformationDesign for Context
Lesley Humphreys and Neal Johnson
Presentation at User Focus, the UXPA DC Chapter conference, Washington, D.C. – October 17, 2014
In 2009, The Getty Foundation issued a challenge to eight art museums: translate a scholarly catalog rich with cultural heritage data, images, and interpretation into an online resource that strives to reach beyond the limitations of print in creative and useful ways. "Systematic catalogs" are a time-honored and resource-intensive method of publishing the core art historical data and art research associated with the cultural objects that these institutions hold in public trust. And so the Online Scholarly Catalog Initiative (OSCI) was begun with the goal of "blowing up the book"... not to eliminate or destroy, but rather to expand the utility and audience of these important resources.
In the spring of 2014, the National Gallery of Art in Washington, DC launched their OSCI publication of its Dutch Paintings of the 17th Century systematic catalog. Through thoughtful and elegant design, the Gallery translated the printed catalog into a sustainable digital publication that extends the audience, usefulness, and sustainability of the information contained within. This talk presents a case study on incorporating academic research information from a traditional printed publication into the public web site for the National Gallery of Art. It also offers lessons that can be applied to other information sites.
Ready to "level up" your digital humanities (DH) game? DH offers theological librarians new opportunities to collaborate with their communities. Drawing on our experience with a graduate seminar in DH at Vanderbilt Divinity School, we discuss how to equip librarians to foster digital scholarship in areas such as digital textual editions, geospatial apps, open access publishing, and network analyses. Discover how DH transforms faculty and librarian relations from a service model to a partnership model.
A walk through of the Linked Art data model, API and community processes. Presented originally at the Rijksmuseum for the 5th Linked Art face to face meeting. Linked Art is a linked open usable data specification created by the community to describe artwork, museum objects, and related bibliographic and archival content.
LUX - Cross Collections Cultural Heritage at YaleRobert Sanderson
A brief presentation based on the CNI talk for the Linked Data for Libraries Discovery affinity group about LUX, Linked Open Usable Data and our discovery processes based on graphs rather than documents.
More Related Content
Similar to Open Repositories 2014: Crowdsourced Transcription via IIIF
2014 Census of Open Access Repositories in Germany, Austria and SwitzerlandPaul Vierkant
The proposal presents the “2014 Census of Open Access Repositories in Germany, Austria and Switzerland” (2014 Census) conducted in the cause of a seminar at Berlin School of Library and Information Science (BSLIS) at Humboldt-Universität zu Berlin. It succeeds the 2012 Census of Open Access repositories in Germany. The 2014 Census gives insights into the development of Open Access repositories and current trends in repository design. By extending the research design it also indicates unexplored aspects of Open Access repositories. With the paper the authors will present the overall 2014 Census results exclusively.
Library collections and the emerging scholarly recordlisld
A high level review of collection trends followed by a summary of recent work on the evolving scholarly record.
Presented at the OCLC Research Library Partnership meeting at the University of Melbourne, 2 December 2015.
The Abnormal Hieratic Global Portal aims to:
- Bring together published texts, i.e. transcriptions, transliterations and translations
- Teaching the study of Abnormal Hieratic with papyri
- Discuss and annotate texts
- Create a name book and dictionary to help new papyri be deciphered
By Ben Companjen, 27th June 2019
Islandora Webinar: Highlighting CUHK Chinese Digital CollectionsErin Tripp
The webinar will feature a presentation and Q&A session with Jeff Liu, Digital Services Librarian and Louisa Lam, Head, Research Support and Digital Initiatives at the CUHK Library.
The CUHK Library has curated a collection of over five million digital objects in the past 20 years. It features Chinese literature, culture, arts, politics, society and religion. Until recently, the collection was stored in a broad range of different systems, complicating the discovery of these precious digital assets.
In 2015, librarians at CUHK embarked on a project to find a permanent, single platform for digital content. Objectives of the project included enhanced discoverability, multi-language support (Chinese, Japanese & Korean) and custom development capability to modify display and viewing features that would showcase Chinese literature in its true form.
Islandora met all the functional requirements and more, including support for digital humanities projects and access to a user-driven open source software community.
The CUHK library was also attracted to the vendor services and support available through discoverygarden. We provided advice, support and custom development assistance; contributing to the launch of the digital repository every step of the way.
The repository (http://repository.lib.cuhk.edu.hk) officially launched in February 2016, making the CUHK Library digital initiatives pioneers in Hong Kong.
DanteSources is a focused digital library endowed with web services that allow visualizing information on Dante Alighieri’s primary sources in form of charts and tables. The visualized charts can be exported in various well-known formats like PDF and JPEG, but the data can be also exported in CSV format, to lend them to further analyses. DanteSources makes information about Dante’s primary sources available in digital format for the first time. Having the information about primary sources dispersed on paper books makes it difficult to
systematically overview how the cultural background of Dante evolved in time. On the other hand, the automatic visualization of data allows understanding the development of Dante’s cultural background in comparison with the different phases of his biography.
Advocating Open Access: Before, during and after HEFCENick Sheppard
Since “self-archiving” of research outputs was first mooted in the mid-1990s, initiatives towards “green” Open Access (OA) across the sector have met with generally limited success and coverage in institutional and subject repositories is generally cited at around 20-30%. However, since the Finch report in 2012 combined with OA policies from RCUK, also in 2012, and HEFCE the following year, there is little doubt that a tipping point of awareness has been reached. This session will aim to contextualise the HEFCE policy in the broader history of Open Access and present a case study of a non-research intensive University and how the repository manager has sought to liaise with academic support services in order to facilitate knowledge exchange across the University. - See more at: http://www.cilip.org.uk/events/open-access-advocacy#sthash.9YqReHt0.dpuf
Blowing Up the Book: Design Challenges for Online Cultural Heritage InformationDesign for Context
Lesley Humphreys and Neal Johnson
Presentation at User Focus, the UXPA DC Chapter conference, Washington, D.C. – October 17, 2014
In 2009, The Getty Foundation issued a challenge to eight art museums: translate a scholarly catalog rich with cultural heritage data, images, and interpretation into an online resource that strives to reach beyond the limitations of print in creative and useful ways. "Systematic catalogs" are a time-honored and resource-intensive method of publishing the core art historical data and art research associated with the cultural objects that these institutions hold in public trust. And so the Online Scholarly Catalog Initiative (OSCI) was begun with the goal of "blowing up the book"... not to eliminate or destroy, but rather to expand the utility and audience of these important resources.
In the spring of 2014, the National Gallery of Art in Washington, DC launched their OSCI publication of its Dutch Paintings of the 17th Century systematic catalog. Through thoughtful and elegant design, the Gallery translated the printed catalog into a sustainable digital publication that extends the audience, usefulness, and sustainability of the information contained within. This talk presents a case study on incorporating academic research information from a traditional printed publication into the public web site for the National Gallery of Art. It also offers lessons that can be applied to other information sites.
Ready to "level up" your digital humanities (DH) game? DH offers theological librarians new opportunities to collaborate with their communities. Drawing on our experience with a graduate seminar in DH at Vanderbilt Divinity School, we discuss how to equip librarians to foster digital scholarship in areas such as digital textual editions, geospatial apps, open access publishing, and network analyses. Discover how DH transforms faculty and librarian relations from a service model to a partnership model.
Similar to Open Repositories 2014: Crowdsourced Transcription via IIIF (20)
A walk through of the Linked Art data model, API and community processes. Presented originally at the Rijksmuseum for the 5th Linked Art face to face meeting. Linked Art is a linked open usable data specification created by the community to describe artwork, museum objects, and related bibliographic and archival content.
LUX - Cross Collections Cultural Heritage at YaleRobert Sanderson
A brief presentation based on the CNI talk for the Linked Data for Libraries Discovery affinity group about LUX, Linked Open Usable Data and our discovery processes based on graphs rather than documents.
An introduction to Linked Open Usable Data (LOUD) through the lens of a zooming paradigm, and thoughts on how such a paradigm can help to address some grand challenges of LOUD, including search granularity, trust and reconciliation. Presented to the IDLab / Knowledge at Web Scale department of the University of Ghent in Feb '23
Data is our Product: Thoughts on LOD SustainabilityRobert Sanderson
Invited keynote presentation for the LINCS Project, June 23rd 2022 at the University of Guelph, Canada. It describes thoughts on a framework for sustainability of linked open usable data products in the cultural heritage domain.
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityRobert Sanderson
Brief and skeptical presentation about wikidata and its potential for use and abuse in the cultural heritage data ecosystem, presented at the PCC/LDAC forum on wikidata, November 12th, 2021.
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable DataRobert Sanderson
An introduction to Linked Art - why we need it, what it is, and how it works. A great starting point if you're interested in linked open usable data in cultural heritage, especially art museums.
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open DataRobert Sanderson
What is the notion of trust, when it comes to publishing linked open data in the cultural heritage sector? This presentation discusses some aspects with relation to three primary questions: How do we trust what was said, trust that the institution said it, and trust what it means?
Invited seminar for UIUC's IS 575 class on metadata in theory and practice, about structural metadata practice in RDF/LOD. Touches on OAI-ORE, PCDM, Annotation, IIIF and Linked Art. Challenges explored are graph boundaries, APIs and context specific metadata.
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data EcosystemRobert Sanderson
There have been, and continue to be, many initiatives to address the social, technological, financial and policy-based challenges that throw up roadblocks towards achieving this vision. However, it is hard to tell whether we are making progress, or whether we are eternally waiting for the hyperloop that will never come. If we are to ever be able to answer research questions that require a broad, international corpus of cultural data, then we need an ecosystem that can be characterized with 5 “C”s: Collaborative, Consistent, Connected, Correct and Contextualized. Each of these has implications for the sustainability, innovation, usability, timeliness and ethical considerations that must be addressed in a coherent and holistic manner. As with autonomous vehicles, technology (and perhaps even machine “intelligence”) is a necessary but insufficient component.
In this presentation, I will frame and motivate this grand challenge and propose where we can build connections between the academy, the cultural heritage sector, and industry. The discussion will explore the issues, and highlight some of the successful endeavors and more approachable opportunities where, together, progress can be made.
Tiers of Abstraction and Audience in Cultural Heritage Data ModelingRobert Sanderson
A walk through of a framework based around the distinctions between Abstraction, Implementation and Audience for considering the value and utility of data modeling patterns and paradigms in cultural heritage information systems. In particular, a focus on CIDOC-CRM, BibFrame, RiC-CM/RiC-O, EDM, and IIIF, with the intent to demonstrate best practices and anti-patterns in modeling.
Presentation about usability of linked data, following LODLAM 2020 at the Getty. Discusses JSON-LD 1.1, IIIF, Linked Art, in the context of the design principles for building usable APIs on top of semantically accurate models, and domain specific vocabularies.
In particular a focus on the different abstraction layers between conceptual model, ontology, vocabulary, and application profile and the various uses of the data.
Standards and Communities: Connected People, Consistent Data, Usable Applicat...Robert Sanderson
Keynote presentation at JCDL 2019 at UIUC, on the interaction between standards (development and usage) and communities. Looking at Linked Open Data, digital library protocols, and evaluation of standards practices.
Euromed2018 Keynote: Usability over Completeness, Community over CommitteeRobert Sanderson
Discussion of cultural heritage issues around usability and prioritization with completeness, and focus on bringing together communities rather than small and transient committees. Focus on Linked Open Usable Data, Annotations, JSON-LD, IIIF and Linked.Art.
Background for linked open data at the J Paul Getty Trust, followed by a summary of Linked Open Usable Data, and an initial walkthrough of the https://linked.art/ model.
Linked Open Data is great for recommendations about publishing data, but we need five more stars for the consumer -- How can it be both complete and usable? Design principles for Linked Open Usable Data.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
The Metaverse and AI: how can decision-makers harness the Metaverse for their...Jen Stirrup
The Metaverse is popularized in science fiction, and now it is becoming closer to being a part of our daily lives through the use of social media and shopping companies. How can businesses survive in a world where Artificial Intelligence is becoming the present as well as the future of technology, and how does the Metaverse fit into business strategy when futurist ideas are developing into reality at accelerated rates? How do we do this when our data isn't up to scratch? How can we move towards success with our data so we are set up for the Metaverse when it arrives?
How can you help your company evolve, adapt, and succeed using Artificial Intelligence and the Metaverse to stay ahead of the competition? What are the potential issues, complications, and benefits that these technologies could bring to us and our organizations? In this session, Jen Stirrup will explain how to start thinking about these technologies as an organisation.
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Leading Change strategies and insights for effective change management pdf 1.pdf
Open Repositories 2014: Crowdsourced Transcription via IIIF
1. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
1
Distributed Repositories of Medieval Calendars
and Crowd-Sourcing of Transcription
Rob Sanderson
azaroth42@gmail.com
azaroth@stanford.edu
t: @azaroth42
Stanford University
Ben Albritton, Stanford University
Doug Emery, University of Pennsylvania
Will Noel, University of Pennsylvania
Dot Porter, University of Pennsylvania
http://iiif.io/
This research was primarily funded by the Andrew W. Mellon Foundation
2. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
2
Image Repositories
• Increase in digitization
• Particularly precious,
fragile, beautiful objects
• Medieval Manuscripts
• Digitized images online
• Increasingly Open
• At high resolution
• Easy to capture an image
• Very hard to capture the text
http://gallica.bnf.fr/ark:/12148/btv1b8449691v/
3. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
3
Calendars
• Ubiquitous in liturgical books
• e.g. Books of Hours
• Structured and often tabular:
Date, Day, Saint / Event
• Content varies slightly
• Variation details give us
information about the
provenance of the object
• Much easier to transcribe
• Good pilot project!
http://www.e-codices.unifr.ch/en/bge/lat0033
4. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
4
Collaborative Crowd Sourcing?
• Meeting at U. Penn including
content providers and
scholars
• Plan:
• Collect transcriptions
together
• Analyze similarities
between manuscripts for
patterns of provenance
• Manuscripts and images
distributed: need a community
to collect sufficient data
http://brbl-dl.library.yale.edu/vufind/Record/3446275
5. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
5
Micro Repository Rant: TEI
• Most transcribing done in TEI
• Terrible for this use case:
• Single XML file
• Single author
• Single location
• Hard to link to images
• Tries to describe too much
• Impossible to use once
created
• Creating TEI is good for:
http://www.thedigitalwalters.org/Data/WaltersManuscripts/html/W41/
6. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
6
Micro Repository Rant: TEI
• Most transcribing done in TEI
• Terrible for this use case:
• Single XML file
• Single author
• Single location
• Hard to link to images
• Tries to describe too much
• Impossible to use once
created
• Creating TEI is good for:
• The academic exercise of
creating TEI
http://www.thedigitalwalters.org/Data/WaltersManuscripts/html/W41/
7. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
7
Requirements
• Distributed image content
• Consistent, rich API
• Selection of regions
• Base, not displayed size
• Alignment of text with region
• Distributed creation
• Distributed curation
• Multiple texts per region
• Styling of the text
• Some semantics
http://oculus-dev.lib.harvard.edu/manifests/view/drs:5981093
8. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
8
1. Images: BNF next to Yale
9. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
9
Open Technology: IIIF Image API
Base URL: {scheme}://{host}{/prefix}/{identifier}!
Image Resource:
{base}/{region}/{size}/{rotation}/{quality}.{format}!
!
http://iiif.io/api/image/1.1/
10. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
10
(Part of the) IIIF Community
• ARTstor
• Bibliothèque Nationale de
France
• Bodleian Libraries, Oxford
University
• British Library
• C2MRF
• Cambridge University
• Cornell University
• DPLA
• Europeana
• e-codices
• Harvard University
• Johns Hopkins University
• National Library of Denmark
• National Library of Poland
• National Library of New Zealand
• National Library of Norway
• National Library of Wales
• Princeton University
• Stanford University
• Wellcome Trust
• UK National Archives
• Yale University
11. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
11
2. Crowdsourced Box Drawing
12. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
12
2. Crowdsourced Box Drawing
13. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
13
2. Crowdsourced Box Drawing
14. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
14
2. Crowdsourced Box Drawing
15. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
15
2. Crowdsourced Box Drawing
16. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
16
2. Crowdsourced Box Drawing
17. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
17
2. Crowdsourced Box Drawing
18. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
18
2. Crowdsourced Box Drawing
19. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
19
2. Crowdsourced Box Drawing
20. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
20
Open Technologies
• Mirador
• IIIF Community developed viewer
• Stanford, Harvard, Yale, [LANL]
• Zooming via Open SeaDragon
• Princeton, and OSD committers
• JCrop
• JQuery plugin for drawing little boxes
• MongoDB
• Store information via REST interface
• W3C Media Fragment image segments
• Trivially converted to IIIF Image API requests
21. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
21
Open Technologies
22. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
22
Open Technologies
23. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
23
Open Technologies
24. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
24
Open Technologies
25. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
25
Open Technologies
26. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
26
Open Technologies
27. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
27
Open Technology
• Line/Column inspiration from TPEN (IIIF compliant)
• Transcription tool developed at St. Louis
• http://t-pen.org/TPEN/
• Line detection flakey, no internal columns
28. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
28
Open Technologies
• Inspiration from TPEN (IIIF compliant)
• Transcription tool developed at St. Louis
• http://t-pen.org/TPEN/
• Line detection flakey, no internal columns
29. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
29
Open Technologies
• Inspiration from TPEN (IIIF compliant)
• Transcription tool developed at St. Louis
• http://t-pen.org/TPEN/
• Line detection flakey, no internal columns
30. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
30
Boring (but Open) Metadata
• Metadata collection to drive the analysis
• Stored along with the segments
• Defaults are normally correct
• Custom extension, not intended for general purpose use
• Convenient to do inline
• Other metadata could be added
• Could be done in a different workflow
31. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
31
Metadata
32. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
32
Metadata
33. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
33
Metadata
34. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
34
Metadata
35. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
35
Metadata
36. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
36
...
37. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
37
Metadata
38. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
38
Open Technology: IIIF Presentation API
Text/Image Linking is a subset of a larger challenge:
• Non-Text / Image Linking
• Dynamic Images
• No Image to link to
• Multiple Images
• Parts of Images
• Parts of larger texts
• Distributed images, texts and links
Need an indirection layer:
• Solution: align text and image with an abstract Canvas
http://iiif.io/api/presentation/1.0/
39. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
39
Open Technology: IIIF Presentation API
40. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
40
Open Technology: IIIF Presentation API
http://iiif.io/api/presentation/1.0/
41. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
41
Open Technology: IIIF Presentation API
http://iiif.io/api/presentation/1.0/
42. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
42
Linked Data People...
If you do not want
to know the score,
look away now!
43. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
43
Linked Data People...
{ "it's" : "just JSON" }
44. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
44
Web Developers...
If you do not want
to know the score,
look away now!
45. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
45
Web Developers...
<_:it's>
<_:all>
<_:Linked_Data>;
46. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
46
Micro Repository Rant 2: RDF Serialization
“RDF/XML was the Semantic Web’s 3 Mile Island incident”
-- Manu Sporny, http://manu.sporny.org/2012/nuclear-rdf/
Or … RDF – Not in my back yard!
• Serializing a graph is, admittedly, hard
• RDF/XML is terrible, and too many others
• Web currently uses JSON as convenient transfer syntax
• JSON-LD allows transfer of RDF in syntax that does not require full
RDF stack, just a JSON implementation
• … as available in every web browser
• Rob's Conclusion: Require JSON-LD
• http://json-ld.org/
47. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
47
JSON-LD Context Magic
{ // Canvas resource!
"@context":"http://iiif.io/api/presentation/2/context.json",!
!
!
@context provides mapping for JSON keys into RDF.
!
"sc":"http://www.shared-canvas.org/ns/",!
"oa":"http://www.w3.org/ns/oa#",!
"service":{!
"@type":"@id", !
"@id":"sioc_svcs:has_service"},!
"height":{!
"@type":"xsd:integer", !
"@id":"exif:height"},!
"sequences":{!
"@type":"@id",!
"@id":"sc:hasSequences",!
"@container":"@list"} !
48. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
48
Open Technologies: REST
• Experimental IIIF REST specification
• http://iiif.io/api/annex/rest/
• For both Presentation and Image
• Trivial Python/WSGI handler
• Processes @context and generates identities
• Stores in MongoDB (but API is agnostic)
• Follows IIIF Presentation and Open Annotation
• http://www.w3.org/community/openannotation/
• Returns the correct JSON-LD
• Doesn't fully handle image upload yet
49. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
49
The Future is Now
• IIIF Image API 2.0
• Request for Comment period open!
• http://iiif.io/api/image/2.0/
• IIIF Presentation API 2.0
• Ditto!
• http://iiif.io/api/presentation/2.0/
Please give us feedback: iiif-discuss@googlegroups.com
• Ongoing work with U.Penn to make a more robust system
50. Distributed Repositories and Crowd-Sourcing Transcription
Open Repositories 2014, Helsinki, Finland, 11th of June 2014
50
Thank You
Rob Sanderson
azaroth42@gmail.com
azaroth@stanford.edu
t: @azaroth42
Stanford University
http://iiif.io/
iiif-discuss@googlegroups.com