- The document discusses best practices for visualizing research findings using visualization tools and publicly available tools for creating and deploying humanities visualizations.
- It considers Edward Tufte's teachings on data visualization and provides a guided tutorial on using the Exhibit framework to create interactive web pages for presenting linked data visually.
- Examples are given of visualizing a dataset of Nobel Prize winners using Exhibit to create faceted browsing, searching, sorting, and different views including a table, timeline, and map.
The promise of Linked Data is not that it is another way of aggregating data. For too long have library data been trapped within data-silos only accessible through obscure protocols. Why is access to library data still an issue? Letting everyone access and link to library data lets anyone build the next killer app. LIBRIS, the Swedish Union Catalogue is, since the beginning of this year, available as Linked Data. We discuss how and why. -- Martin Malmsten & Anders Söderbäck, National Library of Sweden
Presented by Michele C. Weigle, June 4, 2015
Columbia University Web Archiving Collaboration: New Tools and Models
Work by Yasmin AlNoamany, Michele C. Weigle, and Michael L. Nelson
The promise of Linked Data is not that it is another way of aggregating data. For too long have library data been trapped within data-silos only accessible through obscure protocols. Why is access to library data still an issue? Letting everyone access and link to library data lets anyone build the next killer app. LIBRIS, the Swedish Union Catalogue is, since the beginning of this year, available as Linked Data. We discuss how and why. -- Martin Malmsten & Anders Söderbäck, National Library of Sweden
Presented by Michele C. Weigle, June 4, 2015
Columbia University Web Archiving Collaboration: New Tools and Models
Work by Yasmin AlNoamany, Michele C. Weigle, and Michael L. Nelson
"Scholarly Communication: Deconstruct and Decentralize" was presented at the Fall 2017 Meeting of the Coalition for Networked Information. It explores working towards a Scholarly Commons by applying decentralized web ideas to scholarly communication.
2015 FOSS4G Track: Spatiotemporal Interface Development: Using Web Technologi...GIS in the Rockies
Communicating patterns in large spatiotemporal datasets can be a challenging and complicated task. While a variety of tools are available to visualize these data on the desktop, web technologies offer a unique opportunity to create interfaces that allow users to interactively explore complex multivariate datasets. In this presentation we outline a process for building spatiotemporal visualizations on the web. From data processing through architecture and development we describe technologies and languages you should know to get off the ground and walk through other important considerations in developing intuitive data exploration interfaces.
Presentation for a workshop about persistent identifiers organized by the Royal Library of The Netherlands and DANS. Highlights the non-trivial commitments required of all parties involved in persistent identifier systems to actually keep links based on persistent identifiers ... err ... persistent.
Linked Data: turning the web into a context graphLeigh Dodds
A presentation I gave at Strataconf 2012. I reviewed the concepts of Linked Data and argued that while the approach has come from the semantic web community, there are interesting parallels with efforts from Facebook and Schema.org. Linked Data provides a way for us to create resolvable identifiers + discover useful data by just using the web infrastructure more effectively.
IFLA LIDASIG Open Session 2017: Introduction to Linked DataLars G. Svensson
At the IFLA Linked Data Special Interest Group open session in Wroclaw we briefly introduced the mission of the SIG and then went on to a brief introduction to what linked data is and why that topic is important to libraries.
The presentation was held jointly by Astrid Verheusen (general introduction to the SIG) and Lars G. Svensson (introduction to Linked Data)
Presentation about reference rot given at the Complexity Science Hub in Vienna, November 2021.
Links to web resources frequently break (link rot), and linked content can change at unpredictable rates (content drift). These dynamics of the Web are detrimental when references to web resources provide evidence or supporting information.
This presentation will report on research that assessed the extent of these problems for links to web resources in scholarly literature, by using three vast corpora of publications and a range of public web archives. It will also describe the Robust Link approach that offers a proactive, uniform, and machine-actionable way to combat link rot and content drift. Finally, it will introduce the Robustify web service and API that was devised to generate links that remain functional over time, paying special attention to challenges related to deploying infrastructure that is required to be long lasting.
Looks at hyperlinks from the perspective of a managed collection of resources for which link persistence/integrity is considered a quality of service concern. Distinguishes between links into other managed collections and to the web at large. Considers link rot and content drift.
Archive Assisted Archival Fixity Verification FrameworkSawood Alam
The number of public and private web archives has increased, and we implicitly trust content delivered by these archives. Fixity is checked to ensure an archived resource has remained unaltered since the time it was captured. Some web archives do not allow users to access fixity information and, more importantly, even if fixity information is available, it is provided by the same archive from which the archived resources are requested. In this research, we propose two approaches, namely Atomic and Block, to establish and check fixity of archived resources.
A Web-scale Study of the Adoption and Evolution of the schema.org Vocabulary ...Robert Meusel
Promoted by major search engines, schema.org has become a widely adopted standard for marking up structured data in HTML web pages. In this paper, we use a series of largescale Web crawls to analyze the evolution and adoption of schema.org over time. The availability of data from dierent points in time for both the schema and the websites deploying data allows for a new kind of empirical analysis of standards adoption, which has not been possible before. To conduct our analysis, we compare dierent versions of the schema.org vocabulary to the data that was deployed on hundreds of thousands of Web pages at dierent points in time. We measure both top-down adoption (i.e., the extent to which changes in the schema are adopted by data providers) as well as bottom-up evolution (i.e., the extent to which the actually deployed data drives changes in the schema). Our empirical analysis shows that both processes can be observed.
The Power of Sharing Linked Data - ELAG 2014 WorkshopRichard Wallis
Presentation to set the scene and stimulate discussion in the Workshop "The Power of Sharing Linked Data" at ELAG 2014 - Bath University, UK June 10/11 2014
This is a very basic workshop to introduce novice users to Omeka with an eye towards providing hands-on experience to decide whether it can serve their own research needs.
"Scholarly Communication: Deconstruct and Decentralize" was presented at the Fall 2017 Meeting of the Coalition for Networked Information. It explores working towards a Scholarly Commons by applying decentralized web ideas to scholarly communication.
2015 FOSS4G Track: Spatiotemporal Interface Development: Using Web Technologi...GIS in the Rockies
Communicating patterns in large spatiotemporal datasets can be a challenging and complicated task. While a variety of tools are available to visualize these data on the desktop, web technologies offer a unique opportunity to create interfaces that allow users to interactively explore complex multivariate datasets. In this presentation we outline a process for building spatiotemporal visualizations on the web. From data processing through architecture and development we describe technologies and languages you should know to get off the ground and walk through other important considerations in developing intuitive data exploration interfaces.
Presentation for a workshop about persistent identifiers organized by the Royal Library of The Netherlands and DANS. Highlights the non-trivial commitments required of all parties involved in persistent identifier systems to actually keep links based on persistent identifiers ... err ... persistent.
Linked Data: turning the web into a context graphLeigh Dodds
A presentation I gave at Strataconf 2012. I reviewed the concepts of Linked Data and argued that while the approach has come from the semantic web community, there are interesting parallels with efforts from Facebook and Schema.org. Linked Data provides a way for us to create resolvable identifiers + discover useful data by just using the web infrastructure more effectively.
IFLA LIDASIG Open Session 2017: Introduction to Linked DataLars G. Svensson
At the IFLA Linked Data Special Interest Group open session in Wroclaw we briefly introduced the mission of the SIG and then went on to a brief introduction to what linked data is and why that topic is important to libraries.
The presentation was held jointly by Astrid Verheusen (general introduction to the SIG) and Lars G. Svensson (introduction to Linked Data)
Presentation about reference rot given at the Complexity Science Hub in Vienna, November 2021.
Links to web resources frequently break (link rot), and linked content can change at unpredictable rates (content drift). These dynamics of the Web are detrimental when references to web resources provide evidence or supporting information.
This presentation will report on research that assessed the extent of these problems for links to web resources in scholarly literature, by using three vast corpora of publications and a range of public web archives. It will also describe the Robust Link approach that offers a proactive, uniform, and machine-actionable way to combat link rot and content drift. Finally, it will introduce the Robustify web service and API that was devised to generate links that remain functional over time, paying special attention to challenges related to deploying infrastructure that is required to be long lasting.
Looks at hyperlinks from the perspective of a managed collection of resources for which link persistence/integrity is considered a quality of service concern. Distinguishes between links into other managed collections and to the web at large. Considers link rot and content drift.
Archive Assisted Archival Fixity Verification FrameworkSawood Alam
The number of public and private web archives has increased, and we implicitly trust content delivered by these archives. Fixity is checked to ensure an archived resource has remained unaltered since the time it was captured. Some web archives do not allow users to access fixity information and, more importantly, even if fixity information is available, it is provided by the same archive from which the archived resources are requested. In this research, we propose two approaches, namely Atomic and Block, to establish and check fixity of archived resources.
A Web-scale Study of the Adoption and Evolution of the schema.org Vocabulary ...Robert Meusel
Promoted by major search engines, schema.org has become a widely adopted standard for marking up structured data in HTML web pages. In this paper, we use a series of largescale Web crawls to analyze the evolution and adoption of schema.org over time. The availability of data from dierent points in time for both the schema and the websites deploying data allows for a new kind of empirical analysis of standards adoption, which has not been possible before. To conduct our analysis, we compare dierent versions of the schema.org vocabulary to the data that was deployed on hundreds of thousands of Web pages at dierent points in time. We measure both top-down adoption (i.e., the extent to which changes in the schema are adopted by data providers) as well as bottom-up evolution (i.e., the extent to which the actually deployed data drives changes in the schema). Our empirical analysis shows that both processes can be observed.
The Power of Sharing Linked Data - ELAG 2014 WorkshopRichard Wallis
Presentation to set the scene and stimulate discussion in the Workshop "The Power of Sharing Linked Data" at ELAG 2014 - Bath University, UK June 10/11 2014
This is a very basic workshop to introduce novice users to Omeka with an eye towards providing hands-on experience to decide whether it can serve their own research needs.
In this talk we present the term polyglot persistence, give a brief introduction to the world of NoSQL database and point out the benefits and costs of polyglot persistence. Thereafter we present the idea of a multi-model database that reduces the costs for polyglot persistence but keeps its benefits. Next up we present ArangoDB as a Multi-Model database
This slide deck has been prepared for a workshop on Linked Data Publishing and Semantic Processing using the Redlink platform (http://redlink.co). The workshop delivered at the Department of Information Engineering, Computer Science and Mathematics at Università degli Studi dell'Aquila aimed at providing a general understanding of Semantic Web Technologies and how these can be used in real world use cases such as Salzburgerland Tourismus.
A brief introduction has been also included on MICO (Media in Context) a European Union part-funded research project to provide cross-media analysis solutions for online multimedia producers.
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
JSONpedia - Facilitating consumption of MediaWiki contentMichele Mostarda
JSONpedia is a Java library and a REST service meant to access content of MediaWiki pages as JSON. Mainly designed to produce Machine Learning corpuses, it allows to perform advanced queries over Wikipedia dumps using Elasticsearch indexing and faceting capabilities and MongoDB map/reduce support.
NISO Virtual Conference: BIBFRAME & Real World Applications of Linked Bibliographic Data
http://www.niso.org/news/events/2016/virtual_conference/jun15_virtualconf/
June 15, 2016
Opening Keynote: Landscape and Current Status of BIBFRAME and Related Initiatives
Philly Code Camp 2013 Mark Kromer Big Data with SQL ServerMark Kromer
These are my slides from May 2013 Philly Code Camp at Penn State Abington. I will post the samples, code and scripts on my blog here following the event this Saturday: http://www.kromerbigdata.com
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
This interactive seminar will explore trends and initiatives in the digital community of practice in the humanities and the social sciences. Participants will come away with a appreciation of from where the field has emerged and how it interacts with traditional disciplines. This seminar will be of interest to those in traditional disciplines as well as the wider academy as digital humanities is both collaborative and multidisciplinary in practise. It is intended to form a broad and easy introduction to the practise of digital humanities and will appeal especially to new scholar who is open to the potential to combine their traditional scholarship with digital tools and methodologies. It is *introductory* in nature.
Requirements Engineering for the HumanitiesShawn Day
This workshop explores how requirements engineering can be employed by digital and non-digital humanities scholars (and others) to conceptualise and communicate a research project.
requirementsEngineeringAs the field of digital humanities has evolved, one of the biggest challenges has been getting the marrying technical expertise with humanities scholarly practice to successfully deliver sustainable and sound digital projects. At its core this is a communications exercise. However, to communicate effectively demands an ability to effectively translate, define and find clarity in your own mind.
Google Tools for Digital Humanities ScholarsShawn Day
In this seminar we have introduced many lesser known, but potentially even more useful tools to scholars such as the particularly powerful Google Fusion Tables and Google Trends to the simple but powerful Google Keep among others. This just scrapes the surface with a series of tools that evolve everyday and with new tools emerging and other fading away after contributing to our scholarly imagination.
Mapping your data can help to provide new insights on your research findings. However, many scholars are put off by the steep learning curve demanded by Geographic Information Systems (GIS) such as ArcGIS from ESRI. New and simple tools have become available that offer sophisticated output without extensive training. In fact, tools such as Google Maps, Google Earth, Open Street Map among others can offer immediate returns in a matter of hours where tasks in the past required, weeks, months and even years of training.
A brief introduction to Metadata, it’s value and how it can be leveraged in Omeka as a digital narrative tool; and to evaluate what digital narrative tools - such as Omeka or others - may be of use in sharing your research – and telling your story.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
2. Objectives
‣ Consider best practices in sharing research findings
using visualisation tools;
‣ Identify and judge between publicly available tools to
create and deploy humanities visualisation research
products;
‣ Consider data visualisation as part of a larger research
discussion.
4. "Digitization makes the most traditional forms of humanistic
scholarship more necessary, not less.
But the differences mean that we need to reinvent, not
reaffirm, the way we engage with the humanities."
5. So, why do we turn to visualisation for
presentation?
‣ Open Up Large Datasets
‣ Increase density of observable data
‣ Reduce Complexity
‣ Aestheticise Data
‣ Illustrate an Interpretation
‣ Make an Argument
6. Who is Edward Tufte andWhat Does He Teach Us?
‣ Show the Data
‣ Provoke Thought about the Subject at
Hand
‣ Avoid Distorting the Data
‣ Present Many Numbers in a Small
Space
‣ Make Large Datasets Coherent
‣ Encourage Eyes to Compare Data
‣ Reveal Data at Several Levels of Detail
‣ Serve a Reasonably Clear Purpose
‣ Be Closely Integrated with Statistical /Verbal Descriptions of the
Dataset
10. Background
‣ To understand with a quick and dirty tour whether
Exhibit might be of use in your research programmes
!
‣ Exhibit was developed at MIT to provide a lightweight
framework for the presentation, searching and faceted
browsing of digital collections.
‣ Exhibit lets you easily create web pages with advanced
text search and filtering functionalities, with interactive
maps, timelines, and other visualisations
14. Who is SIMILE?
‣ Semantic Interoperability of Metadata and Information in
unLike Environments
‣ MIT Project - 2003-2011
‣ MIT Library,W3C, Mellon-Funded
‣ A collection of tools to enhance inter-operatibility
between digital objects
‣ Led to the SIMILEWidget Community
15. Others
‣ Appalachian (LogIns)
‣ Fresnel (RDF Library)
‣ HTTPTracer (Traffic Sniffer)
‣ jsTEX (TEX Render for Firefox)
‣ Zotz (publish from Zotero to Exhibit)
‣ Potluck (mixes Data from multiple Exhibits)
17. Citeline
‣ Exhibit custom tweaked between
CSail and MIT Libraries
‣ Dedicated to rendering interactive bibliographies
‣ Wizard to take from bibtex to a full Exhibit
‣ Connectivity to Zotero (http://zotero.org)
18. Gadget
‣ An XML Inspector
‣ Display XML context
in a graphical and
browsable manner
‣ Open Source
‣ Free
‣ Approachable
19. Longwell
‣ An RDF Data Browser
‣ Customisable
‣ Graphical
‣ Comes from the Flamenco project that pioneered the
concept of faceted browsing:
"allow users to move through large information spaces in a flexible
manner without feeling lost"
"use of metadata is integrated with free-text search, allowing the user
to follow links, then add search terms, then follow more links, without
interrupting the interaction flow"
20. Piggybank
‣ Collect information from theWeb
‣ Save information for future use
‣ Tag information with keywords
‣ Search & Browse collected information
‣ Retrieve saved information
‣ Share information you have collected
‣ Install screen scrapers - with SIMILE Solvent
!
‣ Similar to Evernote
21. RDFizers
‣ Tools to convert to RDF
‣ RDF for interoperability - Linked Open Data
‣ Context + Content
‣ JPEG -> RDF
‣ MARC/MODS -> RDF
‣ OAI-PMH -> RDF
‣ EMail -> RDF
‣ BibTEX -> RDF
‣ RAW -> RDF
‣ Flat -> RDF
25. Semantic Bank
‣ Ties Longwell <——> Piggybank
‣ Create a Library of Linked Contextual
Information for Use in Collaborative
Environments
‣ Publish feed as RDF
30. Why Exhibit?
‣ Simple
‣ Javascipt - Approachable - Example Based
‣ Modular
‣ Standards Based
‣ Doesn’t Require Server Technology
‣ Browser Based
‣ Allows focus on content not on the technology
31. Why
‣ Free, no cost
‣ Easy to use
‣ No programming skills required
‣ Open source platform
‣ Get involved, share your expertise, write code or add a demo
‣ Scalable - Staged mode scales to hundreds of thousands of items
‣ Lightweight publishing framework for building interactive web pages of
linked data
‣ Supports search (Scripted mode), faceted navigation, interactive displays
‣ Easy to reconfigure and extend
‣ Supports customised data display
32. Setting the Stage
‣ What DoYou need to Make the Magic Happen?
‣ A Text Editor - NotePad or TextWrangler
‣ AWeb Browser - Firefox?
‣ A Data Manipulation Tool - Excel, GoogleDocs?
!
‣ A Dataset
‣ An Open Mind
‣ A Few Hours
‣ Willingness to Play
33. Preparing your data for use in Exhibit
‣ Input Formats
‣ Exhibit JSON
‣ Google Spreadsheet
‣ Generic JSONP
‣ From Babel
‣ BibTex
‣ Excel
‣ Exhibit Page
‣ JPEG
‣ RDF/XML
‣ Tab-SeparatedValues
‣ Output Formats
‣ Exhibit JSON
‣ RDF/XML
‣ Semantic MediaWiki
‣ Tab-SeparatedValues
‣ BibTex
34. StructuringYour Data
‣ Rows and Columns
‣ A Row is an object in the collection
‣ A Column is a piece of metadata
!
‣ The Header is the First Row
‣ Let’s See an example
37. Data Files
‣ An Array of Items
‣ Each Item a record
‣ Each items has properties
‣ Each property has a value
‣ Propeties surrounded by "" quotes
!
‣ Each Item muct have two properties:
‣ Label
‣ Type
41. Exhibit in a Nutshell
Data
json file
Description
html file
Browsable/
Searchable/Visual
Website
'the Exhibit'
42. What Exhibit Does (Programatically)
‣ A web page is loaded
‣ The web page pulls in more code (the Exhibit framework)
‣ A lightweight database is created (within the browser)
‣ The Exhibit Object is created
‣ It extracts from the HTML the user interface
‣ It loads the data into memory
‣ It ten populates the database
‣ It waits for user interaction
49. A Sidenote on Interchange
‣ That Little Orange Button
!
‣ A Lot of Power
‣ Regardless of how you provide
data —> Exhibit will export in
a variety of forms
58. To Take Stock
‣ We have taken a datafile and created a website that
displays that data;
‣ We have added means for the user to search, sort and
filter the data;
‣ We have added a new view to that website so that a user
can choose different means to view the data;
‣ We have started to style the textual presentation.
!
‣ Let’s take a quick look at our data before we go further
68. Geospatial Considerations
‣ Can choose from Google versus OpenStreetMap
‣ Multiple Location for each item
‣ Getting the long lat data you need - geocoding
‣ Lenses Apply to the Bubble displayed
69. What Else CanYou Add?
Views
‣ Bar Charts
‣ Line Charts
‣ Calendars
‣ Scatter Plot
‣ PivotTables
‣ Timeplots
!
!
Facets
‣ Lists
‣ Numeric Range
‣ Text Search
‣ Tag Cloud
‣ Slider
‣ Image
‣ Heirarchical
70. Exhibit in a Nutshell
‣ Pros
‣ Simple
‣ Lightweight
‣ No server required
‣ A host of visualisations
‣ Embeddable in other
systems - ExhibitPress
!
!
‣ Cons
‣ Limited Scalability
‣ Some cross-browser
issues
‣ Restrictions on Look and
Feel
‣ Extensive customisation
means getting into code
71. Making Exhibit Choices
‣ There is a Stable Proven Choice - Exhibit 2.2
‣ There is an all new more standards-compliant Exhibit 3
‣ Exhibit 3 comes in Two Flavours
‣ Exhibit Scripted - Like Exhibit 2 with streamlining and some
visual improvements
‣ Exhibit Stages - Server Based, Robust, Scalable and the Future
!
‣ Exhibit 2.2 offers all whizzy features
‣ Exhibit 3 is faster but not fully ported (if you need maps?)
‣ Exhibit 3 Staged is a new, powerful, professional tool
72. Where to Go Next
!
‣ The ExhibitWiki and GitHub Pages
!
‣ http://www.simile-widgets.org
73. for Next Lecture (18 March):
Sharing
Please take a look at:
!
"The digital humanities is not about building,
it’s about sharing"