SlideShare a Scribd company logo
Disclaimer: I may gloss over or poorly represent tangential responsibilities in this message to
serve the purpose of verbosely illustrating data object creation and transport. json documents
are pulled from extant samples and may not mirror precisely what will be created or be adjusted
for current standards. This is not intended as a specification or work document.
These are things that currently have to happen for a user to move from finding an image to
transcription practice:
1. Host images at a stable URI (or URL)
2. Honor requests for image resizing, manipulation, cropping, etc. at a predictable URI
pattern (IIIF)
3. Store all of the metadata associated with the imaged manuscripts and their folios
4. Query selected metadata (facets) for sets of matches among these collections
5. Provide endpoints for T-PEN to query for specific or sets of collections, manuscripts,
and folios (manifests and canvases)
1-4 happen on the paleography site (within Islandora I believe) but 5 will have to be negotiated.
Any data that is accessible in T-PEN will either need to be stored in T-PEN or resolved through
use of the transcription project identifier (specifically, a SharedCanvas Manifest URI).
Collections, Manuscripts, and Folios (at least) should all have a URI which resolves to a
descriptive JSON-LD document.
Let's take it in pieces:
sc:Manifest
{
"@context" : "http://www.shared-canvas.org/ns/context.json",
"@id" : "http://t-pen.org/Example+Simple/manifest.json",
"@type" : "sc:Manifest",
"label" : "Example Simple",
"metadata" : "http://paleoberry.org/Example+Simple/metadata",
"sequences" : [ {
"@id" : "http://t-pen.org/Tradamus+Simple/sequence/normal",
"@type" : "sc:Sequence",
"label" : "Current Page Order",
"canvases": [
"http://t-pen.org/Example+Simple/canvas/100r",
"http://t-pen.org/Example+Simple/canvas/100v",
"http://t-pen.org/Example+Simple/canvas/101r"
]
}
At its core, a manifest is just a sequence of canvases. Each new arrangement is a unique
canvas. In fact, the IIIF manifest standard suggests "There are no semantics conveyed by
[metadata] information, and clients should not use it for discovery or other purposes." In T-PEN,
each project has its own sc:manifest, even if several people are working on the same
manuscript. That said, there should be some URI a person or machine could dereference to see
a JSON-LD file of Newberry's curated arrangement of images with all available associated
metadata. However, the metadata field can itself be a URI, as it is just key-value pairs, meaning
if Iter made available something like newberry.org/IIIF/SHELFMARK/metadata.json and
returned JSON, any authorized user could make a legitimate and good sc:Manifest, including T-
PEN or any Open Annotation Store.
Bottom line: Official transcription projects will have manuscript metadata in Islandora, but the
project sc:Manifest can be resolved via T-PEN and will include a reference to a metadata URI
(spec).
sc:Canvas
{
"@id" : "http://t-pen.org/Example+Simple/canvas/100r",
"@type" : "sc:Canvas",
"label" : "100r",
"height" : 1000,
"width" : 667,
"images" : [ {
"@type" : "oa:Annotation",
"motivation" : "sc:painting",
"resource" : {
"@id" : "http://paleoberry.org/iiif/Example+Simple/res/100r.jpg",
"@type" : "dctypes:Image",
"format" : "image/jpeg",
"height" : 2365,
"width" : 1579
} ]
"otherContent" : [ {
"@id":"http://t-pen.org/Example+Simple/lines/100r",
"@type":"sc:AnnotationList",
"resources":[
"http://t-pen.org/Example+Simple/line/101083792",
"http://t-pen.org/Example+Simple/line/101083842",
"http://t-pen.org/Example+Simple/line/101083841" ...
]
} ]
}
There does exist a sc:Canvas URI in the authoritative sc:Manifest, but as soon as a new project
is created in T-PEN, it is copied so as not to pollute the original with annotations. Most of the
metadata, if not all, is already covered in the sc:Manifest, so the label is the most important
thing it carries. In fact, even the image resource does not have to be included and can also be
just a URI (IIIF). This canvas is the target of all new annotations (image and transcription) and is
stored and resolved through T-PEN.
Bottom line: The canvases and manifests are linked through @id:URI. The canvas, in this case,
has the image dereferenced, but it can be anything that results in an image (likely a IIIF URI). A
‘service’ property can carry additional instructions if special access is needed.
sc:Annotation
{
“@id" : "http://t-pen.org/Tradamus+Simple/line/101083792",
"@type" : "oa:Annotation",
"motivation" : "tr:transcribing",
"resource" : {
"@type" : "cnt:ContentAsText",
"cnt:chars" : "Infesto Trinitatis"
},
"on" : "http://t-pen.org/Tradamus+Simple/canvas/100r#xywh=148,60,409,18"
}
The region, as it is rectangular, is abbreviated in the ‘on’ as “#xywh=” per the standard. The
content of the transcription is a ‘resource’ that is ‘cnt:chars’ literal string here, but can be
anything in the standard (XML, HTML, JSON, MEI, OGG). In T-PEN, for simplicity it is always a
straight UTF-8 string with escapes for ” to avoid breaking JSON. Any authorized user can use
the ‘@id’ to resolve this annotation as a JSON file and climb up to the canvas as well, if desired.
Bottom line: This is CDH territory, but there are things needed to get this far that may not be
instantly available.
Accession
When the paleography site wants to include a new manuscript for transcribing, it will need to
create one of three links:
1. On the fly project creation
○ The use case for this is only to create the official transcriptions or for a user to
start work on an untranscribed, but discoverable manuscript.
○ Send T-PEN a new sc:Manifest JSON object request that includes a label
(unless this is generated from metadata), a metadata URI, and an ordered list of
labelled images.
○ In a perfect world, this would be a skeletal, but otherwise well-formed sc:Manifest
object so that T-PEN can simply extend it and return it when further URIs are
minted.
○ The user is then passed into T-PEN with this new Manifest displayed as their
new project.
2. Copy a project for transcription practice
○ Most users will follow this path to get into transcription.
○ Send T-PEN an existing sc:Manifest URI (such as the official transcription) with a
copy request.
○ T-PEN creates a new manifest for the users project with the same sequence,
labels, images, and lines, but without the transcription data.
○ Nothing is passed back to the paleography site.
○ The user starts a new project in T-PEN, related to the original in case they want
to check their transcription work.
3. Work on a public project
○ Newberry can decide how heavily this is encouraged, but it is easy to do and
may be a good way to crowd-source some transcriptions or line annotations.
○ Send T-PEN an existing sc:Manifest URI (such as the official transcription).
○ The user is authenticated and begins to transcribe on the public project as
permissions allow. (This will not break the official transcription.)
Bottom line: If T-PEN is the gatekeeper for the new sc:Manifest URIs, it will have to let Islandora
know what it has minted (or agree to a strict convention) and Islandora will have to mint a URI
for the manuscript metadata and construct something very manifest-esque to request its
creation. If Islandora stores the whole manifest, T-PEN will have to let it know what URIs have
been created for every new sc:Canvas in the sequence. There is no good case for canvases or
annotations being stored outside of T-PEN.

More Related Content

Similar to Verbose explanations about data for transcription

Ironmq slides
Ironmq slidesIronmq slides
Ironmq slides
Mikko Koivunalho
 
Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)
Equipex Biblissima
 
All about elasticsearch language clients
All about elasticsearch language clientsAll about elasticsearch language clients
All about elasticsearch language clients
Enterprise Search Warsaw Meetup
 
The Glory of Rest
The Glory of RestThe Glory of Rest
The Glory of Rest
Sławomir Chrobak
 
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
Simeon Warner
 
For project
For projectFor project
For project
jesalnmistry
 
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdfBlaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
breitschbreitsch
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
CHAKER ALLAOUI
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
Marcia Zeng
 
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
Tony Hammond
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
Graphql
GraphqlGraphql
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
PayamBarnaghi
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
Adrian Stevenson
 
Standardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft PresentationStandardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft Presentation
Travis Oliphant
 
20181215 introduction to graph databases
20181215   introduction to graph databases20181215   introduction to graph databases
20181215 introduction to graph databases
Timothy Findlay
 
Towards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LDTowards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LD
José Manuel Cantera Fonseca
 
Eclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science ProjectEclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science Project
Matthew Gerring
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paper
strange_loop
 

Similar to Verbose explanations about data for transcription (20)

Ironmq slides
Ironmq slidesIronmq slides
Ironmq slides
 
Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)
 
All about elasticsearch language clients
All about elasticsearch language clientsAll about elasticsearch language clients
All about elasticsearch language clients
 
The Glory of Rest
The Glory of RestThe Glory of Rest
The Glory of Rest
 
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
 
For project
For projectFor project
For project
 
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdfBlaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
 
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
Graphql
GraphqlGraphql
Graphql
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
 
Standardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft PresentationStandardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft Presentation
 
20181215 introduction to graph databases
20181215   introduction to graph databases20181215   introduction to graph databases
20181215 introduction to graph databases
 
Towards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LDTowards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LD
 
Eclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science ProjectEclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science Project
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paper
 

Recently uploaded

Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
techboxsqauremedia
 
Recruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media MasterclassRecruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media Masterclass
LuanWise
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
Adam Smith
 
Authentically Social by Corey Perlman - EO Puerto Rico
Authentically Social by Corey Perlman - EO Puerto RicoAuthentically Social by Corey Perlman - EO Puerto Rico
Authentically Social by Corey Perlman - EO Puerto Rico
Corey Perlman, Social Media Speaker and Consultant
 
Part 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 SlowdownPart 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 Slowdown
jeffkluth1
 
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Avirahi City Dholera
 
buy old yahoo accounts buy yahoo accounts
buy old yahoo accounts buy yahoo accountsbuy old yahoo accounts buy yahoo accounts
buy old yahoo accounts buy yahoo accounts
Susan Laney
 
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta MatkaDpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
➒➌➎➏➑➐➋➑➐➐Dpboss Matka Guessing Satta Matka Kalyan Chart Indian Matka
 
Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024
Kirill Klimov
 
3 Simple Steps To Buy Verified Payoneer Account In 2024
3 Simple Steps To Buy Verified Payoneer Account In 20243 Simple Steps To Buy Verified Payoneer Account In 2024
3 Simple Steps To Buy Verified Payoneer Account In 2024
SEOSMMEARTH
 
Understanding User Needs and Satisfying Them
Understanding User Needs and Satisfying ThemUnderstanding User Needs and Satisfying Them
Understanding User Needs and Satisfying Them
Aggregage
 
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
Zodiac Signs and Food Preferences_ What Your Sign Says About Your TasteZodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
my Pandit
 
Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024
Adnet Communications
 
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
AnnySerafinaLove
 
Structural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for BuildingsStructural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for Buildings
Chandresh Chudasama
 
Authentically Social Presented by Corey Perlman
Authentically Social Presented by Corey PerlmanAuthentically Social Presented by Corey Perlman
Authentically Social Presented by Corey Perlman
Corey Perlman, Social Media Speaker and Consultant
 
-- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month ---- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month --
NZSG
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
FelixPerez547899
 
BeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdfBeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdf
DerekIwanaka1
 
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
bosssp10
 

Recently uploaded (20)

Creative Web Design Company in Singapore
Creative Web Design Company in SingaporeCreative Web Design Company in Singapore
Creative Web Design Company in Singapore
 
Recruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media MasterclassRecruiting in the Digital Age: A Social Media Masterclass
Recruiting in the Digital Age: A Social Media Masterclass
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
 
Authentically Social by Corey Perlman - EO Puerto Rico
Authentically Social by Corey Perlman - EO Puerto RicoAuthentically Social by Corey Perlman - EO Puerto Rico
Authentically Social by Corey Perlman - EO Puerto Rico
 
Part 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 SlowdownPart 2 Deep Dive: Navigating the 2024 Slowdown
Part 2 Deep Dive: Navigating the 2024 Slowdown
 
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
 
buy old yahoo accounts buy yahoo accounts
buy old yahoo accounts buy yahoo accountsbuy old yahoo accounts buy yahoo accounts
buy old yahoo accounts buy yahoo accounts
 
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta MatkaDpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
Dpboss Matka Guessing Satta Matta Matka Kalyan Chart Satta Matka
 
Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024Organizational Change Leadership Agile Tour Geneve 2024
Organizational Change Leadership Agile Tour Geneve 2024
 
3 Simple Steps To Buy Verified Payoneer Account In 2024
3 Simple Steps To Buy Verified Payoneer Account In 20243 Simple Steps To Buy Verified Payoneer Account In 2024
3 Simple Steps To Buy Verified Payoneer Account In 2024
 
Understanding User Needs and Satisfying Them
Understanding User Needs and Satisfying ThemUnderstanding User Needs and Satisfying Them
Understanding User Needs and Satisfying Them
 
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
Zodiac Signs and Food Preferences_ What Your Sign Says About Your TasteZodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
 
Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024Lundin Gold Corporate Presentation - June 2024
Lundin Gold Corporate Presentation - June 2024
 
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
Anny Serafina Love - Letter of Recommendation by Kellen Harkins, MS.
 
Structural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for BuildingsStructural Design Process: Step-by-Step Guide for Buildings
Structural Design Process: Step-by-Step Guide for Buildings
 
Authentically Social Presented by Corey Perlman
Authentically Social Presented by Corey PerlmanAuthentically Social Presented by Corey Perlman
Authentically Social Presented by Corey Perlman
 
-- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month ---- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month --
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
 
BeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdfBeMetals Investor Presentation_June 1, 2024.pdf
BeMetals Investor Presentation_June 1, 2024.pdf
 
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
Call 8867766396 Satta Matka Dpboss Matka Guessing Satta batta Matka 420 Satta...
 

Verbose explanations about data for transcription

  • 1. Disclaimer: I may gloss over or poorly represent tangential responsibilities in this message to serve the purpose of verbosely illustrating data object creation and transport. json documents are pulled from extant samples and may not mirror precisely what will be created or be adjusted for current standards. This is not intended as a specification or work document. These are things that currently have to happen for a user to move from finding an image to transcription practice: 1. Host images at a stable URI (or URL) 2. Honor requests for image resizing, manipulation, cropping, etc. at a predictable URI pattern (IIIF) 3. Store all of the metadata associated with the imaged manuscripts and their folios 4. Query selected metadata (facets) for sets of matches among these collections 5. Provide endpoints for T-PEN to query for specific or sets of collections, manuscripts, and folios (manifests and canvases) 1-4 happen on the paleography site (within Islandora I believe) but 5 will have to be negotiated. Any data that is accessible in T-PEN will either need to be stored in T-PEN or resolved through use of the transcription project identifier (specifically, a SharedCanvas Manifest URI). Collections, Manuscripts, and Folios (at least) should all have a URI which resolves to a descriptive JSON-LD document. Let's take it in pieces: sc:Manifest { "@context" : "http://www.shared-canvas.org/ns/context.json", "@id" : "http://t-pen.org/Example+Simple/manifest.json", "@type" : "sc:Manifest", "label" : "Example Simple", "metadata" : "http://paleoberry.org/Example+Simple/metadata", "sequences" : [ { "@id" : "http://t-pen.org/Tradamus+Simple/sequence/normal", "@type" : "sc:Sequence", "label" : "Current Page Order", "canvases": [ "http://t-pen.org/Example+Simple/canvas/100r", "http://t-pen.org/Example+Simple/canvas/100v", "http://t-pen.org/Example+Simple/canvas/101r" ] } At its core, a manifest is just a sequence of canvases. Each new arrangement is a unique canvas. In fact, the IIIF manifest standard suggests "There are no semantics conveyed by [metadata] information, and clients should not use it for discovery or other purposes." In T-PEN, each project has its own sc:manifest, even if several people are working on the same
  • 2. manuscript. That said, there should be some URI a person or machine could dereference to see a JSON-LD file of Newberry's curated arrangement of images with all available associated metadata. However, the metadata field can itself be a URI, as it is just key-value pairs, meaning if Iter made available something like newberry.org/IIIF/SHELFMARK/metadata.json and returned JSON, any authorized user could make a legitimate and good sc:Manifest, including T- PEN or any Open Annotation Store. Bottom line: Official transcription projects will have manuscript metadata in Islandora, but the project sc:Manifest can be resolved via T-PEN and will include a reference to a metadata URI (spec). sc:Canvas { "@id" : "http://t-pen.org/Example+Simple/canvas/100r", "@type" : "sc:Canvas", "label" : "100r", "height" : 1000, "width" : 667, "images" : [ { "@type" : "oa:Annotation", "motivation" : "sc:painting", "resource" : { "@id" : "http://paleoberry.org/iiif/Example+Simple/res/100r.jpg", "@type" : "dctypes:Image", "format" : "image/jpeg", "height" : 2365, "width" : 1579 } ] "otherContent" : [ { "@id":"http://t-pen.org/Example+Simple/lines/100r", "@type":"sc:AnnotationList", "resources":[ "http://t-pen.org/Example+Simple/line/101083792", "http://t-pen.org/Example+Simple/line/101083842", "http://t-pen.org/Example+Simple/line/101083841" ... ] } ] } There does exist a sc:Canvas URI in the authoritative sc:Manifest, but as soon as a new project is created in T-PEN, it is copied so as not to pollute the original with annotations. Most of the metadata, if not all, is already covered in the sc:Manifest, so the label is the most important thing it carries. In fact, even the image resource does not have to be included and can also be
  • 3. just a URI (IIIF). This canvas is the target of all new annotations (image and transcription) and is stored and resolved through T-PEN. Bottom line: The canvases and manifests are linked through @id:URI. The canvas, in this case, has the image dereferenced, but it can be anything that results in an image (likely a IIIF URI). A ‘service’ property can carry additional instructions if special access is needed. sc:Annotation { “@id" : "http://t-pen.org/Tradamus+Simple/line/101083792", "@type" : "oa:Annotation", "motivation" : "tr:transcribing", "resource" : { "@type" : "cnt:ContentAsText", "cnt:chars" : "Infesto Trinitatis" }, "on" : "http://t-pen.org/Tradamus+Simple/canvas/100r#xywh=148,60,409,18" } The region, as it is rectangular, is abbreviated in the ‘on’ as “#xywh=” per the standard. The content of the transcription is a ‘resource’ that is ‘cnt:chars’ literal string here, but can be anything in the standard (XML, HTML, JSON, MEI, OGG). In T-PEN, for simplicity it is always a straight UTF-8 string with escapes for ” to avoid breaking JSON. Any authorized user can use the ‘@id’ to resolve this annotation as a JSON file and climb up to the canvas as well, if desired. Bottom line: This is CDH territory, but there are things needed to get this far that may not be instantly available. Accession When the paleography site wants to include a new manuscript for transcribing, it will need to create one of three links: 1. On the fly project creation ○ The use case for this is only to create the official transcriptions or for a user to start work on an untranscribed, but discoverable manuscript. ○ Send T-PEN a new sc:Manifest JSON object request that includes a label (unless this is generated from metadata), a metadata URI, and an ordered list of labelled images. ○ In a perfect world, this would be a skeletal, but otherwise well-formed sc:Manifest object so that T-PEN can simply extend it and return it when further URIs are minted. ○ The user is then passed into T-PEN with this new Manifest displayed as their new project. 2. Copy a project for transcription practice ○ Most users will follow this path to get into transcription.
  • 4. ○ Send T-PEN an existing sc:Manifest URI (such as the official transcription) with a copy request. ○ T-PEN creates a new manifest for the users project with the same sequence, labels, images, and lines, but without the transcription data. ○ Nothing is passed back to the paleography site. ○ The user starts a new project in T-PEN, related to the original in case they want to check their transcription work. 3. Work on a public project ○ Newberry can decide how heavily this is encouraged, but it is easy to do and may be a good way to crowd-source some transcriptions or line annotations. ○ Send T-PEN an existing sc:Manifest URI (such as the official transcription). ○ The user is authenticated and begins to transcribe on the public project as permissions allow. (This will not break the official transcription.) Bottom line: If T-PEN is the gatekeeper for the new sc:Manifest URIs, it will have to let Islandora know what it has minted (or agree to a strict convention) and Islandora will have to mint a URI for the manuscript metadata and construct something very manifest-esque to request its creation. If Islandora stores the whole manifest, T-PEN will have to let it know what URIs have been created for every new sc:Canvas in the sequence. There is no good case for canvases or annotations being stored outside of T-PEN.