SlideShare a Scribd company logo
1 of 4
Disclaimer: I may gloss over or poorly represent tangential responsibilities in this message to
serve the purpose of verbosely illustrating data object creation and transport. json documents
are pulled from extant samples and may not mirror precisely what will be created or be adjusted
for current standards. This is not intended as a specification or work document.
These are things that currently have to happen for a user to move from finding an image to
transcription practice:
1. Host images at a stable URI (or URL)
2. Honor requests for image resizing, manipulation, cropping, etc. at a predictable URI
pattern (IIIF)
3. Store all of the metadata associated with the imaged manuscripts and their folios
4. Query selected metadata (facets) for sets of matches among these collections
5. Provide endpoints for T-PEN to query for specific or sets of collections, manuscripts,
and folios (manifests and canvases)
1-4 happen on the paleography site (within Islandora I believe) but 5 will have to be negotiated.
Any data that is accessible in T-PEN will either need to be stored in T-PEN or resolved through
use of the transcription project identifier (specifically, a SharedCanvas Manifest URI).
Collections, Manuscripts, and Folios (at least) should all have a URI which resolves to a
descriptive JSON-LD document.
Let's take it in pieces:
sc:Manifest
{
"@context" : "http://www.shared-canvas.org/ns/context.json",
"@id" : "http://t-pen.org/Example+Simple/manifest.json",
"@type" : "sc:Manifest",
"label" : "Example Simple",
"metadata" : "http://paleoberry.org/Example+Simple/metadata",
"sequences" : [ {
"@id" : "http://t-pen.org/Tradamus+Simple/sequence/normal",
"@type" : "sc:Sequence",
"label" : "Current Page Order",
"canvases": [
"http://t-pen.org/Example+Simple/canvas/100r",
"http://t-pen.org/Example+Simple/canvas/100v",
"http://t-pen.org/Example+Simple/canvas/101r"
]
}
At its core, a manifest is just a sequence of canvases. Each new arrangement is a unique
canvas. In fact, the IIIF manifest standard suggests "There are no semantics conveyed by
[metadata] information, and clients should not use it for discovery or other purposes." In T-PEN,
each project has its own sc:manifest, even if several people are working on the same
manuscript. That said, there should be some URI a person or machine could dereference to see
a JSON-LD file of Newberry's curated arrangement of images with all available associated
metadata. However, the metadata field can itself be a URI, as it is just key-value pairs, meaning
if Iter made available something like newberry.org/IIIF/SHELFMARK/metadata.json and
returned JSON, any authorized user could make a legitimate and good sc:Manifest, including T-
PEN or any Open Annotation Store.
Bottom line: Official transcription projects will have manuscript metadata in Islandora, but the
project sc:Manifest can be resolved via T-PEN and will include a reference to a metadata URI
(spec).
sc:Canvas
{
"@id" : "http://t-pen.org/Example+Simple/canvas/100r",
"@type" : "sc:Canvas",
"label" : "100r",
"height" : 1000,
"width" : 667,
"images" : [ {
"@type" : "oa:Annotation",
"motivation" : "sc:painting",
"resource" : {
"@id" : "http://paleoberry.org/iiif/Example+Simple/res/100r.jpg",
"@type" : "dctypes:Image",
"format" : "image/jpeg",
"height" : 2365,
"width" : 1579
} ]
"otherContent" : [ {
"@id":"http://t-pen.org/Example+Simple/lines/100r",
"@type":"sc:AnnotationList",
"resources":[
"http://t-pen.org/Example+Simple/line/101083792",
"http://t-pen.org/Example+Simple/line/101083842",
"http://t-pen.org/Example+Simple/line/101083841" ...
]
} ]
}
There does exist a sc:Canvas URI in the authoritative sc:Manifest, but as soon as a new project
is created in T-PEN, it is copied so as not to pollute the original with annotations. Most of the
metadata, if not all, is already covered in the sc:Manifest, so the label is the most important
thing it carries. In fact, even the image resource does not have to be included and can also be
just a URI (IIIF). This canvas is the target of all new annotations (image and transcription) and is
stored and resolved through T-PEN.
Bottom line: The canvases and manifests are linked through @id:URI. The canvas, in this case,
has the image dereferenced, but it can be anything that results in an image (likely a IIIF URI). A
‘service’ property can carry additional instructions if special access is needed.
sc:Annotation
{
“@id" : "http://t-pen.org/Tradamus+Simple/line/101083792",
"@type" : "oa:Annotation",
"motivation" : "tr:transcribing",
"resource" : {
"@type" : "cnt:ContentAsText",
"cnt:chars" : "Infesto Trinitatis"
},
"on" : "http://t-pen.org/Tradamus+Simple/canvas/100r#xywh=148,60,409,18"
}
The region, as it is rectangular, is abbreviated in the ‘on’ as “#xywh=” per the standard. The
content of the transcription is a ‘resource’ that is ‘cnt:chars’ literal string here, but can be
anything in the standard (XML, HTML, JSON, MEI, OGG). In T-PEN, for simplicity it is always a
straight UTF-8 string with escapes for ” to avoid breaking JSON. Any authorized user can use
the ‘@id’ to resolve this annotation as a JSON file and climb up to the canvas as well, if desired.
Bottom line: This is CDH territory, but there are things needed to get this far that may not be
instantly available.
Accession
When the paleography site wants to include a new manuscript for transcribing, it will need to
create one of three links:
1. On the fly project creation
○ The use case for this is only to create the official transcriptions or for a user to
start work on an untranscribed, but discoverable manuscript.
○ Send T-PEN a new sc:Manifest JSON object request that includes a label
(unless this is generated from metadata), a metadata URI, and an ordered list of
labelled images.
○ In a perfect world, this would be a skeletal, but otherwise well-formed sc:Manifest
object so that T-PEN can simply extend it and return it when further URIs are
minted.
○ The user is then passed into T-PEN with this new Manifest displayed as their
new project.
2. Copy a project for transcription practice
○ Most users will follow this path to get into transcription.
○ Send T-PEN an existing sc:Manifest URI (such as the official transcription) with a
copy request.
○ T-PEN creates a new manifest for the users project with the same sequence,
labels, images, and lines, but without the transcription data.
○ Nothing is passed back to the paleography site.
○ The user starts a new project in T-PEN, related to the original in case they want
to check their transcription work.
3. Work on a public project
○ Newberry can decide how heavily this is encouraged, but it is easy to do and
may be a good way to crowd-source some transcriptions or line annotations.
○ Send T-PEN an existing sc:Manifest URI (such as the official transcription).
○ The user is authenticated and begins to transcribe on the public project as
permissions allow. (This will not break the official transcription.)
Bottom line: If T-PEN is the gatekeeper for the new sc:Manifest URIs, it will have to let Islandora
know what it has minted (or agree to a strict convention) and Islandora will have to mint a URI
for the manuscript metadata and construct something very manifest-esque to request its
creation. If Islandora stores the whole manifest, T-PEN will have to let it know what URIs have
been created for every new sc:Canvas in the sequence. There is no good case for canvases or
annotations being stored outside of T-PEN.

More Related Content

Similar to Verbose explanations about data for transcription

From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...Simeon Warner
 
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdfBlaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdfbreitschbreitsch
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioCHAKER ALLAOUI
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD MicrothesauriMarcia Zeng
 
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...FIWARE
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreAndy Powell
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things PayamBarnaghi
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the HaystackAdrian Stevenson
 
Standardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft PresentationStandardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft PresentationTravis Oliphant
 
20181215 introduction to graph databases
20181215   introduction to graph databases20181215   introduction to graph databases
20181215 introduction to graph databasesTimothy Findlay
 
Towards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LDTowards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LDJosé Manuel Cantera Fonseca
 
Eclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science ProjectEclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science ProjectMatthew Gerring
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paperstrange_loop
 

Similar to Verbose explanations about data for transcription (20)

Ironmq slides
Ironmq slidesIronmq slides
Ironmq slides
 
Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)Biblissima et IIIF (MAE)
Biblissima et IIIF (MAE)
 
All about elasticsearch language clients
All about elasticsearch language clientsAll about elasticsearch language clients
All about elasticsearch language clients
 
The Glory of Rest
The Glory of RestThe Glory of Rest
The Glory of Rest
 
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
 
For project
For projectFor project
For project
 
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdfBlaise_UK_109_Max Kleiner_image2textAPI.pdf
Blaise_UK_109_Max Kleiner_image2textAPI.pdf
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
 
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
Graphql
GraphqlGraphql
Graphql
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
 
Standardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft PresentationStandardizing arrays -- Microsoft Presentation
Standardizing arrays -- Microsoft Presentation
 
20181215 introduction to graph databases
20181215   introduction to graph databases20181215   introduction to graph databases
20181215 introduction to graph databases
 
Towards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LDTowards Interoperability between W3C Web of Things and NGSI-LD
Towards Interoperability between W3C Web of Things and NGSI-LD
 
Eclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science ProjectEclipse Con Europe 2014 How to use DAWN Science Project
Eclipse Con Europe 2014 How to use DAWN Science Project
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paper
 

Recently uploaded

QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxDitasDelaCruz
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon investment
 
Challenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistan
Challenges and Opportunities: A Qualitative Study on Tax Compliance in PakistanChallenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistan
Challenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistanvineshkumarsajnani12
 
Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024Marel
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with CultureSeta Wicaksana
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book nowkapoorjyoti4444
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentationuneakwhite
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGpr788182
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...daisycvs
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizharallensay1
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptxRoofing Contractor
 
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service AvailableNashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Availablepr788182
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAITim Wilson
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecZurliaSoop
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityEric T. Tung
 
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...NadhimTaha
 
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSCROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSpanmisemningshen123
 
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...ssuserf63bd7
 
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...pujan9679
 
New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateCannaBusinessPlans
 

Recently uploaded (20)

QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
Challenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistan
Challenges and Opportunities: A Qualitative Study on Tax Compliance in PakistanChallenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistan
Challenges and Opportunities: A Qualitative Study on Tax Compliance in Pakistan
 
Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptx
 
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service AvailableNashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
 
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSCROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
 
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
 
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
 
New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck Template
 

Verbose explanations about data for transcription

  • 1. Disclaimer: I may gloss over or poorly represent tangential responsibilities in this message to serve the purpose of verbosely illustrating data object creation and transport. json documents are pulled from extant samples and may not mirror precisely what will be created or be adjusted for current standards. This is not intended as a specification or work document. These are things that currently have to happen for a user to move from finding an image to transcription practice: 1. Host images at a stable URI (or URL) 2. Honor requests for image resizing, manipulation, cropping, etc. at a predictable URI pattern (IIIF) 3. Store all of the metadata associated with the imaged manuscripts and their folios 4. Query selected metadata (facets) for sets of matches among these collections 5. Provide endpoints for T-PEN to query for specific or sets of collections, manuscripts, and folios (manifests and canvases) 1-4 happen on the paleography site (within Islandora I believe) but 5 will have to be negotiated. Any data that is accessible in T-PEN will either need to be stored in T-PEN or resolved through use of the transcription project identifier (specifically, a SharedCanvas Manifest URI). Collections, Manuscripts, and Folios (at least) should all have a URI which resolves to a descriptive JSON-LD document. Let's take it in pieces: sc:Manifest { "@context" : "http://www.shared-canvas.org/ns/context.json", "@id" : "http://t-pen.org/Example+Simple/manifest.json", "@type" : "sc:Manifest", "label" : "Example Simple", "metadata" : "http://paleoberry.org/Example+Simple/metadata", "sequences" : [ { "@id" : "http://t-pen.org/Tradamus+Simple/sequence/normal", "@type" : "sc:Sequence", "label" : "Current Page Order", "canvases": [ "http://t-pen.org/Example+Simple/canvas/100r", "http://t-pen.org/Example+Simple/canvas/100v", "http://t-pen.org/Example+Simple/canvas/101r" ] } At its core, a manifest is just a sequence of canvases. Each new arrangement is a unique canvas. In fact, the IIIF manifest standard suggests "There are no semantics conveyed by [metadata] information, and clients should not use it for discovery or other purposes." In T-PEN, each project has its own sc:manifest, even if several people are working on the same
  • 2. manuscript. That said, there should be some URI a person or machine could dereference to see a JSON-LD file of Newberry's curated arrangement of images with all available associated metadata. However, the metadata field can itself be a URI, as it is just key-value pairs, meaning if Iter made available something like newberry.org/IIIF/SHELFMARK/metadata.json and returned JSON, any authorized user could make a legitimate and good sc:Manifest, including T- PEN or any Open Annotation Store. Bottom line: Official transcription projects will have manuscript metadata in Islandora, but the project sc:Manifest can be resolved via T-PEN and will include a reference to a metadata URI (spec). sc:Canvas { "@id" : "http://t-pen.org/Example+Simple/canvas/100r", "@type" : "sc:Canvas", "label" : "100r", "height" : 1000, "width" : 667, "images" : [ { "@type" : "oa:Annotation", "motivation" : "sc:painting", "resource" : { "@id" : "http://paleoberry.org/iiif/Example+Simple/res/100r.jpg", "@type" : "dctypes:Image", "format" : "image/jpeg", "height" : 2365, "width" : 1579 } ] "otherContent" : [ { "@id":"http://t-pen.org/Example+Simple/lines/100r", "@type":"sc:AnnotationList", "resources":[ "http://t-pen.org/Example+Simple/line/101083792", "http://t-pen.org/Example+Simple/line/101083842", "http://t-pen.org/Example+Simple/line/101083841" ... ] } ] } There does exist a sc:Canvas URI in the authoritative sc:Manifest, but as soon as a new project is created in T-PEN, it is copied so as not to pollute the original with annotations. Most of the metadata, if not all, is already covered in the sc:Manifest, so the label is the most important thing it carries. In fact, even the image resource does not have to be included and can also be
  • 3. just a URI (IIIF). This canvas is the target of all new annotations (image and transcription) and is stored and resolved through T-PEN. Bottom line: The canvases and manifests are linked through @id:URI. The canvas, in this case, has the image dereferenced, but it can be anything that results in an image (likely a IIIF URI). A ‘service’ property can carry additional instructions if special access is needed. sc:Annotation { “@id" : "http://t-pen.org/Tradamus+Simple/line/101083792", "@type" : "oa:Annotation", "motivation" : "tr:transcribing", "resource" : { "@type" : "cnt:ContentAsText", "cnt:chars" : "Infesto Trinitatis" }, "on" : "http://t-pen.org/Tradamus+Simple/canvas/100r#xywh=148,60,409,18" } The region, as it is rectangular, is abbreviated in the ‘on’ as “#xywh=” per the standard. The content of the transcription is a ‘resource’ that is ‘cnt:chars’ literal string here, but can be anything in the standard (XML, HTML, JSON, MEI, OGG). In T-PEN, for simplicity it is always a straight UTF-8 string with escapes for ” to avoid breaking JSON. Any authorized user can use the ‘@id’ to resolve this annotation as a JSON file and climb up to the canvas as well, if desired. Bottom line: This is CDH territory, but there are things needed to get this far that may not be instantly available. Accession When the paleography site wants to include a new manuscript for transcribing, it will need to create one of three links: 1. On the fly project creation ○ The use case for this is only to create the official transcriptions or for a user to start work on an untranscribed, but discoverable manuscript. ○ Send T-PEN a new sc:Manifest JSON object request that includes a label (unless this is generated from metadata), a metadata URI, and an ordered list of labelled images. ○ In a perfect world, this would be a skeletal, but otherwise well-formed sc:Manifest object so that T-PEN can simply extend it and return it when further URIs are minted. ○ The user is then passed into T-PEN with this new Manifest displayed as their new project. 2. Copy a project for transcription practice ○ Most users will follow this path to get into transcription.
  • 4. ○ Send T-PEN an existing sc:Manifest URI (such as the official transcription) with a copy request. ○ T-PEN creates a new manifest for the users project with the same sequence, labels, images, and lines, but without the transcription data. ○ Nothing is passed back to the paleography site. ○ The user starts a new project in T-PEN, related to the original in case they want to check their transcription work. 3. Work on a public project ○ Newberry can decide how heavily this is encouraged, but it is easy to do and may be a good way to crowd-source some transcriptions or line annotations. ○ Send T-PEN an existing sc:Manifest URI (such as the official transcription). ○ The user is authenticated and begins to transcribe on the public project as permissions allow. (This will not break the official transcription.) Bottom line: If T-PEN is the gatekeeper for the new sc:Manifest URIs, it will have to let Islandora know what it has minted (or agree to a strict convention) and Islandora will have to mint a URI for the manuscript metadata and construct something very manifest-esque to request its creation. If Islandora stores the whole manifest, T-PEN will have to let it know what URIs have been created for every new sc:Canvas in the sequence. There is no good case for canvases or annotations being stored outside of T-PEN.