SlideShare a Scribd company logo
1 of 49
Download to read offline
Data Archiving and Networked Services!
SHEBANQ!
Dirk Roorda - researcher @ DANS,TLA!
System for HEBrew Text: ANnotations
for Queries and Markup!
TEI pre-conference workshop: Query!
Roma – 2013-10-01!
Overview
1.  Context: text, data, research in Hebrew
Bible
2.  MdF database model, MQL query
language
3.  Sharing the research process
4.  CLARIN-NL project: SHEBANQ
5.  Towards new tools
1 (of 5) Context
Text, data and research in the Hebrew Bible
VU Amsterdam
Eep Talstra Centre for Bible and Computer
text + linguistic features => database
database + research questions => publications
4!
2 (of 5) MdF and MQL
•  MdF database model
•  MQL query language
Monad Object Feature
1977-now: Eep Talstra et al. ECA, WIVU.
Print reference (Google Books)
1988-1994 Crist-Jan Doedens: Text
Databases – One Database Model and
Several Retrieval Languages (google
books reference)
2004: Ulrik Petersen. Emdros - a text
database engine for analyzed or
annotated text. COLING
word objects
standard
edition
text
monads
(atomic chunks
of text)
lexeme_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬
old_lexeme_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬
vocalized_lexeme_utf8= ‫ֵר‬‫א‬‫ׁש‬ִ‫י‬‫ת‬
surface_consonants_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬
graphical_lexeme_utf8= ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬
‫בּ‬ְ‫ר‬ֵ‫א‬‫שׁ‬ִ֖‫י‬‫ת‬‫בּ‬ָ‫ר‬ָ֣‫א‬‫א‬ֱ.‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫שּׁ‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ְו‬‫א‬ֵ֥‫ת‬‫ה‬ָ‫א‬ָֽ‫ר‬ֶ‫ץ‬‫׃‬
1234567891011
23456789101112
84383
59559
34680
7763777638
40770
7 .. 511 .. 9
11 .. 5
11 .. 5
11 .. 1
11 .. 1
clause_atom_number=1
clause_atom_relation=0
clause_atom_relation_daughter_tense=unknown
clause_atom_relation_kind=No_relation
clause_atom_relation_mother_tense=unknown
clause_atom_relation_preposition_class=none
clause_atom_type=xQtl
indentation=0
phrase objects
Monad-Object-Feature
subphrase objects
phrase_atom
objects
clause_atom
objects
sentence objects
MQL query language
topographic, i.e:
query expression =~= query results w.r.t.
•  sequence
•  embedding
Example
SELECT ALL OBJECTS!
WHERE!
[Clause!
[Phrase!
[Word FOCUS !
" " "part_of_speech = verb AND !
" " "lexeme = "FJM["]!
]!
..!
[Phrase FOCUS!
" "phrase_function = Objc OR!
" "phrase_function = IrpO!
]!
..!
[Phrase FOCUS!
" "phrase_function = Objc OR!
" "phrase_function = IrpO!
]!
]!
3 (of 5) Sharing
Problem: how to share (intermediate)
results of analysis
Solution: saving queries as annotations
Lock - in
scholarly-bibles.com!
Stuttgart Electronic Study Bible
⇒  massive dissemination
But
⇒ not the right dynamics for
tool development
Leiden: international workshop
biblical scholarship
Desiderata:
new tool development
text transmission (variants)
linguistic analysis (features)
even combined!
a short history: 2012
leiden lorentz!
Hebrew Text in the Archive
urn:nbn:nl:ui:13-ikjj-ek!
Hebrew Text in the Archive
urn:nbn:nl:ui:13-ikjj-ek!
how can the
people annotate
our work?!
Research Data Cycle
Research Data Cycle
Text transmission,
tradition, editorial
processes
Free University,
theology faculty,
server department,
WIVU project
!
NWO projects!NWO projects
religious
communities
theol.
scholars
theol.
scholars
enlightened lay
people
scholarly-
ibles.com!
Research Data Cycle
Text transmission,
tradition, editorial
processes
Free University,
theology faculty,
server department,
WIVU project
!
NWO projects!NWO projects
religious
communities
theol.
scholars
theol.
scholars
CLARIN
SHEBANQ
linguists
Wider public:
Annotation,
Query Saving,
via Linked Data
dig. hum
comp. hum
enlightened lay
people
scholarly-
ibles.com!
Research Data
Archiving
DANS
3 (of 5) Sharing (c’t’d)
Solution: Queries As Annotations
queries-as-annotations
model! query! example!
body! query instruction!
SELECT ALL OBJECTS WHERE [Word
FOCUS part_of_speech = verb AND
lexeme = "‫!]"שים‬
targets!
query results in
context!
‫ׁר‬ֶ‫ש‬ֲ‫א‬ ֙‫ן‬ֶ‫ב‬ֶ֨‫א‬ ָ‫ה‬ ‫ֶת‬‫א‬ ‫֤ח‬ַּ‫ק‬ִּ‫י‬ ַ‫ו‬ ‫ֶר‬‫ק‬ֹּ֗‫ב‬ ַּ‫ב‬ ‫֜ב‬ֹ‫ק‬ֲ‫ע‬ַ‫י‬ ‫֨ם‬ֵּ‫כ‬ְׁ‫ש‬ַּ‫י‬ ַ‫ו‬
‫ֶן‬‫מ‬ֶׁ֖‫ש‬ ‫֥ק‬ֹ‫צ‬ִּ‫י‬ ַ‫ו‬ ‫֑ה‬ָ‫ב‬ֵּ‫צ‬ַ‫מ‬ ּ‫ה‬ָ֖‫ת‬ֹ‫א‬ ‫ׂם‬ֶ‫ש‬ָּ֥‫י‬ ַ‫ו‬ ‫֔יו‬ָ‫ת‬ֹׁ‫ש‬ֲ‫א‬ַֽ‫ר‬ְ‫מ‬ ‫֣ם‬ָׂ‫ש‬

ּ‫ה‬ָֽׁ‫ש‬‫ֹא‬‫ר‬ ‫ַל‬‫ע‬
annotation! published query! qu123 (just an identifier)!
metadata!
researcher, date
created, date last
run, research
question!
Janet Dyk 2004-02-16 2012-01-27
Can the verb ‫ים‬ִׂ‫ש‬ have a double
object? - article in Foundations
for Syriac Lexicography!
OpenAnnotation openannotation.org!
provenance
motivation
demonstrator datanetworkservice.nl/qaa!
demonstrator datanetworkservice.nl/qaa!
demonstrator datanetworkservice.nl/qaa!
demonstrator datanetworkservice.nl/qaa!
demonstrator
demonstrator
demonstrator
demonstrator
still missing:
saving queries
not semantic-web-enabled
sustainability
4 (of 5) Project
CLARIN-NL: SHEBANQ:
(A) Curation
(B) Demonstrator
SHEBANQ
System for Hebrew Text:
ANnotations for Queries
CLARIN-NL project
data curation: LAF
demonstrator: query saver
#!/etc bc
s/g$/q/!
Linguistic Annotation Framework
ISO 24612:2012
Nancy Ide, Laurent Romary
feature definitions
feature definitions
TEI ISO-FS schema
dcr:datcat on <fDecl> versus <f>
26,225,966 <f>s!
!
2.5 GB redundant
attribute
material !
!
5 (of 5) Project
CLARIN-NL: SHEBANQ: (B) Demonstrator
select all objects where
[clause
[phrase phrase_function = Objc
[word FOCUS tense = infinitive_absolute]
]
]
Execute
Query executed
Passage
‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬
‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Controls
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Gen 1:1
2Chron 3:4
Gen 1:1
‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬
‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Text
1Sam 12:4
Ex 23:2
Query results
Prev 2 3 65 ... 2241 Next21 313 results
Executing query ...
view in context
Save this query
Researcher Oliver Glanz
Date created 2013-08-25
Date last run 2013-08-25
Project Data and Tradition
Institute VU/Eep Talstra Centre for Bible and Computing
Reason irregular valency of ‫ּב‬ָ‫ָר‬֣‫א‬
Comments
needs to be combined with query on ‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬
Save PublishCancel
Name valency ‫ּב‬ָ‫ָר‬֣‫א‬
Edit Query
Passage
‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬
‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Controls
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Gen 1:1
2Chron 3:4
Gen 1:1
‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬
‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬
‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬
‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬
Text
1Sam 12:4
Ex 23:2
Saved Query Results
Prev 2 3 65 ... 2241 Next21 313 results
view in context
Information on this query
Researcher Oliver Glanz
Date created 2013-08-25
Date last run 2013-08-25
Project
Institute
Reason
Comments
Name
Query Info
select all objects where
[clause
[phrase phrase_function = Objc
[word FOCUS tense = infinitive_absolute]
]
]
MQL query text Persistent Identifier urn:nbn:nl:ui:13-scpm-ji
http://www.persistent-identifier.nl/?identifier=urn...
valency ‫ּב‬ָ‫ָר‬֣‫א‬
Data and Tradition
VU/Eep Talstra Centre for Bible and Computing
irregular valency of ‫ּב‬ָ‫ָר‬֣‫א‬
needs to be combined with query on ‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬
datanetworkservice.nl/qaa!
SHEBANQ: implementing Q-a-A
5 (of 5) Towards new tools
•  LAF tools
•  or generic graph algorithms
•  Emdros tools
•  or generic database technology
•  Linked Data tools
•  or generic SPARQL queries
Side conditions
•  development close to the researchers
•  preferably in their own institutions
•  decent performance
•  within the scale of a laptop
•  usable to researchers
•  that is: non-programmers
•  persistence in mind
•  new results will be archived and re-
enter the data cycle
thank you
dirk.roorda@dans.knaw.nl
slideshare.net/dirkroorda/
s/g$/q/!
#!/etc bc
Eep Talstra Centre for Bible and Computer!

More Related Content

What's hot

SSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW
 
The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)Frank van Harmelen
 
The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)Frank van Harmelen
 
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordLinked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordSimeon Warner
 
Data and Donuts: Data organization
Data and Donuts: Data organizationData and Donuts: Data organization
Data and Donuts: Data organizationC. Tobin Magle
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...Lucidworks (Archived)
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011sspeiser
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planC. Tobin Magle
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...Marko Rodriguez
 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Asuncion Gomez-Perez
 

What's hot (17)

4-Managing CrossRef DOIs
4-Managing CrossRef DOIs4-Managing CrossRef DOIs
4-Managing CrossRef DOIs
 
SSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow Tutorial
 
The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)
 
The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)
 
Clark - Metadata is the Message
Clark - Metadata is the MessageClark - Metadata is the Message
Clark - Metadata is the Message
 
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordLinked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
 
Data and Donuts: Data organization
Data and Donuts: Data organizationData and Donuts: Data organization
Data and Donuts: Data organization
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
 
Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...Royal society of chemistry activities to develop a data repository for chemis...
Royal society of chemistry activities to develop a data repository for chemis...
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management plan
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data
 

Similar to Shebanq roma-2013-10-01

Data management for researchers
Data management for researchersData management for researchers
Data management for researchersDirk Roorda
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for DiscoveryOCLC
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeologyguest756e05
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...Alannah Fitzgerald
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffHeather Seneff
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009Kevin Ashley
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseRDTF-Discovery
 
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013Digital library literature nabi hasan and mukhtiar singh at ICDL-2013
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013Nabi Hasan
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...TimelessFuture
 
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...Academic English With The Electronic Theses Online Service (EThOS) At The Bri...
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...Martha Brown
 
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible LibraryBeyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible LibraryKsenija Mincic Obradovic
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)robin fay
 
CNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationCNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationJohn Doove
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dlmadhuvardhan
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dlmadhuvardhan
 
Cambridge university library ess update for ucs
Cambridge university library  ess update for ucsCambridge university library  ess update for ucs
Cambridge university library ess update for ucsEdmund Chamberlain
 

Similar to Shebanq roma-2013-10-01 (20)

Shebanq gniezno
Shebanq gnieznoShebanq gniezno
Shebanq gniezno
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
Resources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the WebResources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the Web
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
Saving Queries
Saving QueriesSaving Queries
Saving Queries
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_Seneff
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
 
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013Digital library literature nabi hasan and mukhtiar singh at ICDL-2013
Digital library literature nabi hasan and mukhtiar singh at ICDL-2013
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
 
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...Academic English With The Electronic Theses Online Service (EThOS) At The Bri...
Academic English With The Electronic Theses Online Service (EThOS) At The Bri...
 
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible LibraryBeyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 
CNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationCNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundation
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
Cambridge university library ess update for ucs
Cambridge university library  ess update for ucsCambridge university library  ess update for ucs
Cambridge university library ess update for ucs
 

More from Dirk Roorda

General Missives
General MissivesGeneral Missives
General MissivesDirk Roorda
 
Text Display (when it gets tricky)
Text Display (when it gets tricky)Text Display (when it gets tricky)
Text Display (when it gets tricky)Dirk Roorda
 
Quran and Text-Fabric
Quran and Text-FabricQuran and Text-Fabric
Quran and Text-FabricDirk Roorda
 
Ancient corpora analysis
Ancient corpora analysisAncient corpora analysis
Ancient corpora analysisDirk Roorda
 
Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsDirk Roorda
 
Annotating the Hebrew Bible
Annotating the Hebrew BibleAnnotating the Hebrew Bible
Annotating the Hebrew BibleDirk Roorda
 
20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissenDirk Roorda
 
Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleDirk Roorda
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDirk Roorda
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDirk Roorda
 
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsHebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsDirk Roorda
 
Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Dirk Roorda
 
Data Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleData Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleDirk Roorda
 

More from Dirk Roorda (20)

TF-FAIR.pdf
TF-FAIR.pdfTF-FAIR.pdf
TF-FAIR.pdf
 
Textpy
TextpyTextpy
Textpy
 
General Missives
General MissivesGeneral Missives
General Missives
 
Text Display (when it gets tricky)
Text Display (when it gets tricky)Text Display (when it gets tricky)
Text Display (when it gets tricky)
 
Tf in-context
Tf in-contextTf in-context
Tf in-context
 
Quran and Text-Fabric
Quran and Text-FabricQuran and Text-Fabric
Quran and Text-Fabric
 
Ancient corpora analysis
Ancient corpora analysisAncient corpora analysis
Ancient corpora analysis
 
Qdf2tf
Qdf2tfQdf2tf
Qdf2tf
 
Text fabric
Text fabricText fabric
Text fabric
 
Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew Verbs
 
Annotating the Hebrew Bible
Annotating the Hebrew BibleAnnotating the Hebrew Bible
Annotating the Hebrew Bible
 
20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen
 
Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew Bible
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Award
AwardAward
Award
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsHebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, Lessons
 
Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Laf fabric-dh benelux2014
Laf fabric-dh benelux2014
 
Data Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleData Analysis in the Hebrew Bible
Data Analysis in the Hebrew Bible
 
LAF Fabric
LAF FabricLAF Fabric
LAF Fabric
 

Recently uploaded

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Recently uploaded (20)

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Shebanq roma-2013-10-01

  • 1. Data Archiving and Networked Services! SHEBANQ! Dirk Roorda - researcher @ DANS,TLA! System for HEBrew Text: ANnotations for Queries and Markup! TEI pre-conference workshop: Query! Roma – 2013-10-01!
  • 2. Overview 1.  Context: text, data, research in Hebrew Bible 2.  MdF database model, MQL query language 3.  Sharing the research process 4.  CLARIN-NL project: SHEBANQ 5.  Towards new tools
  • 3. 1 (of 5) Context Text, data and research in the Hebrew Bible
  • 4. VU Amsterdam Eep Talstra Centre for Bible and Computer text + linguistic features => database database + research questions => publications 4!
  • 5. 2 (of 5) MdF and MQL •  MdF database model •  MQL query language
  • 6. Monad Object Feature 1977-now: Eep Talstra et al. ECA, WIVU. Print reference (Google Books) 1988-1994 Crist-Jan Doedens: Text Databases – One Database Model and Several Retrieval Languages (google books reference) 2004: Ulrik Petersen. Emdros - a text database engine for analyzed or annotated text. COLING
  • 7. word objects standard edition text monads (atomic chunks of text) lexeme_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬ old_lexeme_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬ vocalized_lexeme_utf8= ‫ֵר‬‫א‬‫ׁש‬ִ‫י‬‫ת‬ surface_consonants_utf8= ‫ר‬‫א‬‫ׁש‬‫י‬‫ת‬ graphical_lexeme_utf8= ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬ ‫בּ‬ְ‫ר‬ֵ‫א‬‫שׁ‬ִ֖‫י‬‫ת‬‫בּ‬ָ‫ר‬ָ֣‫א‬‫א‬ֱ.‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫שּׁ‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ְו‬‫א‬ֵ֥‫ת‬‫ה‬ָ‫א‬ָֽ‫ר‬ֶ‫ץ‬‫׃‬ 1234567891011 23456789101112 84383 59559 34680 7763777638 40770 7 .. 511 .. 9 11 .. 5 11 .. 5 11 .. 1 11 .. 1 clause_atom_number=1 clause_atom_relation=0 clause_atom_relation_daughter_tense=unknown clause_atom_relation_kind=No_relation clause_atom_relation_mother_tense=unknown clause_atom_relation_preposition_class=none clause_atom_type=xQtl indentation=0 phrase objects Monad-Object-Feature subphrase objects phrase_atom objects clause_atom objects sentence objects
  • 8. MQL query language topographic, i.e: query expression =~= query results w.r.t. •  sequence •  embedding
  • 9. Example SELECT ALL OBJECTS! WHERE! [Clause! [Phrase! [Word FOCUS ! " " "part_of_speech = verb AND ! " " "lexeme = "FJM["]! ]! ..! [Phrase FOCUS! " "phrase_function = Objc OR! " "phrase_function = IrpO! ]! ..! [Phrase FOCUS! " "phrase_function = Objc OR! " "phrase_function = IrpO! ]! ]!
  • 10. 3 (of 5) Sharing Problem: how to share (intermediate) results of analysis Solution: saving queries as annotations
  • 11. Lock - in scholarly-bibles.com! Stuttgart Electronic Study Bible ⇒  massive dissemination But ⇒ not the right dynamics for tool development
  • 12. Leiden: international workshop biblical scholarship Desiderata: new tool development text transmission (variants) linguistic analysis (features) even combined! a short history: 2012 leiden lorentz!
  • 13. Hebrew Text in the Archive urn:nbn:nl:ui:13-ikjj-ek!
  • 14. Hebrew Text in the Archive urn:nbn:nl:ui:13-ikjj-ek! how can the people annotate our work?!
  • 16. Research Data Cycle Text transmission, tradition, editorial processes Free University, theology faculty, server department, WIVU project ! NWO projects!NWO projects religious communities theol. scholars theol. scholars enlightened lay people scholarly- ibles.com!
  • 17. Research Data Cycle Text transmission, tradition, editorial processes Free University, theology faculty, server department, WIVU project ! NWO projects!NWO projects religious communities theol. scholars theol. scholars CLARIN SHEBANQ linguists Wider public: Annotation, Query Saving, via Linked Data dig. hum comp. hum enlightened lay people scholarly- ibles.com! Research Data Archiving DANS
  • 18. 3 (of 5) Sharing (c’t’d) Solution: Queries As Annotations
  • 19. queries-as-annotations model! query! example! body! query instruction! SELECT ALL OBJECTS WHERE [Word FOCUS part_of_speech = verb AND lexeme = "‫!]"שים‬ targets! query results in context! ‫ׁר‬ֶ‫ש‬ֲ‫א‬ ֙‫ן‬ֶ‫ב‬ֶ֨‫א‬ ָ‫ה‬ ‫ֶת‬‫א‬ ‫֤ח‬ַּ‫ק‬ִּ‫י‬ ַ‫ו‬ ‫ֶר‬‫ק‬ֹּ֗‫ב‬ ַּ‫ב‬ ‫֜ב‬ֹ‫ק‬ֲ‫ע‬ַ‫י‬ ‫֨ם‬ֵּ‫כ‬ְׁ‫ש‬ַּ‫י‬ ַ‫ו‬ ‫ֶן‬‫מ‬ֶׁ֖‫ש‬ ‫֥ק‬ֹ‫צ‬ִּ‫י‬ ַ‫ו‬ ‫֑ה‬ָ‫ב‬ֵּ‫צ‬ַ‫מ‬ ּ‫ה‬ָ֖‫ת‬ֹ‫א‬ ‫ׂם‬ֶ‫ש‬ָּ֥‫י‬ ַ‫ו‬ ‫֔יו‬ָ‫ת‬ֹׁ‫ש‬ֲ‫א‬ַֽ‫ר‬ְ‫מ‬ ‫֣ם‬ָׂ‫ש‬ ּ‫ה‬ָֽׁ‫ש‬‫ֹא‬‫ר‬ ‫ַל‬‫ע‬ annotation! published query! qu123 (just an identifier)! metadata! researcher, date created, date last run, research question! Janet Dyk 2004-02-16 2012-01-27 Can the verb ‫ים‬ִׂ‫ש‬ have a double object? - article in Foundations for Syriac Lexicography!
  • 30. demonstrator still missing: saving queries not semantic-web-enabled sustainability
  • 31. 4 (of 5) Project CLARIN-NL: SHEBANQ: (A) Curation (B) Demonstrator
  • 32. SHEBANQ System for Hebrew Text: ANnotations for Queries CLARIN-NL project data curation: LAF demonstrator: query saver #!/etc bc s/g$/q/!
  • 33. Linguistic Annotation Framework ISO 24612:2012 Nancy Ide, Laurent Romary
  • 34.
  • 35.
  • 36.
  • 37.
  • 41. dcr:datcat on <fDecl> versus <f> 26,225,966 <f>s! ! 2.5 GB redundant attribute material ! !
  • 42. 5 (of 5) Project CLARIN-NL: SHEBANQ: (B) Demonstrator
  • 43. select all objects where [clause [phrase phrase_function = Objc [word FOCUS tense = infinitive_absolute] ] ] Execute Query executed Passage ‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬ ‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬ ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Controls ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Gen 1:1 2Chron 3:4 Gen 1:1 ‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬ ‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬ ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Text 1Sam 12:4 Ex 23:2 Query results Prev 2 3 65 ... 2241 Next21 313 results Executing query ... view in context Save this query Researcher Oliver Glanz Date created 2013-08-25 Date last run 2013-08-25 Project Data and Tradition Institute VU/Eep Talstra Centre for Bible and Computing Reason irregular valency of ‫ּב‬ָ‫ָר‬֣‫א‬ Comments needs to be combined with query on ‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬ Save PublishCancel Name valency ‫ּב‬ָ‫ָר‬֣‫א‬ Edit Query
  • 44. Passage ‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬ ‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬ ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Controls ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Gen 1:1 2Chron 3:4 Gen 1:1 ‫ּב‬ְ‫ֵר‬‫א‬‫ׁש‬ִ֖‫י‬‫ת‬‫ּב‬ָ‫ָר‬֣‫א‬‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬‫א‬ֵ֥‫ת‬‫ה‬ַ‫ּׁש‬ָ‫מ‬ַ֖‫י‬ִ‫ם‬‫ו‬ְ‫א‬ֵ֥‫ת‬ ‫ה‬ָ‫א‬ָֽ‫ֶר‬‫ץ‬‫׃‬ ‫ו‬ַ‫ּי‬ֹ֥‫א‬‫מ‬ֶ‫ר‬‫ח‬ִ‫ז‬ְ‫ִק‬‫ּי‬ָ֖‫ה‬‫ּו‬‫מ‬ָ֣‫ה‬‫א‬ֹ֑‫ו‬‫ת‬‫ּכ‬ִ֥‫י‬‫א‬ֶ‫ע‬ֱ‫ל‬ֶ֖‫ה‬‫ּב‬ֵ֥‫י‬‫ת‬ ‫י‬ְ‫ה‬‫ו‬ָֽ‫ה‬‫׃‬ Text 1Sam 12:4 Ex 23:2 Saved Query Results Prev 2 3 65 ... 2241 Next21 313 results view in context Information on this query Researcher Oliver Glanz Date created 2013-08-25 Date last run 2013-08-25 Project Institute Reason Comments Name Query Info select all objects where [clause [phrase phrase_function = Objc [word FOCUS tense = infinitive_absolute] ] ] MQL query text Persistent Identifier urn:nbn:nl:ui:13-scpm-ji http://www.persistent-identifier.nl/?identifier=urn... valency ‫ּב‬ָ‫ָר‬֣‫א‬ Data and Tradition VU/Eep Talstra Centre for Bible and Computing irregular valency of ‫ּב‬ָ‫ָר‬֣‫א‬ needs to be combined with query on ‫א‬ֱ‫ֹל‬‫ה‬ִ֑‫י‬‫ם‬
  • 47. 5 (of 5) Towards new tools •  LAF tools •  or generic graph algorithms •  Emdros tools •  or generic database technology •  Linked Data tools •  or generic SPARQL queries
  • 48. Side conditions •  development close to the researchers •  preferably in their own institutions •  decent performance •  within the scale of a laptop •  usable to researchers •  that is: non-programmers •  persistence in mind •  new results will be archived and re- enter the data cycle