SlideShare a Scribd company logo
Data Analysis for Ancient Corpora
applied to the Quran
Dirk Roorda
and
Cornelis van Lit
Filosofie en Religiewetenschap, Utrecht, 2019-03-28
0
50
100
150
200
250
conj nmpr subs adjv prep art
Parts of Speech after Atnach in ETCBC Phrase
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
• researchers in control of their own
data
• researchers empowered to fully
harness the data available to them
• researchers encouraged to DIY
computing
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
Data model
• Graph model: words, phrases, etc. are “nodes,” relationships
between them are edges.
• Graphs model complex data structures better than other
methods (e.g. XML).
• All stored in easy-to-understand, plain-text files. No messy
XML, SQL, etc.
• ... and we call him Text-Fabric (TF)
Data structure of TF - the IKEA spirit
node
order! order!
stacks of components
uniquely identified
words
phrases
chapters
verses
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
# Consider Phlebas
$ author=Iain M. Banks
## 1
Everything about us,
everything around us,
everything we know [and can know of] is composed ultimately of
patterns of nothing;
that’s the bottom line, the final truth.
So where we find we have any control over those patterns,
why not make the most elegant ones, the most enjoyable and good
ones,
in our own terms?
## 2
Besides,
it left the humans in the Culture free to take care of the things that
really mattered in life,
such as [sports, games, romance,] studying dead languages,
barbarian societies and impossible problems,
and climbing high mountains without the aid of a safety harness.
@node
@compiler=Dirk Roorda
@description=the letters of a word
@name=Culture quotes from Iain
Banks
@source=Good Reads
@url=https://www.goodreads.com/
work/quotes/14366-consider-phlebas
@valueType=str
@writtenBy=Text-Fabric
@dateWritten=2019-01-30T22:20:19Z
Everything
about
us
everything
around
us
everything
we
know
and
can
know
of
is
composed
ultimately
of
patterns
of
nothing
that’s
the
bottom
line
the
final
truth
So letters
@node
@compiler=Dirk Roorda
@description=the punctuation after
a word
@name=Culture quotes from Iain
Banks
@source=Good Reads
@url=https://www.goodreads.com/
work/quotes/14366-consider-phlebas
@valueType=str
@writtenBy=Text-Fabric
@dateWritten=2019-01-30T22:20:19Z
3 ,
6 ,
20 ;
24 ,
27 .
38 ,
45 ,
51 ,
55 ?
,
75 ,
78 ,
,
,
83 ,
88 ,
99 .
punc
banks/tf/
author.tf
gap.tf
letters.tf
number.tf
oslots.tf
otext.tf
otype.tf
punc.tf
terminator.tf
title.tf
TF dataset
otype
@node
@compiler=Dirk Roorda
@name=Culture quotes from Iain Banks
@source=Good Reads
@url=https://www.goodreads.com/work/quotes/14366-consider-phlebas
@valueType=str
@writtenBy=Text-Fabric
@dateWritten=2019-01-30T22:20:19Z
1-99 word
100 book
101-102 chapter
103-114 line
115-117 sentence
oslots
@edge
@compiler=Dirk Roorda
@name=Culture quotes from Iain Banks
@source=Good Reads
@url=https://www.goodreads.com/work/quotes/14366-consider-phlebas
@valueType=str
@writtenBy=Text-Fabric
@dateWritten=2019-01-30T22:20:19Z
100 1-99
1-55
56-99
1-3
4-6
7-9,14-20
21-27
28-38
39-51
52-55
56
57-75
76-77,81-83
84-88
89-99
1-27
28-55
56-99
1-99 word
100 book
101-102 chapter
103-114 line
115-117 sentence
## 1
Everything about us,
everything around us,
everything we know [and can know of] is composed ultimately of patterns of
nothing;
that’s the bottom line, the final truth.
So where we find we have any control over those patterns,
why not make the most elegant ones, the most enjoyable and good ones,
in our own terms?
## 2
Besides,
it left the humans in the Culture free to take care of the things that really
mattered in life,
such as [sports, games, romance,] studying dead languages,
barbarian societies and impossible problems,
and climbing high mountains without the aid of a safety harness.
otext
@config
@compiler=Dirk Roorda
@fmt:text-orig-full={letters}{punc}
@name=Culture quotes from Iain Banks
@sectionFeatures=title,number
@sectionTypes=book,chapter
@source=Good Reads
@url=https://www.goodreads.com/work/quotes/14366-consider-phlebas
@writtenBy=Text-Fabric
@dateWritten=2019-01-30T22:20:19Z
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
https://github.com/ETCBC/lingo/tree/master/easter/tf/c
https://github.com/ETCBC/lingo/tree/master/easter/tf/c
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
CTBA|CTBA#CTBA#CTB###0#0#0#3#1#0#2#0#0#2#0#0#2#0#0#0#0#0 D;L;DOTH|;L;DOT#;L;DOTA#;LD#D#H#0#0#0#3#1#0#3#0#0#2#0#0#2#1#1#3#0#0
D;WOE|;WOE#;WOE#;WOE#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 MW;KA|MW;KA#MW;KA#MWK###0#1#0#3#1#0#2#0#0#0#0#2#0#0#0#0#0#0 BRH|
BR#BRA#BR##H#0#0#0#3#1#0#2#0#0#2#0#0#2#1#1#3#0#0 DDO;D|DO;D#DO;D#DO;D#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 BRH|
BR#BRA#BR##H#0#0#0#3#1#0#2#0#0#2#0#0#2#1#1#3#0#0 DABRHM|ABRHM#ABRHM#ABRHM#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0
ABRHM|ABRHM#ABRHM#ABRHM###0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 AOLD|AOLD#;LD#;LD###0#5#1#0#1#3#2#0#0#0#0#0#0#0#0#0#0#0 LA;SKX|
A;SKX#A;SKX#A;SKX#L##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 A;SKX|A;SKX#A;SKX#A;SKX###0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 AOLD|
Syriac NT (Sedra database)
DEUT33,02 >C- >;71C 1.000 >;71C- >C-
DEUT33,02 DT D.@73T 1.000 D.@73T DT
DEUT33,09 BNW B.@N@73JW 1.000 B.@N@73W BNW
EST 01,16 MWMKN M:MW.K@81N 1.000 M:WM.K@81N MWMKN
EST 03,04 B- K.:- 1.000 B.:- B-
EST 03,04 >MRM >@M:R@70M 1.000 >@M:R@70M >MRM
Hebrew Ketiv-Qere (ETCBC)
&P005381 = MSVO 3, 70
#atf: lang qpc
@tablet
@obverse
@column 1
1.a. 2(N14) , SZE~a SAL TUR3~a NUN~a
1.b. 3(N19) , |GISZ.TE|
2. 1(N14) , NAR NUN~a SIG7
3. 2(N04)# , PIRIG~b1 SIG7 URI3~a NUN~a
@column 2
1. 3(N04) , |GISZ.TE| GAR |SZU2.((HI+1(N57))+(HI+1(N57)))| GI4~a
2. , GU7 AZ SI4~f
@reverse
@column 1
1. 3(N14) , SZE~a
2. 3(N19) 5(N04) ,
3. , GU7
@column 2
1. , AZ SI4~f
Cuneiform Uruk (CDLI)
(1:1:1:1) bi P PREFIX|bi+
(1:1:1:2) somi N STEM|POS:N|LEM:{som|ROOT:smw|M|GEN
(1:1:2:1) {ll~ahi PN STEM|POS:PN|LEM:{ll~ah|ROOT:Alh|GEN
(1:1:3:1) {l DET PREFIX|Al+
(1:1:3:2) r~aHoma`ni ADJ STEM|POS:ADJ|LEM:r~aHoma`n|ROOT:rHm|MS|GEN
(1:1:4:1) {l DET PREFIX|Al+
(1:1:4:2) r~aHiymi ADJ STEM|POS:ADJ|LEM:r~aHiym|ROOT:rHm|MS|GEN
(1:2:1:1) {lo DET PREFIX|Al+
(1:2:1:2) Hamodu N STEM|POS:N|LEM:Hamod|ROOT:Hmd|M|NOM
Arabic Quran (Tanzil)
Source data of a corpus
TEI, Markdown, ASCII, Database
Conversion to TF
TF does more than half of the work
Quran
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/quran/start.ipynb
Computing - Python - Jupyter notebooks
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/bhsa/start.ipynb
BHSA
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/syrnt/start.ipynb
Syriac NT
Old Babylon'
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/oldbabylonian/start.ipynb
Quran
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/quran/search.ipynb
Computing - more power!
Quran
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/quran/search.ipynb
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/bhsa/searchFromMQL.ipynb
BHSA
Syriac NT
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/syrnt/search.ipynb
Old Babylon'
https://nbviewer.jupyter.org/github/Nino-cunei/oldbabylonian/blob/master/analysis/ummama.ipynb
Uruk
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/uruk/search.ipynb
QuranPower to you! (without the programming)
This is Text-Fabric 7.5.3
Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1
in /Users/dirk/text-fabric-data/__apps__/quran
Cleaning up remnant processes, if any ...
Loading data for quran. Please wait ...
Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1
in /Users/dirk/text-fabric-data/__apps__/quran
Setting up TF kernel for quran q-ran/exercises/mining/tf
Using q-ran/quran/tf - 0.3 rv0.4 in /Users/dirk/text-fabric-data
Using q-ran/exercises/mining/tf - 0.3 rv0.1 in /Users/dirk/text-fabric-data
TF setup done.
Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1
in /Users/dirk/text-fabric-data/__apps__/quran
onDocker=False
* Running on http://localhost:8105/ (Press CTRL+C to quit)
Opening quran in browser
Listening at port 18985
127.0.0.1 - - [27/Mar/2019 15:03:55] "GET / HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /passage HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /sections HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /tuples HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /query HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST / HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /passage HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /sections HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /tuples HTTP/1.1" 200 -
127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /query HTTP/1.1" 200 -
dirk:~ > text-fabric quran --mod=q-ran/exercises/mining/tf
Quran
A. reasons
B. a solution
C. toy example of a TF datasource
D. ministudy: rings and sentiments
C'. an easter egg
B'. new ways
A'. new horizons
Sharing and re-using data
Text-Fabric has been developed by a DANS-employee
as a consequence:
Data export is built in ✅
Provenance tracking is built in ✅
Redistribution of newly created data is built in ✅
sharing #1: GitHub & NBviewer
work done in a Jupyter Notebook inside a GitHub repository
is very sharable
sharing #2: Export from TF-browser
sharing #3: Zenodo
sharing #4: Create new features
https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/quran/share.ipynb
• etcbc/valence/tf : the results of the verbal valence work of Janet Dyk in the
SYNVAR project;

• etcbc/lingo/heads/tf : head words for phrases, work done by Cody Kingham;

• ch-jensen/Semantic-mapping-of-participants/actor/tf : participant analysis in
progress by Christian Høygaard-Jensen;

• cmerwich/bh-reference-system/tf: participant analysis in progress by
Christiaan Erwich;

• nino-cunei/oldbabylonian/parallels/tf: similar lines by Dirk Roorda

• q-ran/quran/parallels/tf: similar lines by Dirk Roorda

• q-ran/exercises/mining/tf: sentiments (crude) by Dirk Roorda

• you/quran/sentiments/tf: sentiments (refined) by You

• cvlit/quran/semantics/tf: semantic fields by cvlit
The Text-Fabric Ethos
• Open source tool for corpus annotation and analysis.
• Corpus data in a repository, with standard license, as free as
possible
• Researchers: step out of your technological comfort zones and
pave the way for the ones after you
• Find computational inspiration across disciplines
Open Science Rocks
thank you
Dirk Roorda dirk.roorda@dans.knaw.nl

More Related Content

Similar to Quran and Text-Fabric

Convolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language ProcessingConvolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language Processing
Thomas Delteil
 
6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizing
Krist Wongsuphasawat
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016
KellliBee
 
Reflection on the Data Science Profession in NYC
Reflection on the Data Science Profession in NYCReflection on the Data Science Profession in NYC
Reflection on the Data Science Profession in NYC
Work-Bench
 
TopicModelingNLPHandsOnML
TopicModelingNLPHandsOnMLTopicModelingNLPHandsOnML
TopicModelingNLPHandsOnML
Samir Aryamane
 
2020 Enterprise IT Outlook
2020 Enterprise IT Outlook2020 Enterprise IT Outlook
2020 Enterprise IT Outlook
Raymond Gao
 
Data Science with Spark
Data Science with SparkData Science with Spark
Data Science with Spark
Krishna Sankar
 
The Potential of Metadata - Geoweb 2010
The Potential of Metadata - Geoweb 2010The Potential of Metadata - Geoweb 2010
The Potential of Metadata - Geoweb 2010
Arnulf Christl
 
UBC STAT545 2014 Cm001 intro to-course
UBC STAT545 2014 Cm001 intro to-courseUBC STAT545 2014 Cm001 intro to-course
UBC STAT545 2014 Cm001 intro to-course
Jennifer Bryan
 
RDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar SRDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar S
Emily Nimsakont
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010
Juan Sequeda
 
AI Deeplearning Programming
AI Deeplearning ProgrammingAI Deeplearning Programming
AI Deeplearning Programming
PaulSombat
 
Data science with Perl & Raku
Data science with Perl & RakuData science with Perl & Raku
Data science with Perl & Raku
Sören Laird Sörries
 
Annotating with RDFa
Annotating with RDFaAnnotating with RDFa
Annotating with RDFa
giurca
 
BBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data CloudBBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data Cloud
Patrick Sinclair
 
LWC Datatable LDV, Christian Knapp & Christian Menzinger
LWC Datatable LDV, Christian Knapp  & Christian MenzingerLWC Datatable LDV, Christian Knapp  & Christian Menzinger
LWC Datatable LDV, Christian Knapp & Christian Menzinger
CzechDreamin
 
DRUG - RDSTK Talk
DRUG - RDSTK TalkDRUG - RDSTK Talk
DRUG - RDSTK Talk
rtelmore
 
Choosing the Right Database
Choosing the Right DatabaseChoosing the Right Database
Choosing the Right Database
David Simons
 
[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊
台灣資料科學年會
 
Tapping the Data Deluge with R
Tapping the Data Deluge with RTapping the Data Deluge with R
Tapping the Data Deluge with R
Jeffrey Breen
 

Similar to Quran and Text-Fabric (20)

Convolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language ProcessingConvolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language Processing
 
6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizing
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016
 
Reflection on the Data Science Profession in NYC
Reflection on the Data Science Profession in NYCReflection on the Data Science Profession in NYC
Reflection on the Data Science Profession in NYC
 
TopicModelingNLPHandsOnML
TopicModelingNLPHandsOnMLTopicModelingNLPHandsOnML
TopicModelingNLPHandsOnML
 
2020 Enterprise IT Outlook
2020 Enterprise IT Outlook2020 Enterprise IT Outlook
2020 Enterprise IT Outlook
 
Data Science with Spark
Data Science with SparkData Science with Spark
Data Science with Spark
 
The Potential of Metadata - Geoweb 2010
The Potential of Metadata - Geoweb 2010The Potential of Metadata - Geoweb 2010
The Potential of Metadata - Geoweb 2010
 
UBC STAT545 2014 Cm001 intro to-course
UBC STAT545 2014 Cm001 intro to-courseUBC STAT545 2014 Cm001 intro to-course
UBC STAT545 2014 Cm001 intro to-course
 
RDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar SRDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar S
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010
 
AI Deeplearning Programming
AI Deeplearning ProgrammingAI Deeplearning Programming
AI Deeplearning Programming
 
Data science with Perl & Raku
Data science with Perl & RakuData science with Perl & Raku
Data science with Perl & Raku
 
Annotating with RDFa
Annotating with RDFaAnnotating with RDFa
Annotating with RDFa
 
BBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data CloudBBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data Cloud
 
LWC Datatable LDV, Christian Knapp & Christian Menzinger
LWC Datatable LDV, Christian Knapp  & Christian MenzingerLWC Datatable LDV, Christian Knapp  & Christian Menzinger
LWC Datatable LDV, Christian Knapp & Christian Menzinger
 
DRUG - RDSTK Talk
DRUG - RDSTK TalkDRUG - RDSTK Talk
DRUG - RDSTK Talk
 
Choosing the Right Database
Choosing the Right DatabaseChoosing the Right Database
Choosing the Right Database
 
[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊
 
Tapping the Data Deluge with R
Tapping the Data Deluge with RTapping the Data Deluge with R
Tapping the Data Deluge with R
 

More from Dirk Roorda

TF-FAIR.pdf
TF-FAIR.pdfTF-FAIR.pdf
TF-FAIR.pdf
Dirk Roorda
 
Textpy
TextpyTextpy
Textpy
Dirk Roorda
 
General Missives
General MissivesGeneral Missives
General Missives
Dirk Roorda
 
Text Display (when it gets tricky)
Text Display (when it gets tricky)Text Display (when it gets tricky)
Text Display (when it gets tricky)
Dirk Roorda
 
Tf in-context
Tf in-contextTf in-context
Tf in-context
Dirk Roorda
 
Qdf2tf
Qdf2tfQdf2tf
Qdf2tf
Dirk Roorda
 
Text fabric
Text fabricText fabric
Text fabric
Dirk Roorda
 
Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew Verbs
Dirk Roorda
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
Dirk Roorda
 
Annotating the Hebrew Bible
Annotating the Hebrew BibleAnnotating the Hebrew Bible
Annotating the Hebrew Bible
Dirk Roorda
 
20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen
Dirk Roorda
 
Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew Bible
Dirk Roorda
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
Dirk Roorda
 
Award
AwardAward
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
Dirk Roorda
 
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsHebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Dirk Roorda
 
Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Laf fabric-dh benelux2014
Laf fabric-dh benelux2014
Dirk Roorda
 
Data Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleData Analysis in the Hebrew Bible
Data Analysis in the Hebrew Bible
Dirk Roorda
 
LAF Fabric
LAF FabricLAF Fabric
LAF Fabric
Dirk Roorda
 
Auto ingest demo-werklunch 2013-11-05
Auto ingest demo-werklunch 2013-11-05Auto ingest demo-werklunch 2013-11-05
Auto ingest demo-werklunch 2013-11-05
Dirk Roorda
 

More from Dirk Roorda (20)

TF-FAIR.pdf
TF-FAIR.pdfTF-FAIR.pdf
TF-FAIR.pdf
 
Textpy
TextpyTextpy
Textpy
 
General Missives
General MissivesGeneral Missives
General Missives
 
Text Display (when it gets tricky)
Text Display (when it gets tricky)Text Display (when it gets tricky)
Text Display (when it gets tricky)
 
Tf in-context
Tf in-contextTf in-context
Tf in-context
 
Qdf2tf
Qdf2tfQdf2tf
Qdf2tf
 
Text fabric
Text fabricText fabric
Text fabric
 
Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew Verbs
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Annotating the Hebrew Bible
Annotating the Hebrew BibleAnnotating the Hebrew Bible
Annotating the Hebrew Bible
 
20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen
 
Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew Bible
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Award
AwardAward
Award
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsHebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, Lessons
 
Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Laf fabric-dh benelux2014
Laf fabric-dh benelux2014
 
Data Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleData Analysis in the Hebrew Bible
Data Analysis in the Hebrew Bible
 
LAF Fabric
LAF FabricLAF Fabric
LAF Fabric
 
Auto ingest demo-werklunch 2013-11-05
Auto ingest demo-werklunch 2013-11-05Auto ingest demo-werklunch 2013-11-05
Auto ingest demo-werklunch 2013-11-05
 

Recently uploaded

Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
nitinpv4ai
 
How to Setup Default Value for a Field in Odoo 17
How to Setup Default Value for a Field in Odoo 17How to Setup Default Value for a Field in Odoo 17
How to Setup Default Value for a Field in Odoo 17
Celine George
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
blueshagoo1
 
220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx
Kalna College
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Kalna College
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
Payaamvohra1
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
indexPub
 
SWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptxSWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptx
zuzanka
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
Celine George
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Henry Hollis
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
sanamushtaq922
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
EduSkills OECD
 
BPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end examBPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end exam
sonukumargpnirsadhan
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
khuleseema60
 

Recently uploaded (20)

Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
 
How to Setup Default Value for a Field in Odoo 17
How to Setup Default Value for a Field in Odoo 17How to Setup Default Value for a Field in Odoo 17
How to Setup Default Value for a Field in Odoo 17
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
 
220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
 
SWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptxSWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptx
 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
 
BPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end examBPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end exam
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
 

Quran and Text-Fabric

  • 1. Data Analysis for Ancient Corpora applied to the Quran Dirk Roorda and Cornelis van Lit Filosofie en Religiewetenschap, Utrecht, 2019-03-28 0 50 100 150 200 250 conj nmpr subs adjv prep art Parts of Speech after Atnach in ETCBC Phrase
  • 2. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 3.
  • 4. • researchers in control of their own data • researchers empowered to fully harness the data available to them • researchers encouraged to DIY computing
  • 5. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 6. Data model • Graph model: words, phrases, etc. are “nodes,” relationships between them are edges. • Graphs model complex data structures better than other methods (e.g. XML). • All stored in easy-to-understand, plain-text files. No messy XML, SQL, etc. • ... and we call him Text-Fabric (TF)
  • 7. Data structure of TF - the IKEA spirit node order! order! stacks of components uniquely identified words phrases chapters verses
  • 8. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 9. # Consider Phlebas $ author=Iain M. Banks ## 1 Everything about us, everything around us, everything we know [and can know of] is composed ultimately of patterns of nothing; that’s the bottom line, the final truth. So where we find we have any control over those patterns, why not make the most elegant ones, the most enjoyable and good ones, in our own terms? ## 2 Besides, it left the humans in the Culture free to take care of the things that really mattered in life, such as [sports, games, romance,] studying dead languages, barbarian societies and impossible problems, and climbing high mountains without the aid of a safety harness.
  • 10. @node @compiler=Dirk Roorda @description=the letters of a word @name=Culture quotes from Iain Banks @source=Good Reads @url=https://www.goodreads.com/ work/quotes/14366-consider-phlebas @valueType=str @writtenBy=Text-Fabric @dateWritten=2019-01-30T22:20:19Z Everything about us everything around us everything we know and can know of is composed ultimately of patterns of nothing that’s the bottom line the final truth So letters @node @compiler=Dirk Roorda @description=the punctuation after a word @name=Culture quotes from Iain Banks @source=Good Reads @url=https://www.goodreads.com/ work/quotes/14366-consider-phlebas @valueType=str @writtenBy=Text-Fabric @dateWritten=2019-01-30T22:20:19Z 3 , 6 , 20 ; 24 , 27 . 38 , 45 , 51 , 55 ? , 75 , 78 , , , 83 , 88 , 99 . punc banks/tf/ author.tf gap.tf letters.tf number.tf oslots.tf otext.tf otype.tf punc.tf terminator.tf title.tf TF dataset
  • 11. otype @node @compiler=Dirk Roorda @name=Culture quotes from Iain Banks @source=Good Reads @url=https://www.goodreads.com/work/quotes/14366-consider-phlebas @valueType=str @writtenBy=Text-Fabric @dateWritten=2019-01-30T22:20:19Z 1-99 word 100 book 101-102 chapter 103-114 line 115-117 sentence
  • 12. oslots @edge @compiler=Dirk Roorda @name=Culture quotes from Iain Banks @source=Good Reads @url=https://www.goodreads.com/work/quotes/14366-consider-phlebas @valueType=str @writtenBy=Text-Fabric @dateWritten=2019-01-30T22:20:19Z 100 1-99 1-55 56-99 1-3 4-6 7-9,14-20 21-27 28-38 39-51 52-55 56 57-75 76-77,81-83 84-88 89-99 1-27 28-55 56-99 1-99 word 100 book 101-102 chapter 103-114 line 115-117 sentence ## 1 Everything about us, everything around us, everything we know [and can know of] is composed ultimately of patterns of nothing; that’s the bottom line, the final truth. So where we find we have any control over those patterns, why not make the most elegant ones, the most enjoyable and good ones, in our own terms? ## 2 Besides, it left the humans in the Culture free to take care of the things that really mattered in life, such as [sports, games, romance,] studying dead languages, barbarian societies and impossible problems, and climbing high mountains without the aid of a safety harness.
  • 13. otext @config @compiler=Dirk Roorda @fmt:text-orig-full={letters}{punc} @name=Culture quotes from Iain Banks @sectionFeatures=title,number @sectionTypes=book,chapter @source=Good Reads @url=https://www.goodreads.com/work/quotes/14366-consider-phlebas @writtenBy=Text-Fabric @dateWritten=2019-01-30T22:20:19Z
  • 14. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 15. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 18. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 19. CTBA|CTBA#CTBA#CTB###0#0#0#3#1#0#2#0#0#2#0#0#2#0#0#0#0#0 D;L;DOTH|;L;DOT#;L;DOTA#;LD#D#H#0#0#0#3#1#0#3#0#0#2#0#0#2#1#1#3#0#0 D;WOE|;WOE#;WOE#;WOE#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 MW;KA|MW;KA#MW;KA#MWK###0#1#0#3#1#0#2#0#0#0#0#2#0#0#0#0#0#0 BRH| BR#BRA#BR##H#0#0#0#3#1#0#2#0#0#2#0#0#2#1#1#3#0#0 DDO;D|DO;D#DO;D#DO;D#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 BRH| BR#BRA#BR##H#0#0#0#3#1#0#2#0#0#2#0#0#2#1#1#3#0#0 DABRHM|ABRHM#ABRHM#ABRHM#D##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 ABRHM|ABRHM#ABRHM#ABRHM###0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 AOLD|AOLD#;LD#;LD###0#5#1#0#1#3#2#0#0#0#0#0#0#0#0#0#0#0 LA;SKX| A;SKX#A;SKX#A;SKX#L##0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 A;SKX|A;SKX#A;SKX#A;SKX###0#0#0#0#0#0#0#0#0#1#0#0#2#0#0#0#0#0 AOLD| Syriac NT (Sedra database) DEUT33,02 >C- >;71C 1.000 >;71C- >C- DEUT33,02 DT D.@73T 1.000 D.@73T DT DEUT33,09 BNW B.@N@73JW 1.000 B.@N@73W BNW EST 01,16 MWMKN M:MW.K@81N 1.000 M:WM.K@81N MWMKN EST 03,04 B- K.:- 1.000 B.:- B- EST 03,04 >MRM >@M:R@70M 1.000 >@M:R@70M >MRM Hebrew Ketiv-Qere (ETCBC) &P005381 = MSVO 3, 70 #atf: lang qpc @tablet @obverse @column 1 1.a. 2(N14) , SZE~a SAL TUR3~a NUN~a 1.b. 3(N19) , |GISZ.TE| 2. 1(N14) , NAR NUN~a SIG7 3. 2(N04)# , PIRIG~b1 SIG7 URI3~a NUN~a @column 2 1. 3(N04) , |GISZ.TE| GAR |SZU2.((HI+1(N57))+(HI+1(N57)))| GI4~a 2. , GU7 AZ SI4~f @reverse @column 1 1. 3(N14) , SZE~a 2. 3(N19) 5(N04) , 3. , GU7 @column 2 1. , AZ SI4~f Cuneiform Uruk (CDLI) (1:1:1:1) bi P PREFIX|bi+ (1:1:1:2) somi N STEM|POS:N|LEM:{som|ROOT:smw|M|GEN (1:1:2:1) {ll~ahi PN STEM|POS:PN|LEM:{ll~ah|ROOT:Alh|GEN (1:1:3:1) {l DET PREFIX|Al+ (1:1:3:2) r~aHoma`ni ADJ STEM|POS:ADJ|LEM:r~aHoma`n|ROOT:rHm|MS|GEN (1:1:4:1) {l DET PREFIX|Al+ (1:1:4:2) r~aHiymi ADJ STEM|POS:ADJ|LEM:r~aHiym|ROOT:rHm|MS|GEN (1:2:1:1) {lo DET PREFIX|Al+ (1:2:1:2) Hamodu N STEM|POS:N|LEM:Hamod|ROOT:Hmd|M|NOM Arabic Quran (Tanzil) Source data of a corpus TEI, Markdown, ASCII, Database
  • 20. Conversion to TF TF does more than half of the work
  • 22. Computing - Python - Jupyter notebooks https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/bhsa/start.ipynb BHSA
  • 31. QuranPower to you! (without the programming) This is Text-Fabric 7.5.3 Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1 in /Users/dirk/text-fabric-data/__apps__/quran Cleaning up remnant processes, if any ... Loading data for quran. Please wait ... Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1 in /Users/dirk/text-fabric-data/__apps__/quran Setting up TF kernel for quran q-ran/exercises/mining/tf Using q-ran/quran/tf - 0.3 rv0.4 in /Users/dirk/text-fabric-data Using q-ran/exercises/mining/tf - 0.3 rv0.1 in /Users/dirk/text-fabric-data TF setup done. Using TF app quran commit f6543c213dad36050de3e90373af237e9a4f0bc1 in /Users/dirk/text-fabric-data/__apps__/quran onDocker=False * Running on http://localhost:8105/ (Press CTRL+C to quit) Opening quran in browser Listening at port 18985 127.0.0.1 - - [27/Mar/2019 15:03:55] "GET / HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /passage HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /sections HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:55] "POST /tuples HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /query HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST / HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /passage HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /sections HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /tuples HTTP/1.1" 200 - 127.0.0.1 - - [27/Mar/2019 15:03:56] "POST /query HTTP/1.1" 200 - dirk:~ > text-fabric quran --mod=q-ran/exercises/mining/tf
  • 32. Quran
  • 33. A. reasons B. a solution C. toy example of a TF datasource D. ministudy: rings and sentiments C'. an easter egg B'. new ways A'. new horizons
  • 34. Sharing and re-using data Text-Fabric has been developed by a DANS-employee as a consequence: Data export is built in ✅ Provenance tracking is built in ✅ Redistribution of newly created data is built in ✅
  • 35. sharing #1: GitHub & NBviewer work done in a Jupyter Notebook inside a GitHub repository is very sharable
  • 36.
  • 37. sharing #2: Export from TF-browser
  • 39. sharing #4: Create new features https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/quran/share.ipynb • etcbc/valence/tf : the results of the verbal valence work of Janet Dyk in the SYNVAR project; • etcbc/lingo/heads/tf : head words for phrases, work done by Cody Kingham; • ch-jensen/Semantic-mapping-of-participants/actor/tf : participant analysis in progress by Christian Høygaard-Jensen; • cmerwich/bh-reference-system/tf: participant analysis in progress by Christiaan Erwich; • nino-cunei/oldbabylonian/parallels/tf: similar lines by Dirk Roorda • q-ran/quran/parallels/tf: similar lines by Dirk Roorda • q-ran/exercises/mining/tf: sentiments (crude) by Dirk Roorda • you/quran/sentiments/tf: sentiments (refined) by You • cvlit/quran/semantics/tf: semantic fields by cvlit
  • 40. The Text-Fabric Ethos • Open source tool for corpus annotation and analysis. • Corpus data in a repository, with standard license, as free as possible • Researchers: step out of your technological comfort zones and pave the way for the ones after you • Find computational inspiration across disciplines
  • 41. Open Science Rocks thank you Dirk Roorda dirk.roorda@dans.knaw.nl