SlideShare a Scribd company logo
Towards Open Methods: Using Scientific Workflows in LinguisticsRichard Littauer1
Various tools, such as Kepler, Taverna, Vistrails, and many others have been designed in order to allow for scientific workflows to be created, executed, and shared among scientists and laboratories. Introduction2
Scientific workflows are typically used to automate the processing, analysis, and management of scientific data. Introduction3
Scientific workflows are typically used to automate the processing, analysis, and management of scientific data. They provide a way of tracing provenance and methodologies to help foster reproducible science and the publications of executable papers.Introduction4
By providing front-end visualisationsand adaptations of shell scripts and manual steps, it is easier for scientists to do their work, especially when integrating grids and parallel processing or external databases.Introduction5
How does this relate to Linguistics? Workflows in Linguistics6
How does this relate to Linguistics?Many workflow systems I've been looking at would work in the field of corpus linguistics if we merely had open source databases online to mine. Workflows in Linguistics7
How does this relate to Linguistics?Many workflow systems I've been looking at would work in the field of corpus linguistics if we merely had open source databases online to mine. They, most often, provide a way of cleaning data, and a way of processing repetitive tasks. This is directly applicable to Linguistic work.Workflows in Linguistics8
How does this relate to Open Linguistics? Workflows in Linguistics9
Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics10
Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics11
Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics12
Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics13
Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics14
ExamplesExample workflow15
ExamplesExample workflow
This grabs the most recent XKCD comic off the web.
http://www.myexperiment.org/workflows/1370.html16
ExamplesAnother example workflow17
ExamplesAnother example workflow
This workflow retrieves relevant documents, based on a query optimized by adding a string to the original query that will rank the search output according to the most recent years.
http://www.myexperiment.org/workflows/117.html18
Hypothetical Example19
Hypothetical Example20Chinese character from a text
Hypothetical Example21[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary Database
Hypothetical Example22[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcher
Hypothetical Example23[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcher
Hypothetical Example24[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcherCharacter - Proper dialect reading - definition
Use in LinguisticsSo, if we have a linked network online that is queryable25
Use in LinguisticsSo, if we have a linked network online that is queryable
Hypothetically, it should be possible to use current workflow systems to access and download data26
Use in LinguisticsSo, if we have a linked network online that is queryable
Hypothetically, it should be possible to use current workflow systems to access and download data
My hope is to see how feasible this is27
Use in Linguistics28Other use:
Use in Linguistics29Other use:Shims: data conversion workflows.
Use in Linguistics30Other use:Shims: data conversion workflows.As seen in the LexInfo slides, there are varying definitions for parts of speech (from 5 to 181 different types). Workflows could be used to standardise these after accessing the database…
Use in Linguistics31How does this help Open Methods?
Use in Linguistics32How does this help Open Methods?By keeping track of workflows and workflow systems before they start being popular, we can make sure that users upload and share their workflows to a single repository (like myExperiment.)
Use in Linguistics33How does this help Open Methods?By keeping track of workflows and workflow systems before they start being popular, we can make sure that users upload and share their workflows to a single repository (like myExperiment.)This could then be used by other linguists, along with data supplements, to produce replications, and to check methodology.
Use in Linguistics34How does this help Open Methods?Also, most workflows are now focusing more on providing provenance solutions.
Use in Linguistics35How does this help Open Methods?Also, most workflows are now focusing more on providing provenance solutions.This would make linguistics research more sharable, understandable and repeatable.
Use in LinguisticsWork going on this, currently:36

More Related Content

What's hot

Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
Ross Mounce
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
Eric Kansa
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
M. Tamer Özsu
 
LOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataLOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked Data
Filip Ilievski
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
Armin Haller
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library Data
Marcia Zeng
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
Herbert Van de Sompel
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
Marko Rodriguez
 
Probabilistic Topic models
Probabilistic Topic modelsProbabilistic Topic models
Probabilistic Topic models
Carlos Badenes-Olmedo
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Julien PLU
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
Bernadette Hyland-Wood
 
Consuming Linked Data by Machines - WWW2010
Consuming Linked Data by Machines - WWW2010Consuming Linked Data by Machines - WWW2010
Consuming Linked Data by Machines - WWW2010
Juan Sequeda
 
Dante al tempo del web semantico
Dante al tempo del web semanticoDante al tempo del web semantico
The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
Marko Rodriguez
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
National Information Standards Organization (NISO)
 
Profiling Web Archives
Profiling Web ArchivesProfiling Web Archives
Profiling Web Archives
Michael Nelson
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
petermurrayrust
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
Ross Mounce
 
Research Data Sharing: A Basic Framework
Research Data Sharing: A Basic FrameworkResearch Data Sharing: A Basic Framework
Research Data Sharing: A Basic Framework
Paul Groth
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
Maulik Kamdar
 

What's hot (20)

Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
 
LOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataLOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked Data
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library Data
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
 
Probabilistic Topic models
Probabilistic Topic modelsProbabilistic Topic models
Probabilistic Topic models
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Consuming Linked Data by Machines - WWW2010
Consuming Linked Data by Machines - WWW2010Consuming Linked Data by Machines - WWW2010
Consuming Linked Data by Machines - WWW2010
 
Dante al tempo del web semantico
Dante al tempo del web semanticoDante al tempo del web semantico
Dante al tempo del web semantico
 
The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Profiling Web Archives
Profiling Web ArchivesProfiling Web Archives
Profiling Web Archives
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
 
Research Data Sharing: A Basic Framework
Research Data Sharing: A Basic FrameworkResearch Data Sharing: A Basic Framework
Research Data Sharing: A Basic Framework
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
 

Viewers also liked

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
Trends in Use of Scientific Workflows: Insights from a Public Repository and ...Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
Richard Littauer
 
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Richard Littauer
 
Composing Domain-Specific Languages
Composing Domain-Specific LanguagesComposing Domain-Specific Languages
Composing Domain-Specific Languages
Eelco Visser
 
Static name resolution
Static name resolutionStatic name resolution
Static name resolution
Eelco Visser
 
Type analysis
Type analysisType analysis
Type analysis
Eelco Visser
 
Dynamic Semantics
Dynamic SemanticsDynamic Semantics
Dynamic Semantics
Eelco Visser
 

Viewers also liked (6)

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
Trends in Use of Scientific Workflows: Insights from a Public Repository and ...Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
Trends in Use of Scientific Workflows: Insights from a Public Repository and ...
 
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
 
Composing Domain-Specific Languages
Composing Domain-Specific LanguagesComposing Domain-Specific Languages
Composing Domain-Specific Languages
 
Static name resolution
Static name resolutionStatic name resolution
Static name resolution
 
Type analysis
Type analysisType analysis
Type analysis
 
Dynamic Semantics
Dynamic SemanticsDynamic Semantics
Dynamic Semantics
 

Similar to Towards Open Methods: Using Scientific Workflows in Linguistics

Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
heila1
 
UKON 2014
UKON 2014UKON 2014
Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
National Information Standards Organization (NISO)
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”
DuraSpace
 
Open Opportunities
Open OpportunitiesOpen Opportunities
Open Opportunities
Russ White
 
OpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of DataOpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of Data
openminted_eu
 
Data and science
Data and scienceData and science
Data and science
Anand Deshpande
 
Open science / open research
Open science / open researchOpen science / open research
Open science / open research
heila1
 
Research resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery systemResearch resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery system
Nicole Vasilevsky
 
AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011
Alex Hardisty
 
Making working thesauri
Making working thesauriMaking working thesauri
Making working thesauri
liddy
 
Open Access: Prospectors Wanted!
Open Access: Prospectors Wanted!Open Access: Prospectors Wanted!
Open Access: Prospectors Wanted!
Amos Kujenga
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
CASRAI
 
Overview of open access progress globally
Overview of open access progress globallyOverview of open access progress globally
Overview of open access progress globally
Iryna Kuchma
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
Sören Auer
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Robert H. McDonald
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
Iryna Kuchma
 
A demonstration of transparent and scalable OpenURL quality metrics for use i...
A demonstration of transparent and scalable OpenURL quality metrics for use i...A demonstration of transparent and scalable OpenURL quality metrics for use i...
A demonstration of transparent and scalable OpenURL quality metrics for use i...
alc28
 
Reshaping the world of scholarly communication by Dr. Usha Munshi
Reshaping the world of scholarly communication by Dr. Usha MunshiReshaping the world of scholarly communication by Dr. Usha Munshi
Reshaping the world of scholarly communication by Dr. Usha Munshi
Ata Rehman
 
OpenAIRE-connect: Services for open science
OpenAIRE-connect: Services for open scienceOpenAIRE-connect: Services for open science
OpenAIRE-connect: Services for open science
Jisc
 

Similar to Towards Open Methods: Using Scientific Workflows in Linguistics (20)

Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
 
12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”12.10.14 Slides, “Roadmap to the Future of SHARE”
12.10.14 Slides, “Roadmap to the Future of SHARE”
 
Open Opportunities
Open OpportunitiesOpen Opportunities
Open Opportunities
 
OpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of DataOpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of Data
 
Data and science
Data and scienceData and science
Data and science
 
Open science / open research
Open science / open researchOpen science / open research
Open science / open research
 
Research resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery systemResearch resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery system
 
AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011
 
Making working thesauri
Making working thesauriMaking working thesauri
Making working thesauri
 
Open Access: Prospectors Wanted!
Open Access: Prospectors Wanted!Open Access: Prospectors Wanted!
Open Access: Prospectors Wanted!
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
Overview of open access progress globally
Overview of open access progress globallyOverview of open access progress globally
Overview of open access progress globally
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
 
A demonstration of transparent and scalable OpenURL quality metrics for use i...
A demonstration of transparent and scalable OpenURL quality metrics for use i...A demonstration of transparent and scalable OpenURL quality metrics for use i...
A demonstration of transparent and scalable OpenURL quality metrics for use i...
 
Reshaping the world of scholarly communication by Dr. Usha Munshi
Reshaping the world of scholarly communication by Dr. Usha MunshiReshaping the world of scholarly communication by Dr. Usha Munshi
Reshaping the world of scholarly communication by Dr. Usha Munshi
 
OpenAIRE-connect: Services for open science
OpenAIRE-connect: Services for open scienceOpenAIRE-connect: Services for open science
OpenAIRE-connect: Services for open science
 

More from Richard Littauer

Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 Presentation
Richard Littauer
 
Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
Richard Littauer
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentation
Richard Littauer
 
Saarland and UdS
Saarland and UdSSaarland and UdS
Saarland and UdS
Richard Littauer
 
Building Corpora from Social Media
Building Corpora from Social MediaBuilding Corpora from Social Media
Building Corpora from Social Media
Richard Littauer
 
Visualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat MapsVisualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat Maps
Richard Littauer
 
On Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem IsoglossOn Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem Isogloss
Richard Littauer
 
The Evolution of Morphological Agreement
The Evolution of Morphological AgreementThe Evolution of Morphological Agreement
The Evolution of Morphological Agreement
Richard Littauer
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche Kucha
Richard Littauer
 
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
Richard Littauer
 
The Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer SimulationThe Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer Simulation
Richard Littauer
 
A Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for LanguageA Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for Language
Richard Littauer
 

More from Richard Littauer (12)

Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 Presentation
 
Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentation
 
Saarland and UdS
Saarland and UdSSaarland and UdS
Saarland and UdS
 
Building Corpora from Social Media
Building Corpora from Social MediaBuilding Corpora from Social Media
Building Corpora from Social Media
 
Visualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat MapsVisualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat Maps
 
On Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem IsoglossOn Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem Isogloss
 
The Evolution of Morphological Agreement
The Evolution of Morphological AgreementThe Evolution of Morphological Agreement
The Evolution of Morphological Agreement
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche Kucha
 
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...
 
The Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer SimulationThe Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer Simulation
 
A Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for LanguageA Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for Language
 

Recently uploaded

[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
Kief Morris
 
Password Rotation in 2024 is still Relevant
Password Rotation in 2024 is still RelevantPassword Rotation in 2024 is still Relevant
Password Rotation in 2024 is still Relevant
Bert Blevins
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
Adam Dunkels
 
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
maigasapphire
 
Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...
chetankumar9855
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
SynapseIndia
 
Comparison Table of DiskWarrior Alternatives.pdf
Comparison Table of DiskWarrior Alternatives.pdfComparison Table of DiskWarrior Alternatives.pdf
Comparison Table of DiskWarrior Alternatives.pdf
Andrey Yasko
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
Priyanka Aash
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
Neo4j
 
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
bhumivarma35300
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
aslasdfmkhan4750
 
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdfARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
Inglês no Mundo Digital
 
Pigging Unit Lubricant Oil Blending Plant
Pigging Unit Lubricant Oil Blending PlantPigging Unit Lubricant Oil Blending Plant
Pigging Unit Lubricant Oil Blending Plant
LINUS PROJECTS (INDIA)
 
Choose our Linux Web Hosting for a seamless and successful online presence
Choose our Linux Web Hosting for a seamless and successful online presenceChoose our Linux Web Hosting for a seamless and successful online presence
Choose our Linux Web Hosting for a seamless and successful online presence
rajancomputerfbd
 
The Evolution of Remote Server Management
The Evolution of Remote Server ManagementThe Evolution of Remote Server Management
The Evolution of Remote Server Management
Bert Blevins
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
The Role of Technology in Payroll Statutory Compliance (1).pdf
The Role of Technology in Payroll Statutory Compliance (1).pdfThe Role of Technology in Payroll Statutory Compliance (1).pdf
The Role of Technology in Payroll Statutory Compliance (1).pdf
paysquare consultancy
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
Priyanka Aash
 
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
Bert Blevins
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 

Recently uploaded (20)

[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
[Talk] Moving Beyond Spaghetti Infrastructure [AOTB] 2024-07-04.pdf
 
Password Rotation in 2024 is still Relevant
Password Rotation in 2024 is still RelevantPassword Rotation in 2024 is still Relevant
Password Rotation in 2024 is still Relevant
 
How to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptxHow to Build a Profitable IoT Product.pptx
How to Build a Profitable IoT Product.pptx
 
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
Girls Call Churchgate 9910780858 Provide Best And Top Girl Service And No1 in...
 
Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...Amul milk launches in US: Key details of its new products ...
Amul milk launches in US: Key details of its new products ...
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
 
Comparison Table of DiskWarrior Alternatives.pdf
Comparison Table of DiskWarrior Alternatives.pdfComparison Table of DiskWarrior Alternatives.pdf
Comparison Table of DiskWarrior Alternatives.pdf
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
 
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfBT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdf
 
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
 
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdfARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
ARTIFICIAL INTELLIGENCE (AI) IN MUSIC.pdf
 
Pigging Unit Lubricant Oil Blending Plant
Pigging Unit Lubricant Oil Blending PlantPigging Unit Lubricant Oil Blending Plant
Pigging Unit Lubricant Oil Blending Plant
 
Choose our Linux Web Hosting for a seamless and successful online presence
Choose our Linux Web Hosting for a seamless and successful online presenceChoose our Linux Web Hosting for a seamless and successful online presence
Choose our Linux Web Hosting for a seamless and successful online presence
 
The Evolution of Remote Server Management
The Evolution of Remote Server ManagementThe Evolution of Remote Server Management
The Evolution of Remote Server Management
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
 
The Role of Technology in Payroll Statutory Compliance (1).pdf
The Role of Technology in Payroll Statutory Compliance (1).pdfThe Role of Technology in Payroll Statutory Compliance (1).pdf
The Role of Technology in Payroll Statutory Compliance (1).pdf
 
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
(CISOPlatform Summit & SACON 2024) Digital Personal Data Protection Act.pdf
 
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
Understanding Insider Security Threats: Types, Examples, Effects, and Mitigat...
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 

Towards Open Methods: Using Scientific Workflows in Linguistics

  • 1. Towards Open Methods: Using Scientific Workflows in LinguisticsRichard Littauer1
  • 2. Various tools, such as Kepler, Taverna, Vistrails, and many others have been designed in order to allow for scientific workflows to be created, executed, and shared among scientists and laboratories. Introduction2
  • 3. Scientific workflows are typically used to automate the processing, analysis, and management of scientific data. Introduction3
  • 4. Scientific workflows are typically used to automate the processing, analysis, and management of scientific data. They provide a way of tracing provenance and methodologies to help foster reproducible science and the publications of executable papers.Introduction4
  • 5. By providing front-end visualisationsand adaptations of shell scripts and manual steps, it is easier for scientists to do their work, especially when integrating grids and parallel processing or external databases.Introduction5
  • 6. How does this relate to Linguistics? Workflows in Linguistics6
  • 7. How does this relate to Linguistics?Many workflow systems I've been looking at would work in the field of corpus linguistics if we merely had open source databases online to mine. Workflows in Linguistics7
  • 8. How does this relate to Linguistics?Many workflow systems I've been looking at would work in the field of corpus linguistics if we merely had open source databases online to mine. They, most often, provide a way of cleaning data, and a way of processing repetitive tasks. This is directly applicable to Linguistic work.Workflows in Linguistics8
  • 9. How does this relate to Open Linguistics? Workflows in Linguistics9
  • 10. Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics10
  • 11. Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics11
  • 12. Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics12
  • 13. Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics13
  • 14. Promote the idea and definition, as specified in opendefinition.org of open data in linguistics and in relation to language data.Act as a central point of reference and support for people interested in open linguistic data.Provide guidance on legal issues surrounding linguistic data to the community.Build an index of indexes of open linguistic data sources and tools and link existing resources.Facilitate communication between existing groups.Serve as a mediator between providers and users of of technical infrastructure.Assemble best-practice guidelines / use cases to create, use and distribute data.Open Linguistics14
  • 17. This grabs the most recent XKCD comic off the web.
  • 21. This workflow retrieves relevant documents, based on a query optimized by adding a string to the original query that will rank the search output according to the most recent years.
  • 25. Hypothetical Example21[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary Database
  • 26. Hypothetical Example22[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcher
  • 27. Hypothetical Example23[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcher
  • 28. Hypothetical Example24[ zhi1], [zi2], [zhi2], [shi2], [ci1]Chinese character from a textDictionary DatabaseGeographical data from researcherCharacter - Proper dialect reading - definition
  • 29. Use in LinguisticsSo, if we have a linked network online that is queryable25
  • 30. Use in LinguisticsSo, if we have a linked network online that is queryable
  • 31. Hypothetically, it should be possible to use current workflow systems to access and download data26
  • 32. Use in LinguisticsSo, if we have a linked network online that is queryable
  • 33. Hypothetically, it should be possible to use current workflow systems to access and download data
  • 34. My hope is to see how feasible this is27
  • 36. Use in Linguistics29Other use:Shims: data conversion workflows.
  • 37. Use in Linguistics30Other use:Shims: data conversion workflows.As seen in the LexInfo slides, there are varying definitions for parts of speech (from 5 to 181 different types). Workflows could be used to standardise these after accessing the database…
  • 38. Use in Linguistics31How does this help Open Methods?
  • 39. Use in Linguistics32How does this help Open Methods?By keeping track of workflows and workflow systems before they start being popular, we can make sure that users upload and share their workflows to a single repository (like myExperiment.)
  • 40. Use in Linguistics33How does this help Open Methods?By keeping track of workflows and workflow systems before they start being popular, we can make sure that users upload and share their workflows to a single repository (like myExperiment.)This could then be used by other linguists, along with data supplements, to produce replications, and to check methodology.
  • 41. Use in Linguistics34How does this help Open Methods?Also, most workflows are now focusing more on providing provenance solutions.
  • 42. Use in Linguistics35How does this help Open Methods?Also, most workflows are now focusing more on providing provenance solutions.This would make linguistics research more sharable, understandable and repeatable.
  • 43. Use in LinguisticsWork going on this, currently:36
  • 44. Use in LinguisticsWork going on this, currently:Steiner Lydia, Peter F. Stadler, Michael Cysouw. 2011. A Pipeline for Computational Historical Linguistics. Language Dynamics and Change, p. 89-127.37
  • 45. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflows38
  • 46. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflowshttps://kepler-project.org/39
  • 47. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflowshttps://kepler-project.org/http://www.taverna.org.uk/40
  • 48. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflowshttps://kepler-project.org/http://www.taverna.org.uk/http://www.myexperiment.org41
  • 49. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflowshttps://kepler-project.org/http://www.taverna.org.uk/http://www.myexperiment.orghttp://www.mendeley.com/groups/1235381/workflows-in-linguistics/42
  • 50. More InformationPlaces to look for more information:http://notebooks.dataone.org/workflowshttps://kepler-project.org/http://www.taverna.org.uk/http://www.myexperiment.orghttp://www.mendeley.com/groups/1235381/workflows-in-linguistics/Thank you. Questions?43