SlideShare a Scribd company logo
1 of 60
Download to read offline
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Visual Analytics for Linguistics - Day 4
Olga Scrivner
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
What You Will Learn
DAY 1 Introduction to Visual Analytics
DAY 2 Visualization Methods, Design, and Tools
DAY 3 Working with Unstructured Data
DAY 4 Working with Structured Data
DAY 5 Advanced Analytics
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Our Materials - Web Site
http:
//obscrivn.wixsite.com/visualization
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
What We Need
Interactive Text Mining Suite
Language Variation Suite
Download R code
Download files from Day 3 and Day 4
R and Rstudio
R libraries: tm, party, partykit
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
What We Need
Interactive Text Mining Suite
Language Variation Suite
Download R code
Download files from Day 3 and Day 4
R and Rstudio
R libraries: tm, party, partykit
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Text Mining Review
Intro to ITMS - slides Day 3
R coding
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Text Mining Review
1. Open RStudio
2. Open R file textmining.R:
3. Set up working directory
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Text Mining Package - tm
Main structure - corpus
Corpus is constructed via DirSource, VectorSource,
DataframeSource
mycorpus <- Corpus(VectorSource(file.txt))
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Preprocessing with tm
lower case
mycorpus <- tm_map(mycorpus, tolower)
remove punctuation
mycorpus <- tm_map(mycorpus,
removePunctuation)
remove numbers
mycorpus <- tm_map(mycorpus, removeNumbers)
remove stopwords
mycorpus <- tm_map(mycorpus, removeWords,
stopwords(’english’))
mycorpus <- tm_map(mycorpus, stripWhitespace)
mycorpus <- tm_map(mycorpus,
PlainTextDocument)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
What is LVS?
Language Variation Suite
It is a Shiny web application originally designed for data
analysis in sociolinguistic research.
It can be used for:
Processing spreadsheet data
Visualizing data
Analyzing means, regression, conditional trees ...
(and much more)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Background
LVS is built in R using Shiny package:
1. R - a free programming language for statistical
computing and graphics
2. Shiny App - a web application framework for R
Computational power of R + Web interactivity
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Background
http://littleactuary.github.io/blog/Web-application-framework-with-Shiny/
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Workspace
Browser
Chrome, Firefox, Safari - recommendable
Explorer may cause instability issues
Accessibility
PC, Mac, Linux
Data files will be uploaded from any location on your
computer
Smart Phone
Data files must be on a cloud platform connected to
your phone account (e.g. dropbox)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Server
Since LVS is hosted on a server, Shiny idle time-out settings
may stop application when it is left inactive (it will grey out).
Solution: Click reload and re-upload your csv file
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Data Preparation
Important things to consider before data entry:
File format:
Comma separated value (CSV) - faster processing
Excel format will slow processing
Column names should not contain spaces
Permitted: non-accented characters, numbers,
underscore, hyphen, and period
One column must contain your dependent variable
The rest of the columns contain independent variables
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Terminology Review
a. Categorical - non-numerical data with two values
yes - no; male - female
b. Continuous - numerical data
duration, age, year
c. Multinomial - non-numerical data with three or more
values
regions, nationalities
d. Ordinal - scale: currently not supported
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Terminology Review
a. Categorical - non-numerical data with two values
yes - no; male - female
b. Continuous - numerical data
duration, age, year
c. Multinomial - non-numerical data with three or more
values
regions, nationalities
d. Ordinal - scale: currently not supported
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Workshop Files
1. http://obscrivn.wixsite.com/visualization
Download file Day 4: movie_metadata.csv
Simplified set from https://www.kaggle.com/
deepmatrix/imdb-5000-movie-dataset
2. LVS web site: http://languagevariationsuite.com/
http://cl.indiana.edu/~obscrivn/docs/movie_metadata.csv
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Movie Data
Budget
Director
Actor 1
Director facebook likes
Actor 1 facebook likes
Genre
Year
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Language Variation Suite - Structure
1. Data
Upload file, data summary, adjust data, cross tabulation
2. Visual Analysis
Plotting, cluster classification
3. Inferential Statistics
Modeling, regression, conditional trees, random forest
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Language Variation Suite - Structure
1. Data
Upload file, data summary, adjust data, cross tabulation
2. Visual Analysis
Plotting, cluster classification
3. Inferential Statistics
Modeling, regression, conditional trees, random forest
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Upload File
Upload movie_metadata.csv
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Uploaded Dataset
The data content is imported as a table and allows for
sorting columns.
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Summary
Summary provides a quantitative summary for each variable,
e.g. frequency count, mean, median.
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Data Structure
1. Total number of observations (rows)
2. Number of variables (columns)
3. Variable types
Factor - categorical values
Num - numeric values (0.95, 1.05)
Int - integer values (1, 2, 3)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cross Tabulation
Cross-tabulation examines the relationship between variables.
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cross Tabulation Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cross Tabulation Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Adjusting Browser - Layout
Shiny pages are fluid and reactive.
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Adjusting Browser - Layout
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Language Variation Suite - Structure
1. Data
Upload file, data summary, adjust data, cross tabulation
2. Visual Analysis
Plotting, cluster classification
3. Inferential statistics
Modeling, regression, varbrul analysis, conditional trees,
random forest
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
One Variable Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
One Variable Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Customizing Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Customizing Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Saving Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cluster Plot
Classification of data into sub-groups is based on
pairwise similarities
Groups are clustered in the form of a tree-like
dendrogram
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cluster Plot
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Cluster Plot
Group 1 Animation, Biography and Group 2 Action, Drama,
Comedy
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Inferential Statistics
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Language Variation Suite - Structure
1. Data
Upload file, data summary, adjust data, cross tabulation
2. Visual Analysis
Plotting, cluster classification
3. Inferential statistics
Modeling, regression, varbrul analysis, conditional trees,
random forest
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
How to Create a Regression Model
Step 1 Modeling - create a model with dependent
and independent variables
Step 2 Regression - specify the type of regression
(fixed, mixed) and type of dependent variable
(binary, continuous, multinomial)
Step 3 Stepwise Regression - compare models
(Log-likelihood, AIC, BIC)
Step 4 Conditional Trees - apply non-parametric
tests to the model
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Modeling
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Regression Types
Model
a.) Fixed effect
b.) Mixed effect - individual speaker/token variation (within
group)
Type of Dependent Variable
a.) Binary/categorical (only two values)
b.) Continuous (numeric)
c.) Multinomial - categorical with more than two values
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Regression
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Model Output
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Model Output
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Interpretation: Budget and Genre
Genre Action is the reference value
Positive coefficient - positive effect
Negative coefficient - negative effect
http://www.free-online-calculator-use.com/scientific-notation-converter.html
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Interpretation: Budget and Genre
Genre Action is the reference value
Positive coefficient - positive effect
Negative coefficient - negative effect
http://www.free-online-calculator-use.com/scientific-notation-converter.html
exponential notation:
1.46e-7
0.000000146
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Conditional Tree
Conditional tree: a simple non-parametric regression analysis,
commonly used in social and psychological studies
Linear regression: all information is combined linearly
Conditional tree regression: visual splitting to capture
interaction between variables
Recursive splitting (tree branches)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Conditional Tree
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Conditional Tree
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Conditional Tree
1. Genre is the significant factor for budget
2. Budget distribution is split in two groups:
Action and Animation
Biography, Comedy and Drama
3. Budget is significantly higher for Animation and Action
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Random Forest
1. Variable importance for predictors
2. Robust technique with small n large p data
3. All predictors considered jointly (allows for inclusion of
correlated factors)
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Random Forest
Let’s add more factors!
Return to Modeling
Add independent factors: director facebook likes,
actor 1 facebook likes, title year
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Random Forest
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Random Forest
Genre is the most important predictor for this model.
Close to zero or red-dotted line (cut off values) -
irrelevant for this model
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Assignment
Select LVS or ITMS
Upload your own file (csv for LVS or txt/pdf for ITMS)
Explore your data
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
References I
Baayen, Harald. 2008. Analyzing linguistic data: A practical introduction to statistics.
Cambridge: Cambridge University Press
Gries, Stefan Th. 2015. Quantitative designs and statistical techniques. In Douglas
Biber Randi Reppen (eds.), The Cambridge Handbook of English Corpus Linguistics.
Cambridge: Cambridge University Press
Schnapp, Jeffrey, and Peter Presner. 2009. Digital Humanities Manifesto 2.0.
http://gifsanimados.espaciolatino.com/x_bob_esponja_8.gif
https://daniellestolt.files.wordpress.com/2013/01/are-you-ready1.gif
http://www.martijnwieling.nl/R/sheets.pdf
Visual Analytics
for Linguistics -
Day 4
Olga Scrivner
Course Info
tm Package
Statistical
Visualization
Data Preparation
LVS
Working with
Data
Visual Analytics
Inferential
Analysis
Resources
http://www.rdatamining.com/examples/text-mining
https:
//en.wikibooks.org/wiki/R_Programming/Text_Processing
http://data.library.virginia.edu/
reading-pdf-files-into-r-for-text-mining/
http://www.katrinerk.com/courses/
words-in-a-haystack-an-introductory-statistics-course/
schedule-words-in-a-haystack/
r-code-the-text-mining-package
tm package

More Related Content

Similar to Visual Analytics for Linguistics - Day 4 ESSLLI - structured data

Jingjing Chen Resume_Data Analyst
Jingjing Chen Resume_Data AnalystJingjing Chen Resume_Data Analyst
Jingjing Chen Resume_Data AnalystJingjing Chen
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsPaul Hofmann
 
IRIS.TV Talks Future of Video Personalization at Cross Campus LA
IRIS.TV Talks Future of Video Personalization at Cross Campus LAIRIS.TV Talks Future of Video Personalization at Cross Campus LA
IRIS.TV Talks Future of Video Personalization at Cross Campus LAIRIS.TV
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxRajSingh512965
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisOlga Scrivner
 
fINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptxfINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptxdataKarthik
 
Language Variation Suite - interactive toolkit for quantitative analysis
Language Variation Suite - interactive toolkit for quantitative analysisLanguage Variation Suite - interactive toolkit for quantitative analysis
Language Variation Suite - interactive toolkit for quantitative analysisOlga Scrivner
 
GluonNLP MXNet Meetup-Aug
GluonNLP MXNet Meetup-AugGluonNLP MXNet Meetup-Aug
GluonNLP MXNet Meetup-AugChenguang Wang
 
GluonNLP: A Deep Learning Toolkit for NLP Practitioners
GluonNLP: A Deep Learning Toolkit for NLP PractitionersGluonNLP: A Deep Learning Toolkit for NLP Practitioners
GluonNLP: A Deep Learning Toolkit for NLP PractitionersApache MXNet
 
Your learning ecosystem
Your learning ecosystemYour learning ecosystem
Your learning ecosystemNetDimensions
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Paul Bailey
 
Vasian Markollari Resume 2016
Vasian Markollari Resume 2016Vasian Markollari Resume 2016
Vasian Markollari Resume 2016Vasian Markollari
 
Visual Querying LOD sources with LODeX
 Visual Querying LOD sources with LODeX Visual Querying LOD sources with LODeX
Visual Querying LOD sources with LODeXFabio Benedetti
 
Maoye resume 2017_1_v10_short
Maoye resume 2017_1_v10_shortMaoye resume 2017_1_v10_short
Maoye resume 2017_1_v10_shortMao Ye
 
Language Technologies for Geomatics: From Intelligence to Agility
Language Technologies for Geomatics: From Intelligence to AgilityLanguage Technologies for Geomatics: From Intelligence to Agility
Language Technologies for Geomatics: From Intelligence to AgilityVisionGEOMATIQUE2014
 
Workshop: Data Visualization for Corpus Linguistics via Shiny Framework
Workshop: Data Visualization for Corpus Linguistics via Shiny FrameworkWorkshop: Data Visualization for Corpus Linguistics via Shiny Framework
Workshop: Data Visualization for Corpus Linguistics via Shiny FrameworkOlga Scrivner
 

Similar to Visual Analytics for Linguistics - Day 4 ESSLLI - structured data (20)

Jingjing Chen Resume_Data Analyst
Jingjing Chen Resume_Data AnalystJingjing Chen Resume_Data Analyst
Jingjing Chen Resume_Data Analyst
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
PotterResume 2016A
PotterResume 2016APotterResume 2016A
PotterResume 2016A
 
IRIS.TV Talks Future of Video Personalization at Cross Campus LA
IRIS.TV Talks Future of Video Personalization at Cross Campus LAIRIS.TV Talks Future of Video Personalization at Cross Campus LA
IRIS.TV Talks Future of Video Personalization at Cross Campus LA
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
Resume_2015
Resume_2015Resume_2015
Resume_2015
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
 
fINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptxfINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptx
 
Language Variation Suite - interactive toolkit for quantitative analysis
Language Variation Suite - interactive toolkit for quantitative analysisLanguage Variation Suite - interactive toolkit for quantitative analysis
Language Variation Suite - interactive toolkit for quantitative analysis
 
GluonNLP MXNet Meetup-Aug
GluonNLP MXNet Meetup-AugGluonNLP MXNet Meetup-Aug
GluonNLP MXNet Meetup-Aug
 
GluonNLP: A Deep Learning Toolkit for NLP Practitioners
GluonNLP: A Deep Learning Toolkit for NLP PractitionersGluonNLP: A Deep Learning Toolkit for NLP Practitioners
GluonNLP: A Deep Learning Toolkit for NLP Practitioners
 
Your learning ecosystem
Your learning ecosystemYour learning ecosystem
Your learning ecosystem
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017
 
Vasian Markollari Resume 2016
Vasian Markollari Resume 2016Vasian Markollari Resume 2016
Vasian Markollari Resume 2016
 
Newest mmis resume
Newest mmis  resumeNewest mmis  resume
Newest mmis resume
 
Visual Querying LOD sources with LODeX
 Visual Querying LOD sources with LODeX Visual Querying LOD sources with LODeX
Visual Querying LOD sources with LODeX
 
Maoye resume 2017_1_v10_short
Maoye resume 2017_1_v10_shortMaoye resume 2017_1_v10_short
Maoye resume 2017_1_v10_short
 
Resume
ResumeResume
Resume
 
Language Technologies for Geomatics: From Intelligence to Agility
Language Technologies for Geomatics: From Intelligence to AgilityLanguage Technologies for Geomatics: From Intelligence to Agility
Language Technologies for Geomatics: From Intelligence to Agility
 
Workshop: Data Visualization for Corpus Linguistics via Shiny Framework
Workshop: Data Visualization for Corpus Linguistics via Shiny FrameworkWorkshop: Data Visualization for Corpus Linguistics via Shiny Framework
Workshop: Data Visualization for Corpus Linguistics via Shiny Framework
 

More from Olga Scrivner

Engaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxEngaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxOlga Scrivner
 
HICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesHICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesOlga Scrivner
 
The power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsThe power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsOlga Scrivner
 
Cognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderCognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderOlga Scrivner
 
Introduction to Web Scraping with Python
Introduction to Web Scraping with PythonIntroduction to Web Scraping with Python
Introduction to Web Scraping with PythonOlga Scrivner
 
Call for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyCall for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyOlga Scrivner
 
Jupyter machine learning crash course
Jupyter machine learning crash courseJupyter machine learning crash course
Jupyter machine learning crash courseOlga Scrivner
 
R and RMarkdown crash course
R and RMarkdown crash courseR and RMarkdown crash course
R and RMarkdown crash courseOlga Scrivner
 
The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...Olga Scrivner
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...Olga Scrivner
 
Introduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationIntroduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationOlga Scrivner
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf WorkshopOlga Scrivner
 
R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303Olga Scrivner
 
Gender Disparity in Employment and Education
Gender Disparity in Employment and EducationGender Disparity in Employment and Education
Gender Disparity in Employment and EducationOlga Scrivner
 
CrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersCrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersOlga Scrivner
 
Data Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowData Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowOlga Scrivner
 
Reproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataReproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataOlga Scrivner
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 
Building Shiny Application Series - Layout and HTML
Building Shiny Application Series - Layout and HTMLBuilding Shiny Application Series - Layout and HTML
Building Shiny Application Series - Layout and HTMLOlga Scrivner
 
Introduction to R - from Rstudio to ggplot
Introduction to R - from Rstudio to ggplotIntroduction to R - from Rstudio to ggplot
Introduction to R - from Rstudio to ggplotOlga Scrivner
 

More from Olga Scrivner (20)

Engaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxEngaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptx
 
HICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesHICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning Technologies
 
The power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsThe power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systems
 
Cognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderCognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use Disorder
 
Introduction to Web Scraping with Python
Introduction to Web Scraping with PythonIntroduction to Web Scraping with Python
Introduction to Web Scraping with Python
 
Call for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyCall for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and Technology
 
Jupyter machine learning crash course
Jupyter machine learning crash courseJupyter machine learning crash course
Jupyter machine learning crash course
 
R and RMarkdown crash course
R and RMarkdown crash courseR and RMarkdown crash course
R and RMarkdown crash course
 
The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...
 
Introduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationIntroduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web Application
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf Workshop
 
R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303
 
Gender Disparity in Employment and Education
Gender Disparity in Employment and EducationGender Disparity in Employment and Education
Gender Disparity in Employment and Education
 
CrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersCrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for Beginners
 
Data Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowData Analysis and Visualization: R Workflow
Data Analysis and Visualization: R Workflow
 
Reproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataReproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid data
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 
Building Shiny Application Series - Layout and HTML
Building Shiny Application Series - Layout and HTMLBuilding Shiny Application Series - Layout and HTML
Building Shiny Application Series - Layout and HTML
 
Introduction to R - from Rstudio to ggplot
Introduction to R - from Rstudio to ggplotIntroduction to R - from Rstudio to ggplot
Introduction to R - from Rstudio to ggplot
 

Recently uploaded

Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Visual Analytics for Linguistics - Day 4 ESSLLI - structured data

  • 1. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Visual Analytics for Linguistics - Day 4 Olga Scrivner
  • 2. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis What You Will Learn DAY 1 Introduction to Visual Analytics DAY 2 Visualization Methods, Design, and Tools DAY 3 Working with Unstructured Data DAY 4 Working with Structured Data DAY 5 Advanced Analytics
  • 3. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Our Materials - Web Site http: //obscrivn.wixsite.com/visualization
  • 4. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis What We Need Interactive Text Mining Suite Language Variation Suite Download R code Download files from Day 3 and Day 4 R and Rstudio R libraries: tm, party, partykit
  • 5. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis What We Need Interactive Text Mining Suite Language Variation Suite Download R code Download files from Day 3 and Day 4 R and Rstudio R libraries: tm, party, partykit
  • 6. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Text Mining Review Intro to ITMS - slides Day 3 R coding
  • 7. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Text Mining Review 1. Open RStudio 2. Open R file textmining.R: 3. Set up working directory
  • 8. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Text Mining Package - tm Main structure - corpus Corpus is constructed via DirSource, VectorSource, DataframeSource mycorpus <- Corpus(VectorSource(file.txt))
  • 9. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Preprocessing with tm lower case mycorpus <- tm_map(mycorpus, tolower) remove punctuation mycorpus <- tm_map(mycorpus, removePunctuation) remove numbers mycorpus <- tm_map(mycorpus, removeNumbers) remove stopwords mycorpus <- tm_map(mycorpus, removeWords, stopwords(’english’)) mycorpus <- tm_map(mycorpus, stripWhitespace) mycorpus <- tm_map(mycorpus, PlainTextDocument)
  • 10. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis What is LVS? Language Variation Suite It is a Shiny web application originally designed for data analysis in sociolinguistic research. It can be used for: Processing spreadsheet data Visualizing data Analyzing means, regression, conditional trees ... (and much more)
  • 11. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Background LVS is built in R using Shiny package: 1. R - a free programming language for statistical computing and graphics 2. Shiny App - a web application framework for R Computational power of R + Web interactivity
  • 12. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Background http://littleactuary.github.io/blog/Web-application-framework-with-Shiny/
  • 13. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Workspace Browser Chrome, Firefox, Safari - recommendable Explorer may cause instability issues Accessibility PC, Mac, Linux Data files will be uploaded from any location on your computer Smart Phone Data files must be on a cloud platform connected to your phone account (e.g. dropbox)
  • 14. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Server Since LVS is hosted on a server, Shiny idle time-out settings may stop application when it is left inactive (it will grey out). Solution: Click reload and re-upload your csv file
  • 15. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Data Preparation Important things to consider before data entry: File format: Comma separated value (CSV) - faster processing Excel format will slow processing Column names should not contain spaces Permitted: non-accented characters, numbers, underscore, hyphen, and period One column must contain your dependent variable The rest of the columns contain independent variables
  • 16. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Terminology Review a. Categorical - non-numerical data with two values yes - no; male - female b. Continuous - numerical data duration, age, year c. Multinomial - non-numerical data with three or more values regions, nationalities d. Ordinal - scale: currently not supported
  • 17. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Terminology Review a. Categorical - non-numerical data with two values yes - no; male - female b. Continuous - numerical data duration, age, year c. Multinomial - non-numerical data with three or more values regions, nationalities d. Ordinal - scale: currently not supported
  • 18. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Workshop Files 1. http://obscrivn.wixsite.com/visualization Download file Day 4: movie_metadata.csv Simplified set from https://www.kaggle.com/ deepmatrix/imdb-5000-movie-dataset 2. LVS web site: http://languagevariationsuite.com/ http://cl.indiana.edu/~obscrivn/docs/movie_metadata.csv
  • 19. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Movie Data Budget Director Actor 1 Director facebook likes Actor 1 facebook likes Genre Year
  • 20. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Language Variation Suite - Structure 1. Data Upload file, data summary, adjust data, cross tabulation 2. Visual Analysis Plotting, cluster classification 3. Inferential Statistics Modeling, regression, conditional trees, random forest
  • 21. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Language Variation Suite - Structure 1. Data Upload file, data summary, adjust data, cross tabulation 2. Visual Analysis Plotting, cluster classification 3. Inferential Statistics Modeling, regression, conditional trees, random forest
  • 22. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Upload File Upload movie_metadata.csv
  • 23. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Uploaded Dataset The data content is imported as a table and allows for sorting columns.
  • 24. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Summary Summary provides a quantitative summary for each variable, e.g. frequency count, mean, median.
  • 25. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Data Structure 1. Total number of observations (rows) 2. Number of variables (columns) 3. Variable types Factor - categorical values Num - numeric values (0.95, 1.05) Int - integer values (1, 2, 3)
  • 26. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cross Tabulation Cross-tabulation examines the relationship between variables.
  • 27. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cross Tabulation Plot
  • 28. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cross Tabulation Plot
  • 29. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Adjusting Browser - Layout Shiny pages are fluid and reactive.
  • 30. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Adjusting Browser - Layout
  • 31. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Language Variation Suite - Structure 1. Data Upload file, data summary, adjust data, cross tabulation 2. Visual Analysis Plotting, cluster classification 3. Inferential statistics Modeling, regression, varbrul analysis, conditional trees, random forest
  • 32. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis One Variable Plot
  • 33. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis One Variable Plot
  • 34. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Customizing Plot
  • 35. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Customizing Plot
  • 36. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Saving Plot
  • 37. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cluster Plot Classification of data into sub-groups is based on pairwise similarities Groups are clustered in the form of a tree-like dendrogram
  • 38. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cluster Plot
  • 39. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Cluster Plot Group 1 Animation, Biography and Group 2 Action, Drama, Comedy
  • 40. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Inferential Statistics
  • 41. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Language Variation Suite - Structure 1. Data Upload file, data summary, adjust data, cross tabulation 2. Visual Analysis Plotting, cluster classification 3. Inferential statistics Modeling, regression, varbrul analysis, conditional trees, random forest
  • 42. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis How to Create a Regression Model Step 1 Modeling - create a model with dependent and independent variables Step 2 Regression - specify the type of regression (fixed, mixed) and type of dependent variable (binary, continuous, multinomial) Step 3 Stepwise Regression - compare models (Log-likelihood, AIC, BIC) Step 4 Conditional Trees - apply non-parametric tests to the model
  • 43. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Modeling
  • 44. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Regression Types Model a.) Fixed effect b.) Mixed effect - individual speaker/token variation (within group) Type of Dependent Variable a.) Binary/categorical (only two values) b.) Continuous (numeric) c.) Multinomial - categorical with more than two values
  • 45. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Regression
  • 46. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Model Output
  • 47. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Model Output
  • 48. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Interpretation: Budget and Genre Genre Action is the reference value Positive coefficient - positive effect Negative coefficient - negative effect http://www.free-online-calculator-use.com/scientific-notation-converter.html
  • 49. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Interpretation: Budget and Genre Genre Action is the reference value Positive coefficient - positive effect Negative coefficient - negative effect http://www.free-online-calculator-use.com/scientific-notation-converter.html exponential notation: 1.46e-7 0.000000146
  • 50. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Conditional Tree Conditional tree: a simple non-parametric regression analysis, commonly used in social and psychological studies Linear regression: all information is combined linearly Conditional tree regression: visual splitting to capture interaction between variables Recursive splitting (tree branches)
  • 51. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Conditional Tree
  • 52. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Conditional Tree
  • 53. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Conditional Tree 1. Genre is the significant factor for budget 2. Budget distribution is split in two groups: Action and Animation Biography, Comedy and Drama 3. Budget is significantly higher for Animation and Action
  • 54. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Random Forest 1. Variable importance for predictors 2. Robust technique with small n large p data 3. All predictors considered jointly (allows for inclusion of correlated factors)
  • 55. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Random Forest Let’s add more factors! Return to Modeling Add independent factors: director facebook likes, actor 1 facebook likes, title year
  • 56. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Random Forest
  • 57. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Random Forest Genre is the most important predictor for this model. Close to zero or red-dotted line (cut off values) - irrelevant for this model
  • 58. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Assignment Select LVS or ITMS Upload your own file (csv for LVS or txt/pdf for ITMS) Explore your data
  • 59. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis References I Baayen, Harald. 2008. Analyzing linguistic data: A practical introduction to statistics. Cambridge: Cambridge University Press Gries, Stefan Th. 2015. Quantitative designs and statistical techniques. In Douglas Biber Randi Reppen (eds.), The Cambridge Handbook of English Corpus Linguistics. Cambridge: Cambridge University Press Schnapp, Jeffrey, and Peter Presner. 2009. Digital Humanities Manifesto 2.0. http://gifsanimados.espaciolatino.com/x_bob_esponja_8.gif https://daniellestolt.files.wordpress.com/2013/01/are-you-ready1.gif http://www.martijnwieling.nl/R/sheets.pdf
  • 60. Visual Analytics for Linguistics - Day 4 Olga Scrivner Course Info tm Package Statistical Visualization Data Preparation LVS Working with Data Visual Analytics Inferential Analysis Resources http://www.rdatamining.com/examples/text-mining https: //en.wikibooks.org/wiki/R_Programming/Text_Processing http://data.library.virginia.edu/ reading-pdf-files-into-r-for-text-mining/ http://www.katrinerk.com/courses/ words-in-a-haystack-an-introductory-statistics-course/ schedule-words-in-a-haystack/ r-code-the-text-mining-package tm package