The Voice of Time

The Voice of Time
Michael Falk
University of Kent and Western Sydney University
1. Australia in the Pacific
2. A Romanticist’s Contribution
3. Workflow: harvest, clean, decipher
1 | Australia in the Pacific
Source: AFP
Source: Patrick Kirch, On the Road of the Winds: An Archaeological History of the Pacific Islands before European Contact (Oakland: University of California Press, 2017), p. 6.
Sources: Fishhook map and photo: Val Attenbrow, ‘Aboriginal fishing in Port Jackson’, in The Natural History of Sydney (Sydney, 2010); Dingo photograph: Henry Whitehead - Original photograph, CC BY-SA
3.0, https://commons.wikimedia.org/w/index.php?curid=12057483; Sahul and Sundaland map: Kirch, p. 57; Linguistic data: Rachel Hendery (personal communication), and POLLEX database.
Language Word
Hawaiian kapu
NZ Maori tapu
Proto southern
Vanuatu
*tabur
Gugu Yimidhirr thabul
2 | A Romanticist’s Contribution
Dunlop’s transcription:
Nge a runba wonung bulkirra umbilinto bulwarra;
Pital burra kultan wirripang buntoa
Modern reconstruction (Wafer 2017, p. 204):
ngayaranpa wanang palkirr yampilintu pulwarra
pital para katan wiripang pantuwa
Dunlop’s poetic translation:
Our home is the gibber-gunyah
Where hill joins hill on high;
…
And the rushing of wings, as the wangas pass,
Sweeps the wallaby’s print from the glistening grass.
Modern literal translation (Wafer 2017, p. 206):
Ours is the place where the mountains cohabit with the heights
The eaglehawks and wallabies are happy
TheSydneyMorningHerald,11Oct1848
3 | Workflow: Harvest, Clean, Decipher
© BBC
Harvest Clean Decipher
Encoder-Decoder Text
Correction Model
Language Classification
Model
Train Train
ALTA dataset of 6000 hand-
corrected articles
(Cassidy and Mollá, 2017)
???
InferCleaned tokensCleanTokenised text
Local
PostgreSQL
Database
Untold Riches
Public API
Google Colab
RNN Basics:
A single time-step
RNN Cell
(this time-
step)
i 0, 0, 0, … 1, … 0, 0, 0
‘one-
hot’
vector
RNN Cell
(previous
time-step)
0.99
0.24
0.01
...
c<t-1>(n-1)
c<t-1>(n)
o<t>
RNN Cell
(next time-
step)
c<t>
c<t>: the ‘memory cell’ at time-step t. It is updated each
time and saved for the next time-step
o<t>: the ‘output’ at time-step t.
k n
Model design #1: A general
English-language model
RNN
Cell
k
RNN
Cell
i
RNN
Cell
n
RNN
Cell
d
RNN
Cell
y
RNN
Cell
n
RNN
Cell
e
RNN
Cell
E
RNN
Cell
S
σ σ σ σ σ σ σ σ σ
σ
The ‘softmax activation function’ guesses the next letter based on
the output of the cell, returning a vector of probabilities, e.g.:
(P(a)=0.01, P(b)=0.4, P(c)=0.02, … P(z)=0.001, P(S)=0.1, P(E)=0.002)
The Problem
Model design #2: Binary
classification model
RNN
Cell
S
The ‘softmax activation function’ predicts whether the
whole word is English or Australian and simply outputs
a two-vector, e.g.:
(P(English)=0.37, P(Australian)=0.63)
RNN
Cell
RNN
Cell
k
RNN
Cell
RNN
Cell
i
RNN
Cell
RNN
Cell
n
RNN
Cell
RNN
Cell
e
RNN
Cell
RNN
Cell
E
RNN
Cell
…
…
…
Concatenate
σ
The Voice of Time
Problems and Promises
1 of 16

Recommended

ChatGPT and the Future of Work - Clark Boyd by
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
22K views69 slides
Getting into the tech field. what next by
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
5.3K views22 slides
Google's Just Not That Into You: Understanding Core Updates & Search Intent by
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
6.1K views99 slides
How to have difficult conversations by
How to have difficult conversations How to have difficult conversations
How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC
4.6K views19 slides
Introduction to Data Science by
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceChristy Abraham Joy
82.2K views51 slides
Time Management & Productivity - Best Practices by
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
169.7K views42 slides

More Related Content

Recently uploaded

How Leaders See Data? (Level 1) by
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)Narendra Narendra
13 views76 slides
TGP 2.docx by
TGP 2.docxTGP 2.docx
TGP 2.docxsandi636490
10 views8 slides
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptxDataScienceConferenc1
5 views16 slides
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented GenerationDataScienceConferenc1
11 views29 slides
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx by
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptxDataScienceConferenc1
5 views15 slides
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks by
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks
[DSC Europe 23] Aleksandar Tomcic - Adversarial AttacksDataScienceConferenc1
5 views20 slides

Recently uploaded(20)

[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
Organic Shopping in Google Analytics 4.pdf by GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials12 views
CRM stick or twist.pptx by info828217
CRM stick or twist.pptxCRM stick or twist.pptx
CRM stick or twist.pptx
info82821710 views
Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821712 views
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
3196 The Case of The East River by ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9012 views
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862012 views
Survey on Factuality in LLM's.pptx by NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
Ukraine Infographic_22NOV2023_v2.pdf by AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K views
Short Story Assignment by Kelly Nguyen by kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0119 views
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views

Featured

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present... by
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
55.4K views138 slides
12 Ways to Increase Your Influence at Work by
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
401.6K views64 slides
ChatGPT webinar slides by
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slidesAlireza Esmikhani
30.3K views36 slides
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G... by
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
3.6K views12 slides
Barbie - Brand Strategy Presentation by
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
25.1K views46 slides

Featured(20)

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present... by Applitools
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools55.4K views
12 Ways to Increase Your Influence at Work by GetSmarter
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter401.6K views
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G... by DevGAMM Conference
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
DevGAMM Conference3.6K views
Barbie - Brand Strategy Presentation by Erica Santiago
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
Erica Santiago25.1K views
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well by Saba Software
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Saba Software25.2K views
Introduction to C Programming Language by Simplilearn
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
Simplilearn8.4K views
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr... by Palo Alto Software
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
The Pixar Way: 37 Quotes on Developing and Maintaining a Creative Company (fr...
Palo Alto Software88.3K views
9 Tips for a Work-free Vacation by Weekdone.com
9 Tips for a Work-free Vacation9 Tips for a Work-free Vacation
9 Tips for a Work-free Vacation
Weekdone.com7.2K views
How to Map Your Future by SlideShop.com
How to Map Your FutureHow to Map Your Future
How to Map Your Future
SlideShop.com275.1K views
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -... by AccuraCast
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...
Beyond Pride: Making Digital Marketing & SEO Authentically LGBTQ+ Inclusive -...
AccuraCast3.4K views
Exploring ChatGPT for Effective Teaching and Learning.pptx by Stan Skrabut, Ed.D.
Exploring ChatGPT for Effective Teaching and Learning.pptxExploring ChatGPT for Effective Teaching and Learning.pptx
Exploring ChatGPT for Effective Teaching and Learning.pptx
Stan Skrabut, Ed.D.57.6K views
How to train your robot (with Deep Reinforcement Learning) by Lucas García, PhD
How to train your robot (with Deep Reinforcement Learning)How to train your robot (with Deep Reinforcement Learning)
How to train your robot (with Deep Reinforcement Learning)
Lucas García, PhD42.5K views
4 Strategies to Renew Your Career Passion by Daniel Goleman
4 Strategies to Renew Your Career Passion4 Strategies to Renew Your Career Passion
4 Strategies to Renew Your Career Passion
Daniel Goleman122K views
The Student's Guide to LinkedIn by LinkedIn
The Student's Guide to LinkedInThe Student's Guide to LinkedIn
The Student's Guide to LinkedIn
LinkedIn87.9K views
Different Roles in Machine Learning Career by Intellipaat
Different Roles in Machine Learning CareerDifferent Roles in Machine Learning Career
Different Roles in Machine Learning Career
Intellipaat12.4K views
Defining a Tech Project Vision in Eight Quick Steps pdf by TechSoup
Defining a Tech Project Vision in Eight Quick Steps pdfDefining a Tech Project Vision in Eight Quick Steps pdf
Defining a Tech Project Vision in Eight Quick Steps pdf
TechSoup 9.7K views

The Voice of Time

  • 1. The Voice of Time Michael Falk University of Kent and Western Sydney University
  • 2. 1. Australia in the Pacific 2. A Romanticist’s Contribution 3. Workflow: harvest, clean, decipher
  • 3. 1 | Australia in the Pacific Source: AFP
  • 4. Source: Patrick Kirch, On the Road of the Winds: An Archaeological History of the Pacific Islands before European Contact (Oakland: University of California Press, 2017), p. 6.
  • 5. Sources: Fishhook map and photo: Val Attenbrow, ‘Aboriginal fishing in Port Jackson’, in The Natural History of Sydney (Sydney, 2010); Dingo photograph: Henry Whitehead - Original photograph, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=12057483; Sahul and Sundaland map: Kirch, p. 57; Linguistic data: Rachel Hendery (personal communication), and POLLEX database. Language Word Hawaiian kapu NZ Maori tapu Proto southern Vanuatu *tabur Gugu Yimidhirr thabul
  • 6. 2 | A Romanticist’s Contribution
  • 7. Dunlop’s transcription: Nge a runba wonung bulkirra umbilinto bulwarra; Pital burra kultan wirripang buntoa Modern reconstruction (Wafer 2017, p. 204): ngayaranpa wanang palkirr yampilintu pulwarra pital para katan wiripang pantuwa Dunlop’s poetic translation: Our home is the gibber-gunyah Where hill joins hill on high; … And the rushing of wings, as the wangas pass, Sweeps the wallaby’s print from the glistening grass. Modern literal translation (Wafer 2017, p. 206): Ours is the place where the mountains cohabit with the heights The eaglehawks and wallabies are happy TheSydneyMorningHerald,11Oct1848
  • 8. 3 | Workflow: Harvest, Clean, Decipher © BBC
  • 9. Harvest Clean Decipher Encoder-Decoder Text Correction Model Language Classification Model Train Train ALTA dataset of 6000 hand- corrected articles (Cassidy and Mollá, 2017) ??? InferCleaned tokensCleanTokenised text Local PostgreSQL Database Untold Riches Public API
  • 11. RNN Basics: A single time-step RNN Cell (this time- step) i 0, 0, 0, … 1, … 0, 0, 0 ‘one- hot’ vector RNN Cell (previous time-step) 0.99 0.24 0.01 ... c<t-1>(n-1) c<t-1>(n) o<t> RNN Cell (next time- step) c<t> c<t>: the ‘memory cell’ at time-step t. It is updated each time and saved for the next time-step o<t>: the ‘output’ at time-step t. k n
  • 12. Model design #1: A general English-language model RNN Cell k RNN Cell i RNN Cell n RNN Cell d RNN Cell y RNN Cell n RNN Cell e RNN Cell E RNN Cell S σ σ σ σ σ σ σ σ σ σ The ‘softmax activation function’ guesses the next letter based on the output of the cell, returning a vector of probabilities, e.g.: (P(a)=0.01, P(b)=0.4, P(c)=0.02, … P(z)=0.001, P(S)=0.1, P(E)=0.002)
  • 14. Model design #2: Binary classification model RNN Cell S The ‘softmax activation function’ predicts whether the whole word is English or Australian and simply outputs a two-vector, e.g.: (P(English)=0.37, P(Australian)=0.63) RNN Cell RNN Cell k RNN Cell RNN Cell i RNN Cell RNN Cell n RNN Cell RNN Cell e RNN Cell RNN Cell E RNN Cell … … … Concatenate σ

Editor's Notes

  1. Kiribati, Cook Islands, Tonga and Solomon Islands
  2. Green = training set, blue = validation set