Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Visualising Typological Relationships: Plotting WALS with Heat Maps

951 views

Published on

Presented at EACL 2012.

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

Visualising Typological Relationships: Plotting WALS with Heat Maps

  1. 1. Visualising Typological Relationships: Plotting WALS with Heat Maps Richard Littauer¹, Rory Turnbull², Alexis Palmer¹1 Universität des Saarlandes2 Ohio State University
  2. 2. Why?• Data deluge in science• Typology has been shown to be useful for linguistic studies (Greenberg 1963, Chomsky 2000, Dunn et al. 2001).• Showing typological diversity visually can help cut down on research time and illuminate new areas of possible research.
  3. 3. Basic Overview• Our visualisation technique combines: – geographic – phylogenetic – linguistic data.World Atlas of Language Structures (WALS)(Dryer and Haspelmath, 2011).
  4. 4. Previous WorkSimilar visualisation work:- Language Typology: Mayer et al., 2010; Rohrdantz et al., 2010- Phylogeny: Multitree, 2009- Geographical variation: Wieling et al., 2011Work with WALS:- Daumé & Campbell 2007, Daumé 2009
  5. 5. PruningWALS: – 2,678 – 192 feature options (out of 144 features) – 16% of the data filledPruning: – 372 Languages – Average of 96 features – Only languages with 30% or more filled
  6. 6. Phylogenetic DistanceWALS’ Tree Hierarchy: – Three different levels – Doesn’t take into account language contact. • Family: ‘Sino- Tibetan’; • Sub-family: ‘Tibeto-Burman’; • Genus: ‘Northern Naga’. – We used geographical proximity as a proxy for language contact.
  7. 7. Geographical Proximity Filtering• Each language in WALS is associated with a geographical coordinate.• Haversine formula• Within limits: geography, fullness in WALS.
  8. 8. Geographical Proximity Filtering• First approach: – Arbitrary radius from centroid in order to create a decision boundary for clustering neighbouring languages. – 500 kilometres provided a sufficient number of examples after cleaning WALS.
  9. 9. Geographical Proximity Filtering• Second approach: – Arbitrary lower bound for near languages. – Sufficient remainder. – Under-representative of contact languages. – Not as good as the radius method.
  10. 10. WALS Languages and Sparsity -3 Language Family Kiwaian Kwomtari-Baibai -4 Lower Sepik-Ramu Other Trans-New Guinea Longitude -5 Feature density 0.1 0.2 0.3 -6 0.4 0.5 0.6 0.7 -7 -8 140 142 144 146 Latitude
  11. 11. Geographically Focused Map Alignment of Verbal Person Marking O & V Ordering and the Adposition & NP Order ing Person Marking on Adpositions Gender Distinctions in Independent Personal Pronouns O & V Ordering and the Adj & N Order ing Order of Adjective and Noun Order of Adposition and Noun Phrase Position of Tense−Aspect Affixes Order of Genitive and Noun Negative Morphemes Position of Negative Word With Respect to S, O, & V Postverbal Negative Morphemes Preverbal Negative Morphemes Order of Negative Morpheme and Verb Order of Object and Verb Arapesh (Mountain) Una Imonda Waskia Amele Usan Kobon Yimas Alamblak Kewa Tauya Hua Yagaria Dumo AwtuwHamtai Sentani
  12. 12. Phylogenetic Focused Map Comparative Constructions Order of Degree Word and Adjective Adjoined relative clauses SVNegO Order SNegVO Order NegSVO Order Optional Double Negation in SVO languages Different word order in negative clauses Order of Person Markers on the Verb Position of Polar Question Particles Correlative relative clauses Reciprocal Constructions Double−headed relative clauses Postnominal relative clauses Order of Numeral and Noun Temne Kisi Grebo Bambara Supyire Akan Koromfe Dagbani Ewe Yoruba Gwari Igbo Babungo Mumuye Ewondo Doyayo Kongo Gbeya Bossangoa Sango Luvale Nkore−Kiga Zulu Swahili Wolof Ijo (Kolokuma) Fula (Nigerian) Diola−Fogny Birom FyemW E
  13. 13. More Maps Alignment of Verbal Person Marking Alignment of Verbal Person Marking Alignment of Verbal Person Marking O & V Ordering and the Adposition & NP Order ing Relationship between the Order of O & V and the Order of Adp & NP Relationship between the Order of O & V and the Order of Adp & NP Person Marking on Adpositions Person Marking on Adpositions Person Marking on Adpositions Gender Distinctions in Independent Personal Pronouns Gender Distinctions in Independent Personal Pronouns Gender Distinctions in Independent Personal Pronouns O & V Ordering and the Adj & N Ordering Relationship between the Order of O & V and the Order of Adj & N Relationship between the Order of O & V and the Order of Adj & N Order of Adjective and Noun Order of Adjective and Noun Order of Adjective and Noun Order of Adposition and Noun Phrase Order of Adposition and Noun Phrase Order of Adposition and Noun Phrase Position of Tense−Aspect Affixes Position of Tense−Aspect Affixes Position of Tense−Aspect Affixes Order of Genitive and Noun Order of Genitive and Noun Order of Genitive and Noun Negative Morphemes Negative Morphemes Negative Morphemes Position of Negative Word With Respect to S, O, & V Position of Negative Word With Respect to Subject, Object, and Verb Position of Negative Word With Respect to Subject, Object, and Verb Postverbal Negative Morphemes Postverbal Negative Morphemes Postverbal Negative Morphemes Preverbal Negative Morphemes Preverbal Negative Morphemes Preverbal Negative Morphemes Order of Negative Morpheme and Verb Order of Negative Morpheme and Verb Order of Negative Morpheme and Verb Order of Object and Verb Order of Object and Verb Order of Object and Verb Arapesh (Mountain) Una Imonda Waskia Amele Usan KobonHamtai Awtuw Yimas Alamblak Kewa Tauya Hua Yagaria Dumo Sentani Arapesh (Mountain) Dani (Lower Grand Valley) Hua Amele Usan Kobon Yimas Alamblak Dumo Imonda Una Kewa Tauya Waskia Yagaria Asmat Awtuw Sentani Arapesh (Mountain) Una Suena Waskia Amele Hua Kewa Kobon Yagaria Usan Marind Imonda Awtuw Alamblak Tauya Yimas Hamtai Alignment of Verbal Person Marking Alignment of Verbal Person Marking Alignment of Verbal Person Marking Relationship between the Order of O & V and the Order of Adp & NP Relationship between the Order of O & V and the Order of Adp & NP Relationship between the Order of O & V and the Order of Adp & NP Person Marking on Adpositions Person Marking on Adpositions Person Marking on Adpositions Gender Distinctions in Independent Personal Pronouns Gender Distinctions in Independent Personal Pronouns Gender Distinctions in Independent Personal Pronouns Relationship between the Order of O & V and the Order of Adj & N Relationship between the Order of O & V and the Order of Adj & N Relationship between the Order of O & V and the Order of Adj & N Order of Adjective and Noun Order of Adjective and Noun Order of Adjective and Noun Order of Adposition and Noun Phrase Order of Adposition and Noun Phrase Order of Adposition and Noun Phrase Position of Tense−Aspect Affixes Position of Tense−Aspect Affixes Position of Tense−Aspect Affixes Order of Genitive and Noun Order of Genitive and Noun Order of Genitive and Noun Negative Morphemes Negative Morphemes Negative Morphemes Position of Negative Word With Respect to Subject, Object, and Verb Position of Negative Word With Respect to Subject, Object, and Verb Position of Negative Word With Respect to Subject, Object, and Verb Postverbal Negative Morphemes Postverbal Negative Morphemes Postverbal Negative Morphemes Preverbal Negative Morphemes Preverbal Negative Morphemes Order of Negative Morpheme and Verb Preverbal Negative Morphemes Order of Negative Morpheme and Verb Order of Object and Verb Order of Object and Verb Order of Negative Morpheme and Verb Order of Object and Verb Dani (Lower Grand Valley) Arapesh (Mountain) Una Waskia Hua Amele Kewa Kobon Tauya Imonda Yagaria Dumo Marind Hamtai Awtuw Alamblak Yimas Usan Sentani Dani (Lower Grand Valley) Arapesh (Mountain) Hua Tauya Waskia Kobon Imonda Alamblak Dumo Usan Amele Kewa Una Yagaria Sentani Awtuw Yimas Arapesh (Mountain) Una Dumo Yagaria Kewa Tauya Yimas Kobon Usan Alamblak Amele Hua Waskia Imonda Suena Hamtai Awtuw
  14. 14. Conclusion• A newly applied method for looking at sparse data• Combines phylogenetic, geographic, and typological data
  15. 15. Final RemarksFuture work:• Integrating Ethnologue or Multitree for language families.• Further exploration showing more natural organisation of the linguistic featuresAll code and visualisations available here:https://github.com/RichardLitt/visualizing-language

×