CHILDES Overview - Basic

10,071 views

Published on

An introdcution to the Child Language Data Exchange System (CHILDES). Basic tutorials on how to use the CLAN program provided to study
conversational interactions for research.

1. What is CHILDES? (Slide 2)
2. Why we need CHILDES? (Slide 7)
3. Who started CHILDES? (Slide 10)
4. Why we need a lot of data with CHILDES? (Slide 16)
5. Where is CHILDES? (Slide 18)
6. How can I get the latest info of CHILDES? (Slide 21)
7. What are the tools provided by CHILDES? (Slide 23)
8. What related software do I need? (Slide 25)
9. Where is the CHILDES program? (Slide 30)
10. I want to install CLAN to my Windows. HOW ? (Slide 34)
11. I just want to study the available language database.How?(Slide 39)
12. How can I read transcripts together with audio & video files? (Slide 49)
13.I want to search words/ language structure from various corpus for research. HOW? (Slide 52)
14. Can you introduce me some useful COMMANDS? (Slide 55)
15. WOW! Can I create a language corpus for my own kids with CHILDES? (Slide 64)

Website: http://childes.psy.cmu.edu

Published in: Economy & Finance, Technology
  • Be the first to comment

CHILDES Overview - Basic

  1. 1. CHILDES SYSTEM OVERVIEW - BASIC -
  2. 2. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option What is CHILDES? 1.
  3. 5. System Exchange Child Data Language
  4. 6. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option CHI ld L anguage D ata E xchange S ystem
  5. 7. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Why we need CHILDES? 2.
  6. 8. You want to study what I say You want to investigate languages Because….. (Photo source: http://www.flickr.com/photos/klapow/203398273/)
  7. 9. CHILDES provides Tools for studying conversational interactions
  8. 10. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Who started CHILDES? 3.
  9. 11. Department of Psychology, Carnegie Mellon University Found in 1984 Concord MA
  10. 12. The team Director Brian MacWhinney Contact [email_address] Programmers Leonid Spektor Franklin Chen
  11. 13. members 4,500 
  12. 14. corpora 130 
  13. 15. published articles 1,500 
  14. 16. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Why we need a lot of data with CHILDES? 4.
  15. 17. We needs LOTS of DATA. WHY? Universals and Differences Photo source: http://www.flickr.com/photos/alvy/69385239/ Photo source: http://www.flickr.com/photos/alvy/69385239/
  16. 18. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Where is CHILDES? 5.
  17. 19. http://childes.psy.cmu.edu visit CHILDES website at
  18. 21. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option How can I get the latest info of CHILDES? 6.
  19. 22. Subscribe to the CHILDES Mailing Lists now!
  20. 23. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option What are the tools provided by CHILDES? 7.
  21. 24. Transcript database Programs for transcript analysis Methods for linguistic coding Systems for audio and video linking The CHILDES system provides tools for studying conversational interactions, including
  22. 25. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option What related software do I need? 8.
  23. 26. BEFORE installing, you should have To read the media files To view the Manual To unzip the corpus To display the characters Quicktime player Winzip Unicode fonts: Arial FixedSys Adobe reader
  24. 27. Download unicode fonts - STEP ONE
  25. 28. Download unicode fonts - STEP TWO
  26. 29. Download unicode fonts - STEP THREE
  27. 30. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Where is the CHILDES program? 9.
  28. 31. The program available at CHILDES is called CLAN
  29. 32. Download CLAN
  30. 33. 4 versions are available  versions  No longer supported ClanWin ClanX + ClanXu UnixClan
  31. 34. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option I want to install CLAN to my Windows. HOW ? 10.
  32. 35. Getting Started @ Windows (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Download CLAN at ” Program and Database ” Section. updated frequently download new version
  33. 36. After Download (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Double click the *.exe file downloaded and follow the instructions given by InstallShield
  34. 37. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option I have CLAN now. What should I do? 10.
  35. 38. Download the Manual for details CLAN Program Manual: How to use the CLAN program CHAT Transcript System: How to record the conversation in a standard format at CHILDES.
  36. 39. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option I just want to study the available language database. How? 11.
  37. 40. Click Database to download the Corpus from around the world
  38. 41. TWO ways to view the data View the corpus using WebData You can download the corpus and run the transcript in your local machine in this page. 1. Unzip the corpus into folders 2. Use CLAN program to open the *.CHA files.
  39. 42. Local Transcripts Download the audio and video files here and place them in the same folder of the transcripts.. Download the bilingual corpus here.
  40. 43. e.g. Download YipMatthews bilingual corpus On Window, right click the mouse >> save target as >> choose the directory for this zip file. On Mac, click the link and it will save automatically.
  41. 44. (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Unzip the corpus files Unzip the downloaded corpus by right click the mouse >> extract here
  42. 45. (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Unzip the corpus files 1. After extraction, folders, which contains *.cha under names of children being investigated, will place inside a folder. 2. Each folder contains *.cha files, which are transcripts of the bilingual children.
  43. 46. This is a transcript in CHAT format (*.cha file)
  44. 47. Zoom inside a transcript FAT=Father, he is saying “what’s bear doing?” CHI=children, saying “writing a letter, letter” %mor=morphological tier, list parts of speech “ n” is NOUN, “PL” is plural, so “friends” is a plural noun
  45. 48. Download the Manual for more
  46. 49. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option How can I read transcripts together with audio & video files? 12.
  47. 50. (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Playback with audio file 1. Put the corresponding audio and *.cha files in the same folder 3. Audio Wave of the sound file will pop up inside the CLAN window. 2. Open the CHA file Click Mode >>Sonic Mode>>Locate the audio file. 4. Either use Esc+8 OR Click Mode >> Continuous playback
  48. 51. (Photo Source : http://www.flickr.com/photos/tanaka/49602421) Playback with video file 1. Put corresponding video files and *.cha files in the same folder 2. Open the CHA file Click Mode >>Sonic Mode>>Locate the video file. 3. Video Player will pop up inside the CLAN window. 4. Either use Esc+8 OR Click Mode >> Continuous playback
  49. 52. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option I want to search words/ language structure from various corpus for research. HOW? 13.
  50. 53. Command window 1. Click Window >> Commands Or Ctrl+D 2. Type the Command here
  51. 54. Basic structure of the Commands freq +t*CHI 0042.cha mlu +t*MOT 0042.cha Tier(s) (started with +t ) Command Name Target file name) (ended with .cha or .cex ) basically composed of 3 subparts
  52. 55. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option Can you introduce me some useful COMMANDS? 14.
  53. 56. stands for Mean Length Utterance A. MLU The ratio of morphemes over utterances
  54. 57. mlu +t*CHI *.cha 2. TYPE 1. Click “WORKING” Locate the files/folder here by clicking SELECT DIRECTORY 3. Click “RUN”
  55. 58. stands for Frequency B. FREQ Count numbers of words used in selected files + Calculate the type– token ratio (a measure of lexical diversity )
  56. 59. freq +t*CHI ( filename) .cha 2. TYPE 1. Click “WORKING” Locate the files/folder here by clicking SELECT DIRECTORY 3. Click “RUN”
  57. 60. is for Keyword and Line searching C. Kwal Search data for user-specified words + Output those keywords in context.
  58. 61. is used for Combination search D. Combo A powerful program that searches the data for specified combinations of words or character strings.
  59. 62. combo +t*CHI +s”what^is” (filename) .cha 2. TYPE 1. Click “WORKING” Locate the files/folder here by clicking SELECT DIRECTORY 3. Click “RUN” You want to search file with the word “what”+”is”
  60. 63. For more information on commands details <ul><li>combo +t*MOT +s&quot;kitty^kitty&quot; 0042.cha </li></ul><ul><li>kwal +sbunny -w2 +w2 0042.cha </li></ul>example +s “xx^xx” - search for specific combinations of words OR character strings -w* and +w* options for number of text lines included before and after the search words.
  61. 64. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option WOW! Can I create a language corpus for my own kids with CHILDES? 15.
  62. 65. http://childes.psy.cmu.edu YES!!!
  63. 66. This is the Work Flow Sound / Video data CHILD/INFORMANT record transcribe TEXT link RUN CHECK @ CLAN CLAN sound+video+transcript  corpus “ Esc-L” digitalized audio/video files in computer
  64. 67. Photo source: http://www.sxc.hu/photo/740583 ; royal free under usage option What about the details? e.g. how to record data, digitalize the sounds & video? 16.
  65. 68. http://childes.psy.cmu.edu Visit CHILDES SYSTEM OVERVIEW - ADVANCE - Coming soon!
  66. 69. This introduction was produced by Uta Lam using materials derived from the CHILDES website AND the Bilingual Child Language Corpus contributed to CHILDES by Virginia Yip (Chinese University of Hong Kong) and Stephen Matthews (University of Hong Kong). Special Thanks to Brian MacWhinney, Virginia Yip, Stephen Matthew Contact me at [email_address] April 2007 I disclaim any responsibility in regards with photos, contents displayed and links provided by this slides. At time of review, they were deemed valuable either for this slides or content. Upon your visit – this slide or its content may have changed or be unavailable.

×