Grf corpus project training 1

214 views

Published on

Training workshop presentation

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
214
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Grf corpus project training 1

  1. 1. R A T R A I N I N G D A YGRF Corpus project
  2. 2. Sign in to the project Get your user account and log in tohttps://grfcorpus.teamworkpm.net/
  3. 3. Get the software Software download from: http://tla.mpi.nl/tools/tla-tools/elan/ Or from the project page
  4. 4. ELAN working environment ELAN project consists of 2 files .etf file Source audio file Download 2 files from teamwork 1) your personal audio file as per your task 2) standard etf template file
  5. 5. Create your new project File : new -> wav/mp3 + etf. The annotation work consists of 2 parts: 1) segmentation 2) transcription
  6. 6. Segmentation 1 Options -> segmentation mode Listen first. Different participants are recorded.
  7. 7. Segmentation 2 Start with Speaker1 - Sentence tier Each speaker separate. Fine tune boundaries Delete, move merge and split
  8. 8. Transcription 1 Options -> transcription mode Select Speech
  9. 9. Transcription 2 Listen and type
  10. 10. Transcription 3 This phase:
  11. 11. 1st copy of segmentation Options -> Annotation mode Tiers -> Create annotations ondependent tiers Speech -> JyutPing, Translation
  12. 12. More transcription Use this or transcription view to enter text For jyutping transcription use website: http://hktv.cc/hp/cantonesetojyutping/ Pay attention to spaces
  13. 13. Tokenizing Tier ->Tokenize tiers: JyutPing -> Words Adjust segments while pressing Alt
  14. 14. 2nd copy of segmentation Tier -> Create annotations on dependent tiers Words -> English Gloss, IPA, Language Language has Controlled Vocabulary: E, C, P, ?
  15. 15. Last 2 Tiers Code switching types Annotation mode Select a section with your mouse and double click Choose an option Translation Annotation mode or Transcription mode Ctrl+Enter or Configure Verbal Unit Tier
  16. 16. More participants Recreate tier structure for each participant Tier -> Add new participant -> OK Take a break and repeatthe wholetranscriptionprocess. Save your work often Try using a mouse
  17. 17. Finish Upload .eaf file to Teamwork and set the task tocomplete and upload saved file

×