Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Expand
titleDecember 2024

Date

Priorities, Tasks, & Assignments 🥼 Notes, Progress, or Issues 📔

🔗

Next meeting

Status
colourYellow
titlePLANNING

  •  Andrew Collins list of options of drop downs for Best estimate dialect, here is a draft: https://utexas.box.com/s/ktgzmnrte2t4uujmpg0uq479ew58mqk8 after consensus in next meeting, Sonia will add it on Smartsheet - add a column “Best estimate dialect” with all dialect options, drop down options specially for acoustic analysis, make list, consensus and create
  • Castilian dialect - regional variation

  • Mexico dialect

  • Puerto Rico dialect

  • Catalan - regional variation, català-central, etc.,

Rule add dialect of participant to the drop down

other info:

  • Hometown (where did they grew up)- sociodemographic questionnaire

CLIPPING

Next tasks pending to assign:

Panel
panelIconIdatlassian-note
panelIcon:note:
bgColor#B3F5FF

Coordination of meetings

  • Meetings needed, when

    • Connected Speech planning (structure) - tasks pending - pre-r01 data, MINT.. see previous notes, no meeting needed, just more support, clipping students can help

    • Connected speech and VISTA transcription progress + Connected Reliability process

    • Connected Speech Clipping (Shayla lead)

    • Connected Speech Whisper (Whendy lead) Alejandra, Alexa - once trained just check every 2 weeks how they are doing

    • Connected Speech data analysis

Code switches

  • we have noticed that fo Spanish samples code switches are transcribed sometimes like (difficult on how similar languages are)

    • paraphasias, fragments, xxx

    • SOLUTION:

      • PROPS1: include it within the reliability, Rater 2 should be Helena/Núria/Sonia

      • PROPS2 (CWR): Having a code switching checker, if she notices a code switch she checks it in the smartsheet -

        • Alyne (brasilian spa-cat, recognizing code switching) non native speaker

        • Do a test (5 samples), if she misses one code switch is out - influence of catalan of samples, take notes, grammar, vocabulary, - 90% accuracy, (blinded) - transcripts+audios

      • PROPS3 (SMK): Helena/Núria/SMK listen to all WAB picnic and change if there are code switches - mark it in column in Smartsheet - Decide task by task – maybe Núria first

  •  create a column in CATALAN, SPANISH, ENGLISH smartsheet to check if there is a code-switch

recommendation to use chat to ask questions in case there is a code-switch

Training -CATALAN transcriptions - Helena

Helena training complete, high reliability

  • next steps - Diana has to give access to all systems

  • which samples assign?

    • 1. CS picnic scene reliability? - 1 more sample

    • 2. VISTA samples?

Chat preguntas transcripción

  1. Cuanto tiempo toma hacer preguntas-respuestas

  2. sistema mejor?

  3. doubts

Discussion - effect on reliability, faster-more efficient

Have same for Helena

check by Mor doesn’t work

Whisper

RELIABILITY CS transcriptions

  • Ana Elisa Quinonez Camacho created the CS folder structure for Spanish for Kesha’s project and for Dr. Santos' project during the week of 11/25. Note, since this process/structure was developed after reliability was run, the transcriber 1 samples are the final transcripts with consensus.

Ana Elisa Quinonez Camacho Created the reliability report.

Question about where to document reliability scores for the WABs (pre and post) used for the VISTA paper.

Panel
panelIconIdatlassian-note
panelIcon:note:
bgColor#B3F5FF

transcription questions SYSTEM

  • teams work well, Jan excel

  • ask student to create a deposit of doubts using https://utexas.box.com/s/t7zdxjtgspsbk2ofqvhlda85u6zx104b as template 2 tabs, questions of utterance segmentation, questions about coding, and a column specifying the coding using the wiki guide (ej. Punctuation, Capitalization, Numbers, Titles)- this way we can filter it and consult previous decisions before asking a question and also responding it appropriately - same way + having a corpus of decisions

Words not in CLAN library

example: ma (madre) detected as copula

mamí is not correct - correct in library

  • issues with adjectives in MOR

https://utexas.box.com/s/7cl2a29xk7ddvc2yn0woa6cvthcefyat

Decision: code as paraphasia or keep as-is

Sonia Marqués add to the guide

TRANSCRIPTION STATUS

SPANISH

Connected Speech Picnic scene:

GENERAL DOUBTS:

  •  confirm with
Stephanie Grasso
  • that Picnic Scene has more priority than VISTA, for now it’s:

______________________________________________________________________________

SPANISH

  • Connected Speech Picnic scene:

    •  CS Reliability completed
  • VISTA probes:

  • Whendy as of 12/2: Finished transcribing BISE013_Post2, making final revisions, not yet ran through CLAN. Began transcribing BISE004.

  • Ana Elisa 12/1: Currently working on BISE014_Post1. Need to revise BISE015_Post1, BISE014_Post2, and BISE015_Post2.

    • Note Catalanismo BISE014_BACC005_LaDificultadParaHablar: *PAR: estoy haciendo ejercicios para retardar la [//] el deterioramento [: deterioro. Decision on 12/3, code as phonological paraphasia.

  • Dr.Santos samples:

  •  all samples completed?
  •  Reliability completed?

WHISPER