28 April 2024
zoom : https://u-paris.zoom.us/j/85172751178?pwd=Zm5tZm42d0FPN0JHVWFVd3E0MkFoZz09
room 134 (first floor)
Bâtiment Olympe de Gouges
8 Place Paul Ricoeur
75013 PARIS
Accès au bâtiment Olympe de Gouges
This informal workshop is intended to discuss various approaches to probing audio LLMs such as Whisper (Radford et al., 2019) or Wav2vec (Baevski et al., 2018).
PROVISIONAL PROGRAMME
9h Nicolas Ballier & Guillaume Wisniewski (UPCité) : introduction
We briefly present current research in the making on speech language models.
session 1 : speech variability and LLM response : some experiments and questions
9h 15 Tori (Georgina) Fullerton (UPCité) : How Whisper models respond to speech variability?
[This talk compares Whisper model transcriptions to human perception of VOT variation
and showcases research in the making on the detection of compact compounds in English.]
9h 40 discussion
10h Richard Wright (university of Washington), Nicolas Ballier & Philippe Martin (UPCité): resynthesing speech and analysing response to segmental and suprasegmental features
[This talk reports research in the making on vowel and pitch synthesis and how Whisper models respond]
10h15 discussion
10h 30 Behnoosh Namdarzadeh (UPCité) Whisper transcriptions for lesser-resourced languages : the case of Persian
[This talk reports our findings for the transcription task of a language trained with 24 hours of speech]
10h40 coffee break
Session 2 : Probing Speech Language Models : first experiments and results
11h00 Nicolas Ballier and Jean-Baptiste Yunès (UpCité) : a customised C++ implementation of Whisper to probe representations
[This talk presents our reverse engineering method based on the internal representations of the Whisper models.]
11h10 Hosein Mohebbi (Tilburg) : Adapting context mixing methods to explore speech transformers
[This talk presents the workflow used to analyse acoustic and linguistic representations in the EMNLP2023 paper.]
Mohebbi, H., Chrupała, G., Zuidema, W., & Alishahi, A. (2023). Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (pp. 8249-8260).
11h40 discussion
Session 3: So many parameters, so little time
12h00 round table
[We discuss some of the LLM parameters and methods to investigate audio LLMs. How to systematically investigate hallucinations and effects of language models?]
Antonio Balvet (Lille) : Prompt engineering for spoken transcriptions of French
Peter Uhrig (Erlangen, tbc)
Alternates : Guillaume Wisniewski and the Deeptypo team (tbc) : playing with Wav2vec and raw signal
Contact person : nicolas.ballier AT u-paris DOT fr
The 13th International Conference on Technical Communication
"Empowering Technical Communicators: Skills, Growth, and Education" March 7, 2025 Contact Ismael RAMOS RUIZ Call for Papers In a field constantly shaped by technological advancements, technical communication professionals must hone their skills continuously. The roles...
Séminaire de recherche en langues de spécialité, corpus et traductologie, 2024-2025
Contacts : Professeurs Natalie Kübler et Christopher Gledhill. 25 novembre 2024 14h-16h (salle 720 Olympe de Gouge, Paris) Patrick DROUIN, Professeur, Université de Montréal. Extraction automatique de termes : un regard sous le capot Résumé : Dans cet exposé, je...
Interaction in TED Talks – TransQuest Project
September 13, ODG 830 Université Paris Cité, CLILLAC-ARP Journée d'Études du projet TransQuest Organiser: Agnès Celle Accès au bâtiment Olympe de Gouges Programme 9:30-10:15 Fiona Rossette-Crake, guest speaker, Université Paris Nanterre, CREATED Talks : Oratory, “New...
SILES – Séminaire International sur la Langue Espagnole (2024-2025)
SILES est un groupe de travail, d’échange et de recherche autour de l’espagnol animé par l’équipe de linguistes hispanistes de l’UFR EILA de l’Université Paris Cité, rattaché à l’équipe de recherche CLILLAC-ARP. Ce séminaire se donne pour but de réunir périodiquement...