From Sound to Discourse: Computer-based modelling of large-scale data to study linguistic change over time
IdEX Emergence en Recherche, January 2021 – December 2022
PI – Ioana Chitoran
Project team
- Ioana Chitoran, Giuseppina Turco, Hiyon Yoo (Université de Paris)
- Ioana Vasilescu, Lori Lamel (LISN – Paris Saclay)
- Elisabeth Degand, Anne-Catherine Simon (Université Catholique de Louvain)
Résumé
The Son – Discours project proposes a multi-level analysis of spoken language at the interface of linguistics and computer science. The goal is to study linguistic variation: (a) at the fine-scale level of sound structure (phonetic and phonological variation); (b) at the larger scale of discourse (variation in the organization of discourse). The working hypothesis is that language is variable, but variation is not random. Variable patterns develop within the context of a linguistic system which is both flexible and constraining. Some patterns may remain circumscribed to the system and disappear, others may evolve by moving further away from the system, leading to a linguistic change. Understanding how variation emerges, and what factors may favor or constrain it, is the general goal of our project. We propose a hybrid methodology, where the hypothesis is tested by both behavioral experimental methods and automatic classification techniques borrowed from automatic speech recognition, in joint, incremental approaches combining linguistic analysis and automatic processing. We use machine learning methods derived from data mining to test the likelihood that a given variation pattern may be related to linguistic change.
Phonetics and Phonology in Europe – PaPE 2021
Workshop: From speech technology to big data phonetics and phonology: A win-win paradigm
À lire aussi
![Politiques linguistiques en Europe – Séminaire de recherche, 2025-2026](https://clillac-arp.u-paris.fr/wp-content/uploads/sites/15/2024/04/LOGOS-OEP-Ministere-Culture-DGLFLF-1080x675.jpg)
Politiques linguistiques en Europe – Séminaire de recherche, 2025-2026
Logos de l'Observatoire Européen du plurilinguisme et de la Délégation générale à la langue française et aux langues de France (DGLFLF) Le professeur José Carlos Herreras anime régulièrement un séminaire de recherche intitulé Poli-tiques linguistiques en Europe....
![Journées d’Étude de la Communication Culturelle et des Innovations Numériques](https://clillac-arp.u-paris.fr/wp-content/uploads/sites/15/2025/02/JECommNumerique-1080x675.jpg)
Journées d’Étude de la Communication Culturelle et des Innovations Numériques
![The 13th International Conference on Technical Communication](https://clillac-arp.u-paris.fr/wp-content/uploads/sites/15/2023/09/Appels_a-1080x675.jpg)
The 13th International Conference on Technical Communication
"Empowering Technical Communicators: Skills, Growth, and Education" March 7, 2025 Contact Ismael RAMOS RUIZ Call for Papers In a field constantly shaped by technological advancements, technical communication professionals must hone their skills continuously. The roles...
![TransQuest (2022-)](https://clillac-arp.u-paris.fr/wp-content/uploads/sites/15/2022/10/Colloque-1080x675.jpg)
TransQuest (2022-)
Ce projet analyse le rôle des questions dans la transmission du savoir dans les TED talks. Il propose une analyse contrastive et multimodale des enchaînements questions-réponses dans les TED talks en anglais et en français. Porteur Agnès Celle Résumé L'objectif est...