Cet évènement est passé.

NLP seminar: « Discrete Representation Models for Speech Data: Semantic vs. Acoustic Tokens »

Name: NLP seminar: « Discrete Representation Models for Speech Data: Semantic vs. Acoustic Tokens »
Start: 2024-10-07T10:00:00+02:00
End: 2024-10-07T11:00:00+02:00
Location: A008

7 octobre 2024 @ 10:00 am - 11:00 am

The next NLP seminar, « Discrete Representation Models for Speech Data: Semantic vs. Acoustic Tokens » organized by Multispeech, Sémagramme, and Synalp teams, will take place at Loria on October 7, at 10 am, in room A008.

Speaker: Robin San Roman, Meta AI Paris & Multispeech.

Abstract: Discrete representation models for audio have emerged as a way to apply natural language processing techniques, such as language models (LMs), to the speech modality. We will discuss two families of representations: semantic tokens and acoustic tokens. Semantic tokens capture the meaning of speech at a high level, while acoustic tokens represent the raw audio signal. We will examine the pros and cons of each approach for speech representation, compression, and modelling. By understanding the strengths and limitations of these representations, we can develop approaches that bridge the gap between semantic and acoustic representations.

Détails

Date : 7 octobre 2024
Heure :
10:00 am - 11:00 am
Catégorie d’Évènement: Séminaire

Lieu

A008

NLP seminar: « Discrete Representation Models for Speech Data: Semantic vs. Acoustic Tokens »

Détails

Lieu

À propos

Contact

L’actualité du Loria

Accès privé