Selected publications
All publications
Authors marked with an asterisk (*) contributed equally.
2023
- Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, and Didier Schwab. Pre-training for Speech Translation: CTC Meets Optimal Transport. In International Conference on Machine Learning (ICML 2023, oral).
2022
Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier. Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français. In Journées d’Études sur la Parole (JEP 2022).
Hang Le, Sina Alisamir, Marco Dinarelli, Fabien Ringeval, Solène Evain, Ha Nguyen, Marcely Zanon Boito, Salima Mdhaffar, Ziyi Tong, Natalia Tomashenko, Titouan Parcollet, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Didier Schwab, Laurent Besacier. LeBenchmark, un référentiel d’évaluation pour le français oral. In Journées d’Études sur la Parole (JEP 2022).
2021
Solène Evain*, Manh Ha Nguyen*, Hang Le*, Marcely Zanon Boito*, Salima Mdhaffar*, Sina Alisamir*, Ziyi Tong, Natalia Tomashenko*, Marco Dinarelli*, Titouan Parcollet*, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier. Task agnostic and task specific self-supervised learning from speech with lebenchmark. In Neural Information Processing Systems (NeurIPS 2021, Datasets and Benchmarks Track).
Hang Le, Florentin Barbier, Ha Nguyen, Natalia Tomashenko, Salima Mdhaffar, Souhir Gahbiche, Bougares Fethi, Benjamin Lecouteux, Didier Schwab, Yannick Estève. ON-TRAC’systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks. In International Conference on Spoken Language Translation (IWSLT 2021).
Solène Evain*, Manh Ha Nguyen*, Hang Le*, Marcely Zanon Boito*, Salima Mdhaffar*, Sina Alisamir*, Ziyi Tong, Natalia Tomashenko*, Marco Dinarelli*, Titouan Parcollet*, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier. LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech. In Annual Conference of the International Speech Communication Association (INTERSPEECH 2021).
Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier. Lightweight Adapter Tuning for Multilingual Speech Translation. In Annual Meeting of the Association for Computational Linguistics (ACL 2021).
2020
Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier. Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation. In International Conference on Computational Linguistics (COLING 2020, oral).
Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab. FlauBERT: des modèles de langue contextualisés pré-entraînés pour le français. In Actes de la 6e conférence conjointe Journées d’Études sur la Parole (JEP 2020), Traitement Automatique des Langues Naturelles (TALN 2020).
2019
Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab. FlauBERT: Unsupervised Language Model Pre-training for French. In The Language Resources and Evaluation Conference (LREC 2020).
Loïc Vial, Benjamin Lecouteux, Didier Schwab, Hang Le, Laurent Besacier. The LIG system for the English-Czech Text Translation Task of IWSLT 2019. In International Conference on Spoken Language Translation (IWSLT 2019).