Luong Hieu Thi's homepage

RADAR Challenge 2026: Robust Audio Deepfake Recognition under Media Transformations

Hieu-Thi Luong, Xuechen Liu, Ivan Kukanov, Zheng Xin Chai, Kong Aik Lee

APSIPA RADAR Challenge 2026

Challenge Preprint

Robust Localization of Partially Fake Speech: Metrics and Out-of-Domain Evaluation

Hieu-Thi Luong, Inbal Rimon, Haim Permuter, Kong Aik Lee, Eng Siong Chng

APSIPA 2025

Code Preprint

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation

Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng

ICASSP 2025

Samples Download Project Preprint

Room Impulse Responses help attackers to evade Deep Fake Detection

Hieu-Thi Luong, Duc-Tuan Truong, Kong Aik Lee, Eng Siong Chng

SLT 2024

Samples Download Code Preprint

Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme

Hieu-Thi Luong, Junichi Yamagishi

Interspeech 2023

Samples Paper

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Hieu-Thi Luong, Junichi Yamagishi

arXiv manuscript

Samples Code Preprint

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance

Hieu-Thi Luong, Junichi Yamagishi

Speech Synthesis Workshop 2021 (SSW11)

Samples Slide Preprint

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion

Hieu-Thi Luong, Junichi Yamagishi

VCC2020 Workshop

Samples Preprint

Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion (Ph.D. thesis)

Hieu-Thi Luong

Ph.D. thesis, 2020

Slide Preprint Thesis

NAUTILUS: a Versatile Voice Cloning System

Hieu-Thi Luong, Junichi Yamagishi

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Samples Paper

Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech

Hieu-Thi Luong, Junichi Yamagishi

ASRU 2019

Samples Poster Preprint

A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Hieu-Thi Luong, Junichi Yamagishi

arXiv manuscript

Samples Preprint

Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

Interspeech 2019

Samples Preprint

Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems

Hieu-Thi Luong, Junichi Yamagishi

SLT 2018

Samples Poster Preprint

Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation

Hieu-Thi Luong, Junichi Yamagishi

Interspeech 2018

Samples Poster Preprint

Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects

Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

Interspeech 2018

Preprint

Adapting and Controlling DNN-based Speech Synthesis using Input Codes

Hieu-Thi Luong, Shinji Takaki, Gustav Eje Henter, Junichi Yamagishi

ICASSP 2017

Samples Preprint

A non-expert Kaldi recipe for Vietnamese Speech Recognition System

Hieu-Thi Luong, Hai-Quan Vu

WLSI-OIAF4HLT 2016

Corpus Paper

Hieu-Thi Luong

Blog posts [read more]

⅓ espresso [read more]

Selected publications

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

Others

Patents

Artworks

Side projects

ボクラ

bokura.ai

Vo

Vocabulary

ABC

ABC Notation Editor