Email: contact (at) hieuthi.com
CV: pdf (Last updated: 2023-02-01)
I received my Ph.D. degree in Multidisciplinary Science in 2020 from SOKENDAI, Japan and currently is a Research Fellow at Nanyang Technological University, Singapore. My works focus on researching and developing novel solutions for Speech and Language Processing Systems including Automatic Speech Recognition, Speech Synthesis, Fake Speech Detection, etc. I'm interested in Speech Processing, Machine Learning and Natural Language Processing in general.
I also do programming and drawing as hobbies. More below.
For inquiries about research, technology, education, or something else, you can contact me via the email listed above.
Articles written in English
Fleeting notes written in Vietnamese
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation
Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng
submitted to ICASSP 2025
Room Impulse Responses help attackers to evade Deep Fake Detection
Hieu-Thi Luong, Duc-Tuan Trung, Kong Aik Lee, Eng Siong Chng
SLT 2024
Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme
Hieu-Thi Luong, Junichi Yamagishi
Interspeech 2023
LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Hieu-Thi Luong, Junichi Yamagishi
arXiv manuscript
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Hieu-Thi Luong, Junichi Yamagishi
Speech Synthesis Workshop 2021 (SSW11)
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Hieu-Thi Luong, Junichi Yamagishi
VCC2020 Workshop
Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion (Ph.D. thesis)
Hieu-Thi Luong
Ph.D. thesis, 2020
NAUTILUS: a Versatile Voice Cloning System
Hieu-Thi Luong, Junichi Yamagishi
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Hieu-Thi Luong, Junichi Yamagishi
ASRU 2019
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Hieu-Thi Luong, Junichi Yamagishi
arXiv manuscript
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa
Interspeech 2019
Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Hieu-Thi Luong, Junichi Yamagishi
SLT 2018
Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation
Hieu-Thi Luong, Junichi Yamagishi
Interspeech 2018
Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa
Interspeech 2018
Adapting and Controlling DNN-based Speech Synthesis using Input Codes
Hieu-Thi Luong, Shinji Takaki, Gustav Eje Henter, Junichi Yamagishi
ICASSP 2017
A non-expert Kaldi recipe for Vietnamese Speech Recognition System
Hieu-Thi Luong, Hai-Quan Vu
WLSI-OIAF4HLT 2016
Learning device, learning method, voice synthesis device, voice synthesis method and program
Inventor: Hieu-Thi Luong, Junichi Yamagishi
P7109071 · Issued Jul 29, 2022
Utility extracts vocabulary from text for learning English. Present in printer-friendly format
An online editor for ABC Notation format for edit, play, and print music sheets.