SLS PUBLICATIONS
Theses (1998 - present)
Many of our students' theses are available here in Adobe Acrobat
(PDF) format.
Ph.D. Theses
2023
R. Haulcy, AI-Based Speech Assessment of Cognitive Impairment
Disorders, MIT Department of Electrical Engineering and Computer
Science, June 2023.
(PDF)
S. Khurana, Transfer Learning for Spoken Language
Processing, MIT Department of Electrical Engineering and Computer
Science, June 2023.
(PDF)
2022
T. He, Towards a Deeper Understanding of Neural Language
Generation, MIT Department of Electrical Engineering and Computer
Science, May 2022.
(PDF)
2019
T. Alhanai, Detecting Cognitive Impairment from Spoken
Language, MIT Department of Electrical Engineering and Computer
Science, June 2019.
(PDF)
M. Korpusik, Deep Learning for Spoken Dialogue Systems:
Application to Nutrition, MIT Department of Electrical Engineering and
Computer Science, June 2019.
(PDF)
2018
Y. Belinkov, On Internal Language Representations in Deep
Learning: An Analysis of Machine Translation and Speech
Recognition, MIT Department of Electrical Engineering and Computer
Science, June 2018.
(PDF)
D. Harwath, Learning Spoken Language Through Vision, MIT
Department of Electrical Engineering and Computer Science, June 2018.
(PDF)
2017
X. Feng, Multi-Modal and Deep Learning for Robust Speech
Recognition, MIT Department of Electrical Engineering and Computer
Science, September 2017.
(PDF)
Y. Zhang, Exploring Neural Network Architectures for Acoustic
Modeling, MIT Department of Electrical Engineering and Computer
Science, September 2017.
(PDF)
S. Li, Improving Learning Experience in MOOCs with Educational
Content Linking, MIT Department of Electrical Engineering and
Computer Science, February 2017.
(PDF)
2016
A. Lee, Language-Independent Methods for Computer-Assisted
Pronunciation Training, MIT Department of Electrical Engineering
and Computer Science, September 2016.
(PDF)
E. Chuangsuwanich, Multilingual Techniques for Low Resource
Automatic Speech Recognition, MIT Department of Electrical Engineering and
Computer Science, June 2016.
(PDF)
M. Price, Energy-scalable Speech Recognition Circuits, MIT
Department of Electrical Engineering and Computer Science, June 2016.
(PDF)
S. Shum, Overcoming Resource Limitations in the Processing of
Unlimited Speech: Applications to Speaker and Language
Recognition, MIT Department of Electrical Engineering and Computer
Science, June 2016.
(PDF)
2014
C. Lee, Discovering Linguistic Structures in Speech: Models and
Applications, MIT Department of Electrical Engineering and
Computer Science, September 2014.
(PDF)
2013
Y. Zhang, Unsupervised Speech Processing with Applications to Query-by-Example Spoken Term Detection, MIT Department of Electrical Engineering and Computer Science, February 2013.
(PDF)
2012
I. McGraw, Crowd-supervised Training of Spoken Language Systems, MIT Department of Electrical Engineering and Computer Science, June 2012.
(PDF)
H. Chang, Multi-level Acoustic Modeling for Automatic Speech Recognition, MIT Department of Electrical Engineering and Computer Science, June 2012.
(PDF)
Y. Xu, Language Technologies in Speech-Enabled Second Language Learning Games: From Reading to Dialogue, MIT Department of Electrical Engineering and Computer Science, June 2012.
(PDF)
J. Liu, Harvesting and Summarizing User-Generated Content for Advanced Speech-Based Human-Computer Interaction, MIT Department of Electrical Engineering and Computer Science, February 2012.
(PDF)
2011
M. Peabody, Methods for Pronunciation Assessment in Computer Aided Language Learning, MIT Department of Electrical Engineering and Computer Science, September 2011.
(PDF)
2010
R. Zbib, Using Linguistic Knowledge in Statistical Machine Translation. MIT Department of Civil and Environmental Engineering, September 2010.
(PDF)
2009
J. Lee, Automatic Correction of Grammatical Errors in Non-native English Text. MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
K. Schutte, Parts-based Models and Local Features for Automatic Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
T. Sainath, Applications of Broad Class Knowledge for Noise Robust Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
A. Gruenstein, Toward Widely-Available and Usable Multimodal Conversational Interfaces. MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
B. Hsu, Language Modeling for Limited-Data Domains. MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
G. Choueiter, Linguistically-Motivated Sub-word Modeling with Applications to Speech Recognition. MIT Department of Electrical Engineering and Computer Science, February 2009.
(PDF)
2006
A. Park, Unsupervised Pattern Discovery in Speech: Applications to Word Acquistion and Speaker Segmentation. MIT Department of Electrical Engineering and Computer Science, September 2006.
(PDF)
E. Filisko, Developing Attribute Acquisition Strategies in Spoken Dialogue Systems via User Simulation. MIT Department of Electrical Engineering and Computer Science, June 2006.
(PDF)
H. Shu, Multi-Tape Finite-State Transducer for Asynchronous Multi-Stream Pattern Recognition with Application to Speech. MIT Department of Electrical Engineering and Computer Science, May 2006.
(PDF)
2005
K. Livescu, Feature-Based Pronunciation Modeling for Automatic Speech Recognition. MIT Department of Electrical Engineering and Computer Science, September 2005.
(PDF)
M. Tang, Large Vocabulary Continuous Speech Recognition Using Linguistic Features and Constraints. MIT Department of Electrical Engineering and Computer Science, May 2005.
(PDF)
2003
J. Yi, Corpus-Based Unit Selection for Natural-Sounding Speech Synthesis. MIT Department of Electrical Engineering and Computer Science, May 2003.
(PDF)
2002
X. Mou, Towards a Unified Framework for Sub-lexical and Supra-lexical
Linguistic Modeling. MIT Department of Electrical Engineering and Computer
Science, June 2002.
(PDF)
I. Bazzi, Modelling Out-of-Vocabulary Words for Robust Speech Recognition.
MIT Department of Electrical Engineering and Computer Science, June 2002.
(PDF)
2001
G. Chung, Towards Multi-Domain Speech Understanding with Flexible
and Dynamic Vocabulary. MIT Department of Electrical Engineering and
Computer Science, June 2001. (gzip'd
PS),
(PDF)
C. Wang, Prosodic Modeling for Improved Speech Recognition and Understanding.
MIT Department of Electrical Engineering and Computer Science, June 2001.
(PDF)
2000
K. Ng, Subword-based Approaches for Spoken Document Retrieval.
MIT Department of Electrical Engineering and Computer Science, February
2000. (gzip'd
PS),
(PDF)
M. Spina, Analysis & Transcription of General Audio Data.
MIT Department of Electrical Engineering and Computer Science, June 2000.
(PDF)
1998
T.J. Hazen, The Use of Speaker Correlation Information for Automatic
Speech Recognition. MIT Department of Electrical Engineering and Computer
Science, January 1998. (PDF)
R. Lau, Subword Lexical Modelling for Speech Recognition. MIT
Department of Electrical Engineering and Computer Science, May
1998. (PDF)
G. Flammia, Discourse Segmentation of Spoken Dialogue: An Empirical
Approach. MIT Department of Electrical Engineering and Computer Science,
June 1998. (PDF),
Nb Discourse Annotation Tool (zip
file)
J. Chang, Near-Miss Modeling: A Segment-Based Approach to Speech
Recognition. MIT Department of Electrical Engineering and Computer Science,
June 1998. (PDF)
M. McCandless, A Model for Interactive Computation: Applications
to Speech Research. MIT Department of Electrical Engineering and Computer
Science, June 1998. (PDF)
A. K. Halberstadt, Heterogenous Acoustic Measurements and Multiple
Classifiers for Speech Recognition. MIT Department of Electrical Engineering
and Computer Science, November 1998. (PDF)
S.M. and M.Eng. Theses
2024
H. Chang, Perturbation-invariant Speech Representation Learning
by Online Clustering, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, February 2024.
(PDF)
Y. Chuang, Information Retrieval with Dense and Sparse
Representations, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, February 2024.
(PDF)
2022
C. Lai, Finding Sparse Subnetworks in Self-Supervised Speech
Recognition and Speech Synthesis, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, May 2022.
(PDF)
2021
W. Fang, On End-to-end Automatic Fact-checking Systems,
S. M. Thesis, MIT
Department of Electrical Engineering and Computer Science, September
2021.
(PDF)
R. Manna, Constructing Low Resource Approaches to Improve
Speech-to-text Translation from Modern Standard Arabic to English,
M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science,
September 2021.
(PDF)
A. Rouditchenko, Learning Audio-Video Language
Representations, M. Eng. Thesis, MIT Department of Electrical
Engineering and Computer Science, June 2021.
(PDF)
I. Palmer, Spoken ObjectNet: Creating a Bias-Controlled Spoken
Caption Dataset," M. Eng. Thesis, MIT Department of Electrical
Engineering and Computer Science, June 2021.
(PDF)
M. Nadeem, On Factuality in Neural Language Models, M. Eng. Thesis, MIT
Department of Electrical Engineering and Computer Science, February
2021.
(PDF)
K. Tangri, Using Natural Language to Predict Bias and Factuality
in Media with a Study on Rationalization, M.Eng. Thesis, MIT
Department of Electrical Engineering and Computer Science, February 2021.
(PDF)
2019
E. Azuh Mensah, Towards Multilingual Lexicon Discovery from
Visually Grounded Speech," M. Eng. Thesis, MIT Department of
Electrical Engineering and Computer Science, September 2019.
(PDF)
O. Ren, Detecting Check-Worthy Claims, M. Eng. Thesis, MIT
Department of Electrical Engineering and Computer Science, June 2019.
(PDF)
Y. Chung, Unsupervised Learning of Cross-Modal Mappings between
Speech and Text, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, June 2019.
(PDF)
L. Ford, Large-scale Acoustic Scene Analysis with Deep Residual
Networks, M. Eng. Thesis, MIT Department of Electrical Engineering
and Computer Science, June 2019.
(PDF)
H. Luo, Neural Attentions for Natural Language Understanding and
Modeling, S. M. Thesis, MIT Department of Electrical Engineering
and Computer Science, June 2019.
(PDF)
G. Rivera, Automatic Detection of Code-switching in Arabic
Dialects, M. Eng. Thesis, MIT Department of Electrical Engineering
and Computer Science, June 2019.
(PDF)
B. Xu, Combating Fake News with Adversarial Domain Adaptation
and Neural Models, M.Eng. Thesis, MIT Department of Electrical
Engineering and Computer Science, February 2019.
(PDF)
2018
W. Hsu, Unsupervised Learning of Disentangled Representations
for Speech with Neural Variational Inference Models, S. M. Thesis,
MIT Department of Electrical Engineering and Computer Science, June
2018.
(PDF)
S. Koppula, Energy-Efficient Speaker Identification with
Low-Precision Networks, M. Eng. Thesis, MIT Department of
Electrical Engineering and Computer Science, June 2018.
(PDF)
K. Leidal, Neural Techniques for Modeling Visually Grounded
Speech, M.Eng. Thesis, MIT Department of Electrical Engineering
and Computer Science, June 2018.
(PDF)
A. R. Titus, A Study of Adaptive Enhancement Methods for
Improved Distant Speech Recognition, M.Eng. Thesis, MIT Department
of Electrical Engineering and Computer Science, June 2018.
(PDF)
2016
J. Drexler, Deep Unsupervised Learning from Speech,
S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June
2016.
(PDF)
H. Nassif, Learning Sentiment and Semantic Relatedness in User
Generated Content Using Neural Models, M.Eng. Thesis, MIT
Department of Electrical Engineering and Computer Science, June 2016.
(PDF)
F. Sun, Speech Representation Models for Speech Synthesis and
Multimodal Speech Recognition, M.Eng. Thesis, MIT Department of Electrical
Engineering and Computer Science, June 2016.
(PDF)
2015
M. Korpusik, Spoken Language Understanding in a Nutrition
Dialogue System, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, June 2015.
(PDF)
L. Liu, Acoustic Models for Speech Recognition Using Deep Neural
Networks Based on Approximate Math, M.Eng. Thesis, Department of
Electrical Engineering and Computer Science, June 2015.
(PDF)
R. Naphtal, Natural Language Processing Based Nutritional
Application, M.Eng. Thesis, Department of Electrical Engineering
and Computer Science, June 2015.
(PDF)
P. Saylor, Spoke: A Framework for Building Speech-Enabled
Websites, M.Eng. Thesis, Department of Electrical Engineering and
Computer Science, May 2015.
(PDF)
A. Alghunaim, A Vector Space Approach for Aspect-Based Sentiment
Analysis, M.Eng. Thesis, Department of Electrical Engineering and
Computer Science, May 2015.
(PDF)
2014
T. Al Hanai, Lexical and Language Modeling of Diacritics and
Morphemes in Arabic Automatic Speech Recognition, S. M. Thesis,
MIT Department of Electrical Engineering and Computer Science,
February 2014.
(PDF)
2013
X. Fang, Bayesian Distance Metric Learning on i-vector for
Speaker Verification, S. M. Thesis, MIT Department of Electrical
Engineering and Computer Science, September 2013.
(PDF)
C. Cai, Adapting Existing Games for Education Using Speech Recognition, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2013.
(PDF)
D. Harwath, Unsupervised Modeling of Latent Topics and Lexical Units in Speech Audio, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2013.
(PDF)
2012
A. Lee, A Comparison-based Approach to Mispronunciation Detection, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2012.
(PDF)
2011
I. Badr, Pronunciation Learning for Automatic Speech Recognition, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2011.
(PDF)
S. Shum, Unsupervised Methods for Speaker Diarization, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2011.
(PDF)
A. Goldie, CHATTER: A Spoken Language Dialogue System for Language Learning Applications, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2011.
(PDF)
C. Varenhorst, Making Speech Recognition Work on the Web, M.Eng. thesis, MIT Department of Electrical Engineering and Computer
Science, May 2011.
(PDF)
Y. Li, Medical Data Mining: Improving Information Accessibility using Online Patient Drug Reviews, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, February 2011.
(PDF)
2010
S. Dyar, A Multimodal Speech Interface for Dynamic Creation and Retrieval of Geographical Landmarks on a Mobile Device, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010.
(PDF)
K. Luu, Real-Time Noise-Robust Speech Detection, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2010.
(PDF)
C. Lee, Closed-loop Auditory-based Representation for Robust Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010.
(PDF)
S. Liu, Multimodal Speech Interfaces for Map-based Applications, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010.
(PDF)
2009
Y. Zhang, Unsupervised Spoken Keyword Spotting and Learning of Acoustically Meaningful Units, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2009.
(PDF)
G. Matthias, Incremental Speech Understanding in a Multimodal Web-Based Spoken Dialogue System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
S. Wang, Using Graphone Models in Automatic Speech Recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
S. Pueblo, Videorealistic Facial Animation for Speech-Based Interfaces, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009.
(PDF)
2008
B. Zhu, Multimodal Speech Recognition with Ultrasonic Sensors, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, September 2008.
(PDF)
G. Yu, Efficient Error Correction for Speech Systems Using Constrained Re-recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, July 2008.
(PDF)
H. Chang, Large-Margin Gaussian Mixture Modeling for Automatic Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2008.
(PDF)
I. McGraw, Web-based, Speech-enabled Games for Vocabulary Acquisition in a Foreign Language, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2008.
(PDF)
Y. Xu, Combining Linguistics and Statistics for High-Quality Limited Domain English-Chinese Machine Translation, S.M. thesis, Department of Electrical Engineering and Computer Science, June 2008.
(PDF)
2006
D. Schultz, Robust Audio-Visual Person Verification Using Web-Camera Video, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, September 2006.
(PDF)
2005
T. Sainath, Acoustic Landmark Detection and Segmentation using the McAulay-Quatieri Sinusoidal Model, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2005.
(PDF)
R. Woo, Exploration of Small Enrollment Speaker Verification on Handheld Devices, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2005.
(PDF)
2004
E. Saenko, Articulatory Features for Robust Visual Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2004.
(PDF)
G. Choueiter, A Wavelet and Filter Bank Framework for Phonetic Classification, S.M. thesis, MIT Department of Civil and Environmental Engineering, August 2004.
(PDF)
V. Lee, LanguageLand: A Multimodal Conversational Spoken Language Learning System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2004.
(PDF)
J. Zhang, Language Generation and Speech Synthesis in Dialogues for Language Learning, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, May 2004.
(PDF)
B. Cowan, PLUTO: A Preprocessor for Multilingual Spoken Language Generation, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, February 2004.
(PDF)
J. Lee, Translingual Grammar Induction for Conversational Systems,
S.M. thesis, MIT Department of Electrical Engineering and Computer Science,
September 2004.
(PDF)
2003
A. Boozer, Characterization of Emotional Speech in Human-Computer Dialogues, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2003.
(PDF)
C. Chuu, LIESHOU: A Mandarin Conversational Task Agent for the Galaxy-II Architecture, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, February 2003.
(PDF)
C. La, Infrastructure Development for Integration of Lip Reading into the SUMMIT Speech Recognizer, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2003.
(PDF)
T. J. Lau, SLLS: An Online Conversational Spoken Language Learning System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2003.
(PDF)
L. S. Miyakawa, Distributed Speech Recognition within a Segment-Based Framework, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2003.
(PDF)
S. B. Wang, A Multimodal Galaxy-based Geographic System, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2003.
(PDF)
2002
A. S. Park, ASR Dependent Techniques for Speaker Recognition,
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
May, 2002.(PDF)
E. J. Pusateri, Rapid Speaker Adaptation with Speaker Clustering,
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
May, 2002.
(PDF)
E. A. Filisko, A Context Resolution Server for the GALAXY Conversational
Systems, M.Eng. thesis, MIT Department of Electrical Engineering and
Computer Science, June 2002.
(PDF)
J. Kuo, An XML Messaging Protocol for Multimodal Galaxy Applications,
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
May 2002.
(PDF)
S. Crockett, Rapid Configuration of Discourse and Dialog Management in
Conversational Systems, M.Eng. thesis, MIT Department of Electrical
Engineering and Computer Science, May 2002.
(PDF)
2001
E. Weinstein, SpeechBuilder: Facilitating Spoken Dialogue System
Development, M.Eng. thesis, MIT Department of Electrical Engineering
and Computer Science, May 2001.
(gzip'd
PS),
(PDF)
2000
E. Sandness, Discriminative Training of Acoustic models in a Segment-Based
Speech Recognizer, M.Eng. thesis, MIT Department of Electrical Engineering
and Computer Science, May 2000.
(PDF),
(gzip'd
PS)
T. Burianek, Building a Speech Understanding System Using Word Spotting
Techniques, M.Eng. thesis, MIT Department of Electrical Engineering
and Computer Science, July 2000.
(PDF)
J. Pearlman, SLS-Lite: Enabling Spoken Language Systems Design for
Non-Experts, M.Eng. thesis, MIT Department of Electrical Engineering
and Computer Science, August 2000.
(gzip'd
PS)
L. Baptist, Genesis-II: A Language Generation Module for Conversational
Systems, S.M. thesis, MIT Department of Electrical Engineering and
Computer Science, August 2000.
(gzip'd
PS), (PDF)
A. Suchato, Framework for Joint Recognition of Pronounced and Spelled
Proper Names, S.M. thesis, MIT Department of Electrical Engineering
and Computer Science, September 2000. (PDF)
1999
S. Kamppari, Word and Phone Level Acoustic Confidence Scoring for
Speech Understanding Systems, M.Eng. thesis, MIT Department of Electrical
Engineering and Computer Science, September 1999.
(PDF),
(gzip'd
PS)
K. Livescu, Analysis and Modeling of Non-Native Speech for Automatic
Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering
and Computer Science, August 1999.
(PDF),
(gzip'd
PS)
1998
S. Lee, Probabilistic Segmentation for Segment-based Speech Recognition,
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
May 1998.
(PDF),
(gzip'd
PS)
J. Yi, Natural-Sounding Speech Synthesis Using Variable-Length Units,
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
May 1998.
(PDF),
(gzip'd
PS)
|