SLS
Spoken Language Systems
MIT Computer Science and Artificial Intelligence Laboratory

SLS PUBLICATIONS

Theses (1998 - present)

Many of our students' theses are available here in Adobe Acrobat (PDF) format.

Ph.D. Theses

2023

R. Haulcy, AI-Based Speech Assessment of Cognitive Impairment Disorders, MIT Department of Electrical Engineering and Computer Science, June 2023. (PDF)

S. Khurana, Transfer Learning for Spoken Language Processing, MIT Department of Electrical Engineering and Computer Science, June 2023. (PDF)

2022

T. He, Towards a Deeper Understanding of Neural Language Generation, MIT Department of Electrical Engineering and Computer Science, May 2022. (PDF)

2019

T. Alhanai, Detecting Cognitive Impairment from Spoken Language, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

M. Korpusik, Deep Learning for Spoken Dialogue Systems: Application to Nutrition, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

2018

Y. Belinkov, On Internal Language Representations in Deep Learning: An Analysis of Machine Translation and Speech Recognition, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

D. Harwath, Learning Spoken Language Through Vision, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

2017

X. Feng, Multi-Modal and Deep Learning for Robust Speech Recognition, MIT Department of Electrical Engineering and Computer Science, September 2017. (PDF)

Y. Zhang, Exploring Neural Network Architectures for Acoustic Modeling, MIT Department of Electrical Engineering and Computer Science, September 2017. (PDF)

S. Li, Improving Learning Experience in MOOCs with Educational Content Linking, MIT Department of Electrical Engineering and Computer Science, February 2017. (PDF)

2016

A. Lee, Language-Independent Methods for Computer-Assisted Pronunciation Training, MIT Department of Electrical Engineering and Computer Science, September 2016. (PDF)

E. Chuangsuwanich, Multilingual Techniques for Low Resource Automatic Speech Recognition, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

M. Price, Energy-scalable Speech Recognition Circuits, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

S. Shum, Overcoming Resource Limitations in the Processing of Unlimited Speech: Applications to Speaker and Language Recognition, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

2014

C. Lee, Discovering Linguistic Structures in Speech: Models and Applications, MIT Department of Electrical Engineering and Computer Science, September 2014. (PDF)

2013

Y. Zhang, Unsupervised Speech Processing with Applications to Query-by-Example Spoken Term Detection, MIT Department of Electrical Engineering and Computer Science, February 2013. (PDF)

2012

I. McGraw, Crowd-supervised Training of Spoken Language Systems, MIT Department of Electrical Engineering and Computer Science, June 2012. (PDF)

H. Chang, Multi-level Acoustic Modeling for Automatic Speech Recognition, MIT Department of Electrical Engineering and Computer Science, June 2012. (PDF)

Y. Xu, Language Technologies in Speech-Enabled Second Language Learning Games: From Reading to Dialogue, MIT Department of Electrical Engineering and Computer Science, June 2012. (PDF)

J. Liu, Harvesting and Summarizing User-Generated Content for Advanced Speech-Based Human-Computer Interaction, MIT Department of Electrical Engineering and Computer Science, February 2012. (PDF)

2011

M. Peabody, Methods for Pronunciation Assessment in Computer Aided Language Learning, MIT Department of Electrical Engineering and Computer Science, September 2011. (PDF)

2010

R. Zbib, Using Linguistic Knowledge in Statistical Machine Translation. MIT Department of Civil and Environmental Engineering, September 2010. (PDF)

2009

J. Lee, Automatic Correction of Grammatical Errors in Non-native English Text. MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

K. Schutte, Parts-based Models and Local Features for Automatic Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

T. Sainath, Applications of Broad Class Knowledge for Noise Robust Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

A. Gruenstein, Toward Widely-Available and Usable Multimodal Conversational Interfaces. MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

B. Hsu, Language Modeling for Limited-Data Domains. MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

G. Choueiter, Linguistically-Motivated Sub-word Modeling with Applications to Speech Recognition. MIT Department of Electrical Engineering and Computer Science, February 2009. (PDF)

2006

A. Park, Unsupervised Pattern Discovery in Speech: Applications to Word Acquistion and Speaker Segmentation. MIT Department of Electrical Engineering and Computer Science, September 2006. (PDF)

E. Filisko, Developing Attribute Acquisition Strategies in Spoken Dialogue Systems via User Simulation. MIT Department of Electrical Engineering and Computer Science, June 2006. (PDF)

H. Shu, Multi-Tape Finite-State Transducer for Asynchronous Multi-Stream Pattern Recognition with Application to Speech. MIT Department of Electrical Engineering and Computer Science, May 2006. (PDF)

2005

K. Livescu, Feature-Based Pronunciation Modeling for Automatic Speech Recognition. MIT Department of Electrical Engineering and Computer Science, September 2005. (PDF)

M. Tang, Large Vocabulary Continuous Speech Recognition Using Linguistic Features and Constraints. MIT Department of Electrical Engineering and Computer Science, May 2005. (PDF)

2003

J. Yi, Corpus-Based Unit Selection for Natural-Sounding Speech Synthesis. MIT Department of Electrical Engineering and Computer Science, May 2003. (PDF)

2002

X. Mou, Towards a Unified Framework for Sub-lexical and Supra-lexical Linguistic Modeling. MIT Department of Electrical Engineering and Computer Science, June 2002. (PDF)

I. Bazzi, Modelling Out-of-Vocabulary Words for Robust Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 2002. (PDF)

2001

G. Chung, Towards Multi-Domain Speech Understanding with Flexible and Dynamic Vocabulary. MIT Department of Electrical Engineering and Computer Science, June 2001. (gzip'd PS), (PDF)

C. Wang, Prosodic Modeling for Improved Speech Recognition and Understanding. MIT Department of Electrical Engineering and Computer Science, June 2001. (PDF)

2000

K. Ng, Subword-based Approaches for Spoken Document Retrieval. MIT Department of Electrical Engineering and Computer Science, February 2000. (gzip'd PS), (PDF)

M. Spina, Analysis & Transcription of General Audio Data. MIT Department of Electrical Engineering and Computer Science, June 2000. (PDF)

1998

T.J. Hazen, The Use of Speaker Correlation Information for Automatic Speech Recognition. MIT Department of Electrical Engineering and Computer Science, January 1998. (PDF)

R. Lau, Subword Lexical Modelling for Speech Recognition. MIT Department of Electrical Engineering and Computer Science, May 1998. (PDF)

G. Flammia, Discourse Segmentation of Spoken Dialogue: An Empirical Approach. MIT Department of Electrical Engineering and Computer Science, June 1998. (PDF), Nb Discourse Annotation Tool (zip file)

J. Chang, Near-Miss Modeling: A Segment-Based Approach to Speech Recognition. MIT Department of Electrical Engineering and Computer Science, June 1998. (PDF)

M. McCandless, A Model for Interactive Computation: Applications to Speech Research. MIT Department of Electrical Engineering and Computer Science, June 1998. (PDF)

A. K. Halberstadt, Heterogenous Acoustic Measurements and Multiple Classifiers for Speech Recognition. MIT Department of Electrical Engineering and Computer Science, November 1998. (PDF)

S.M. and M.Eng. Theses

2024

H. Chang, Perturbation-invariant Speech Representation Learning by Online Clustering, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2024. (PDF)

Y. Chuang, Information Retrieval with Dense and Sparse Representations, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2024. (PDF)

2022

C. Lai, Finding Sparse Subnetworks in Self-Supervised Speech Recognition and Speech Synthesis, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, May 2022. (PDF)

2021

W. Fang, On End-to-end Automatic Fact-checking Systems, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, September 2021. (PDF)

R. Manna, Constructing Low Resource Approaches to Improve Speech-to-text Translation from Modern Standard Arabic to English, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, September 2021. (PDF)

A. Rouditchenko, Learning Audio-Video Language Representations, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2021. (PDF)

I. Palmer, Spoken ObjectNet: Creating a Bias-Controlled Spoken Caption Dataset," M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2021. (PDF)

M. Nadeem, On Factuality in Neural Language Models, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2021. (PDF)

K. Tangri, Using Natural Language to Predict Bias and Factuality in Media with a Study on Rationalization, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2021. (PDF)

2019

E. Azuh Mensah, Towards Multilingual Lexicon Discovery from Visually Grounded Speech," M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, September 2019. (PDF)

O. Ren, Detecting Check-Worthy Claims, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

Y. Chung, Unsupervised Learning of Cross-Modal Mappings between Speech and Text, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

L. Ford, Large-scale Acoustic Scene Analysis with Deep Residual Networks, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

H. Luo, Neural Attentions for Natural Language Understanding and Modeling, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

G. Rivera, Automatic Detection of Code-switching in Arabic Dialects, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2019. (PDF)

B. Xu, Combating Fake News with Adversarial Domain Adaptation and Neural Models, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2019. (PDF)

2018

W. Hsu, Unsupervised Learning of Disentangled Representations for Speech with Neural Variational Inference Models, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

S. Koppula, Energy-Efficient Speaker Identification with Low-Precision Networks, M. Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

K. Leidal, Neural Techniques for Modeling Visually Grounded Speech, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

A. R. Titus, A Study of Adaptive Enhancement Methods for Improved Distant Speech Recognition, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2018. (PDF)

2016

J. Drexler, Deep Unsupervised Learning from Speech, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

H. Nassif, Learning Sentiment and Semantic Relatedness in User Generated Content Using Neural Models, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

F. Sun, Speech Representation Models for Speech Synthesis and Multimodal Speech Recognition, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2016. (PDF)

2015

M. Korpusik, Spoken Language Understanding in a Nutrition Dialogue System, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2015. (PDF)

L. Liu, Acoustic Models for Speech Recognition Using Deep Neural Networks Based on Approximate Math, M.Eng. Thesis, Department of Electrical Engineering and Computer Science, June 2015. (PDF)

R. Naphtal, Natural Language Processing Based Nutritional Application, M.Eng. Thesis, Department of Electrical Engineering and Computer Science, June 2015. (PDF)

P. Saylor, Spoke: A Framework for Building Speech-Enabled Websites, M.Eng. Thesis, Department of Electrical Engineering and Computer Science, May 2015. (PDF)

A. Alghunaim, A Vector Space Approach for Aspect-Based Sentiment Analysis, M.Eng. Thesis, Department of Electrical Engineering and Computer Science, May 2015. (PDF)

2014

T. Al Hanai, Lexical and Language Modeling of Diacritics and Morphemes in Arabic Automatic Speech Recognition, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, February 2014. (PDF)

2013

X. Fang, Bayesian Distance Metric Learning on i-vector for Speaker Verification, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, September 2013. (PDF)

C. Cai, Adapting Existing Games for Education Using Speech Recognition, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2013. (PDF)

D. Harwath, Unsupervised Modeling of Latent Topics and Lexical Units in Speech Audio, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2013. (PDF)

2012

A. Lee, A Comparison-based Approach to Mispronunciation Detection, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2012. (PDF)

2011

I. Badr, Pronunciation Learning for Automatic Speech Recognition, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2011. (PDF)

S. Shum, Unsupervised Methods for Speaker Diarization, S. M. Thesis, MIT Department of Electrical Engineering and Computer Science, June 2011. (PDF)

A. Goldie, CHATTER: A Spoken Language Dialogue System for Language Learning Applications, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2011. (PDF)

C. Varenhorst, Making Speech Recognition Work on the Web, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2011. (PDF)

Y. Li, Medical Data Mining: Improving Information Accessibility using Online Patient Drug Reviews, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, February 2011. (PDF)

2010

S. Dyar, A Multimodal Speech Interface for Dynamic Creation and Retrieval of Geographical Landmarks on a Mobile Device, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010. (PDF)

K. Luu, Real-Time Noise-Robust Speech Detection, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2010. (PDF)

C. Lee, Closed-loop Auditory-based Representation for Robust Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010. (PDF)

S. Liu, Multimodal Speech Interfaces for Map-based Applications, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2010. (PDF)

2009

Y. Zhang, Unsupervised Spoken Keyword Spotting and Learning of Acoustically Meaningful Units, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2009. (PDF)

G. Matthias, Incremental Speech Understanding in a Multimodal Web-Based Spoken Dialogue System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

S. Wang, Using Graphone Models in Automatic Speech Recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

S. Pueblo, Videorealistic Facial Animation for Speech-Based Interfaces, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2009. (PDF)

2008

B. Zhu, Multimodal Speech Recognition with Ultrasonic Sensors, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, September 2008. (PDF)

G. Yu, Efficient Error Correction for Speech Systems Using Constrained Re-recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, July 2008. (PDF)

H. Chang, Large-Margin Gaussian Mixture Modeling for Automatic Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2008. (PDF)

I. McGraw, Web-based, Speech-enabled Games for Vocabulary Acquisition in a Foreign Language, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2008. (PDF)

Y. Xu, Combining Linguistics and Statistics for High-Quality Limited Domain English-Chinese Machine Translation, S.M. thesis, Department of Electrical Engineering and Computer Science, June 2008. (PDF)

2006

D. Schultz, Robust Audio-Visual Person Verification Using Web-Camera Video, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, September 2006. (PDF)

2005

T. Sainath, Acoustic Landmark Detection and Segmentation using the McAulay-Quatieri Sinusoidal Model, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2005. (PDF)

R. Woo, Exploration of Small Enrollment Speaker Verification on Handheld Devices, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2005. (PDF)

2004

E. Saenko, Articulatory Features for Robust Visual Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2004. (PDF)

G. Choueiter, A Wavelet and Filter Bank Framework for Phonetic Classification, S.M. thesis, MIT Department of Civil and Environmental Engineering, August 2004. (PDF)

V. Lee, LanguageLand: A Multimodal Conversational Spoken Language Learning System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2004. (PDF)

J. Zhang, Language Generation and Speech Synthesis in Dialogues for Language Learning, M.Eng. Thesis, MIT Department of Electrical Engineering and Computer Science, May 2004. (PDF)

B. Cowan, PLUTO: A Preprocessor for Multilingual Spoken Language Generation, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, February 2004. (PDF)

J. Lee, Translingual Grammar Induction for Conversational Systems, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2004. (PDF)

2003

A. Boozer, Characterization of Emotional Speech in Human-Computer Dialogues, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2003. (PDF)

C. Chuu, LIESHOU: A Mandarin Conversational Task Agent for the Galaxy-II Architecture, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, February 2003. (PDF)

C. La, Infrastructure Development for Integration of Lip Reading into the SUMMIT Speech Recognizer, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2003. (PDF)

T. J. Lau, SLLS: An Online Conversational Spoken Language Learning System, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2003. (PDF)

L. S. Miyakawa, Distributed Speech Recognition within a Segment-Based Framework, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2003. (PDF)

S. B. Wang, A Multimodal Galaxy-based Geographic System, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, June 2003. (PDF)

2002

A. S. Park, ASR Dependent Techniques for Speaker Recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May, 2002.(PDF)

E. J. Pusateri, Rapid Speaker Adaptation with Speaker Clustering, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May, 2002. (PDF)

E. A. Filisko, A Context Resolution Server for the GALAXY Conversational Systems, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, June 2002. (PDF)

J. Kuo, An XML Messaging Protocol for Multimodal Galaxy Applications, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2002. (PDF)

S. Crockett, Rapid Configuration of Discourse and Dialog Management in Conversational Systems, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2002. (PDF)

2001

E. Weinstein, SpeechBuilder: Facilitating Spoken Dialogue System Development, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2001. (gzip'd PS), (PDF)

2000

E. Sandness, Discriminative Training of Acoustic models in a Segment-Based Speech Recognizer, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 2000. (PDF), (gzip'd PS)

T. Burianek, Building a Speech Understanding System Using Word Spotting Techniques, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, July 2000. (PDF)

J. Pearlman, SLS-Lite: Enabling Spoken Language Systems Design for Non-Experts, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, August 2000. (gzip'd PS)

L. Baptist, Genesis-II: A Language Generation Module for Conversational Systems, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, August 2000. (gzip'd PS), (PDF)

A. Suchato, Framework for Joint Recognition of Pronounced and Spelled Proper Names, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, September 2000. (PDF)

1999

S. Kamppari, Word and Phone Level Acoustic Confidence Scoring for Speech Understanding Systems, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, September 1999. (PDF), (gzip'd PS)

K. Livescu, Analysis and Modeling of Non-Native Speech for Automatic Speech Recognition, S.M. thesis, MIT Department of Electrical Engineering and Computer Science, August 1999. (PDF), (gzip'd PS)

1998

S. Lee, Probabilistic Segmentation for Segment-based Speech Recognition, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 1998. (PDF), (gzip'd PS)

J. Yi, Natural-Sounding Speech Synthesis Using Variable-Length Units, M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science, May 1998. (PDF), (gzip'd PS)


32 Vassar Street
Cambridge, MA 02139 USA
(+1) 617.253.3049
 


©2020, Spoken Language Systems Group. All rights reserved.

About SLS
---Our Technologies
---Demonstration
Research Initiatives
---Technologies
---Applications
---Glossary
Publications
---Research Summary
---Theses
---Papers
---Archives
News and Events
---News Articles
---Archives
SLS People
---Research Staff
---Post-Doctoral Students
---Administrative Staff
---Support Staff
---Visitors
---Graduate Students
---Undergraduate Students
---Emeritus
---Positions with SLS
Contact Us
---Positions with SLS
---Visitor Information