|
SLS PUBLICATIONS ARCHIVES
The following is an archive of reports, theses and papers authored by SLS group staff and students. You can follow this link for our recent publications.
Theses
Many of our students' theses
are available either here in Adobe Acrobat (PDF) format and possibly gzip'd PostScript format,
or from the LCS
Publications Office as technical reports (electronic and/or printed
format). The items below are listed by year (descending), then by month (ascending) and
finally by first author.
Ph.D. Theses
1995
H. Meng, Phonological
Parsing for Bi-directional Letter-to-Sound/Sound-to-Letter Generation.
MIT Department of Electrical Engineering and Computer Science, February
1995. MIT-LCS-TR-687. (PDF)
R. Kassel, A Comparsion of Approaches to On-line Handwritten Character Recognition. MIT Department
of Electrical Engineering and Computer Science, June 1995. MIT-LCS-TR-661. (PDF)
1989
H. Leung, The Use of Artificial Neural Networks for Phonetic Recognition. MIT Department of Electricial Engineering and Computer Science, May 1989.
(PDF)
1988
J. Glass, Finding Acoustic Regularities in Speech: Applications to Phonetic Recognition. MIT Department of Electricial Engineering and Computer Science, May 1988. (PDF)
S.M. and M.Eng. Theses
1997
S. Sarma, A Segment-based
Speaker Verification System Using Summit. S.M. thesis, MIT Department of Electrical
Engineering and Computer Science, April 1997.
(PDF), (gzip'd PS)
G. Chung, Hierarchical
Duration Modelling for a Speech Recognition System. S.M. thesis, MIT
Department of Electrical Engineering and Computer Science, May 1997.
(PDF), (gzip'd PS)
A. Parmar, A Semi-Automatic System for the Syllabification and
Stress Assignment of Large Lexicons. M.Eng. thesis, MIT Department
of Electrical Engineering and Computer Science, June 1997.
(PDF),
(gzip'd PS)
C. Wang, Porting the Galaxy System to Mandarin Chinese
S.M. thesis, MIT Department of Electrical Engineering and Computer
Science, May 1997.
(PDF),
(gzip'd PS)
B. Serridge, Context-Dependent
Modeling in a Segment-based Speech Recognition System. M.Eng. thesis,
MIT Department of Electrical Engineering and Computer Science, August 1997.
(PDF), (gzip'd PS)
1996
R. Chun, A Hierarchical
Feature Representation. M.Eng. thesis, MIT Department of Electrical
Engineering and Computer Science, March 1996. MIT-LCS-TR-698.
A. Manos, A Study
on Out-of-Vocabulary Word Modelling for a Segment-based Keyword Spotting
System. S.M thesis, MIT Department of Electrical Engineering and Computer
Science, April 1996. MIT-LCS-TR-694. (PDF), (gzip'd PS)
M. Muzumdar, Semi-automatic
Acoustic Measurement Optimization for a Segmental Speech Recognition System.
M.Eng. thesis, MIT Department of Electrical Engineering and Computer Science,
June 1996.
1995
J. Chang, Speech Recognition
System Robustness to Microphone Variations. S.M. thesis, MIT Department
of Electrical Engineering and Computer Science, January 1995. MIT-LCS-TR-650.
(PDF)
J. Koppelman, A Statistical
Approach to Language Modelling in the ATIS Domain. M.Eng. thesis, MIT
Department of Electrical Engineering and Computer Science, January 1995.
MIT-LCS-TR-646.
1984
J. Glass, Nasal Consonants and Nasalized Vowels: An Acoustic Study and Recognition Experiment. S.M. thesis, MIT Department of Electrical Engineering and Computer Science, December 1984.
(PDF)
Papers
1997 | 1996 | 1995 | 1994 | 1990
Many of our papers are available below in Adobe Acrobat (PDF) format and possibly gzip'd PostScript format.
The items below are listed by year (descending), then by month (ascending) and
finally by first author.
1997
T.J. Hazen and V. Zue,
"Segment-Based Automatic Language Identification,"
Journal of the Acoustical Society of America, Vol. 101, No. 4,
pp. 2323-2331, April 1997.
(PDF),
(gzip'd PS)
R. Lau, G. Flammia, C.
Pao and V. Zue. "WebGALAXY: Beyond Point and ClickA Conversational
Interface to a Browser," Proc. Sixth International World Wide Web
Conference, pp. 119-127, Santa Clara, CA, April 1997. (HTML)
A. Manos, and V. Zue,
"A Segment-based Wordspotter Using Phonetic Filler Models," Proc.
IEEE International Conference on Acoustics, Speech and Signal Processing,
Munich, Germany, April 1997.
(PDF), (gzip'd PS)
K. Ng and V. Zue, "An
Investigation of Subword Unit Representations for Spoken Document Retrieval,"
Proc. of the ACM SIGIR Conference, p. 139, Philadelphia, PA, July 1997.
(PDF), (gzip'd PS)
J. Chang and J. Glass,
"Segmentation and Modeling in Segment-Based Recognition." Proc.
Eurospeech 97, pp. 1199-1202, Rhodes, Greece, September 1997. (PDF)
G. Chung and S. Seneff,
"Hierarchical Duration Modelling for Speech Recognition Using the
Angie Framework," Proc. Eurospeech 97, pp. 1475-1478, Rhodes, Greece,
September 1997.
(PDF)
G. Flammia and V. Zue,
"Learning the Structure of Mixed Initiative Dialogues Using a Corpus
of Annotated Conversations," Proc. Eurospeech 97, pp.1871-1874, Rhodes,
Greece, September 1997.
(PDF)
A. Halberstadt and J.
Glass, "Heterogeneous Acoustic Measurements for Phonetic Classification,"
Proc. Eurospeech 97, pp. 401-404, Rhodes, Greece, September 1997.
(PDF)
T.J. Hazen and J. Glass, "A Comparison of Novel Techniques for Instantaneous
Speaker Adaptation" Proc. Eurospeech 97, pp. 2047-2050, Rhodes, Greece,
September 1997.
(PDF),(gzip'd PS)
J. Hugunin and V. Zue,
"On the Design of Effective Speech-Based Interfaces for Desktop Applications,"
Proc. Eurospeech 97, pp. 1335-1338, Rhodes, Greece, September 1997.
(PDF)
R. Lau, G. Flammia, C.
Pao and V. Zue, "Webgalaxy - Integrating Spoken Language and Hypertext
Navigation," Proc. Eurospeech 97, pp. 883-886, Rhodes, Greece, September
1997. (PDF)
R. Lau and S. Seneff,
"Providing Sublexical Constraints for Word Spotting Within the Angie
Framework," Proc. Eurospeech 97, pp. 263-266, Rhodes, Greece, September
1997. (PDF)
M. McCandless and J.
Glass, "MUSE: A Scripting Langauge for the Development of Interactive
Speech Analysis and Recognition Tools," Proc. Eurospeech 97, pp. 629-632,
Rhodes, Greece, September 1997. (PDF)
K. Ng and V. Zue, "Subword
Unit Representations for Spoken Document Retrieval," Proc. Eurospeech
97, pp.1607-1610, Rhodes, Greece, September 1997.
(PDF)
S. Sarma and V. Zue, "A
Segment-Based Speaker Verification System Using Summit," Proc. Eurospeech
97, pp. 843-846, Rhodes, Greece, September 1997.
(PDF),(gzip'd PS)
M. Spina and V. Zue, "Automatic
Transcription of General Audio Data: Effect of Environment Segmentation
on Phonetic Recognition," Proc. Eurospeech 97, pp. 1547-1550, Rhodes,
Greece, September 1997. (PDF)
C. Wang, J. Glass, H.
Meng, J. Polifroni, S. Seneff and V. Zue, "YINHE: A Mandarin
Chinese Version of the Galaxy System," Proc. Eurospeech 97,
pp. 351-354, Rhodes, Greece, September 1997.
(PDF)
V. Zue, "Conversational
Interfaces: Advances and Challenges," Proc. Eurospeech 97, p. KN-18,
Rhodes, Greece, September 1997.
(PDF), (gzip'd PS)
V. Zue, S. Seneff, J. Glass, L. Hetherington, E. Hurley,
H. Meng, C. Pao, J. Polifroni, R. Schloming, and P. Schmid,
"From Interface to Content: Translingual Access and Delivery of On-Line Information,
" Proc. EUROSPEECH 97, pp. 2227-2230, Rhodes, Greece, September,
1997. (PDF)
1996
J. Glass, J. Chang,
and M. McCandless, "A Probabilistic Framework for Feature-Based
Speech Recognition," Proc. ICSLP 96, pp. 2277-2280, Philadelphia,
PA, October 1996. (PDF)
E. Hurley, J. Polifroni
and J. Glass, "Telephone Data Collection Using the World Wide Web,"
Proc. ICSLP 96, pp. 1898-1901, Philadelphia, PA, October 1996. (PDF), (gzip'd PS)
H. Meng, S. Busayapongchai, J. Glass,
D. Goddeau, L. Hetherington, E. Hurley, C. Pao, J. Polifroni, S. Seneff, and V. Zue,
"Wheels: A Conversational System in the AUtomobile Classifieds Domain,
" Proc. ICSLP 96, pp. 542-545, Philadelphia, PA, October
1996. (PDF)
S. Seneff, D. Goddeau, C. Pao, and J. Polifroni,
"Multimodal Discourse Modelling in a Multi-user Multi-domain Environment,
" Proc. ICSLP 96, pp. 192-195, Philadelphia, PA, October
1996. (PDF)
S. Seneff, R. Lau and
H. Meng, "ANGIE: A New Framework for Speech Analysis Based on Morpho-Phonological
Modelling," Proc. ICSLP 96, pp. 110-113, Philadelphia, PA, October
1996. (PDF)
S. Seneff, and J. Polifroni,
"A New Restaurant Guide Conversational System: Issues in Rapid Prototyping
for Specialized Domains," Proc. ICSLP 96, pp. 665-668, Philadelphia, PA, October
1996. (PDF)
M. Spina and V. Zue,
"Automatic Transcription of General Audio Data: Preliminary Analysis,"
Proc. ICSLP 96, pp. 594-597, Philadelphia, PA, October 1996. (PDF)
1995
J. Chang and V. Zue,
"A Study of Speech Recognition System Robustness to Microphone Variations."
Proc. Eurospeech 95, Madrid, Spain, September 1995. (PDF)
J. Glass,
G. Flammia, D. Goodine, M. Phillips, J. Polifroni, S. Sakai, S. Seneff, and V. Zue,
"Multilingual Spoken-Language Understanding in the MIT Voyager System,"
Speech Communication, Vol. 17, No. 1, pp. 1-18, March 1995. (PDF)
1994
V. Zue, S. Seneff, J. Polifroni, M. Phillips, C. Pao, D. Goodine, D.
Goddeau, and J. Glass, "Pegasus: A Spoken Dialogue Interface for
Online Travel Planning," Speech Communication, 15, 331--340, 1994.
D. Goddeau, E. Brill, J. Glass, C. Pao, M. Phillips, J.
Polifroni, S. Seneff, and V. Zue, "Galaxy: A Human-Language
Interface to On-Line Travel Information," Proc. Int. Conf. on
Spoken Language Processing, 707--710, Yokohama, Japan, 1994.
1990
V. Zue, J. Glass, D. Goodine, H. Leung, M. Phillips, J. Polifroni, and S. Seneff, "Recent Progress on the SUMMIT System," Third DARPA Speech and Natural Language Workshop, pp. 24-27, Hidden Valley, Pennsylvania, USA, June 1990.
(PDF)
1988
J. Glass and V. Zue, "Multi-Level Acoustic Segmentation of Continuous Speech," Proc. ICASSP, pp. 429 - 432, New York, April 1988.
(PDF)
S. Seneff, "A Joint Synchrony/Mean-rate Model of Auditory Speech Processing," Journal of Phonetics, 1988, vol. 16, pp. 55 - 76.
(PDF)
|