LTI Technical Reports and Memos
2009
CMU-LTI-09-018
Power Iteration Clustering; Frank Lin and William W. Cohen; Fall 2009
CMU-LTI-09-017
Semi-Supervised Classification of Network Data Using Very Few Labels; Frank Lin and William W. Cohen; Fall 2009
CMU-LTI-09-015
Unsupervised Estimation of Classification and Regression Error Rates; Pinar Donmez, Guy Lebanon, and Krishnakumar Balasubramanian; Fall 2009
CMU-LTI-09-013
Question Generation via Overgenerating Transformations and Ranking; Michael Heilman and Noah A. Smith; Spring 2009
CMU-LTI-09-005
Modeling Content from Human-Verified Blacklists for Accurate Zero-Hour Phish Detection; Guang Xiang, Bryan A. Pendleton, and Jason Hong; Spring 2009
2008
CMU-LTI-08-011
Automatically Generating Reading Comprehension Look-Back Strategy Questions from Expository Texts; Donna M. Gates ; Fall 2008
CMU-LTI-08-009
Products of Weighted Logic Problems; Shay B. Cohen, Robert J. Simmons, and Noah A. Smith; Fall 2008
CMU-LTI-08-005
SOUR CREAM: Toward Semantic Processing of Recipes; Dan Tasse and Noah A. Smith; Spring 2008
CMU-LTI-08-004
Learning to Extract Gene-Protein Names from Weakly-Labeled Text; Richard Wang, Anthony Tomasic, Robert E. Frederking, Isaac Simmons, and William W. Cohen; Spring 2008
CMU-LTI-08-003
The Multi-Rank Bootstrap Algorithm: Semi-Supervised Political Blog Classification and Ranking Using Semi-Supervised Link Classification; Frank Lin and William W. Cohen; Spring 2008
CMU-LTI-08-002
Learning to Walk Text Networks; Einat Minkov and William W. Cohen; Spring 2008
2007
CMU-LTI-07-017
A Supervised Acoustic Model for Simultaneous Multiparticipant Vocal Activity Detection in Close-Talk Microphone Recordings of Meetings; Kornel Laskowski and Tanja Schultz; Fall 2007
CMU-LTI-07-015
Combining Personalized Agents to Improve Content-Based Recommendations; Jason M. Adams, Paul N. Bennett, and Anthony Tomasic; Fall 2007
CMU-LTI-07-014
A Dual-Use Speech CAPTCHA: Aiding Visually Impaired Web Users While Providing Transcriptions of Audio Streams; Andy Schliakjer; Fall 2007
CMU-LTI-07-013
Annotation Guide for Laughter in Multi-Party Conversation; Kornel Laskowski, Susanne Burger, and Timothy Notari; Fall 2007
CMU-LTI-07-009
Text-to-speech in Vocabulary Acquisition and Student Knowledge Models: a Classroom Study Using the REAP Intelligent Tutoring System; Carol Sisson; Summer 2007
CMU-LTI-07-005
Recommending Recipients in the Enron E-mail Corpus; Vitor R. Carvalho and William W. Cohen; Summer 2007
2006
CMU-LTI-06-010
Suffix Array and its Applications in Empirical Natural Language Processing; Ying (Joy) Zhang; Fall 2006
CMU-LTI-06-007
A Log-linear Block Transliteration Model based on Bi-Stream HMMs; Bing Zhao, Nguyen Bach, Ian Lane, and Stephan Vogel; Fall 2006
CMU-LTI-06-005
ARGUS: Efficient Scalable Continuous Query Optimization for Large-Volume Data Streams; Chun Jin and Jaime Carbonell; Summer 2006
CMU-LTI-06-002
Notes on Single-Pass Online Learning Algorithms ; Vitor R. Carvalho and William W. Cohen; Summer 2006
2005
CMU-LTI-05-197
Full-Text Federated Search in Peer-to-Peer Networks; Jie Lu; Fall 2005
CMU-LTI-05-193
Dynamic Machine Translation Evaluation Methods: Algorithmic Analysis and Generalization; Lucian Vlad Lita; Spring 2005
2004
CMU-LTI-04-188
Gazetteers, Word Net, Encyclopedias, and The Web: Analyzing Question Answering Resources; Warren A. Hunt, Lucian Vlad Lita, and Eric Nyberg; Fall 2004
CMU-LTI-04-186
Integrating Tools for the Creation of Speech-Enabled Tutors; Jonathan C. Brown; Fall 2004
CMU-LTI-04-185
Speech Graffiti: Assessing the User Experience; Stefanie Tomko; Spring 2004
CMU-LTI-04-182
Speech Recognition Technology Applied for English Education in Korea; Jong-Hyun Lee; May 2004
CMU-LTI-04-181
ARGUS: Combining Rete and DBMS for Continuous Profile Matching on Large-Volume Data Streams; Chun Jin and Jaime Carbonell; May 2004
CMU-LTI-04-180
Novelty Detection with Nearest Neighbor, Support Vector Machines, and Kernel Regression; Jian Zhang, Yiming Yang, and Jaime Carbonell; March 2004
2003
CMU-LTI-03-179
FLOOD: A Planning Framework for Reasoning with Linguistic Data; Curtis Huttenhower; December 2003
CMU-LTI-03-178
Cross-lingual Event Tracking; Nianli Ma, Yiming Yang, and Monica Rogati; October 22, 2003
CMU-LTI-03-177*
CMU ARCTIC Databases for Speech Synthesis; John Kominek and Alan Black; September 2003
CMU-LTI-03-177-a*
Ukernal: A Unification Kernal; Benjamin Han and Alon Lavie; September 2003
CMU-LTI-03-176
A Prototype of an English-Polish Machine Translation System; Anna Kupsc, Teruko Mitamura, and Eric Nyberg; July 2003
CMU-LTI-03-175
Improving Speech Recognizer Performance in a Dialog System Using N-best Hypotheses Reranking; Ananlada Chotimongkol; April 2003
CMU-LTI-03-174
Maximal Lattice Overlap in Example-Based Machine Translation; Rebecca Hutchinson, Paul N. Bennett, Jaime Carbonell, Peter Jansen, and Ralf Brown; June 2003
2001
CMU-LTI-01-169
Improving Pronunciation Accuracy of Proper Names with Language Origin Classes; Ariadna Font Llitjos; Fall 2001
1998
CMU-LTI-98-MEMO
CMU Two-party and Three-party Spontaneous Speech Data Collection - Travel Domain; Sondra Ahlen and Anuj Vaidya; September 1998
1997
CMU-LTI-97-155
Hypothesis Driven Lexical Adaptation for Transcribing Multilingual Broadcast News; Petra Geutner, Michael Finke, and Peter Scheytt; December 1997
CMU-LTI-97-154
Speech Recognition on Serbo-Croatian Dictation and Broadcast News Data; Peter Scheytt, Michael Finke, Petra Geutner; December 1997
CMU-LTI-97-153
Data Collection Scenarios for C-Star Travel Domain; Sondra Ahlen, Brian Connelly, Michelle Corkadel, Rob Malkin, Anuj Vaidya, and Rodolfo Vega; August 1997
CMU-LTI-97-152
Issues in Generating Turkish from Interlingua; Dilek Zeynep Hakkani, Gorham Tur, Kemal Oflazer, Teruko Mitamura, Eric Nyberg, and Kemal Oflazer; August 1997
CMU-LTI-97-150
Vocal Tract Length Normalization for Large Vocabulary Continuous Speech Recognition; Puming Zhan and Alex Waibel; May 1997
* Duplicate numbers due to clerical errors |