Lemur Search   
Language Technologies Institute
Carnegie Mellon University
School of Computer Science

LTI Reports by Year

2011 (6) 2010 (5) 2009 (5) 2008 (6) 2007 (6) 2006 (4) 2005 (2)
2004 (6) 2003 (7) 2001 (1) 1998 (1) 1997 (5)

2011 [top]

Ancestry.com Online Forum Test Collection
Jonathan L. Elsas
CMU-LTI-017

Predicting Responses and Discovering Social Factors in Scientific Literature
Dani Yogatama, Michael Heilman, Brendan O'Connor, Chris Dyer, Bryan R. Routledge, and Noah A. Smith
CMU-LTI-11-015

Recall-Oriented Learning for Named Entity Recognition in Wikipedia
Behrang Mohit, Nathan Schneider, Rishav Bhowmick, Kemal Oflazer, and Noah A. Smith
CMU-LTI-11-012

METEOR-Tuned Phrase-Based SMT: CMU French-English and Haitian-English Systems for WMT 2011
Michael Denkowski and Alon Lavie
CMU-LTI-11-011

Parallelization Strategies for a Dynamic Lexical Tree Decoder
Matthias Vogelgesang and Florian Metze
CMU-LTI-01-010

Predicting FED Action From Text
Tal Stramer, Bryan R. Routledge and Noah A. Smith
CMU-LTI-11-005

2010 [top]

Visualizing Topical Quotations Over Time to Understand News Discourse
Nathan Schneider, Rebecca Hwa, Philip Gianfortoni, Dipanjan Das, Michael Heilman, Alan W. Black, Frederick L. Crabbe, and Noah A. Smith
CMU-LTI-01-013

ASR System Combination Techniques, Tools and Experiments
Udhyakumar Nallasamy, Ian Lane and Florian Metze
CMU-LTI-10-010

Evaluating Translations Produced by Amazon Mechanical Turk
Matthias Eck, Ian Lane and Alex Waibel
CMU-LTI-10-009

Softmax-Margin Training for Structured Log-Linear Models
Kevin Gimpel and Noah A. Smith
CMU-LTI-10-008

SEMAFOR 1.0: A Probabilistic Frame-Semantic Parser
Dipanjan Das, Desai Cheny, Nathan Schneider and Noah A. Smith
CMU-LTI-10-001

2009 [top]

Power Iteration Clustering
Frank Lin and William W. Cohen
CMU-LTI-09-018

Semi-Supervised Classification of Network Data Using Very Few Labels
Frank Lin and William W. Cohen
CMU-LTI-09-017

Unsupervised Estimation of Classification and Regression Error Rates
Pinar Donmez, Guy Lebanon, and Krishnakumar Balasubramanian
CMU-LTI-09-015

Question Generation via Overgenerating Transformations and Ranking
Michael Heilman and Noah A. Smith
CMU-LTI-09-013

Modeling Content from Human-Verified Blacklists for Accurate Zero-Hour Phish Detection
Guang Xiang, Bryan A. Pendleton, and Jason Hong
CMU-LTI-09-005

2008 [top]

Automatically Generating Reading Comprehension Look-Back Strategy Questions from Expository Texts
Donna M. Gates
CMU-LTI-08-011

Products of Weighted Logic Problems
Shay B. Cohen, Robert J. Simmons, and Noah A. Smith
CMU-LTI-08-009

SOUR CREAM: Toward Semantic Processing of Recipes
Dan Tasse and Noah A. Smith
CMU-LTI-08-005

Learning to Extract Gene-Protein Names from Weakly-Labeled Text
Richard Wang, Anthony Tomasic, Robert E. Frederking, Isaac Simmons, and William W. Cohen
CMU-LTI-08-004

The Multi-Rank Bootstrap Algorithm: Semi-Supervised Political Blog Classification and Ranking Using Semi-Supervised Link Classification
Frank Lin and William W. Cohen
CMU-LTI-08-003

Learning to Walk Text Networks
Einat Minkov and William W. Cohen
CMU-LTI-08-002

2007 [top]

A Supervised Acoustic Model for Simultaneous Multiparticipant Vocal Activity Detection in Close-Talk Microphone Recordings of Meetings
Kornel Laskowski and Tanja Schultz
CMU-LTI-07-017

Combining Personalized Agents to Improve Content-Based Recommendations
Jason M. Adams, Paul N. Bennett, and Anthony Tomasic
CMU-LTI-07-015

A Dual-Use Speech CAPTCHA: Aiding Visually Impaired Web Users While Providing Transcriptions of Audio Streams
Andy Schliakjer
CMU-LTI-07-014

Annotation Guide for Laughter in Multi-Party Conversation
Kornel Laskowski, Susanne Burger, and Timothy Notari
CMU-LTI-07-013

Text-to-speech in Vocabulary Acquisition and Student Knowledge Models: a Classroom Study Using the REAP Intelligent Tutoring System
Carol Sisson
CMU-LTI-07-009

Recommending Recipients in the Enron E-mail Corpus
Vitor R. Carvalho and William W. Cohen
CMU-LTI-07-005

2006 [top]

Suffix Array and its Applications in Empirical Natural Language Processing
Ying (Joy) Zhang
CMU-LTI-06-010

A Log-linear Block Transliteration Model based on Bi-Stream HMMs
Bing Zhao, Nguyen Bach, Ian Lane, and Stephan Vogel
CMU-LTI-06-007

ARGUS: Efficient Scalable Continuous Query Optimization for Large-Volume Data Streams
Chun Jin and Jaime Carbonell
CMU-LTI-06-005

Notes on Single-Pass Online Learning Algorithms
Vitor R. Carvalho and William W. Cohen
CMU-LTI-06-002

2005 [top]

Full-Text Federated Search in Peer-to-Peer Networks
Jie Lu
CMU-LTI-05-197

Dynamic Machine Translation Evaluation Methods: Algorithmic Analysis and Generalization
Lucian Vlad Lita
CMU-LTI-05-193

2004 [top]

Gazetteers, Word Net, Encyclopedias, and The Web: Analyzing Question Answering Resources
Warren A. Hunt, Lucian Vlad Lita, and Eric Nyberg
CMU-LTI-04-188

Integrating Tools for the Creation of Speech-Enabled Tutors
Jonathan C. Brown
CMU-LTI-04-186

Speech Graffiti: Assessing the User Experience
Stefanie Tomko
CMU-LTI-04-185

Speech Recognition Technology Applied for English Education in Korea
Jong-Hyun Lee
CMU-LTI-04-182

ARGUS: Combining Rete and DBMS for Continuous Profile Matching on Large-Volume Data Streams
Chun Jin and Jaime Carbonell
CMU-LTI-04-181

Novelty Detection with Nearest Neighbor, Support Vector Machines, and Kernel Regression
Jian Zhang, Yiming Yang, and Jaime Carbonell
CMU-LTI-04-180

2003 [top]

FLOOD: A Planning Framework for Reasoning with Linguistic Data
Curtis Huttenhower
CMU-LTI-03-179

Cross-lingual Event Tracking
Nianli Ma, Yiming Yang, and Monica Rogati
CMU-LTI-03-178

CMU ARCTIC Databases for Speech Synthesis
John Kominek and Alan Black
CMU-LTI-03-177

Ukernal: A Unification Kernal
Benjamin Han and Alon Lavie
CMU-LTI-03-177-a

A Prototype of an English-Polish Machine Translation System
Anna Kupsc, Teruko Mitamura, and Eric Nyberg
CMU-LTI-03-176

Improving Speech Recognizer Performance in a Dialog System Using N-best Hypotheses Reranking
Ananlada Chotimongkol
CMU-LTI-03-175

Maximal Lattice Overlap in Example-Based Machine Translation
Rebecca Hutchinson, Paul N. Bennett, Jaime Carbonell, Peter Jansen, and Ralf Brown
CMU-LTI-03-174

2001 [top]

Improving Pronunciation Accuracy of Proper Names with Language Origin Classes
Ariadna Font Llitjos
CMU-LTI-01-169

1998 [top]

CMU Two-party and Three-party Spontaneous Speech Data Collection - Travel Domain
Sondra Ahlen and Anuj Vaidya
CMU-LTI-98-MEMO

1997 [top]

Hypothesis Driven Lexical Adaptation for Transcribing Multilingual Broadcast News
Petra Geutner, Michael Finke, and Peter Scheytt
CMU-LTI-97-155

Speech Recognition on Serbo-Croatian Dictation and Broadcast News Data
Peter Scheytt, Michael Finke, Petra Geutner
CMU-LTI-97-154

Data Collection Scenarios for C-Star Travel Domain
Sondra Ahlen, Brian Connelly, Michelle Corkadel, Rob Malkin, Anuj Vaidya, and Rodolfo Vega
CMU-LTI-97-153

Issues in Generating Turkish from Interlingua
Dilek Zeynep Hakkani, Gorham Tur, Kemal Oflazer, Teruko Mitamura, Eric Nyberg, and Kemal Oflazer
CMU-LTI-97-152

Vocal Tract Length Normalization for Large Vocabulary Continuous Speech Recognition
Puming Zhan and Alex Waibel
CMU-LTI-97-150

Language Technologies Institute • 5000 Forbes Ave • Pittsburgh, PA 15213-3891 • (412) 268-6591