Home


About

   Directions
   Admissions

   How To Apply

   The LTI Brochure


Education

   Ph.D.
   Dual Ph.D. with Portugal
   M.S.

   Undergrad Minor

   Courses

    FYI

LTI Forms

Seminars
   LTI Seminar Series
   Joint Speech Seminar (JSS)

   Machine Translation (MT)

   Student Research Symposium

   Information Retrieval Series

   Large Scale Lunch Seminar

   Intelligence Seminar


Visitor Information
   General
   Maps & Directions
   Hotel Links
   Parking Information


Research
   Projects

   Reports

    Dissertations


People

   Faculty

   Students

   Upcoming Graduates

   Staff

   Visitors   

   Who to See for What


Contacts


LTI Seminar Abstracts
Summer 2008


August 20
Xiaojin (Jerry) Zhu
University of Wisconsin

Text-to-Picture Synthesis

I will present a Text-to-Picture system that synthesizes a picture from general, unrestricted natural language text.  The process is analogous to Text-to-Speech synthesis, but with pictorial output that conveys the gist of the text.  Our system consists of multiple parts, including natural language processing, computer vision, computer graphics, and machine learning. First, it identifies informative and "picturable" keywords in the input text, then searches for a good image per keyword, and finally optimizes the overall collage layout. The effectiveness of our system is assessed with user studies.  I will focus on the machine learning techniques to identify picturable keywords, and to optimize the picture layout. Experiments suggest that Text-to-Picture synthesis has great potential in augmenting human-computer and human-human communication modalities, with applications in education and health care, among others.

July 18
Alice Oh
MIT

Generating Multiple Perspectives in Baseball Summaries by Content Reordering

Every story about an event offers a unique perspective about the event. A popular sporting event, such as a Major League Baseball game, is followed by several summary articles that show different points of view. The goal of this research is to build a computational model of perspective and build a system for automatically generating multiple summary articles showing different perspectives.

My approach is to take a neutral summary article, reorder the content of that summary based on event features extracted from the description of the game, and produce two new summaries showing the local team perspectives. I will present an initial user survey that validated the hypothesis that content ordering has a significant effect on the users' perception of perspective. I will also discuss collecting and analyzing a parallel corpus of baseball game data and summary articles showing local team perspectives. I will then describe the reordering algorithm, the implementation of the system, and a user study to evaluate the output of the system.



 

July 29
Gideon Borensztajn and Stefan Frank
Institute for Logic, Language and Computation
University of Amsterdam

Computational Tools for Cognitive Research into Linguistic Phenomena

Our research group at the Institute for Logic, Language and Computation (ILLC)
at the University of Amsterdam uses techniques from computational linguistics
for psychological investigations in human language acquisition and
processing. In this talk, we present two examples of this research.

In the first part of the talk, Gideon Borensztajn presents his research that
shows children's grammars to grow more abstract with age. A method was
developed for automatic identification of the most probable multi-word
constructions used in children's utterances, given syntactically annotated
utterances from the Brown corpus of CHILDES (Sagae et al., 2007). The
constructions that were found cover many interesting linguistic phenomena from
the language acquisition literature, and show a progression from very concrete
towards abstract constructions. For all children of the Brown corpus,
grammatical abstraction, defined as the percentage of variable slots in the
productive units of their grammar, increases globally with age. This research
was presented at this year's Cognitive Science Conference.

In the second part of the talk, Stefan Frank discusses how the psychological
validity of sentence-processing models can be evaluated using so-called
"surprisal theory" (Hale, 2001; Levy, 2008). According to this theory, the
time required to read a word is inversely logarithmically related to the
word's probability of occurrence given its sentence context. Although such
probabilities can be estimated from large text corpora, the question remains
whether these "objective" probabilities resemble the "subjective"
probabilities assigned to words by readers. Surprisal theory can only be
falsified if we assume that subjective probabilities are indeed similar to
objective probabilities. If both this assumption and surprisal theory are
correct, an objectively more accurate probability model should also provide
more accurate predictions of word-reading times. A comparison of word
probabilities - as estimated by different models - and experimental
reading-time data indicates that such a relation does not necessarily hold. If
we nevertheless hold on to surprisal theory, these findings provide insight
into the nature of the human sentence-processing system.

BIOGRAPHICAL SKETCHES

Stefan Frank is a post-doctoral researcher in Rens Bod's research group at the
ILLC at the University of Amsterdam. He holds a PhD degree from Tilburg
University and the Max Planck Institute for Psycholinguistics in Nijmegen.
Between 2004 and 2007, he was a post-doctoral researcher at the Nijmegen
Institute for Cognition and Information, where he was awarded a 3-year
research grant on computational modeling of sentence comprehension. He has
been at ILLC since November 2007.

Gideon Borensztajn is currently a PhD student in Rens Bod's research group at
the ILLC at the University of Amsterdam. He received a Masters degree in
Cognitive Science from the University of Amsterdam in 2006. Prior to that,
between 1996 and 2004, he worked as a system and software developer at Camera
Obscura, School of the Arts in Tel Aviv, Israel.

 

The LTI Webmaster
 



LTI is part of the School of Computer Science at Carnegie Mellon University.
This page is maintained by The LTI Webmaster.