all.bib file://localhost/Users/chenl/Documents/PAPERS/bib/bib/all.bib BibTeX Bibliography en Copyright 2006 Fri, 05 Jan 2007 14:56:10 -0600 Using Maximum Entropy (ME) Model to Incorporate Gesture Cues for SU Detection Speech and Gesture Analysis for Evaluation of Parkinson Disease Discourse segmentation of multi-party conversation Projecting the end of a speaker's turn: A cognitive cornerstone of conversation Addressee Identification in Face-to-Face Meetings Generation of Natural Response Timing Using Decision Tree Based on Prosodic and Linguistic Information Prosodic Features Which Cue Backchannel Responses in English and Japanese Using Prosodic Clues to decide when to produce back-channel utterances Interactional units in conversation: syntactic, intonational, and pragmatic resources for the managment of turns Projection and 'silences': Notes on phonetic and conversational structure Keeping the Floor in Multiparty Conversations: Intonation, Syntax, and Pause Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody in human-computer dialog Prosodic Cues for Interfaction Control in Spoken Dialogue Systems Exploring Prosody in Interaction Control A maximum entropy approach to natural language processing The role of gesture in communication and thinking Bradcast news segmentation using MDE and STT information to improve speech recognition Final Report: parsing speech and structural event detection http://www.clsp.jhu.edu/ws2005/groups/eventdetect/documents/finalreport.pdf Maximum {E}ntropy {M}odeling {T}oolkit for {P}ython and {C}++ Gestural Cues for Sentence Segmentation Gesture Mind Markers in ECAs Locating Salient Portions of Meeting Using Multimodal Cues A Multimodal Analysis of Floor Control in Meetings Meeting Recording Quick Transcription Guidelines http://www.nist.gov/speech/test_beds/mr_proj/meeting_corpus_1/documents/pdf/MeetingDataQTRSpec-V1.3.pdf Embodied Conversational Agents Informing the design of embodied conversational agents by analysing multimodal politeness behaviors in human-human communication Automatic Detection of discourse structure for speech recognition and understanding A corpus for studying addressing behavior in multi-party dialogues Eye Gaze Patterns in Conversations: There is More to Conversational Agents Than Meets the Eyes Multi-level Dialog Act Tags. A Multimodal Discourse Ontology for Meeting Understanding Turn-taking in social talk dialogues: temporal, formal and functional aspects Melodic cues to turn-taking in English: evidence from perception Testing the Perceptual Relevance of Syntactic Completion and Melodic Configuration for Turn-Taking in Dutch Pitch Accents, Boundary Tones and Turn-taking in Dutch Map Task Dialogues An Open Source Prosodic Feature Extraction Tool The {ICSI} meeting recorder dialog act ({MRDA}) corpus Durational Aspects of Turn-taking in Spontaneous Face-to-Face and Telephone Dialogues Models of Gaze in Multi-party Discourse A simulation of small group discussion Modeling Gaze Behavior as a Function of Discourse Structure Coordinating Turn-Taking With Gaze Offering a Hand to Pragmatic Understanding: The Role of Speech and Gesture in Comprehension and Memory Summarization of Videotaped Presentations: Automatic Analysis of Motion and Gesture Meeting Browser: Tracking and Summarizing Meetings Advances in Automatic Meeting Record Creation and Access Pitch-Based Emphasis Detection for Characterization of Meeting Records Pitch-Based Emphasis Detection for Segmenting Speech Recordings Multimodal Summarization of Meeting Recordings Comparing {HMM}, {M}aximum {E}ntropy, and {C}onditional {R}andom {F}ields for Disfluency Detection Automatic Dialog act segmentation and classification in multiparty meetings Predicting end of utterance in multimodal and unimodal conditions Praat based Prosodic Feature Extraction Toolkit Uisng Simple Speech-Based Features to Detect the State of a Meeting and the Roles of the Meeting Participants {SRILM} - An Extensible Language Modeling Toolkit Articulated Body Tracking Using Dynamic Belief Propagation {SONIC}: The {U}niversity of {C}olorado Continuous Speech Recognizer Gesture, Gaze, and Ground Prosodic Features Extraction {VACE} Multimodal Meeting Corpus Speech and Non-Speech Detection In Meeting Audio for Transcription Accurate Head Pose Tracking in Low Resolution Video Online Updating Appearance Generative Mixture Model for Meanshift Tracking Shared Linguistic Resources for Human Language Technology in the Meeting Domain Automatic Analysis of Multimodal Group Actions in Meetings Meeting Modeling in the Context of Multimodal Research Linguistic Annotation Tools http://www.ldc.upenn.edu/annotation/ Hand modeling, analysis and recognition Wavesurfer software http://www.speech.kth.se/wavesurfer/ Natural Turn-Taking Needs no Manual: Computational Theory and Model Conversational Organization: Interaction Between Speakers and Hearers The {NIST} {M}eeting {R}oom {P}ilot {C}orpus Blackboard Systems A Coding Tool for Multimodal Analysis of Meeting Video Vector Coherence Mapping: Motion Field Extraction by Exploiting Multiple Coherences Designing Autonomous Agents: Theory and Practice from Biology to Engineering and back The elimination of visible behaviour from social interactions: effects on verbal, nonverbal and interpersonal variables Gaze and Mutual Gaze Speech-Gesture Mismatches: Evidence For One Unerlying Representation of Linguistic {\&} Nonlinguistic Information Natural Interactivity Resources - Data, Annotation Schemes and Tools Emerging Requirements for Multi-Modal Annotation {\&} Analysis Tools Gestures and speech, interactions and separations: A reply to McNeill How Representational Gestures Help Speaking The Regulation of Speaker Turns in Face-to-Face Conversation: Some Implications for Conversation in Sound-only Communication Channels SignStream: A Database Tool for Rsearch on Visual-Gestural Language Gesture, Speech, and Computaional Stages: a Reply to {Mc}{N}eill Meaning in Movements: an Investigation into the Interrelationship of Physiographic Gestures and Speech in Seven-year-olds Gazing in Trials - a Powerful Signal in Floor Appointment Transana: a tool for the transcription and qualitative analysis of audio and video data, http://transana.org http://www.transana.org Some Functions of Gaze-direction in Social Interaction Using CLAN http://childes.psy.cmu.edu/clan Durational Aspects in Turn Taking Anvil http://www.dfki.de/~?kipp/anvil/ Speaking while monitoring addressees for understanding Natural Gesture in Descriptive Monologues Effects of Gaze on Multiparty Mediated Communication A system for Situated Temporal Analysis of Multimodal Communication Non-Verbal Cues for Discourse Structure The production of gesture and speech Manual activity during speaking in aphasic subjects Nonverbal Behaviours Improving a Simulation of Small Group Discussion Gesture and Intonation Gesture and the process of speech production: We think, therefore we gesture Development of the user-state conversations for the multimodal corpus in {SmartKom} Gestural Origo and Loci-transitions in Natural Discourse Segmentation {FORM}: a kinematically-based Gesture Annotation Scheme, http://www.ldc.upenn.edu/Projects/FORM/index.html Embodied Conversational Agents:Representation and Intelligence in User Interface Gesture Annotation: {T}ools and {D}ata, http://www.ldc.upenn.edu/annotatio/gesture http://www.ldc.upenn.edu/annotatio/gesture MacVissta: A System for Multimodal Analysis A versatile camera calibration technique for high accuracy {3D} machine vision metrology using off-the-shelf {TV} cameras and lenses The Language-Thought-Hand System Toward Interpretation of natural speech/gesture for spatial planning on a virtual map Gesture as an Indicator of Early Error Detection in Self-Monitoring of Speech Movement coordination in social interaction: some examples described Gestural Spatialization in Natural Discourse Segmentation Hearing Gesture: how our hands help us think Pointing Gesture Interpretation in a Multimodal Context The repertoire of nonverbal behavioral categories A probabilistic approach to reference resolution in multimodal user interfaces Space-time gestures Lexical gestures and lexical access A Multimodal Database of Gestures and Speech Language and thought interface: A study of spontaneous gestures and Japanese mimetics Thinking and Speaking Gesture, speech, and lexical access: the role of lexical movements in speech production Topic and focus of a sentence and the patterning of a text Vision based hand gesture interpretation using recursive estimation Distribution of Semantic Features across speech and gesture by humans and machines On the Tip of the Mind: Gesture as a key to Conceptualization Visual display: poting and natural language: the power of multimodal interaction Biological and cognitive foundations of intelligent sensor fusion Gesture production during stuttered speech: Insights into the nature of gesture-speech integration MediaTagger: Macintosh-based video transcription, http://www.mpi.nl/world/tg/CAVA/mt/MTandDB.html http://www.mpi.nl/world/tg/CAVA/mt/MTandDB.html Hidden Markov model based continuous online gesture recognition CAVA data base http://www.mpi.nl/world/tg/CAVA/mt/CAVA_db.html Dynamical system representation, generation, and recognition of basic oscillatory motion gestures A state-based approach to the representation and recognition of gesture Gesture and the poetics of prose ISLE Natural Interactivity and Multimodality(NIMM) WP8.1- Survey of NIMM Data Resources, Current and Future User Profiles, Markets and User Needs for NIMM Resources Dynamic system representation of basic and nonlinear in parameters oscillatory motion gestures Visual and Linguistic Information in Gesture Classification A state-based technique for the summarization and recognition of gesture Toward multimodal human-computer interface Isolated sign language recognition using hidden Markov models Recursive identification of gesture inputs using hidden Markov models Task-specific gesture analysis in real-time using interpolated views Recognizing temporal trajectories using the condensation algorithm Temporal classification of natural gesture and application to video coding A Salience-Based Approach to Gesture-Speech Alignment Understanding people pointing: the Perseus system Neural Architecture for Gesture-Based Human-Machine-Interaction Exploiting speech/gesture co-occurrence for improving continuous gesture recognition in weather narration The {SmartKom} Multimodal Corpus at {BAS} http://www.smartkom.org/reports/Report-NR-34.pdf A Robust Agent-Based Gesture Tracking System Gesture recognition using the Perseus architecture Labeling of gestures in SmartKom - The coding system Synergistic use of direct manipulation and natural language Natural Language with Integrated deictic and graphic gestures Multimodal Corpora for Human-machine interaction research Unification-based multimodal integration Speech/Gesture Interface to a Visual-Computing Environment Natural Language with Integrated deictic and graphic gestures Pointing gesture recognition based on 3D-tracking of face, hands and head orientation Integrating simultaneous input from speech, gaze, and hand gestures Understanding coverbal dimensional gestures in a virtual design environment Hand motion gestural oscillations and multimodal discourse Synchronization of speech and hand gestures during multimodal human-computer interaction Coverbal inconic gestures for object descriptions in virtual environments: an empirical study Speech-gesture driven multimodal interfaces for crisis management Gesture Patterns during Speech Repairs Two-handed gesture in multi-modal natural dialog Multimodal human discourse: {G}esture and speech Integrating simultaneous input from speech, gaze, and hand gestures An approach to natural gesture in virtual environments Recovering the Temporal Structure of Natural Gesture Gestural Interface to a visual computing Environment for Molecular biologists (invited speech) Perceptual user interfaces: multimodal interfaces that process what comes naturally Mutual disambiguation of recognition errors in a multimodel architecture Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review Ten myths of multimodal interaction Integration and synchronization of input modes during multimodal human-computer interaction Shape your imagination: Iconic gestural-based interaction Research Challenges in Gesture: Open Issues and Unsolved Problems Disfluencies in gesture: Gestural correlates to speech silent and filled pauses Eyes in the interface Statistical multimodal integration for intelligent HCI Holds as Gestural Correlates to Empty and Filled Speech Pauses Interaction with on-screen objects using visual gesture recognition Gestural Trajectory Symmetries and Discourse Segmentation Hand gesture recognition using combined features of location, angle and velocity Speech Pauses and Gestural Holds in Parkinson's Disease Gesture-based interaction and communication: Automated classification of hand gesture contours Oscillatory Gestures and Discourse Real-time input of 3D pose and gestures of a user's hand and its applications for HCI Video camera-based dynamic gesture recognition for HCI The Catchment Feature Model: A Device for Multimodal Fusion and A Bridge Between Signal and Sense Gestural Hand Trajectory Symmetries and Discourse Segmentation Gestural Hand Trajectory Symmetries and Discourse Segmentation Talkbank Program, Grant No.BCS-996009 KDI SBE http://www.talkbank.org {KDI}: {C}ross-model {A}nalysis {S}ignal and {S}ense- {D}ata and {C}omputational {R}esources for {G}esture, {S}peech and {G}aze {R}esearch, http://vislab.cs.vt.edu/KDI http://vislab.cs.vt.edu/KDI {ARDA VACE} project http://vislab.cs.wright.edu/Projects/MEETING-ANALYSIS/Overview.html {DARPA} {EARS} {P}rogram, http://www.darpa.mil/ipto/programs/ears/ Put-that-there ICONIC: Speech and Depictive Gestures at the Human-Machine Interface Multi-Modal Natural Diaglogue Continuous Recognition of Deictiv Gestures for Multimodal Interfaces FingerMouse: A freehand pointing interface Toward the use of gesture in traditional user interfaces Television control by hand gestures Computer vision in interactive computer graphics Unencumbered gestural interaction Recognizing human action in time-sequential images using hidden markov model Parametric hidden Markov models for gesture recognition Research challenges in gesture: open issues and unsolved problems Synergistic use of direct manipulation and natural language QuickSet: Multimodal Interaction for Distributed Applications Multimodal interfaces that process what comes naturally Hand and Mind: What Gestures Reveal about Thought Growth Points, Catchments, and Contexts Catchments and Context: Non-modular Factors in Speech and Gesture Growth points in thinking-for-speaking Instructions for annotating discourses Catchments, prosody and discourse Dynamic imagery in speech and gesture Gestures cues for conversational interaction in monocular video Gestural trajectory symmetries and discourse segmentation A Comparision of Disfluency Distribution in a Unimodal and a Multimodal Speech Interface Finite-state multimodal parsing and understanding User-Centered Modeling for Spoken Language and Multimodal Interfaces User-Centered Modeling for Spoken Language and Multimodal Interfaces Put That Where? Voice and Gesture at the Graphics Interface Gesture with Speech for Graphics Manipulation ALIVE: Artificial Life Interactive Video Environment Smart Rooms Effects of the Restriction of Hand Gestures on Disfluency Visual Gesture Recognition Data Driven Gesture Model Acquisition using Minimum Description Length The curvature primal sketch ANVIL: A Generic Annotation Tool for Multimodal Dialogue VisSTA: A Tool for Analyzing Multimodal Discourse Data Learning classification trees {L}iving {H}and to {M}outh: {P}sychological {T}heories about {S}peech and {G}esture in {I}nteractive {D}ialogue {S}ystems Automatic Hand Hold Detection in Natural Conversation A Parallel Algorighm for Dynamic Gesture Tracking Formal Syntax of Gesture : CoGesT1.1 Visual Prosody and Speech Intelligibility Some relationships between body motion and speech Current Issues in the Study of Gesture Prosody Based Co-analysis for Continuopus Recognition of Coverbal Gestures Glove-talk II - a neural-network interface which maps gestures to parallel formant speech synthesizer controls Achieving Effective Floor Control with a Low-Bandwidth Gesture-Sensitive Videoconferencing System Multimodal Model Integration for Sentence Unit Detection Prosody Based Audio-Visual Co-analysis for Coverbal Gesture Recognition Improving continuous gesture recognition with spoken prosody {L}inguistic {A}nnotation: {S}urvey by {LDC} http://www.ldc.upenn.edu/annotation/ http://www.ldc.upenn.edu/annotation/ A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues {S}tructural {M}etadata {R}esearch in the {EARS} {P}rogram To{BI}: A Standard for Labeling English Prosody The {ICSI/SRI/UW} {RT}-04 {S}tructural {M}etadata {E}xtraction {S}ystem Multi-stream adaptive evidence combination for noise robust ASR Predicting spoken disfluencies during human-computer interaction An algorithm for pronominal anaphora resolution A simplest Systematics for the Organisation of Turn Taking for Conversation Experiments on sentence boundary detedction Cyperpunc: A lightweight punctuation annotation system for speech Robust speech detection and segmentation for real-time ASR applications On the Structure of Speaker-Auditor Interaction During Speaking Turns Some Signals and Rules for Taking Speaking Turns in Conversations A comparision between syntactic and prosodic phrasing Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech Structural Event Detection for Rich Transcription of Speech Speech Recognition with Automatic Punctuation Strategies for automatic segmentation of audio data Speaker segmentation on conversational telephone speech www.nist.gov/speech/tests/rt/%20rt2003/spring/presentations/RT03_Slides.pdf Automatic Time Alignment of Phonemes Using Acoustic-Phonetic Information A linked-HMM model for robust voicing and speech detection Automatic Segmentation and Labeling of Speech. Multiple Media Correlation: Theory and Application Multispeaker speech activity detection for the {ICSI} meeting recorder Robust speech recognition in noisy environments: the 2001 IBM SPINE evaluation system Transcriber: Development and use of a tool for assisting speech corpora production Resegmentation of switchboard Automatic phonemic transcription and linguistic annotaion from known text with Hidden Markov Models, An Aligner fro German An HMM-based system for automatic segmentation and alignment of speech Perceptual Evaluation of Automatic Segmentation in Text-To-Speech Synthesis An Introduction to the Diagnostic Evaluation of SwitchBoard-Corpus Automatic Speech Recognition System Linguistic Dissection of Switchboard-Corpus Automatic Speech Recognition System Flexible Transcription Alignment The CU-HTK MARCH 2000 HUB5E TRANSCRIPTION SYSTEM The {SRI} {M}arch 2000 {H}ub-5 Conversational Speech Transcription System Improving the phonetic annotation by means of prosodic phrasing The Effect of Pruning and Compression on Graphical Representation of the Output of a Speech Recognizer The {HTK} Book The {\emph{Aligner}} Praat, a system for doing phonetics by computer. Simple Metadata Annotation Specification Transcriber : Development and use of a tool for assisting speech corpora production The {VIC} Corpus of Conversational Speech Creation of an {A}nnotated {G}erman {B}roadcast {S}peech {D}atabase for {S}poken {D}ocument {R}etrieval On Developing New Text and Audio Corpora and Speech Recognition Tools for the Turkish Language Maximum likelihood linear transformations fro HMM-based speech recognition citeseer.nj.nec.com/article/gales98maximum.html A Compact Model for Speaker-Adaptive Training {ISIP} 2000 Conversational Speech Evaluation System On automatic phonetic transcription quality: lower word error rates do not guarantee better transcriptions A RECURSIVE ALGORIGHM FOR THE FORCED ALIGNMENT OF VERY LONG AUDIO SEGMENTS Adaptive sentence boundary disambiguation Sentence boundary detection in broadcast speech transcript Prosody-based automatic segmentation of speech into sentences and topics Automatic detection of sentence boundaries and disfluencies based on recognition words Prosody modeling for automatic speech recognition and undersanding Prosody-based automatic detection of annoyance and frustration in human-computer dialog Dialogue act modeling for automatic tagging and recognition of conversational speech Speaking: from intention to articulation The structure of words and syllables: Evidence from errors in speech Repeating words in spontaneous speech How listeners compensate for disfluencies in spontaneous speech The effects of false starts and repetitions on the processing of subsequent words in spontaneou speech Preliminaries to A Theory of Speech Disfluencies Integrating multiple knowledge sources for detecting and correction of repairs in human-computer dialog Edit detection and parsing for transcribed speech A syntactic framework for speech repairs and other disruption Automatic Summarization of Spoken Dialogues in Unrestricted Domains An Introduction to Hidden {M}arkov Models Statistical Language Modeling for Speech disfluencies Speech repairs, intonational phrased and discourse markers: Modeling speakers' utterances in spoken dialogue How and when are disfluencies found When can listeners detect disfluency in spontaneous speech A corpus-based study of repair cues in spontaneous speech Analysis and automatic recognition of false starts in spontaneous speech A prosody-only decision-tree model for disfluency detection Deterministic parsing of syntactic nonfluencies Phonetic consequences of speech disfluency Jucture cues to disfluency On not recognizing disfluencies in dialog Disfluency Rates in Conversation: Effects of Age, Relationship, Topic, Role and Gender To errr is human: ecology and acoustics of speech disfluencies Annotation of a Multichannel noisy speech corpus A Multimodal Database of Gesture and Speech Multimodal Corpora for Human-Machine Interaction Research Video-Tracking and recognition of pointing gesture using Hidden Markov Models Gesture and speech dysfluencies Gesture production during stuttered speech: Insights into the nature of gesture-speech integration National Center Sign Language and Gesture Resource http://www.bu.edu/asllrp/cslgr/ Survey of NIMM data Resources, Current and Future User Profiles, Markets and User Needs for NIMM Resources The {ICSI} {M}eeting {C}orpus The Relation between Dialogue Acts and Hot Spots in Meetings Spotting "Hot Spots" in Meetings: Human, Judgements and Prosodic Cues Location based Speaker Segmentation Feature Selection for the Classification of Crosstalk in Multi-Channel Audio {M}ulti-{C}hannel Source Seperation by Factorial {HMMS} Detection of Agreement vs. Disagreement in Meetings: Training with Unlabeled data {R}esampling Techniques for {S}entence {B}oundary {D}etection: {A} {C}ase {S}tudy in {M}achine {L}earning from {I}mbalanced {D}ata for {S}poken {L}anguage {P}rocessing Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus The Use of Prosody In a Combined System for Punctuation Generation and Speech Recognition Modeling dynamic prosodic variation for speaker verification Punctuation Annotation using Statistical prosody models The Use of Prosody In a Combined System for Punctuation Generation and Speech Recognition Prosody-Based Automatic Segmentation of Speech into Sentences and Topics Automatic Linguistic segmentation of conversational speech Class-Based n-gram Models of Natural Language {TnT}-a Statistical part-of-speech Tagger Broadcast news Segmentation Using MDE and STT Information to Improve S peech Recognition {MDE Research at ICSI+SRI+UW, NIST RT-03F Workshop} Resampling Techniques for Sentence Boundary Detection: A Case Study in Machine Learning from Imbalanced Data for Spoken Language Processing Disfluency Annotation Stylebook for the Switchboard Corpus Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing Automatic Analysis of Multimodal Group Actions in Meetings Clustering and segmenting speakers and their locations in meetings Recognition of meeting action using information obtained from different modalities Towards automatic addressee identification in multi-party dialogues Head orientation and gaze direction in meetings A multimodal database framework for multimedia meeting annotations The {ISL} Meeting Corpus: The impact of Meeting Type on Speech Type Microphone array speech recognition: Experiments on overlapping speech in meetings Modeling human interaction in meetings Finding presentations in recorded meetings using audio and video features Towards a multimodal meeting record GAZE Groupware System: Mediating joint attention in multiparty communication and collaboration Progress in automatic meeting transcription Can Prosody Aid the Automatic Processing of Multi-Party Meetings: Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech Multimodal people {ID} for a multimedia meeting browser Meetings about meetings: Research at {ICSI} on speech in multiparty conversations Audio-visual speaker tracking with importance particle filters The {ISL} Meeting Room System Detecting Emotions in Speech Prosody-based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog Location based speaker segmentation On automatic annotation of meeting databases Tracking Focus of Attention in Meetings Audio information access from meeting rooms Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues Dynamic Bayesian networks for meeting structuring Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation The {ICSI} meeting corpus Automatically generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings Lecture and Presentation Tracking in an intelligent Meeting Room The Meeting Project at {ICSI} Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition