Normal view MARC view ISBD view

Machine Learning for Multimodal Interaction [electronic resource] : 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers / edited by Andrei Popescu-Belis, Steve Renals, Hervé Bourlard.

Contributor(s): Popescu-Belis, Andrei [editor.] | Renals, Steve [editor.] | Bourlard, Hervé [editor.] | SpringerLink (Online service).
Material type: materialTypeLabelBookSeries: Information Systems and Applications, incl. Internet/Web, and HCI: 4892Publisher: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2008Edition: 1st ed. 2008.Description: XI, 308 p. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783540781554.Subject(s): Artificial intelligence | Signal processing | User interfaces (Computer systems) | Human-computer interaction | Natural language processing (Computer science) | Computers and civilization | Computer vision | Artificial Intelligence | Signal, Speech and Image Processing | User Interfaces and Human Computer Interaction | Natural Language Processing (NLP) | Computers and Society | Computer VisionAdditional physical formats: Printed edition:: No title; Printed edition:: No titleDDC classification: 006.3 Online resources: Click here to access online
Contents:
Invited Paper -- Robust Real Time Face Tracking for the Analysis of Human Behaviour -- Multimodal Processing -- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion -- Meeting State Recognition from Visual and Aural Labels -- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers -- HCI, User Studies and Applications -- Automatic Annotation of Dialogue Structure from Simple User Interaction -- Interactive Pattern Recognition -- User Specific Training of a Music Search Engine -- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing -- Integrating Semantics into Multimodal Interaction Patterns -- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment -- Image and Video Processing -- Face Recognition in Smart Rooms -- Gaussian Process Latent Variable Models for Human Pose Estimation -- Discourse and Dialogue Processing -- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech -- Term-Weighting for Summarization of Multi-party Spoken Dialogues -- Automatic Decision Detection in Meeting Speech -- Czech Text-to-Sign Speech Synthesizer -- Speech and Audio Processing -- Using Prosodic Features in Language Models for Meetings -- Posterior-Based Features and Distances in Template Matching for Speech Recognition -- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems -- Transfer Learning for Tandem ASR Feature Extraction -- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search -- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding -- Modeling Vocal Interaction for Segmentation in Meeting Recognition -- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation -- PASCAL Speech Separation Challenge II -- To Separate Speech -- Microphone Array Beamforming Approach to Blind Speech Separation.
In: Springer Nature eBook
    average rating: 0.0 (0 votes)
No physical items for this record

Invited Paper -- Robust Real Time Face Tracking for the Analysis of Human Behaviour -- Multimodal Processing -- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion -- Meeting State Recognition from Visual and Aural Labels -- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers -- HCI, User Studies and Applications -- Automatic Annotation of Dialogue Structure from Simple User Interaction -- Interactive Pattern Recognition -- User Specific Training of a Music Search Engine -- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing -- Integrating Semantics into Multimodal Interaction Patterns -- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment -- Image and Video Processing -- Face Recognition in Smart Rooms -- Gaussian Process Latent Variable Models for Human Pose Estimation -- Discourse and Dialogue Processing -- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech -- Term-Weighting for Summarization of Multi-party Spoken Dialogues -- Automatic Decision Detection in Meeting Speech -- Czech Text-to-Sign Speech Synthesizer -- Speech and Audio Processing -- Using Prosodic Features in Language Models for Meetings -- Posterior-Based Features and Distances in Template Matching for Speech Recognition -- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems -- Transfer Learning for Tandem ASR Feature Extraction -- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search -- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding -- Modeling Vocal Interaction for Segmentation in Meeting Recognition -- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation -- PASCAL Speech Separation Challenge II -- To Separate Speech -- Microphone Array Beamforming Approach to Blind Speech Separation.

There are no comments for this item.

Log in to your account to post a comment.