• Contact

  • Newsletter

  • About us

  • Delivery options

  • Prospero Book Market Podcast

  • News

  • 0
    Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022, Hefei, China, December 15?18, 2022, Proceedings

    Man-Machine Speech Communication by Zhenhua, Ling; Jianqing, Gao; Kai, Yu;

    17th National Conference, NCMMSC 2022, Hefei, China, December 15?18, 2022, Proceedings

    Series: Communications in Computer and Information Science; 1765;

      • GET 20% OFF

      • The discount is only available for 'Alert of Favourite Topics' newsletter recipients.
      • Publisher's listprice EUR 85.59
      • The price is estimated because at the time of ordering we do not know what conversion rates will apply to HUF / product currency when the book arrives. In case HUF is weaker, the price increases slightly, in case HUF is stronger, the price goes lower slightly.

        36 307 Ft (34 578 Ft + 5% VAT)
      • Discount 20% (cc. 7 261 Ft off)
      • Discounted price 29 046 Ft (27 662 Ft + 5% VAT)

    36 307 Ft

    db

    Availability

    Estimated delivery time: In stock at the publisher, but not at Prospero's office. Delivery time approx. 3-5 weeks.
    Not in stock at Prospero.

    Why don't you give exact delivery time?

    Delivery time is estimated on our previous experiences. We give estimations only, because we order from outside Hungary, and the delivery time mainly depends on how quickly the publisher supplies the book. Faster or slower deliveries both happen, but we do our best to supply as quickly as possible.

    Product details:

    • Edition number 1st ed. 2023
    • Publisher Springer
    • Date of Publication 11 May 2023
    • Number of Volumes 1 pieces, Book

    • ISBN 9789819924004
    • Binding Paperback
    • No. of pages332 pages
    • Size 235x155 mm
    • Weight 528 g
    • Language English
    • Illustrations 5 Illustrations, black & white; 86 Illustrations, color
    • 502

    Categories

    Long description:

    This book constitutes the refereed proceedings of the 17th National Conference on Man?Machine Speech Communication, NCMMSC 2022, held in China, in December 2022.


    The 21 full papers and 7 short papers included in this book were carefully reviewed and selected from 108 submissions. They were organized in topical sections as follows: MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model, MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.

    More

    Table of Contents:

    MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model.- A Multi-feature Sets Fusion Strategy with Similar Samples Removal for Snore Sound Classification.- Multi-Hypergraph Neural Networks for Emotion Recognition in Multi-Party Conversations.- Using Emoji as an Emotion Modality in Text-Based Depression Detection.- Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis.- Semantic enhancement framework for robust speech recognition.- Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model.- Predictive AutoEncoders are Context-Aware Unsupervised Anomalous Sound Detectors.- A pipelined framework with serialized output training for overlapping speech recognition.- Adversarial Training Based on Meta-Learning in Unseen Domains for Speaker Verification.- Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement.- Multiple Confidence Gates for Joint Training of SE and ASR.- Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Linguistic Information Fusion.- Pre-training Techniques For Improving Text-to-Speech Synthesis By Automatic Speech Recognition Based Data Enhancement.- A Time-Frequency Attention Mechanism with Subsidiary Information for Effective Speech Emotion Recognition.- Interplay between prosody and syntax-semantics: Evidence from the prosodic features of Mandarin tag questions.- Improving Fine-grained Emotion Control and Transfer with Gated Emotion Representations in Speech Synthesis.- Violence Detection through Fusing Visual Information to Auditory Scene.- Mongolian Text-to-Speech Challenge under Low-Resource Scenario for NCMMSC2022.- VC-AUG  Voice Conversion based Data Augmentation for Text-Dependent Speaker Veri?cation.- Transformer-based potential emotional relation mining network for emotion recognition in conversation.- FastFoley Non-Autoregressive Foley Sound Generation Based On Visual Semantics.- Structured Hierarchical Dialogue Policy with Graph Neural Networks.- Deep Reinforcement Learning for On-line Dialogue State Tracking.- Dual Learning for Dialogue State Tracking.- Automatic Stress Annotation and Prediction For Expressive Mandarin TTS.- MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.

    More
    Recently viewed
    previous
    Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022, Hefei, China, December 15?18, 2022, Proceedings

    Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022, Hefei, China, December 15?18, 2022, Proceedings

    Zhenhua, Ling; Jianqing, Gao; Kai, Yu;(ed.)

    36 307 HUF

    next