• Contact

  • Newsletter

  • About us

  • Delivery options

  • Prospero Book Market Podcast

  • Computer Vision: Cognitive Models for Visual Commonsense

    Computer Vision by Zhu, Yixin; Zhu, Song-Chun;

    Cognitive Models for Visual Commonsense

      • GET 20% OFF

      • The discount is only available for 'Alert of Favourite Topics' newsletter recipients.
      • Publisher's listprice EUR 96.29
      • The price is estimated because at the time of ordering we do not know what conversion rates will apply to HUF / product currency when the book arrives. In case HUF is weaker, the price increases slightly, in case HUF is stronger, the price goes lower slightly.

        39 936 Ft (38 034 Ft + 5% VAT)
      • Discount 20% (cc. 7 987 Ft off)
      • Discounted price 31 949 Ft (30 427 Ft + 5% VAT)

    39 936 Ft

    db

    Availability

    Not yet published.

    Why don't you give exact delivery time?

    Delivery time is estimated on our previous experiences. We give estimations only, because we order from outside Hungary, and the delivery time mainly depends on how quickly the publisher supplies the book. Faster or slower deliveries both happen, but we do our best to supply as quickly as possible.

    Long description:

    This volume on visual commonsense reasoning, part of a comprehensive three-volume series, presents a computational framework for bridging the gap between modern computer vision capabilities and human-like visual understanding. While current AI systems excel at pattern recognition tasks, they often lack the sophisticated reasoning capabilities that humans demonstrate effortlessly in understanding and interacting with their environment. This work addresses this limitation by integrating physical, social, and abstract reasoning within a unified computational framework.

    The volume is organized into three parts. The first part establishes the theoretical foundations of visual commonsense through a systematic examination of physical understanding, including affordances, intuitive physics, causality, and tool use. These components form the basis for understanding how objects and environments behave and interact. The second part delves into social reasoning aspects, exploring intent, theory of mind, and nonverbal communication - crucial capabilities for AI systems to interpret and predict human behavior. The third part investigates abstract visual reasoning, examining higher-level cognitive capabilities.

    Drawing from cognitive science, computer vision, and artificial intelligence, this work:

    • Provides a systematic treatment of visual commonsense ranging from foundational theories to practical implementations
    • Introduces computational frameworks integrating multiple forms of reasoning
    • Demonstrates applications through extensive examples and case studies
    • Highlights current challenges and future directions in developing human-like visual AI

    This carefully crafted volume serves as an invaluable resource for researchers, graduate students, and practitioners in computer vision, artificial intelligence, cognitive science, and related fields. It offers both theoretical insights and practical guidance for developing AI systems with more sophisticated visual understanding capabilities, moving closer to human-like visual intelligence.

    More

    Table of Contents:

    Introduction.- Affordance and Functionality.- Physical Commonsense Reasoning.- Causality in Daily Activities.- Tool-use.- Mirroring and Immitation.- Utility.- Nonverbal Communication: Gaze, Pointing and Drawing.- Intention.- Animacy: Physical vs. Social Perception.- Theory of Mind Representations.- Explainable AI.- Communicative Learning.- Abstract Reasoning.- The Current State and Challenges.

    More