Computer Vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXVII (Lecture Notes in Computer Science)

個数:

Computer Vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXVII (Lecture Notes in Computer Science)

  • 提携先の海外書籍取次会社に在庫がございます。通常3週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合は、ご注文数量が揃ってからまとめて発送いたします。
    3. 美品のご指定は承りかねます。

    ●3Dセキュア導入とクレジットカードによるお支払いについて
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Paperback:紙装版/ペーパーバック版/ページ数 753 p.
  • 商品コード 9783031198359

Full Description

The 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel Aviv, Israel, during October 23-27, 2022.

 

The 1645 papers presented in these proceedings were carefully reviewed and selected from a total of 5804 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.

Contents

Most and Least Retrievable Images in Visual-Language Query Systems.- Sports Video Analysis on Large-Scale Data.- Grounding Visual Representations with Texts for Domain Generalization.- Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions.- StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation.- VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance.- Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.- End-to-End Active Speaker Detection.- Emotion Recognition for Multiple Context Awareness.- Adaptive Fine-Grained Sketch-Based Image Retrieval.- Quantized GAN for Complex Music Generation from Dance Videos.- Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction.- Localizing Visual Sounds the Easy Way.- Learning Visual Styles from Audio-Visual Associations.- Remote Respiration Monitoring of Moving Person Using Radio Signals.- Camera Pose Estimation and Localization with Active Audio Sensing.- PACS: A Dataset for Physical Audiovisual Commonsense Reasoning.- VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer.- Telepresence Video Quality Assessment.- MultiMAE: Multi-modal Multi-task Masked Autoencoders.- AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation.- Audio—Visual Segmentation.- Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression.- Relationformer: A Unified Framework for Image-to-Graph Generation.- GAMa: Cross-view Video Geo-localization.- Revisiting a kNN-based Image Classification System with High-capacity Storage.- Geometric Representation Learning for Document Image Rectification.- S2-VER: Semi-Supervised Visual Emotion Recognition.- Image Coding for Machines with Omnipotent Feature Learning.- Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval.- Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition.- Semantic-Guided Multi-Mask Image Harmonization.- Learning an Isometric Surface Parameterization for Texture Unwrapping.- Towards Regression-Free Neural Networks for Diverse Compute Platforms.- Relationship Spatialization for Depth Estimation.- Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.- FAR: Fourier Aerial Video Recognition.- Translating a Visual LEGO Manual to a Machine-Executable Plan.- Fabric Material Recovery from Video Using Multi-Scale Geometric Auto-Encoder.- MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment.- The One Where They Reconstructed 3D Humans and Environments in TV Shows.

最近チェックした商品