Computer Vision – ECCV 2024

18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LVI

Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol (Herausgeber)

Buch | Softcover

LXXXV, 499 Seiten

2024
Springer International Publishing (Verlag)
978-3-031-72991-1 (ISBN)

Artikel merken

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024.

The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.

HowToCaption: Prompting LLMs to Transform Video Annotations at Scale.- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection.- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction.- On Pretraining Data Diversity for Self-Supervised Learning.- Look Around and Learn: Self-Training Object Detection by Exploration.- Bayesian Self-Training for Semi-Supervised 3D Segmentation.- Motion and Structure from Event-based Normal Flow.- ParCo: Part-Coordinating Text-to-Motion Synthesis.- Learning to Complement and to Defer to Multiple Users.- Tiny Models are the Computational Saver for Large Models.- DragVideo: Interactive Drag-style Video Editing.- Multi-Sentence Grounding for Long-term Instructional Video.- Do Generalised Classifiers really work on Human Drawn Sketches?.- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding.- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°.- MotionDirector: Motion Customization of Text-to-Video Diffusion Models.- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer.- Enhanced Motion Forecasting with Visual Relation Reasoning.- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression.- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers.- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar.- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models.- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models.- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer.- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors.- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation.- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.

Erscheinungsdatum	31.10.2024
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	LXXXV, 499 p. 181 illus., 177 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Informatik ► Grafik / Design ► Digitale Bildverarbeitung
	Mathematik / Informatik ► Informatik ► Netzwerke
	Technik ► Elektrotechnik / Energietechnik
Schlagworte	Artificial Intelligence • Computer Networks • Computer systems • computer vision • Education • Human-Computer Interaction (HCI) • Image Analysis • image coding • Image Processing • image reconstruction • Image Segmentation • learning • machine learning • Object recognition • pattern recognition • reconstruction • Signal Processing • Software engineering
ISBN-10	3-031-72991-9 / 3031729919
ISBN-13	978-3-031-72991-1 / 9783031729911
Zustand	Neuware