Covariances in Computer Vision and Machine Learning
Springer International Publishing (Verlag)
978-3-031-00692-0 (ISBN)
Covariance matrices play important roles in many areas of mathematics, statistics, and machine learning, as well as their applications. In computer vision and image processing, they give rise to a powerful data representation, namely the covariance descriptor, with numerous practical applications.
In this book, we begin by presenting an overview of the {it finite-dimensional covariance matrix} representation approach of images, along with its statistical interpretation. In particular, we discuss the various distances and divergences that arise from the intrinsic geometrical structures of the set of Symmetric Positive Definite (SPD) matrices, namely Riemannian manifold and convex cone structures. Computationally, we focus on kernel methods on covariance matrices, especially using the Log-Euclidean distance.
We then show some of the latest developments in the generalization of the finite-dimensional covariance matrix representation to the {it infinite-dimensional covariance operator} representation via positive definite kernels. We present the generalization of the affine-invariant Riemannian metric and the Log-Hilbert-Schmidt metric, which generalizes the Log-Euclidean distance. Computationally, we focus on kernel methods on covariance operators, especially using the Log-Hilbert-Schmidt distance. Specifically, we present a two-layer kernel machine, using the Log-Hilbert-Schmidt distance and its finite-dimensional approximation, which reduces the computational complexity of the exact formulation while largely preserving its capability. Theoretical analysis shows that, mathematically, the approximate Log-Hilbert-Schmidt distance should be preferred over the approximate Log-Hilbert-Schmidt inner product and, computationally, it should be preferred over the approximate affine-invariant Riemannian distance.
Numerical experiments on image classification demonstrate significant improvements of the infinite-dimensional formulation over the finite-dimensional counterpart. Given the numerous applications of covariance matrices in many areas of mathematics, statistics, and machine learning, just to name a few, we expect that the infinite-dimensional covariance operator formulation presented here will have many more applications beyond those in computer vision.
Ha Quang Minh received the Ph.D. degree in mathematics from Brown University, Providence, RI, USA, in May 2006, under the supervision of Steve Smale. He is currently a Researcher in the Department of Pattern Analysis and Computer Vision (PAVIS) with the Istituto Italiano di Tecnologia (IIT), Genova, Italy. Prior to joining IIT, he held research positions at the University of Chicago, the University of Vienna, Austria, and Humboldt University of Berlin, Germany. He was also a Junior Research Fellow at the Erwin Schrodinger International Institute for Mathematical Physics in Vienna and a Fellow at the Institute for Pure and Applied Mathematics (IPAM) at the University of California, Los Angeles (UCLA). His current research interests include applied and computational functional analysis, applied and computational di erential geometry, machine learning, computer vision, and image and signal processing. His recent research contributions include the infinite-dimensional Log-Hilbert-Schmidt metric and Log-Determinant divergences between positive definite operators, along with their applications in machine learning and computer vision in the setting of kernel methods. He received the Microsoft Best Paper Award at the Conference on Uncertainty in Artificial Intelligence (UAI) in 2013 and the IBM Pat Goldberg Memorial Best Paper Award in Computer Science, Electrical Engineering, and Mathematics in 2013.Vittorio Murino is full professor and head of the Pattern Analysis and Computer Vision (PAVIS) department at the Istituto Italiano di Tecnologia (IIT), Genoa, Italy. He received the Ph.D. in Electronic Engineering and Computer Science in 1993 at the University of Genoa, Italy. Then, he was first at the University of Udine and, since 1998, at the University of Verona, where he was chairman of the Department of Computer Science from 2001 to 2007. His research interests are in computer vision and machine learning, in particular, probabilistic techniques for image and video analysis with applications to video surveillance, biomedical image analysis and bio-informatics. He is currently a member of the editorial board of Computer Vision and Image Understanding, Pattern Analysis and Applications, and Machine Vision & Applications journals. He was also associate editor of Pattern Recognition and of the IEEE Transactions on Systems, Man, and Cybernetics until 2016. Finally, he is a Senior Member of the IEEE since 2002 and IAPR Fellow since 2006.
Acknowledgments.- Introduction.- Data Representation by Covariance Matrices.- Geometry of SPD Matrices.- Kernel Methods on Covariance Matrices.- Data Representation by Covariance Operators.- Geometry of Covariance Operators.- Kernel Methods on Covariance Operators.- Conclusion and Future Outlook.- Bibliography.- Authors' Biographies.
Erscheinungsdatum | 06.06.2022 |
---|---|
Reihe/Serie | Synthesis Lectures on Computer Vision |
Zusatzinfo | XIII, 156 p. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 191 x 235 mm |
Gewicht | 336 g |
Themenwelt | Informatik ► Grafik / Design ► Digitale Bildverarbeitung |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
ISBN-10 | 3-031-00692-5 / 3031006925 |
ISBN-13 | 978-3-031-00692-0 / 9783031006920 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich