Multimodal Interaction in Image and Video Applications (Intelligent Systems Reference Library, 48, Band 48) - Hardcover

Sappa, Angel D.; Vitrià, Jordi

 
9783642359316: Multimodal Interaction in Image and Video Applications (Intelligent Systems Reference Library, 48, Band 48)

Inhaltsangabe

Traditional Pattern Recognition (PR) and Computer Vision (CV) technologies have mainly focused on full automation, even though full automation often proves elusive or unnatural in many applications, where the technology is expected to assist rather than replace the human agents. However, not all the problems can be automatically solved being the human interaction the only way to tackle those applications.

Recently, multimodal human interaction has become an important field of increasing interest in the research community. Advanced man-machine interfaces with high cognitive capabilities are a hot research topic that aims at solving challenging problems in image and video applications. Actually, the idea of computer interactive systems was already proposed on the early stages of computer science. Nowadays, the ubiquity of image sensors together with the ever-increasing computing performance has open new and challenging opportunities for research in multimodal human interaction.

This book aims to show how existing PR and CV technologies can naturally evolve using this new paradigm. The chapters of this book show different successful case studies of multimodal interactive technologies for both image and video applications. They cover a wide spectrum of applications, ranging from interactive handwriting transcriptions to human-robot interactions in real environments.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

Über die Autorin bzw. den Autor

Michael Teutsch received his diploma degree in computer science and his Ph.D. degree from the Karlsruhe Institute of Technology (KIT) in 2009 and 2014, respectively. From 2009-2016 he worked as a research scientist and a postdoc at the Fraunhofer IOSB, Karlsruhe, Germany. Since 2016, he has been with Hensoldt Optronics, Oberkochen, Germany. His research interests include computer vision, visual surveillance, object detection, object tracking, and machine learning. Michael has been organizing and co-chairing the annual IEEE International Workshop on Perception Beyond the Visible Spectrum (PBVS) in conjunction with the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) since 2018. He is active as lecturer in computer vision currently at the Baden-Wuerttemberg Cooperative State University (DHBW ) Heidenheim, Germany. Michael serves as reviewer for several journals and conferences such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), or IEEE Transactions on Geoscience and Remote Sensing (TGRS). He has authored or co-authored more than 30 scientific publications.

Angel D. Sappa received his Electro Mechanical Engineering degree (1995) from National University of La Pampa, Argentina, and his Ph.D. degree in Industrial Engineering (1999) from Polytechnic University of Catalonia, Barcelona, Spain. In 2003, after research positions in France (LAAS-CNRS), the UK (UK Advanced Robotics), and Greece (ITI-CERTH), he joined the Computer Vision Center, Barcelona, Spain, where he currently holds a Senior Scientist position. Since 2016 he has been a full professor at the ESPOL Polytechnic University, Guayaquil, Ecuador, where he leads the computer vision team at CIDIS research center; he is the director of the Electrical Engineering Ph.D. program. His research interests include crossspectral image processing and representation; 3D data acquisition, processing, and modeling; and computer vision applications. He published about 200 papers in international journals and conference proceedings and served as program committee member in several international conferences. He has been involved in several national, regional, and international research projects and several technological transfer projects; he has been the cofounder of VINTRA Inc. (San Francisco, USA) and Crowdmobile S.L. (Barcelona, Spain). He is a Senior Member of the Institute of Electrical and Electronics Engineers (IEEE).
Riad I. Hammoud received an M.S. degree in Controls of Systems and a Ph.D. in Computer Vision and Robotics from UTC and INRIA (France) late 1997 and early 2001, respectively. He did his postdoc at Indiana University in 2002. Since early 2003, he has been working on several projects involving infrared imaging for defense, automotive, and robotics applications. Early 2019, he joined TuSimple to develop autonomous driving systems. From 2012-2019, he worked at BAE Systems (Boston, MA, USA), on DARPA, AFRL, and other U.S. government agencies' advanced research projects as principal investigator (PI), team lead, and research scientist. Before joining BAE Systems, Riad was at Tobii-Dynavox (Pittsburgh, PA, USA) and Delphi Automotive Systems (Kokomo, IN, USA) working on Assistive Technologies and Active Safety Systems. He joined Seth Teller's team at MIT as a collaborating Researcher to work on the DARPA Robotics Challenge (2012-2015). Dr. Riad Hammoud served as guest editor of several special issues of top journals in computer vision including CVIU and IJCV. He authored several edited book including the Springer book on Augmented Vision Perception in Infrared. Since 2004, he has been organizing and chairing a workshop series in conjunction with the IEEE CVPR on Perception Beyond the Visible Spectrum (PBVS). He also serves as the general chair of SPIE Automatic Target Recognition conference (2018-2021).

Von der hinteren Coverseite

Traditional Pattern Recognition (PR) and Computer Vision (CV) technologies have mainly focused on full automation, even though full automation often proves elusive or unnatural in many applications, where the technology is expected to assist rather than replace the human agents. However, not all the problems can be automatically solved being the human interaction the only way to tackle those applications.

Recently, multimodal human interaction has become an important field of increasing interest in the research community. Advanced man-machine interfaces with high cognitive capabilities are a hot research topic that aims at solving challenging problems in image and video applications. Actually, the idea of computer interactive systems was already proposed on the early stages of computer science. Nowadays, the ubiquity of image sensors together with the ever-increasing computing performance has open new and challenging opportunities for research in multimodal human interaction.

This book aims to show how existing PR and CV technologies can naturally evolve using this new paradigm. The chapters of this book show different successful case studies of multimodal interactive technologies for both image and video applications. They cover a wide spectrum of applications, ranging from interactive handwriting transcriptions to human-robot interactions in real environments.

„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.

Weitere beliebte Ausgaben desselben Titels

9783642439834: Multimodal Interaction in Image and Video Applications (Intelligent Systems Reference Library, Band 48)

Vorgestellte Ausgabe

ISBN 10:  3642439837 ISBN 13:  9783642439834
Verlag: Springer, 2015
Softcover