The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech. Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style.
Accordingly, speech conversion techniques are now of growing interest among researchers. HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis.
Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.
Professor Keikichi Hirose received the B. E. degree in electrical engineering in 1972, and the M. E. and Ph. D. degrees in electronic engineering respectively in 1974 and 1977 from the University of Tokyo. From 1977, he is a faculty member at the University of Tokyo, and was a Professor of the Department of Electronic Engineering from 1994. Currently he is professor at the Department of Information and Communication Engineering, Graduate School of Information Science and Technology, University of Tokyo. From March 1987 to January 1988, he was Visiting Scientist at the Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, U.S.A. He has been engaged in a wide range of research on spoken language processing, including analysis, synthesis, recognition, dialogue systems, and computer-assisted language learning. From 2000 to 2004, he was Principal Investigator of the national project “Realization of advanced spoken language information processing utilizing prosodic features,” supported by the Japanese Government. He served as Chair of Speech Committee, Institute of Electronics, Information and Communication Engineers (IEICE)/Acoustical Society of Japan (ASJ) from 2003 to 2005. He is Chair of Speech Prosody Special Interest Group (SPro-SIG), ISCA, from October 2010. He has been on the editorial board of Speech Communication journal since 2004 and on the editorial board of ETRI Journal since 2009. He is a Fellow of Institute of Information and Communication Engineering and a member of a number of academic societies, including IEEE, International Speech Communication Association (Board member), Acoustical Society of America, Acoustical Society of Japan, Information Processing Society of Japan, Japanese Society for Artificial Intelligence, and Research Institute of Signal Processing Japan (Board member).
Jianhua Tao received the M.S. degree from Nanjing University in 1996 and the Ph.D. in Computer Science from TsinghuaUniversity in 2001. He is currently the professor at National Laboratory of Pattern Recognition (NLPR) of Chinese Academy of Sciences where he chairs the human computer speech interaction group. He developed quite several earliest versions of Speech systems, multimodal interaction system in China, and published more than 90 papers in IEEE Trans. on ASLP, ICASSP, Interspeech, ICME, ICPR, ICCV, ICIP, etc. He has been the main researcher and contributor of several national scientific projects supported by National Natural Science Foundation of China (NSFC), National High-Tech Program and International Cooperation Projects (863). Currently, He is one of the editorial board members of "International Journal on Computational Linguistics and Chinese Language Processing", “Journal on Multimodal User Interfaces (JMUI)”, “International Journal of Synthetic Emotions (IJSE)”, and the Steering Committee Member for the IEEE Transactions on Affective Computing. He was elected as vice-chair of ISCA Special Interesting Group of Chinese Spoken Language Processing from 2006, the executive committee member of HUMAINE association from 2007, the board member of COCOSDA from 2007, and is also the Council member of Chinese Speech Information Processing Society and the Acoustical Society of China.
The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech. Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style.
Accordingly, speech conversion techniques are now of growing interest among researchers. HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis.
„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.
Anbieter: Brook Bookstore On Demand, Napoli, NA, Italien
Zustand: new. Questo è un articolo print on demand. Bestandsnummer des Verkäufers 3b17d2cd5c218e29f7abb1c09ccb563d
Anzahl: Mehr als 20 verfügbar
Anbieter: Ria Christie Collections, Uxbridge, Vereinigtes Königreich
Zustand: New. In. Bestandsnummer des Verkäufers ria9783662452578_new
Anzahl: Mehr als 20 verfügbar
Anbieter: GreatBookPricesUK, Woodford Green, Vereinigtes Königreich
Zustand: New. Bestandsnummer des Verkäufers 21993871-n
Anzahl: Mehr als 20 verfügbar
Anbieter: GreatBookPrices, Columbia, MD, USA
Zustand: New. Bestandsnummer des Verkäufers 21993871-n
Anzahl: 15 verfügbar
Anbieter: moluna, Greven, Deutschland
Gebunden. Zustand: New. Bestandsnummer des Verkäufers 20937850
Anzahl: Mehr als 20 verfügbar
Anbieter: Books Puddle, New York, NY, USA
Zustand: New. 213. Bestandsnummer des Verkäufers 26372225649
Anzahl: 4 verfügbar
Anbieter: buchversandmimpf2000, Emtmannsberg, BAYE, Deutschland
Buch. Zustand: Neu. This item is printed on demand - Print on Demand Titel. Neuware -The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech. Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style.Accordingly, speech conversion techniques are now of growing interest among researchers. HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis.Springer-Verlag KG, Sachsenplatz 4-6, 1201 Wien 224 pp. Englisch. Bestandsnummer des Verkäufers 9783662452578
Anzahl: 1 verfügbar
Anbieter: Majestic Books, Hounslow, Vereinigtes Königreich
Zustand: New. Print on Demand 213. Bestandsnummer des Verkäufers 374868398
Anzahl: 4 verfügbar
Anbieter: Biblios, Frankfurt am main, HESSE, Deutschland
Zustand: New. PRINT ON DEMAND 213. Bestandsnummer des Verkäufers 18372225659
Anzahl: 4 verfügbar
Anbieter: AHA-BUCH GmbH, Einbeck, Deutschland
Buch. Zustand: Neu. Druck auf Anfrage Neuware - Printed after ordering - The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech. Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style.Accordingly, speech conversion techniques are now of growing interest among researchers. HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis. Bestandsnummer des Verkäufers 9783662452578
Anzahl: 1 verfügbar