[PDF][PDF] AUDIO RECOGNITION PROBLEMS

Y Preobrazhenskiy, A Preobrazhenskiy… - European …, 2022 - desymp.promonograph.org
Машинное обучение активно внедряется в разработки компаний уровня Google [1],
Facebook [2], Netflix [3] и др. Основополагающими причинами взрывного роста …

[PDF][PDF] Audio-visual automatic speech recognition: An overview

G Potamianos, C Neti, J Luettin… - Issues in visual and audio …, 2004 - academia.edu
audio-visual ASR. As an application of speaker adaptation, we consider the problem of
automatic recognition … of audio-visual ASR, and on what we view as open problems in this area. …

[PDF][PDF] Audio visual speech recognition

C Neti, G Potamianos, J Luettin, I Matthews, H Glotin… - 2000 - infoscience.epfl.ch
… ; and b The design of audio-visual information fusion algorithms … audio-only LVCSR systems,
under all possible audio-visual … art in audio-visual ASR by seriously tackling the problem of …

Deep audio-visual speech recognition

T Afouras, JS Chung, A Senior… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
audio-visual models the audio signal dominates, because speech recognition is a significantly
easier problem … babble noise with 0 dB SNR to the audio stream with probability pn ¼ 0:…

Panns: Large-scale pretrained audio neural networks for audio pattern recognition

Q Kong, Y Cao, T Iqbal, Y Wang… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
audio pattern recognition tasks. Previous researchers have previously investigated transfer
learning for audio … To solve this problem, we design a balanced sampling strategy to train …

Environmental sound recognition with time–frequency audio features

S Chu, S Narayanan, CCJ Kuo - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
… We showed in [5] that the use of all features for classification does not always produce
good performance for the audio classification problems of our interest. This in turn leads to the …

Audio-visual speech modeling for continuous speech recognition

S Dupont, J Luettin - IEEE transactions on multimedia, 2000 - ieeexplore.ieee.org
… In Section III, we tackle the problem of integrating the information obtained from the …
audio-visual speech recognition systems. Finally, results on a multispeaker digit strings recognition

A survey of affect recognition methods: audio, visual and spontaneous expressions

Z Zeng, M Pantic, GI Roisman, TS Huang - Proceedings of the 9th …, 2007 - dl.acm.org
… Next, we examine the available approaches to solving the problem of machine understanding
of human affective behavior occurring in real-world settings. We finally outline some …

Robust audio-visual speech recognition under noisy audio-video conditions

D Stewart, R Seymour, A Pass… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
… Summary and Conclusion This paper dealt with the problem of audio-visual speech integration
given that the relative reliabilities of the two modalities may fluctuate due to corruption in …

[PDF][PDF] Audio recognition using mel spectrograms and convolution neural networks

B Thornton - 2019 - noiselab.ucsd.edu
audio time series with a short-time Fourier transform to create a spectrogram which was used
as an input to the CNN. The problem … approaches to the audio classification problem. The …