Repository logo

Application of deep learning for array microphone processing

dc.contributorGraduate Program in Electrical and Electronic Engineering.
dc.contributor.advisorArslan, Levent M.
dc.contributor.authorAkyürek, Muhammed Furkan.
dc.date.accessioned2023-03-16T10:20:41Z
dc.date.available2023-03-16T10:20:41Z
dc.date.issued2020.
dc.description.abstractArray microphone processing is a complex application with multiple interlinked components like direction of arrival for the audio sources, beamforming and postfiltering that are dependent on the array geometry. The array microphones gained popularity by the advent of the smart speakers. In this thesis, an end-to-end solution is provided containing all of the array microphone processing components along with the denoising integrated to the core of the system using a deep learning method called autoencoders. The neural network system is trained on the magnitude spectra generated by a dataset created exclusively for this thesis by combining some of the publicly available speech and noise datasets. This thesis proposes a single channel and a multichannel speech enhancement model to solve the beamforming problem. The multichannel autoencoder model is shown to perform better than some of the common conventional beamforming methods by objective evaluation methods. Results from this thesıs indicate the room for improvement in this field by the use of neural networks.
dc.format.extent30 cm.
dc.format.pagesxvii, 71 leaves ;
dc.identifier.otherEE 2020 A58
dc.identifier.urihttps://hdl.handle.net/20.500.14908/12984
dc.publisherThesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020.
dc.subject.lcshMicrophone arrays.
dc.subject.lcshSpeech processing systems.
dc.titleApplication of deep learning for array microphone processing

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
b2714083.035507.001.PDF
Size:
9.31 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
b2714083.035508.001.zip
Size:
5.3 MB
Format:
Unknown data format

Collections