Orhun Olgun, Methods For Recording, Interpolation And Reproduction Of 6 Degrees-Of-Freedom Immersive Audio Using Rigid Spherical Microphone Arrays

Ph.D. Candidate: Orhun Olgun
Program: Multimedia Informatics
Date: 25.08.2025 / 13:30
Place: 
B-223

Abstract: Future immersive media systems must support six degrees-of-freedom (6DoF) user movement to deliver realistic audiovisual experiences. While rigid spherical microphone arrays (SMAs) are standard for 3DoF spatial audio capture, extending to 6DoF reproduction requires new models and interpolation methods. This thesis presents a comprehensive framework for 6DoF audio capture and reproduction, using higher-order Ambisonics and rigid spherical microphone arrays. A key contribution of this thesis is SPWIN (Sparse Plane-Wave Interpolation), a novel method that reconstructs sound fields between microphone positions with high perceptual accuracy. The proposed system is validated through interaural coherence analysis for the objective tests and MUSHRA-based perceptual tests in head-tracked virtual reality for the subjective tests. Additionally, the thesis introduces a method for estimating sound source directivity using spherical harmonic decomposition. These contributions address critical gaps in spatial audio reproduction and support emerging standards such as MPAI-CAE.