Volltext-Downloads (blau) und Frontdoor-Views (grau)

Audio synchronisation with a tunnel matrix for time series and dynamic programming

  • Precise multimodal studies require precise synchronisation between audio and video signals. However, raw audio and audio from video recordings can be out of sync for several reasons. In order to re-synchronise them, a dynamic programming (DP) approach is presented here. Traditionally, DP is performed on the rectangular distance matrix comparing each value in signal A with each value in signal B. Previous work limited the search space using for example the Sakoe Chiba Band (Sakoe and Chiba, 1978). However, the overall space of the distance matrix remains identical. Here, a tunnel matrix and its according DP-algorithm are presented. The matrix contains merely the computed distance of two signals to a pre-specified bandwidth and the computational cost is equally reduced. An example implementation demonstrates the functionality on artificial data and on data from real audio and video recordings.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Jan Gorisch, Laurent Prévot
URN:urn:nbn:de:bsz:mh39-60964
URL:http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=7158221
ISBN:978-1-4673-6997-8
Parent Title (German):International Conference on Acoustics, Speech and Signal Processing (Brisbane, Australia): IEEE 2015
Publisher:IEEE
Place of publication:New York
Document Type:Conference Proceeding
Language:English
Year of first Publication:2015
Date of Publication (online):2017/04/25
Publicationstate:Postprint
Reviewstate:Peer-Review
Tag:Audio-video Synchronisation; Imageloss Compensation; Storage Requirements; Tunnel DP-algorithm; Tunnel Matrix
First Page:3846
Last Page:3850
Dewey Decimal Classification:400 Sprache / 400 Sprache, Linguistik
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Computerlinguistik
Open Access?:Ja
Licence (German):Es gilt das UrhG