Multimodal spatio-temporal-spectral fusion for deep learning applications in physiological time series processing : a case study in monitoring the depth of anesthesia

Bahador, Nooshin; Jokelainen, Jarno; Mustola, Seppo; Kortelainen, Jukka

Multimodal spatio-temporal-spectral fusion for deep learning applications in physiological time series processing : a case study in monitoring the depth of anesthesia

Bahador, Nooshin; Jokelainen, Jarno; Mustola, Seppo; Kortelainen, Jukka (2021-03-30)

Avaa tiedosto

nbnfi-fe2022031022836.pdf (22.81Mt)

nbnfi-fe2022031022836_meta.xml (38.73Kt)

nbnfi-fe2022031022836_solr.xml (38.23Kt)

Lataukset:

URL:

https://doi.org/10.1016/j.inffus.2021.03.001

Bahador, Nooshin

Jokelainen, Jarno

Mustola, Seppo

Kortelainen, Jukka

Elsevier

30.03.2021

Bahador, N., Jokelainen, J., Mustola, S., & Kortelainen, J. (2021). Multimodal spatio-temporal-spectral fusion for deep learning applications in physiological time series processing: A case study in monitoring the depth of anesthesia. Information Fusion, 73, 125–143. https://doi.org/10.1016/j.inffus.2021.03.001

https://creativecommons.org/licenses/by/4.0/
© 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
https://creativecommons.org/licenses/by/4.0/

doi:https://doi.org/10.1016/j.inffus.2021.03.001

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022031022836

Tiivistelmä

Abstract

Physiological signals processing brings challenges including dimensionality (due to the number of channels), heterogeneity (due to the different range of values) and multimodality (due to the different sources). In this regard, the current study intended, first, to use time-frequency ridge mapping in exploring the use of fused information from joint EEG-ECG recordings in tracking the transition between different states of anesthesia. Second, it investigated the effectiveness of pre-trained state-of-the-art deep learning architectures for learning discriminative features in the fused data in order to classify the states during anesthesia. Experimental data from healthy-brain patients undergoing operation (N = 20) were used for this study. Data was recorded from the BrainStatus device with single ECG and 10 EEG channels. The obtained results support the hypothesis that not only can ridge fusion capture temporal-spectral progression patterns across all modalities and channels, but also this simplified interpretation of time-frequency representation accelerates the training process and yet improves significantly the efficiency of deep models. Classification outcomes demonstrates that this fusion could yields a better performance, in terms of 94.14% precision and 0.28 s prediction time, compared to commonly used data-level fusing methods. To conclude, the proposed fusion technique provides the possibility of embedding time-frequency information as well as spatial dependencies over modalities and channels in just a 2D array. This integration technique shows significant benefit in obtaining a more unified and global view of different aspects of physiological data at hand, and yet maintaining the desired performance level in decision making.

Kokoelmat

Avoin saatavuus [32049]

Ellei muuten mainita, aineiston lisenssi on https://creativecommons.org/licenses/by/4.0/