Abstract
Electrocardiogram (ECG)-based diagnostics are pivotal in early cardiac disorder detection, yet existing models often fail to integrate temporal, spectral, and spatial dynamics inherent in complex arrhythmic patterns. Most traditional approaches are unimodal, relying either on time-domain signal processing or spatially limited CNN models, thereby overlooking cross-domain dependencies and subtle morphological cues. Addressing this gap, this research proposes FusionHeartNet, a unified deep learning framework that fuses signal- and image-based representations using a dual-spectrum feature embedding (DSFE) strategy. DSFE synergistically extracts morphological descriptors and spectral signatures via Fourier and wavelet transforms, while spatial morphology is preserved through GAF and CWT scalograms. These dual-domain features are refined by a multi-focus attention module (MFAM) and classified through the heart fusion classifier (HFC), which is optimized using Bayesian optimization with adaptive learning rate scheduling (BO-ALRS). Experimental validation on the MIT-BIH Arrhythmia Database demonstrates an accuracy of 98.47%, F1-score of 91.67%, and kappa of 0.9311, significantly outperforming baseline models. FusionHeartNet sets a new benchmark for robust, multi-dimensional ECG analysis, offering clinically viable precision in early heart disease detection.
Keywords
Get full access to this article
View all access options for this article.
