Assistive Audio Enhancement for Neurodivergent Individuals

Audio Comparison Table

Below is a comparison table with audio samples for different stimuli using four audio processing stages: Neural Network Model (NN), Dynamic Range Compression (DRC), Ground Truth (GT), and Raw Mixture (Mix).
Click the play button to listen to the corresponding audio files.

Audio Samples for Listening Test
Trigger Non-Trigger NN DRC GT Mix
tap cricket
squeak waves
sniff camera noise
finger-snapping raining
slam river flowing
cutlery-silverware crows
chewing-mastication speech
breathing footsteps
alarm humming/singing
bark birds chirping

Analysis

My Figures

Mean Triggerability across Trigger Labels

Overall Mean Triggerability

Overall Mean Perceived Improvement

Mean Perceived Improvement across Trigger Labels

T-bars represent bootstrapped 95% confidence intervals. Stars indicate the significance level of an algorithm having a higher perceived value than the algorithm of comparison (*p < 0.05, **p < 0.01, ***p < 0.001). The color of the star indicates the algorithm of comparison. For the comparison, a Friedman test was conducted, followed by a paired Wilcoxon post hoc test. P-values were adjusted with Bonferroni. The different stimuli versions are abbreviated the following way: Ground Truth (GT), Noise Cancelling (ANC), Dynamic Range Compression (DRC), Auto-Encoder Neural Network (NN), Raw Mixture (Mix).