Neurodivergent Audio

Audio Comparison Table

Below is a comparison table with audio samples for different stimuli using four audio processing stages: Neural Network Model (NN), Dynamic Range Compression (DRC), Ground Truth (GT), and Raw Mixture (Mix).
Click the play button to listen to the corresponding audio files.

Audio Samples for Listening Test

Trigger

Non-Trigger

DRC

Mix

tap

cricket

squeak

waves

sniff

camera noise

finger-snapping

raining

slam

river flowing

cutlery-silverware

crows

chewing-mastication

speech

breathing

footsteps

alarm

humming/singing

bark

birds chirping

Analysis

My Figures

Mean Triggerability across Trigger Labels

Overall Mean Triggerability

Overall Mean Perceived Improvement

Mean Perceived Improvement across Trigger Labels

T-bars represent bootstrapped 95% confidence intervals. Stars indicate the significance level of an algorithm having a higher perceived value than the algorithm of comparison (*p < 0.05, **p < 0.01, ***p < 0.001). The color of the star indicates the algorithm of comparison. For the comparison, a Friedman test was conducted, followed by a paired Wilcoxon post hoc test. P-values were adjusted with Bonferroni. The different stimuli versions are abbreviated the following way: Ground Truth (GT), Noise Cancelling (ANC), Dynamic Range Compression (DRC), Auto-Encoder Neural Network (NN), Raw Mixture (Mix).