From: Channel and temporal-frequency attention UNet for monaural speech enhancement
Model | #Para. | Year | With reverb | Without reverb | ||||||
---|---|---|---|---|---|---|---|---|---|---|
 |  |  | WB-PESQ | NB-PESQ | STOI(%) | SI-SDR | WB-PESQ | NB-PESQ | STOI(%) | SI-SDR |
Noisy | - | - | 1.822 | 2.753 | 86.62 | 9.033 | 1.582 | 2.454 | 91.52 | 9.07 |
DCCRN [51] | 3.7M | 2020 | - | 3.077 | - | - | - | 3.266 | - | - |
DCCRN+ [52] | 4.7M | 2021 | - | 3.30 | - | - | - | 3.33 | - | - |
Conv-TasNet [8] | 5.1M | 2019 | 2.75 | - | - | - | 2.73 | - | - | - |
PoCoNet [53] | 50M | 2020 | 2.832 | - | - | - | 2.748 | - | - | - |
CTS-Net [54] | 4.4M | 2021 | 3.02 | 3.47 | 92.7 | 15.58 | 2.94 | 3.42 | 96.66 | 17.99 |
FullSubNet [14] | 5.6M | 2021 | 3.057 | 3.584 | 92.11 | 16.04 | 2.882 | 3.428 | 96.32 | 17.30 |
GaGNet [55] | 5.9M | 2022 | 3.18 | 3.57 | 93.22 | 16.57 | 3.17 | 3.56 | 97.13 | 18.91 |
FullSubNet+ [15] | 8.7M | 2022 | 3.177 | 3.648 | 93.64 | 16.44 | 3.002 | 3.503 | 96.67 | 18.00 |
FS-CANet [56] | 4.2M | 2022 | 3.218 | 3.665 | 93.93 | 16.82 | 3.017 | 3.513 | 96.74 | 18.08 |
CTFUNet | 6.1M | 2023 | 3.367 | 3.741 | 94.39 | 17.16 | 3.176 | 3.639 | 97.17 | 18.66 |