From: Channel and temporal-frequency attention UNet for monaural speech enhancement
Layer name | Input size | Hyperparameters | Output size |
---|---|---|---|
conv2d-1 | \(C \times F \times L\) | (1,1),(1,1) | \(3C \times F \times L\) |
conv2d-2 | \(3C \times F \times L\) | (3,3),(1,1) | \(3C \times F \times L\) |
conv2d-3 | \(C \times F \times L\) | (1,1),(1,1) | \(C \times F \times L\) |