From: Channel and temporal-frequency attention UNet for monaural speech enhancement
Layer name | Input size | Hyperparameters | Output s |
---|---|---|---|
conv2d-1 | \(C \times F \times L\) | (1,1),(1,1),(1,1) | \(C \times F \times L\) |
conv2d-2 | \(C \times F \times L\) | (3,3),(1,1),(1,\(2^{i-1}\)) | \(C \times F \times L\) |
conv2d-3 | \(C \times F \times L\) | (1,1),(1,1),(1,1) | \(C \times F \times L\) |