EURASIP Journal on Audio, Speech, and Music Processing

Table 2 Architecture of MCHCA

From: Channel and temporal-frequency attention UNet for monaural speech enhancement

Layer name	Input size	Hyperparameters	Output size
conv2d-1	\(C \times F \times L\)	(1,1),(1,1)	\(3C \times F \times L\)
conv2d-2	\(3C \times F \times L\)	(3,3),(1,1)	\(3C \times F \times L\)
conv2d-3	\(C \times F \times L\)	(1,1),(1,1)	\(C \times F \times L\)

Back to article page