EURASIP Journal on Audio, Speech, and Music Processing

Table 1 Architecture of the ith TFCN

From: Channel and temporal-frequency attention UNet for monaural speech enhancement

Layer name	Input size	Hyperparameters	Output s
conv2d-1	\(C \times F \times L\)	(1,1),(1,1),(1,1)	\(C \times F \times L\)
conv2d-2	\(C \times F \times L\)	(3,3),(1,1),(1,\(2^{i-1}\))	\(C \times F \times L\)
conv2d-3	\(C \times F \times L\)	(1,1),(1,1),(1,1)	\(C \times F \times L\)

Back to article page