Fig. 1From: Multi-task deep cross-attention networks for far-field speaker verification and keyword spottingModel architecture of the proposed multi-task deep cross-attention network (MTCANet). In this instance, it uses the KWS branch as query and the speaker verification branch as key and value, effectively enhancing the utilization efficiency of intermediate embedded features extracted by the two tasksBack to article page