Separate a target speaker's speech from a mixture of two speakers For project

Deep Learning/Papers2read 2020. 5. 18. 10:01

Separate a target speaker's speech from a mixture of two speakers

For project and code or API request: https://www.catalyzex.com/paper/arxiv:2005.07074

(FaceFilter: Audio-visual speech separation using still images)

Done using a deep audio-visual speech separation network. Unlike previous works that used lip movement on video clips or pre-enrolled speaker information as an auxiliary conditional feature, we use a single face image of the target speaker

'Deep Learning > Papers2read' 카테고리의 다른 글

From Adobe researchers: State of the art in High-Resolution Image Inpainting For (0)	2020.05.29
Adversarial Colorization of Icons Based on Structure and Color ConditionsAuthors: Tsai-Ho Sun, Chien-Hsun Lai, Sai-Keung Wong, and Yu-Shuen WangAbstract: We present a system to help #designers create icons that are widely used in banners, signboards, bi.. (0)	2020.05.20
State of the art in lane detection! For project and code or API request: [https: (0)	2020.05.15
오늘 소개드릴 논문은 흥미로운 응용사례와 같이 설명드리겠습니다. 최근에 보고있는 논문들이 ICLR이나 CVPR 최근 논문 + 실사례 적용을 하는 (0)	2020.05.15
This week's AI Paper Club topic is Deepfakes. We'll cover the technical, philoso (0)	2020.05.01