ABOUT
MEMBERS
PUBLICATIONS
RESEARCH
ACTIVITY
CONTACT
SEMINAR
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
Yejin Kwon
2025.09.26
MLLM
VENUE
2025 arXiv
PAPER LINK
arXiv
PDF
PDF 다운로드
이전 글
Improving Adversarial Robustness Requires Revisiting Misclassified Examples
다음 글
VRA: Variational Rectified Activation for Out-of-distribution Detection
목록으로