SEMINAR

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models

Yejin Kwon
2025.09.26
MLLM
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
VENUE2025 arXiv
PAPER LINKarXiv