Abstract
Transcriptions of phone calls are of significant value across diverse fields, such as sales, customer service, healthcare, and law enforcement. Nevertheless, the analysis of these recorded conversations can be an arduous and time-intensive process, especially when dealing with long and multifaceted dialogues. In this work, we propose a novel method, which we name SegLLM, for efficient and accurate call segmentation and topic extraction. SegLLM is composed of offline and online phases. The offline phase is applied once to a given list of topics and involves generating a distribution of synthetic sentences for each topic using a large language model (LLM). The online phase is applied to every call separately and scores the similarity between the transcripted conversation and the topic anchors found in the offline phase. The proposed paradigm provides an accurate and efficient method for call segmentation and topic extraction that does not require labeled data, thus making it a versatile approach applicable to various domains.
| Original language | English |
|---|---|
| Title of host publication | ICASSP |
| Subtitle of host publication | 2024 IEEE International Conference on Acoustics, Speech and Signal Processing |
| Pages | 11361-11365 |
| Number of pages | 5 |
| DOIs | |
| State | Published - 2024 |
| Event | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024 |
Conference
| Conference | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Seoul |
| Period | 14/04/24 → 19/04/24 |
Bibliographical note
DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.Keywords
- Law enforcement
- Oral communication
- Medical services
- Tagging
- Signal processing
- Data mining
- Speech processing
- LLM
- Call Segmentation
- Transformers