The objective of the current on-going research is to automatically identify the role played by a speaker in a dialogue, and to explore potential conditions that might impose higher speaker’s role identification. We use an interactive Map Task setup with two potential roles: followers and leaders, where each speaker participated twice thus acting in both roles with the same interlocutor. The paper aims to identify speaker’s role, and to explore potential influence of the gender of the speaker, the gender of the interlocutor, and the order of the roles played by the speaker. By using deep learning procedures over a set of acoustic features, we automatically trace the footprints of the role through the speech signal. Results show an average of 73.3% role’s classification rate. We further show that there is a significant difference in the role’s classification rates, depending on the interlocutor’s gender. On average, when the interlocutor is a male, the speaker tends to identify with his or her role more clearly – 77.5% versus 69.9% when the interlocutor is a woman.
|Title of host publication||Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings|
|Editors||Rodmonga Potapova, Oliver Jokisch, Alexey Karpov|
|Number of pages||10|
|State||Published - 2018|
|Event||20th International Conference on Speech and Computer, SPECOM 2018 - Leipzig, Germany|
Duration: 18 Sep 2018 → 22 Sep 2018
|Name||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Conference||20th International Conference on Speech and Computer, SPECOM 2018|
|Period||18/09/18 → 22/09/18|
Bibliographical noteFunding Information:
Acknowledgments. This work was supported by the Open Media and Information Lab (OMILab) at The Open University of Israel [Grant Number 20184] and by research grant #507761 from the Research Authority at The Open University of Israel.
- Acoustic features
- Deep learning
- Map Task corpus
- Role identification