INVERSYNTH II: SOUND MATCHING VIA SELF-SUPERVISED SYNTHESIZER-PROXY AND INFERENCE-TIME FINETUNING

Oren Barkan, Shlomi Shvartzman, Noy Uzrad, Moshe Laufer, Almog Elharar, Noam Koenigstein

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرمنشور من مؤتمرمراجعة النظراء

ملخص

Synthesizers are widely used electronic musical instruments. Given an input sound, inferring the underlying synthesizer's parameters to reproduce it is a difficult task known as sound-matching. In this work, we tackle the problem of automatic sound matching, which is otherwise performed manually by professional audio experts. The novelty of our work stems from the introduction of a novel differentiable synthesizer-proxy that enables gradient-based optimization by comparing the input and reproduced audio signals. Additionally, we introduce a novel self-supervised finetuning mechanism that further refines the prediction at inference time. Both contributions lead to state-of-the-art results, outperforming previous methods across various metrics. Our code is available at: https://github.com/inversynth/ InverSynth2.

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيف24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Proceedings
المحررونAugusto Sarti, Fabio Antonacci, Mark Sandler, Paolo Bestagini, Simon Dixon, Beici Liang, Gael Richard, Johan Pauwels
ناشرInternational Society for Music Information Retrieval
الصفحات642-648
عدد الصفحات7
رقم المعيار الدولي للكتب (الإلكتروني)9781732729933
حالة النشرنُشِر - 2023
الحدث24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Milan, إيطاليا
المدة: ٥ نوفمبر ٢٠٢٣٩ نوفمبر ٢٠٢٣

سلسلة المنشورات

الاسم24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Proceedings

!!Conference

!!Conference24th International Society for Music Information Retrieval Conference, ISMIR 2023
الدولة/الإقليمإيطاليا
المدينةMilan
المدة٥/١١/٢٣٩/١١/٢٣

ملاحظة ببليوغرافية

Publisher Copyright:
© Barkan et al.

بصمة

أدرس بدقة موضوعات البحث “INVERSYNTH II: SOUND MATCHING VIA SELF-SUPERVISED SYNTHESIZER-PROXY AND INFERENCE-TIME FINETUNING'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا