Abstract
Over the last couple of years, Deep Learning (DL) methods for objects and features classification have been shown to overcome previous state-of-the-art classification techniques in multiple areas, such as image classification and speech recognition. In our previous paper MESRS – Model Ensemble Speech Recognition System, we have described a unique speech recognition system for automatic classification of voice commands. The work described in this paper, presents a novel method for classification that continues our previous work by extending the system-supported input to the image space, not just the audio space. Aside from supporting multiple input types, this paper also describes an automated method of models ensemble based on the K-Nearest Neighbors algorithm. The automatic method of ensemble selection was added in order to improve the system’s running times and achieve the highest possible accuracy results. The work in this paper shows that applying dynamic input-based classification over multiple architectures can significantly improve the final classification results. Since different models with different architectures could achieve different results on different inputs, the task of producing the best results could be achieved by selecting the best fitted model for the given input. This method was tested over multiple datasets including Chest X Ray Pneumonia Dataset, Malaria Cells Dataset, Road Potholes Dataset, and the Voice Commands Dataset which also served us in our previous work. This paper proves that our method works and has the ability to improve the classification quality on top of all of the above datasets. Our results were compared with previous results obtained by similar works on top of the above datasets and a significant improvement was shown for all of the tested datasets. These findings prove the effectiveness of our method and motivate us to develop it further, in order to achieve even better results in future work.
Original language | English |
---|---|
Title of host publication | Advances in Information and Communication - Proceedings of the 2021 Future of Information and Communication Conference, FICC |
Editors | Kohei Arai |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 536-557 |
Number of pages | 22 |
ISBN (Print) | 9783030731021 |
DOIs | |
State | Published - 2021 |
Event | Future of Information and Communication Conference, FICC 2021 - Virtual, Online Duration: 29 Apr 2021 → 30 Apr 2021 |
Publication series
Name | Advances in Intelligent Systems and Computing |
---|---|
Volume | 1364 AISC |
ISSN (Print) | 2194-5357 |
ISSN (Electronic) | 2194-5365 |
Conference
Conference | Future of Information and Communication Conference, FICC 2021 |
---|---|
City | Virtual, Online |
Period | 29/04/21 → 30/04/21 |
Bibliographical note
Publisher Copyright:© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Keywords
- Data mining
- Deep Learning
- Ensemble classifier
- KNN