Repository logo
 

Musical Instrument Recognition in Polyphonic Audio Through Convolutional Neural Networks and Spectrograms

aut.relation.conferenceInternational Conference on Audio, Language and Image Processing
aut.relation.issue07
aut.relation.volume18
dc.contributor.authorRujia, Chen
dc.contributor.authorGhobakhlou, Ali
dc.contributor.authorNarayanan, Ajit
dc.date.accessioned2024-07-16T21:23:43Z
dc.date.available2024-07-16T21:23:43Z
dc.date.issued2024-07-04
dc.description.abstractThis study investigates the task of identifying musical instruments in polyphonic compositions using Convolutional Neural Networks (CNNs) from spectrogram inputs, focusing on binary classification. The model showed promising results, with an accuracy of 97% on solo instrument recognition. When applied to polyphonic combinations of 1 to 10 instruments, the overall accuracy was 64%, reflecting the increasing challenge with larger ensembles. These findings contribute to the field of Music Information Retrieval (MIR) by highlighting the potential and limitations of current approaches in handling complex musical arrangements. Future work aims to include a broader range of musical sounds, including electronic and synthetic sounds, to improve the model's robustness and applicability in real-time MIR systems.
dc.identifier.citationWorld Academy of Science, Engineering and Technology, International Journal of Electronics and Communication Engineering, Vol:18, No:07, 2024
dc.identifier.urihttp://hdl.handle.net/10292/17794
dc.publisherWorld Academy of Science, Engineering and Technology
dc.relation.urihttps://publications.waset.org/abstracts/185822/musical-instrument-recognition-in-polyphonic-audio-through-convolutional-neural-networks-and-spectrograms
dc.rights© 2024 World Academy of Science, Engineering and Technology. Creative Commons Attribution 4.0 International License.
dc.rights.accessrightsOpenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectbinary classifier, CNN
dc.titleMusical Instrument Recognition in Polyphonic Audio Through Convolutional Neural Networks and Spectrograms
dc.typeConference Contribution
pubs.elements-id561349

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
24CZ0701631.pdf
Size:
797.38 KB
Format:
Adobe Portable Document Format
Description:
Conference contribution