AudioProtoPNet: An interpretable deep learning model for bird sound classification
arxiv(2024)
摘要
Recently, scientists have proposed several deep learning models to monitor
the diversity of bird species. These models can detect bird species with high
accuracy by analyzing acoustic signals. However, traditional deep learning
algorithms are black-box models that provide no insight into their
decision-making process. For domain experts, such as ornithologists, it is
crucial that these models are not only efficient, but also interpretable in
order to be used as assistive tools. In this study, we present an adaption of
the Prototypical Part Network (ProtoPNet) for audio classification that
provides inherent interpretability through its model architecture. Our approach
is based on a ConvNeXt backbone architecture for feature extraction and learns
prototypical patterns for each bird species using spectrograms of the training
data. Classification of new data is done by comparison with these prototypes in
latent space, which simultaneously serve as easily understandable explanations
for the model's decisions.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要