Explaining Neural Networks by Decoding Layer Activations

ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021(2021)

引用 9|浏览12
暂无评分
摘要
We present a `CLAssifier-DECoder' architecture (ClaDec) which facilitates the comprehension of the output of an arbitrary layer in a neural network (NN). It uses a decoder to transform the non-interpretable representation of the given layer to a representation that is more similar to the domain a human is familiar with. In an image recognition problem, one can recognize what information is represented by a layer by contrasting reconstructed images of ClaDec with those of a conventional autoencoder(AE) serving as reference. We also extend ClaDec to allow the trade-off between human interpretability and fidelity. We evaluate our approach for image classification using Convolutional NNs. We show that reconstructed visualizations using encodings from a classifier capture more relevant information for classification than conventional AEs.
更多
查看译文
关键词
neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要