Enhancing Visual Coding Through Collaborative Perception

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS(2023)

引用 0|浏览7
暂无评分
摘要
A central challenge facing the nature human-computer interaction involves understanding how neural circuits process visual perceptual information to improve the user's operation ability under complex tasks. Visual coding models aim to explore the biological characteristics of retinal ganglion cells to provide quantitative predictions of responses to a range of visual stimuli. The existing visual coding models lack adaptability in natural and complex scenes. Therefore this article proposes an enhanced visual coding model through collaborative perception. Our model first extracts the multimodal spatiotemporal features of the input video to simulate the retinal response characteristics adaptively. Second, it uses the basis function to compile the input stimulus into a multimodal stimulus matrix. Afterward, the upstream and downstream filters reform the stimulus matrix to generate the spike sequence. Experiments show that the proposed model reproduces the physiological characteristics of ganglion cells in the biological retina, leading to the high accuracy, good adaptability, and biological interpretability in comparison with its rivals.
更多
查看译文
关键词
Feature compilation,multimodal stimulus,nonlinearity,visual coding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要