Involution Based Speech Autoencoder: Investigating the Advanced Vision Operator Performance on Speech Feature Extraction

Tianle Zhong, Israel Mendoza Velázquez,Yoichi Haneda

GCCE(2021)

引用 0|浏览0
暂无评分
摘要
Feature extraction is an important aspect of deep learning. In recent studies, various methods from image processing have been applied to speech feature extraction. Although the involution operator has achieved great success in vision recognition, the use of the involution operator in speech tasks has not been investigated. To determine whether the performance improvement of this operator on visual tasks is reflected directly in the speech processing domain, this study investigated autoencoder networks operating on Mel-spectrograms utilizing involution and convolution. It was determined that convolution continues to outperform involution as a direct replacement. The reasons for discrepancies in the input feature dimensions of different fields are discussed. Based on its channel-agnostic property, involution in combination with convolution requires additional research.
更多
查看译文
关键词
Speech processing,Autoencoder,Feature Extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要