MusicFactory: Application of a Convolutional Neural Network for the Generation of Soundscapes from Images

New Trends in Disruptive Technologies, Tech Ethics and Artificial Intelligence(2022)

引用 0|浏览0
暂无评分
摘要
A soundscape is a sound description of a concrete environment. Therefore, the soundscapes are always connected to a visual component, as it might capture sounds from an urban city, a countryside, or a domestic place. In this work, we present a system that generate soundscapes from images. Firstly, we recognize some objects in the image. In a second step the system searches the most adequate sounds according to the entities identified in the picture. Finally, a soundscape is synthesized by combining the short sound files found. The results obtained according to the subjective evaluation are promising and encouraging to deepen our research in the soundscape generation.
更多
查看译文
关键词
Deep learning, Soundscapes, Music generation, Image recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要