Exploring Data Quantity Requirements for Domain Adaptation in the Classification of Satellite Image Time Series

2019 10th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp)(2019)

引用 3|浏览33
暂无评分
摘要
Land cover maps are a vital input variable in all types of environmental research and management. However the modern state-of-the-art machine learning techniques used to create them require substantial training data to produce optimal accuracy. Domain Adaptation is one technique researchers might use when labelled training data are unavailable or scarce. This paper looks at the result of training a convolutional neural network model on a region where data are available (source domain), and then adapting this model to another region (target domain) by retraining it on the available labelled data, and in particular how these results change with increasing data availability. Our experiments performing domain adaptation on satellite image time series, draw three interesting conclusions: (1) a model trained only on data from the source domain delivers 73.0% test accuracy on the target domain; (2) when all of the weights are retrained on the target data, over 16,000 instances were required to improve upon the accuracy of the source-only model; and (3) even if sufficient data is available in the target domain, using a model pretrained on a source domain will result in better overall test accuracy compared to a model trained on target domain data only-88.9% versus 84.7%.
更多
查看译文
关键词
source domain,data availability,domain adaptation,test accuracy,target domain data,data quantity requirements,land cover maps,environmental research,machine learning techniques,substantial training data,labelled training data,convolutional neural network model,satellite image time series classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要