Multi-source and multimodal data fusion for improved management of a wastewater treatment plant

JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING(2023)

引用 0|浏览4
暂无评分
摘要
Wastewater treatment plants (WWTPs) are complex systems presenting stochastic, non-linear, and non stationary behavior, which makes their operational management very challenging. In this context, data collected from distributed sources across the plant play a central role in the optimized operation and control of WWTPs. However, even when available, the use of collected data is far from trivial due to the coexistence of asynchronous measurements, data with different granularity, measurements of different quality (precision, accuracy), multimodal sources (sensors, spectra, images, hyphenated instrumentation), among other aspects related to the data life cycle. Such heterogeneity in process data characteristics hinders the application of most off-the-shelf data analytics methods. Flexible solutions able to cope with the complexity of systems and of the data they generate are therefore necessary to overcome these limitations and enable an effective analysis and operation of WWTPs. In this article, data-fusion approaches for handling multiple heterogeneous sources of process data are developed and comparatively tested. Priority is given to solutions that can flexibly be adapted to different specific operational contexts. The methodologies are tested on an industrial case study (WWTP), where the concentration of a toxin in the effluent stream is to be predicted from available heterogeneous data. Single and multi-source modeling approaches are contemplated and a nested cross-validation method was developed to handle the time-series nature of the models. Bayesian fusion synergistically combines data from different sources considering their uncertainty, standing out among the methodologies tested as offering a good balance in terms of accuracy (RMSEP = 1.34), stability (prequential IQR = 0.034), and flexibility (to accommodate missing and new sources).
更多
查看译文
关键词
Heterogeneous data fusion,Multimodal data,Wastewater treatment plant,Industry 4.0,Data-driven modeling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要