Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data

Proceedings of the International Symposium on Automation and Robotics in Construction (IAARC)Proceedings of the 36th International Symposium on Automation and Robotics in Construction (ISARC)(2019)

引用 5|浏览0
暂无评分
摘要
Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data Florian Stinner, Yingying Yang, Thomas Schreiber, Gerrit Bode, Marc Baranski and Dirk Müller Pages 226-233 (2019 Proceedings of the 36th ISARC, Banff, Canada, ISBN 978-952-69524-0-6, ISSN 2413-5844) Abstract: Machine Learning Algorithms (ML) offer a high potential with low manual effort to discover appropriate energy efficiency measures for buildings. Although many building automation systems (BAS) record a high amount of data, technical systems such as boilers provide only a few data points per building. However, machine-learning algorithms require training based on a sufficient number of instances of a technical system in order to enable cross-building use. In contrast to electrical systems, few data sets of actual operation of thermal systems are publicly available. Since 2012, the monitoring system in our test object has continuously provided threshold-based data with a maximum resolution of 1 minute. We monitor the plants, energy consumption and comfort parameters with 9239 data points in total. In this paper, we show how our published data set from this building is structured. In order to facilitate the use of ML, each data point receives a uniform label according to a previously developed approach. Since the documentation of ML data sets varies in the building sector, we show an approach to standardize data sets with special datasheets for thermal systems to provide sufficient information for application of ML. We use the Brick Schema, a unified ontology standard for the description of topology in buildings, which is part of the future ASHRAE Standard 223P. We couple this with an approach we developed for the structured labeling of data points in buildings. We show how to semi-automatically generate physical models based on an open-source Modelica library from this ontology-based model. We show that the models, enriched with real time series data and data sheets, are in good agreement with the measured data. Finally, we show with an ML example that our approach based on Brick Schema and Modelica is able to deliver ML compliant data sets. Keywords: Standardized data sets; Machine Learning; Simulation; Modelica; Building Energy Systems DOI: https://doi.org/10.22260/ISARC2019/0031 Download fulltext Download BibTex Download Endnote (RIS) TeX Import to Mendeley
更多
查看译文
关键词
standardized time series data,generic data sets,building services,machine learning,machine learning applications
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要