Synthetic Generation of Trip Data: The Case of Smart Card

Minh Kieu, Iris Brighid Meredith,Andrea Raith

Data Science for Transportation(2023)

引用 0|浏览0
暂无评分
摘要
While individual data are key for epidemiology, social simulation, economics, and various other fields, data owners are increasingly required to protect the personally identifiable information from data. Simple data de-identification or ‘data masking’ measures are limited, because they both reduce the utility of the dataset and are not sufficient to protect individual confidentiality. This paper provides detail on the creation of a synthetic trip data in transportation, with the Smart Card data as the case study. It discusses and compares two machine learning methods, a Generative Adversarial Network and a Bayesian Network for modelling and generating this dataset. The synthetic data retain important utility of the real dataset, e.g., the origin, destination, and time of travel, while each data point does not represent a real trip in the original dataset. The synthetic dataset can be used in various applications, including microsimulation of public transport systems, analysing travel behaviours, model predictive control of transit flows, or evaluation of transport policies.
更多
查看译文
关键词
Synthetic data, Generative Adversarial Network, Bayesian Network, Smart Card data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要