G2A2: Graph Generator with Attributes and Anomalies
dc.contributor.author | Dey, Saikat | en |
dc.contributor.author | Jha, Sonal | en |
dc.contributor.author | Feng, Wu-chun | en |
dc.date.accessioned | 2024-08-07T12:10:37Z | en |
dc.date.available | 2024-08-07T12:10:37Z | en |
dc.date.issued | 2024-05-07 | en |
dc.date.updated | 2024-08-01T07:51:15Z | en |
dc.description.abstract | Many data-mining applications use dynamic attributed graphs to represent relational information; but due to security and privacy concerns, there is a dearth of publicly available datasets that can be represented as dynamic attributed graphs. Even when such datasets are available, they do not have ground truth that can be useful for classification problems, e.g., anomaly detection. Thus, researchers commonly generate synthetic graphs using either statistical or deep generative (DG) methods. However, neither approach produces ground truth. Statistical methods struggle to replicate intricate patterns found in real-world dynamic attributed graphs, while DG methods require a significant number of graphs for training. To address these shortcomings, we present G2A2, an automated graph generator with attributes and anomalies, which encompasses (1) probabilistic models to generate a dynamic bipartite graph, representing realistic time-evolving connections between two independent sets of entities, (2) realistic injection of anomalies for ground truth using a novel algorithm that captures the general properties of graph anomalies across domains, and (3) generative adversarial network (GAN) model to produce realistic attributes, learned from an existing real-world dataset. We also show that G2A2 is scalable and can generate a graph with a million edges in under a minute of computing time. Using the maximum mean discrepancy (MMD) metric to evaluate the realism of a G2A2-generated graph against three real-world graphs, G2A2 outperforms Kronecker graph generation by reducing the MMD distance by up to six-fold (6×). | en |
dc.description.version | Published version | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.doi | https://doi.org/10.1145/3649153.3649206 | en |
dc.identifier.uri | https://hdl.handle.net/10919/120878 | en |
dc.language.iso | en | en |
dc.publisher | ACM | en |
dc.relation.ispartof | CF '24: Proceedings of the 21st ACM International Conference on Computing Frontiers | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.holder | The author(s) | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.title | G2A2: Graph Generator with Attributes and Anomalies | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |