Shojaee, ParshinZeng, YingyanWahed, MuntasirSeth, AviJin, RanLourentzou, Ismini2023-02-082023-02-082023-01http://hdl.handle.net/10919/113725Industrial Internet provides a collaborative computational platform for participating enterprises, allowing the collection of big data for machine learning tasks. Despite the promise of training and deployment acceleration, and the potential to optimize decision-making processes through data-sharing, the adoption of such technologies is impacted by the increasing concerns about information privacy. As enterprises prefer to keep data private, this limits interoperability. While prior work has largely explored privacy-preserving mechanisms, the proposed methods naively average or randomly sample data shared from all participants instead of selecting the most well-suited subsets for a particular downstream learning task. Motivated by the lack of effective data-sharing mechanisms for heterogeneous machine learning tasks in Industrial Internet, we propose PriED, a task-driven data-sharing framework that selectively fuses shared data and local data from participants to improve supervised learning performance. PriED utilizes privacy-preserving data distillation to facilitate data exchange, and dynamic data selection to optimize downstream machine learning tasks. We demonstrate performance improvements on a real semiconductor manufacturing case study.application/pdfenIn CopyrightAttentionData sharingData distillationIndustrial internetReinforcement learningA Task-Driven Privacy-Preserving Data-Sharing Framework for the Industrial InternetArticle - Refereed2023-02-07Jin, Ran [0000-0003-3847-4538]