Mastering Long-Tail Complexity on Graphs: Characterization, Learning, and Generalization

dc.contributor.authorWang, Haohuien
dc.contributor.authorJing, Baoyuen
dc.contributor.authorDing, Kaizeen
dc.contributor.authorZhu, Yadaen
dc.contributor.authorCheng, Weien
dc.contributor.authorZhang, Sien
dc.contributor.authorFan, Yonghuien
dc.contributor.authorZhang, Liqingen
dc.contributor.authorZhou, Daweien
dc.date.accessioned2024-09-04T12:15:12Zen
dc.date.available2024-09-04T12:15:12Zen
dc.date.issued2024-08-25en
dc.date.updated2024-09-01T07:48:29Zen
dc.description.abstractIn the context of long-tail classification on graphs, the vast majority of existing work primarily revolves around the development of model debiasing strategies, intending to mitigate class imbalances and enhance the overall performance. Despite the notable success, there is very limited literature that provides a theoretical tool for characterizing the behaviors of long-tail classes in graphs and gaining insight into generalization performance in real-world scenarios. To bridge this gap, we propose a generalization bound for long-tail classification on graphs by formulating the problem in the fashion of multi-task learning, i.e., each task corresponds to the prediction of one particular class. Our theoretical results show that the generalization performance of long-tail classification is dominated by the overall loss range and the task complexity. Building upon the theoretical findings, we propose a novel generic framework Hier- Tail for long-tail classification on graphs. In particular, we start with a hierarchical task grouping module that allows us to assign related tasks into hypertasks and thus control the complexity of the task space; then, we further design a balanced contrastive learning module to adaptively balance the gradients of both head and tail classes to control the loss range across all tasks in a unified fashion. Extensive experiments demonstrate the effectiveness of HierTail in characterizing long-tail classes on real graphs, which achieves up to 12.9% improvement over the leading baseline method in balanced accuracy.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1145/3637528.3671880en
dc.identifier.urihttps://hdl.handle.net/10919/121070en
dc.language.isoenen
dc.publisherACMen
dc.rightsIn Copyrighten
dc.rights.holderThe author(s)en
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleMastering Long-Tail Complexity on Graphs: Characterization, Learning, and Generalizationen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3637528.3671880.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: