Projects per year
Abstract / Description of output
Hierarchical clustering studies a recursive partition of a data set into clusters of successively smaller size, and is a fundamental problem in data analysis. In this work we study the cost function for hierarchical clustering introduced by Dasgupta, and present two polynomial-time approximation algorithms: Our first result is an O(1)-approximation algorithm for graphs of high conductance. Our simple construction bypasses complicated recursive routines of finding sparse cuts known in the literature. Our second and main result is an O(1)-approximation algorithm for a wide family of graphs that exhibit a well defined structure of clusters. This result generalises the previous state-of-the art, which holds only for graphs generated from stochastic models. The significance of our work is demonstrated by the empirical analysis on both synthetic and real-world data sets, on which our presented algorithm outperforms the previously proposed algorithm for graphs with a well-defined cluster structure.
Original language | English |
---|---|
Title of host publication | Advances in Neural Information Processing Systems |
Editors | M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan |
Publisher | Curran Associates Inc |
Pages | 9278-9289 |
Number of pages | 12 |
Volume | 34 |
Publication status | Published - 6 Dec 2021 |
Event | Thirty-fifth Conference on Neural Information Processing Systems - Virtual Duration: 6 Dec 2021 → 14 Dec 2021 https://nips.cc/Conferences/2021 |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
Volume | 34 |
ISSN (Print) | 1049-5258 |
Conference
Conference | Thirty-fifth Conference on Neural Information Processing Systems |
---|---|
Abbreviated title | NeurIPS 2021 |
Period | 6/12/21 → 14/12/21 |
Internet address |
Fingerprint
Dive into the research topics of 'Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs'. Together they form a unique fingerprint.Projects
- 1 Active