Abstract
We address the problem of simultaneously learning a k-means clustering and deep feature representation from unlabelled data, which is of interest due to the potential for deep k-means to outperform traditional two-step feature extraction and shallow clustering strategies. We achieve this by developing a gradient estimator for the non-differentiable k-means objective via the Gumbel-Softmax reparameterisation trick. In contrast to previous attempts at deep clustering, our concrete k-means model can be optimised with respect to the canonical k-means objective and is easily trained end-to-end without resorting to time consuming alternating optimisation techniques. We demonstrate the efficacy of our method on standard clustering benchmarks.
Original language | English |
---|---|
Title of host publication | ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 4252-4256 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-5090-6631-5 |
ISBN (Print) | 978-1-5090-6632-2 |
DOIs | |
Publication status | Published - 14 May 2020 |
Event | 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing - Barcelona, Spain Duration: 4 May 2020 → 8 May 2020 Conference number: 45 |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Print) | 1520-6149 |
ISSN (Electronic) | 2379-190X |
Conference
Conference | 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP 2020 |
Country/Territory | Spain |
City | Barcelona |
Period | 4/05/20 → 8/05/20 |
Keywords / Materials (for Non-textual outputs)
- Deep Clustering
- Unsupervised Learning
- Gradient Estimation