Abstract / Description of output
Estimating the geographical range of a species from sparse observations is a challenging and important geospatial prediction problem. Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location. This problem has a long history in ecology, but traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets which can include tens of millions of records for hundreds of thousands of species. In this work, we use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously. We find that our approach scales gracefully, making increasingly better predictions as we increase the number of species and the amount of data per species when training. To make this problem accessible to machine learning researchers, we provide four new benchmarks that measure different aspects of species range estimation and spatial representation learning. Using these benchmarks, we demonstrate that noisy and biased crowdsourced data can be combined with implicit neural representations to approximate expert-developed range maps for many species.
Original language | English |
---|---|
Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
Publisher | PMLR |
Pages | 6320-6342 |
Number of pages | 23 |
Volume | 202 |
Publication status | Published - 10 Jul 2023 |
Event | The Fortieth International Conference on Machine Learning - Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 Conference number: 40 https://icml.cc/ |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
ISSN (Electronic) | 2640-3498 |
Conference
Conference | The Fortieth International Conference on Machine Learning |
---|---|
Abbreviated title | ICML 2023 |
Country/Territory | United States |
City | Honolulu |
Period | 23/07/23 → 29/07/23 |
Internet address |