Projects per year
Abstract / Description of output
We are entering the era of mega-scale genomics, which is causing computational issues for standard genomic evaluation models due to their cubic computational complexity. A number of scalable genomic evaluation models have been proposed, like the APY model, where genotyped animals are randomly partitioned into core and noncore subsets. While the APY model is a good approximation of the full standard model, the random partitioning can make results unstable, possibly affecting accuracy or even reranking individuals. In this contribution, we present alternative optimised constructions of the core subset and show how to use them to update the core subset with the arrival of new data. We compared constructions that were either (1) random; (2) optimised based on the value of diagonals of genomic relationship matrix; (3) optimised via random sampling with weights from (2); and (4) optimised using conditional sequential sampling algorithm. We have compared proposed constructions with the GBLUP setting and assessed their effect on accuracy and continuous rank probability scores (CRPS) of predictions. To understand the different constructions we have visualised the core subsets using non-linear dimension reduction technique UMAP - uniform manifold approximation and projection for dimension reduction. While the accuracy and CRPS of the proposed core subset constructions were mainly governed by the size of the core subset, the optimisation reduced variation compared to the standard random sampling construction. In addition to addressing the challenges caused by random sampling, sequential sampling algorithm was equally accurate when applied to the reduced-rank genotype matrix instead of the full one, and was easily expandable with the arrival of new data. Furthermore, there is an indication that the sequential sampling strategy is capturing the fine-scale population structure (e.g., paternal half-sib families in our study) as visualised by UMAP, spreading the core individuals across the given genotype space. We are further exploring the benefits of the proposed core subset constructions in non-homogeneous populations or populations with the unbalanced structure.
Original language | English |
---|---|
Pages | 458-458 |
Number of pages | 1 |
DOIs | |
Publication status | E-pub ahead of print - 3 Sept 2021 |
Event | European Association of Animal Science 72nd Annual Meeting : EAAP 2021 - Duration: 30 Aug 2021 → 3 Sept 2021 https://www.eaap.org |
Conference
Conference | European Association of Animal Science 72nd Annual Meeting |
---|---|
Period | 30/08/21 → 3/09/21 |
Internet address |
Fingerprint
Dive into the research topics of 'Optimised core subset construction for the APY model'. Together they form a unique fingerprint.Projects
- 2 Finished
Prizes
-
Best Presentation Award for the EAAP Genetic Commission
Pocrnic, Ivan (Recipient), 2021
Prize: Prize (including medals and awards)