TY - JOUR
T1 - The DoGA consortium expression atlas of promoters and genes in 100 canine tissues
AU - DoGA Consortium
AU - Hörtenhuber, Matthias
AU - Hytönen, Marjo K
AU - Mukarram, Abdul Kadir
AU - Arumilli, Meharji
AU - Araujo, César L
AU - Quintero, Ileana
AU - Syrjä, Pernilla
AU - Airas, Niina
AU - Kaukonen, Maria
AU - Kyöstilä, Kaisa
AU - Niskanen, Julia
AU - Jokinen, Tarja S
AU - Mottaghitalab, Faezeh
AU - Takan, Işıl
AU - Salokorpi, Noora
AU - Raman, Amitha
AU - Stevens, Irene
AU - Iivanainen, Antti
AU - Yoshihara, Masahito
AU - Gusev, Oleg
AU - Bannasch, Danika
AU - Sukura, Antti
AU - Schoenebeck, Jeffrey J
AU - Ezer, Sini
AU - Katayama, Shintaro
AU - Daub, Carsten O
AU - Kere, Juha
AU - Lohi, Hannes
N1 - © 2024. The Author(s).
PY - 2024/12
Y1 - 2024/12
N2 - The dog, Canis lupus familiaris, is an important model for studying human diseases. Unlike many model organisms, the dog genome has a comparatively poor functional annotation, which hampers gene discovery for development, morphology, disease, and behavior. To fill this gap, we established a comprehensive tissue biobank for both the dog and wolf samples. The biobank consists of 5485 samples representing 132 tissues from 13 dogs, 12 dog embryos, and 24 wolves. In a subset of 100 tissues from nine dogs and 12 embryos, we characterized gene expression activity for each promoter, including alternative and novel, i.e., previously not annotated, promoter regions, using the 5' targeting RNA sequencing technology STRT2-seq. We identified over 100,000 promoter region candidates in the recent canine genome assembly, CanFam4, including over 45,000 highly reproducible sites with gene expression and respective tissue enrichment levels. We provide a promoter and gene expression atlas with interactive, open data resources, including a data coordination center and genome browser track hubs. We demonstrated the applicability of Dog Genome Annotation (DoGA) data and resources using multiple examples spanning canine embryonic development, morphology and behavior, and diseases across species.
AB - The dog, Canis lupus familiaris, is an important model for studying human diseases. Unlike many model organisms, the dog genome has a comparatively poor functional annotation, which hampers gene discovery for development, morphology, disease, and behavior. To fill this gap, we established a comprehensive tissue biobank for both the dog and wolf samples. The biobank consists of 5485 samples representing 132 tissues from 13 dogs, 12 dog embryos, and 24 wolves. In a subset of 100 tissues from nine dogs and 12 embryos, we characterized gene expression activity for each promoter, including alternative and novel, i.e., previously not annotated, promoter regions, using the 5' targeting RNA sequencing technology STRT2-seq. We identified over 100,000 promoter region candidates in the recent canine genome assembly, CanFam4, including over 45,000 highly reproducible sites with gene expression and respective tissue enrichment levels. We provide a promoter and gene expression atlas with interactive, open data resources, including a data coordination center and genome browser track hubs. We demonstrated the applicability of Dog Genome Annotation (DoGA) data and resources using multiple examples spanning canine embryonic development, morphology and behavior, and diseases across species.
KW - Animals
KW - Dogs/genetics
KW - Promoter Regions, Genetic/genetics
KW - Genome
KW - Wolves/genetics
KW - Molecular Sequence Annotation
KW - Organ Specificity
KW - Gene Expression Profiling/methods
U2 - 10.1038/s41467-024-52798-1
DO - 10.1038/s41467-024-52798-1
M3 - Article
C2 - 39433728
SN - 2041-1723
VL - 15
SP - 1
EP - 12
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 9082
ER -