A crowdsourcing database for the copy‑number variation of the Spanish population
Metadatos
Mostrar el registro completo del ítemEditorial
Springer Nature
Fecha
2023-03-09Referencia bibliográfica
López‑López et al. A crowdsourcing database for the copy‑number variation of the Spanish population. Human Genomics (2023) 17:20. [https://doi.org/10.1186/s40246‑023‑00466‑8]
Patrocinador
Ministry of Science and Innovation, Spain (MICINN) Spanish Government; Instituto de Salud Carlos III; European Commission; European Regional Development Fund (ERDF, "A way to make Europe"); PID2020-117979RB-I00 IMP/00019 IMP/00009 PI20/01305Resumen
Background Despite being a very common type of genetic variation, the distribution of copy-number variations (CNVs) in the population is still poorly understood. The knowledge of the genetic variability, especially at the level of the local population, is a critical factor for distinguishing pathogenic from non-pathogenic variation in the discovery of new disease variants.Results Here, we present the SPAnish Copy Number Alterations Collaborative Server (SPACNACS), which currently contains copy number variation profiles obtained from more than 400 genomes and exomes of unrelated Spanish individuals. By means of a collaborative crowdsourcing effort whole genome and whole exome sequencing data, produced by local genomic projects and for other purposes, is continuously collected. Once checked both, the Spanish ancestry and the lack of kinship with other individuals in the SPACNACS, the CNVs are inferred for these sequences and they are used to populate the database. A web interface allows querying the database with different filters that include ICD10 upper categories. This allows discarding samples from the disease under study and obtaining pseudo-control CNV profiles from the local population. We also show here additional studies on the local impact of CNVs in some phenotypes and on pharmacogenomic variants. SPACNACS can be accessed at: .Conclusion SPACNACS facilitates disease gene discovery by providing detailed information of the local variability of the population and exemplifies how to reuse genomic data produced for other purposes to build a local reference database.