Mostrar el registro sencillo del ítem

dc.contributor.authorC. Leist, Ivo
dc.contributor.authorRivas Torrubia, María
dc.contributor.authorAlarcón Riquelme, Marta Eugenia 
dc.contributor.authorBarturen Briñas, Guillermo
dc.contributor.authorG. Gut, Ivo
dc.contributor.authorRueda, Manuel
dc.date.accessioned2024-12-20T07:55:30Z
dc.date.available2024-12-20T07:55:30Z
dc.date.issued2024-12-04
dc.identifier.citationC. Leist, I. et. al. BMC Bioinformatics (2024) 25:373. [https://doi.org/10.1186/s12859-024-05993-2]es_ES
dc.identifier.urihttps://hdl.handle.net/10481/98317
dc.description.abstractBackground: Phenotypic data comparison is essential for disease association studies, patient stratification, and genotype–phenotype correlation analysis. To support these efforts, the Global Alliance for Genomics and Health (GA4GH) established Phenopackets v2 and Beacon v2 standards for storing, sharing, and discovering genomic and phenotypic data. These standards provide a consistent framework for organizing biological data, simplifying their transformation into computer-friendly formats. However, matching participants using GA4GH-based formats remains challenging, as current methods are not fully compatible, limiting their effectiveness. Results: Here, we introduce Pheno-Ranker, an open-source software toolkit for individual- level comparison of phenotypic data. As input, it accepts JSON/YAML data exchange formats from Beacon v2 and Phenopackets v2 data models, as well as any data structure encoded in JSON, YAML, or CSV formats. Internally, the hierarchical data structure is flattened to one dimension and then transformed through one-hot encoding. This allows for efficient pairwise (all-to-all) comparisons within cohorts or for matching of a patient’s profile in cohorts. Users have the flexibility to refine their comparisons by including or excluding terms, applying weights to variables, and obtaining statistical significance through Z-scores and p-values. The output consists of text files, which can be further analyzed using unsupervised learning techniques, such as clustering or multidimensional scaling (MDS), and with graph analytics. Pheno-Ranker’s performance has been validated with simulated and synthetic data, showing its accuracy, robustness, and efficiency across various health data scenarios. A real data use case from the PRECISESADS study highlights its practical utility in clinical research. Conclusions: Pheno-Ranker is a user-friendly, lightweight software for semantic similarity analysis of phenotypic data in Beacon v2 and Phenopackets v2 formats, extendable to other data types. It enables the comparison of a wide range of variables beyond HPO or OMIM terms while preserving full context. The software is designed as a command-line tool with additional utilities for CSV import, data simulation, summary statistics plotting, and QR code generation. For interactive analysis, it also includes a web-based user interface built with R Shiny. Links to the online documentation, including a Google Colab tutorial, and the tool’s source code are available on the project home page: https:// github. com/ CNAG-​Biome dical-​Infor matics/ pheno-​ranker.es_ES
dc.description.sponsorshipProject 3TR ith funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No 831434es_ES
dc.description.sponsorshipEuropean Union’s Horizon 2020 research and innovation programme and EFPIAes_ES
dc.description.sponsorshipSpanish Instituto de Salud Carlos III, Fondo de Investigaciones Sanitarias and cofunded with ERDF funds (PI19/01772)es_ES
dc.description.sponsorshipSpanish Ministry of Science and Innovation through the Instituto de Salud Carlos III and the 2014–2020 Smart Growth Operating Programes_ES
dc.description.sponsorshipEuropean Regional Development Fund (MINECO/FEDER, BIO2015-71792-P)es_ES
dc.description.sponsorshipGeneralitat de Catalunya through the Departament de Salut and the Departament d’Empresa i Coneixementes_ES
dc.description.sponsorshipEU/EFPIA Innovative Medicines Initiative Joint Undertaking (PRECISESADS, grant n. 115565) including in-kind contributions from the EFPIA members involvedes_ES
dc.language.isoenges_ES
dc.publisherBMCes_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectGA4GHes_ES
dc.subjectPhenopacket v2es_ES
dc.subjectBeacon v2es_ES
dc.titlePheno‑Ranker: a toolkit for comparison of phenotypic data stored in GA4GH standards and beyondes_ES
dc.typejournal articlees_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.1186/s12859-024-05993-2
dc.type.hasVersionVoRes_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional