The weights of PRS for idiopathic pulmonary arterial hypertension
Data files
Oct 25, 2025 version files 56.58 MB
-
README.md
718 B
-
Supplementary_Data.zip
56.58 MB
Abstract
This study aimed to develop a comprehensive prediction model integrating polygenic risk scores (PRS) and clinical factors to identify individuals at high risk for incident idiopathic pulmonary arterial hypertension (IPAH). A PRS was constructed using summary statistics derived from the largest genome-wide association study for pulmonary arterial hypertension in Europeans and validated in the UK Biobank (732 cases, 458,258 controls). After excluding individuals with IPAH at baseline, 316,073 participants (316 incident IPAH patients) were split into training and testing sets (7:3). Variable selection was performed using least absolute shrinkage and selection operator in the training set, and a prediction model for incident IPAH was established using the Cox proportional hazards model. The dataset here is the Supplementary Data for the manuscript titled "Polygenic risk score and clinical factors predict incident idiopathic pulmonary arterial hypertension". The dataset contains the weight for each single-nucleotide polymorphism used for constructing PRS for IPAH.
Dataset DOI: 10.5061/dryad.wm37pvmzx
Supplementary Data for PRS Construction
(Supplementary_Data.zip)
Description of the data and file structure
We constructed a genome-wide polygenic risk score (PRS) for idiopathic pulmonary arterial hypertension (IPAH) using summary statistics from a published pulmonary arterial hypertension (PAH) genome-wide association study (GWAS) (doi: 10.1016/S2213-2600(18)30409-0). The dataset comprises the variant-specific weights required for PRS construction. The file contains the assigned weight for every single-nucleotide polymorphism (SNP) included in the IPAH PRS model.
