Genetic risk scores can reveal hidden DNA information

Researchers have found that polygenic risk scores, which summarize a person's likelihood of developing diseases like diabetes and cancer, can be reverse-engineered to uncover underlying genetic data. This vulnerability raises privacy concerns, potentially allowing identification through public databases or reconstruction by insurers. The discovery highlights risks in sharing such scores, even anonymously.

Polygenic risk scores (PRS) aggregate the effects of numerous single-nucleotide polymorphisms (SNPs) in the genome to estimate disease predispositions. Companies like 23andMe and researchers use these scores to outline health risks, and individuals sometimes share them publicly for interpretation advice.

Traditionally viewed as low-risk for privacy due to the computational complexity of the knapsack problem—akin to deducing a phone number from its digits' sum—PRS are now shown to be exploitable. The key lies in the precise weights, up to 16 digits long, assigned to each SNP's contribution to disease risk, particularly in smaller models.

Gamze Gürsoy at Columbia University in New York explained: “Because the final polygenic risk score is constrained by a finite number of ways you could arrive at that number, and a statistically likely arrangement of the underlying SNPs, it can be deduced with a high degree of accuracy.” Alongside Kirill Nikitin, Gürsoy tested 298 PRS models using 50 or fewer SNPs on genetic data from 2353 individuals. By calculating possible genomes and filtering improbable mutations, they daisy-chained attacks across models, achieving 94.6 percent accuracy in reconstructing genotypes and predicting 2450 SNPs per person.

Notably, just 27 SNPs sufficed to identify someone in a database of 500,000 samples, with up to 90 percent precision for relatives. Individuals of African and East Asian descent faced higher identification risks due to underrepresentation in genetic databases. Gürsoy noted that 447 small, high-precision models in a public database are vulnerable.

“We wanted to point out that the risk is low, but under [some conditions], there might still be some leakage,” Gürsoy said, urging caution in research designs involving vulnerable groups. Ying Wang at Massachusetts General Hospital acknowledged existing data protections and computational limits but recommended treating small models as sensitive in clinical contexts and consent processes.

The findings stem from a preprint on bioRxiv (DOI: 10.64898/2026.02.16.706191).

Related Articles

Doctor reviewing PAM gene DNA model next to diabetes patient using Ozempic, showing genetic resistance to the drug
Image generated by AI

Study links PAM gene variants to weaker blood-sugar response to GLP-1 drugs such as Ozempic

Reported by AI Image generated by AI Fact checked

Researchers say genetic variants in the PAM gene may help explain why some people with Type 2 diabetes get less blood-sugar benefit from GLP-1 receptor agonist drugs such as Ozempic, a phenomenon they describe as “GLP-1 resistance.”

Researchers at the University of Geneva have developed MangroveGS, an AI model that predicts cancer metastasis risk with nearly 80% accuracy. The tool analyzes gene expression patterns in tumor cells, initially from colon cancer, and applies to other types like breast and lung. Published in Cell Reports, it aims to enable more personalized treatments.

Reported by AI Fact checked

Researchers from the University of Barcelona and the University of Oregon report that short DNA molecules known as polypurine reverse Hoogsteen hairpins (PPRHs) suppressed the PCSK9 gene and reduced blood cholesterol in a mouse model. In transgenic mice carrying the human PCSK9 gene, a single injection of one candidate (HpE12) cut plasma PCSK9 by 50% and total cholesterol by 47% three days later, according to findings published in Biochemical Pharmacology.

Researchers at Rockefeller University report that a new single-cell screening platform, PerturbFate, can trace how many different genetic disruptions converge on common regulatory programs that drive resistance to the melanoma drug vemurafenib, pointing to potential combination-therapy targets.

This website uses cookies

We use cookies for analytics to improve our site. Read our privacy policy for more information.
Decline