Genetic risk scores can reveal hidden DNA information

Researchers have found that polygenic risk scores, which summarize a person's likelihood of developing diseases like diabetes and cancer, can be reverse-engineered to uncover underlying genetic data. This vulnerability raises privacy concerns, potentially allowing identification through public databases or reconstruction by insurers. The discovery highlights risks in sharing such scores, even anonymously.

Polygenic risk scores (PRS) aggregate the effects of numerous single-nucleotide polymorphisms (SNPs) in the genome to estimate disease predispositions. Companies like 23andMe and researchers use these scores to outline health risks, and individuals sometimes share them publicly for interpretation advice.

Traditionally viewed as low-risk for privacy due to the computational complexity of the knapsack problem—akin to deducing a phone number from its digits' sum—PRS are now shown to be exploitable. The key lies in the precise weights, up to 16 digits long, assigned to each SNP's contribution to disease risk, particularly in smaller models.

Gamze Gürsoy at Columbia University in New York explained: “Because the final polygenic risk score is constrained by a finite number of ways you could arrive at that number, and a statistically likely arrangement of the underlying SNPs, it can be deduced with a high degree of accuracy.” Alongside Kirill Nikitin, Gürsoy tested 298 PRS models using 50 or fewer SNPs on genetic data from 2353 individuals. By calculating possible genomes and filtering improbable mutations, they daisy-chained attacks across models, achieving 94.6 percent accuracy in reconstructing genotypes and predicting 2450 SNPs per person.

Notably, just 27 SNPs sufficed to identify someone in a database of 500,000 samples, with up to 90 percent precision for relatives. Individuals of African and East Asian descent faced higher identification risks due to underrepresentation in genetic databases. Gürsoy noted that 447 small, high-precision models in a public database are vulnerable.

“We wanted to point out that the risk is low, but under [some conditions], there might still be some leakage,” Gürsoy said, urging caution in research designs involving vulnerable groups. Ying Wang at Massachusetts General Hospital acknowledged existing data protections and computational limits but recommended treating small models as sensitive in clinical contexts and consent processes.

The findings stem from a preprint on bioRxiv (DOI: 10.64898/2026.02.16.706191).

Makala yanayohusiana

A new book by bioethicist Daphne O. Martschenko and sociologist Sam Trejo explores the implications of polygenic scores in genetic testing, highlighting potential inequalities and myths surrounding genetics. Through their 'adversarial collaboration,' the authors debate whether such research can promote equity or entrench social divides. They call for stricter regulation to ensure responsible use.

Imeripotiwa na AI

Researchers at the University of Geneva have developed MangroveGS, an AI model that predicts cancer metastasis risk with nearly 80% accuracy. The tool analyzes gene expression patterns in tumor cells, initially from colon cancer, and applies to other types like breast and lung. Published in Cell Reports, it aims to enable more personalized treatments.

Researchers have discovered that DNA in newly fertilized eggs forms a structured 3D scaffold before the genome activates, challenging long-held assumptions. Using a new technique called Pico-C, scientists mapped this organization in fruit fly embryos. A related study shows that disrupting this structure in human cells triggers an immune response as if under viral attack.

Imeripotiwa na AI

Researchers at UC San Francisco and Wayne State University found that generative AI can process complex medical datasets faster than traditional human teams, sometimes yielding stronger results. The study focused on predicting preterm birth using data from over 1,000 pregnant women. This approach reduced analysis time from months to minutes in some cases.

Jumapili, 3. Mwezi wa tano 2026, 19:21:24

PhilHealth expands Z Benefits for rare diseases

Ijumaa, 1. Mwezi wa tano 2026, 03:57:54

DNA hairpin molecules lowered cholesterol by 47% in mice by silencing PCSK9, researchers report

Jumatatu, 13. Mwezi wa nne 2026, 11:00:31

Genes account for half of human lifespan variation, study shows

Jumatano, 1. Mwezi wa nne 2026, 08:41:16

SCORE study reveals analytical variability in social sciences

Jumanne, 10. Mwezi wa tatu 2026, 20:27:47

AI system tests century-old theory on cancer origins

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa