Generative AI outperforms human teams in analyzing medical data

Researchers at UC San Francisco and Wayne State University found that generative AI can process complex medical datasets faster than traditional human teams, sometimes yielding stronger results. The study focused on predicting preterm birth using data from over 1,000 pregnant women. This approach reduced analysis time from months to minutes in some cases.

Scientists at UC San Francisco and Wayne State University conducted a real-world test of generative AI in health research, comparing its performance to human experts. The task involved predicting preterm birth, a leading cause of newborn death in the United States, where about 1,000 babies are born prematurely each day. The researchers used microbiome data compiled from approximately 1,200 pregnant women across nine studies, sourced from the March of Dimes Preterm Birth Data Repository.

To evaluate AI capabilities, the team drew on datasets from the DREAM crowdsourcing competition, which previously involved over 100 global teams developing machine learning models for preterm birth risks and gestational age estimation. Human participants in that competition took about three months to build models, followed by nearly two years to consolidate and publish findings.

In the new study, eight AI chatbots were given natural language prompts to generate analytical code without direct human programming. Only four of the systems produced usable code, but those that succeeded matched or exceeded the performance of human teams. For instance, a junior pair—a UCSF master's student, Reuben Sarwal, and a high school student, Victor Tarca—developed prediction models with AI support, generating functional code in minutes rather than hours or days required by experienced programmers.

The entire process, from inception to journal submission, took just six months. "These AI tools could relieve one of the biggest bottlenecks in data science: building our analysis pipelines," said Marina Sirota, PhD, professor of Pediatrics at UCSF and principal investigator of the March of Dimes Prematurity Research Center. Co-senior author Adi L. Tarca, PhD, from Wayne State University, added, "Thanks to generative AI, researchers with a limited background in data science won't always need to form wide collaborations or spend hours debugging code. They can focus on answering the right biomedical questions."

The study, co-authored by Sirota and Tarca, emphasizes that AI requires human oversight to avoid misleading results. It was published in Cell Reports Medicine on February 17, highlighting potential for faster progress in understanding preterm birth risk factors.

相关文章

Radiologist and AI system struggling to identify deepfake X-ray images in a medical study.
AI 生成的图像

Study finds radiologists and AI models struggle to spot AI-generated “deepfake” X-rays

由 AI 报道 AI 生成的图像 事实核查

A study published March 24, 2026 in *Radiology* reports that AI-generated “deepfake” X-rays can be convincing enough to mislead radiologists and several multimodal AI systems. In testing, radiologists’ average accuracy rose from 41% when they were not told fakes were included to 75% when they were warned, highlighting potential risks for medical imaging security and clinical decision-making.

Researchers at the University of Michigan have developed an AI system called Prima that interprets brain MRI scans in seconds, identifying neurological conditions with up to 97.5% accuracy. The tool also flags urgent cases like strokes and brain hemorrhages, potentially speeding up medical responses. Findings from the study appear in Nature Biomedical Engineering.

由 AI 报道

At the Game Developers Conference 2026 in San Francisco, generative AI tools drew mixed reactions, with demos from Google highlighting potential uses amid widespread developer skepticism. A recent industry report showed 52% of companies using the technology, but only 36% of workers incorporating it into their jobs, and 52% viewing it as harmful to the sector.

Researchers at the University of Geneva have developed MangroveGS, an AI model that predicts cancer metastasis risk with nearly 80% accuracy. The tool analyzes gene expression patterns in tumor cells, initially from colon cancer, and applies to other types like breast and lung. Published in Cell Reports, it aims to enable more personalized treatments.

由 AI 报道

韩国商会工业联合会(KCCI)的一项民调显示,韩国工作者借助生成式AI平台,平均每周工时减少8.4小时,整体17.8%。超过一半受访者每天使用此类工具,信息和电信行业采用率最高。

A New York Times analysis shows Google's AI Overviews, powered by Gemini, answering correctly only 90% to 91% of questions in a standard benchmark. This translates to tens of millions of incorrect responses daily across searches. Google disputes the test's relevance.

此网站使用 cookie

我们使用 cookie 进行分析以改进我们的网站。阅读我们的 隐私政策 以获取更多信息。
拒绝