Hoping more folks like him defend the guardrails.
ill-incentives have always influenced academia, but I’m hoping we’re able to walk it back a bit
In closing, my Redfin escapism has shifted to LLM medical escapism, I know better but if you don’t or you are in even more dire straits, it provides such an illusion of hope and that’s dangerous.
peer review is built to assume good faith work by people who are all part of a community of scholarship, it can partially hold up to people within the community gaming metrics. if people are just going to appear, game the system to publish some papers, and then disappear into their real careers, there's no hope of this working.
i don't understand why residencies want med students to publish papers anyway. it's very difficult to do good scientific research, it requires training, time, and almost always apprenticeship. none of this is part of the medical school curriculum, which is why we need special MD-PhD programs for people who want to do both. nobody expects that doing a PhD in biology or epidemiology would give you any clinical know-how, why is it reasonable to expect the reverse?
If medical residents, or teaching hospitals, want people to do research, they should go get funding from established research funding sources that have standards and practices for funding and monitoring research.
For residency, the two most important things are: 1) board scores. 2) research output.
It's not uncommon to see 40-50 publications for competitive residencies.
incentives, incentives, incentives.
1. You get a 4 year degree in college. You hopefully get a very good GPA. You need to do so-called pre-med classes that really don't have much to do with medical education but are known as "weed out" classes, particularly Organic Chemistry. If you don't do these in your 4 year degree, you can do a program afterwards called a post-BAC;
2. At some point you take the MCAT. You may need to take it multiple times to get a sufficient score;
3. You apply to med school with your transcripts, any relevant experience, your MCAT, a personal statement and letters of recommendation. This is an onerous process. Demand greatly exceeds supply. You will need to do an interview (if you get that far);
4. If you get accepted you will do a 4 year program that's broadly characterized as MD or DO. It's easier to get into a DO school but they have worse match rates into residency, particularly for competitive specialties. There's also the international option, particularly Caribbean schools. They have even worse match rates;
5. Now begins the US Medical License Exam ("USMLE") process to become a doctor. You take Step 1 as an M2 (second year medical student). Typically the first 2 years of med school are academic. The last 2 are mostly clinical where you do rotations in various specialties;
6. As an M4 you have to do these rotations as well as take Step 2 (of the USMLE) and do your residency applications. This is probably the most stressful part because you can end up unmatched and then you've spent $400-800k+ to not become a doctor, at least not immediately and probably not in your preferred specialty;
7. To apply for residency you apply to programs, hopefully get an interview and then submit an application for each program you're interested in. This again includes letters of recommendation (very important), transcripts, your Step 2 results (Step 1 is now pass/fail, more on this below), research, etc. Applicants rank their programs. Programs rank their applicants. A matchin algorithm compares the two and attempts to essentially place each applicant in their most preferred program. Not all specialties do this. You can also attempt to match as a couple (usually used by married people);
8. If you match you're now contractually obligated to do that program. Depending on the specialty it's going to be 3-7 years, more if you do a fellowship afterwards. You basically get paid minimum wage for that entire time. Somewhere in there you need to take Step 3 and at the end do your medical boards to be licensed to operate independently as a medical doctor.
9. If you don't match, it gets real awkward. You either scramble for an open spot (a process called the SOAP), extend medical school for a 5th year (so you don't have the stink of having failed to match, seriously) or do a research year to improve your odds next year. Note that you can match into incomplete programs (eg an intern year only program).
So, let's do the math. In a perfect world you graduate high school at 18, college at 22, get accepted immediately, graduate medical school at 26, match immediately and then complete residency at 33 (for a general surgery residency program). That's a lot of education and training. You likely have $400k=$1M in debt by this point. And only now do you earn a real income.
But it often doesn't go that way. You may fail to get into medical school the first time. You may not have realized you wanted to have been a doctor so you had to do 1-2 years of a post-BAC. So you might be 25-26 before you start medical school. You may fail to match or not try and do a research year. Or you might do an MD-PhD program and take a few extra years to graduate. Combined with a fellowship, that 33 years of age might turn into 40 years old.
So one thing that changed in the last few years is that Step 1 went from a score to pass/fail. This is ostensibly to reduce the stress of having a bad score. Some med schools are also pass/fail rather than having a class ranking. What this means in practice is that school reputation and ranking become more important. These are harder to get into obviously so it has a knock-on effect into undergrad. So if you go to Harvard undergrad, you'll generally have a better chance of going to a T20 med school. But how do you get into Harvard?
But let me bring this long-winded thing back to research. Over the past decade, the number of research items for each matched resident has massively increased, more than doubled in some cases. Some med schools are research-heavy so going to those has become a competitive advantage. It means people who successfully match into a competitive specialty are more likely to take a research year before applying. This is particularly true for neurosurgery.
Income potential and lifestyle massively vary. Primary care (family medicine) and pediatrics have awful earning potential. Any surgical specialty, dermatology (I honestly don't understand this one) and radiology have much higher earning potential. The difference can be 5x or more.
So I guess this is a really long way of saying that churning out low-quality research is resume-padding. Residency programs don't even tend to care about the quality of your research. It's just the number of research items you have. Increased competitiveness of certain programs combined with reduced signal in other areas (particularly Step 1 going pass/fail) may have exacerbated the situation.
So anyone who complains about how much doctors earn should look at the time it takes and the years of exploitation as a resident. Maybe a doctor wouldn't be so expensive if it wasn't so expensive to become a doctor. You will also find a large number of physicians who would take a big pay cut if they didn't have to deal with insurance.
By your definition, every human endeavor is dismal and always has been - all are corrupted and flawed to some degree. Is there evidence that current science is more dismal than others or than before? You can look at any day in history and see people saying the same things about how it's so dismal and not like the good old days.
> No wonder the general public distrusts "the intelectual elite", we deserved it.
The general public has no idea about scientific publishing, publish or perish, or the distorted incentives it creates. Science has delivered at an incredible level for centuries, arguably more than any other human enterprise. Covid-19 vaccines were available in record time - it wasn't the science that caused it to go somewhat off the rails.
People are people.
It obviously has flaws, and we should never stop trying to improve it, but I think AI can be a great way to help connect a bunch of information they have to a bunch of information they don't, or to help spot patterns and potential avenues that they happened to miss. And obviously you want to be careful about becoming over-reliant on it, or being too trusting when it can be wrong. But I think we've been at the point for a long time where a doctor using a search engine to find medical literature should be a very reasonable thing to do, and I think AI can at least be an incremental (but massive) improvement on that workflow.
But I hope the end result of that is that doctors can not only deliver better (and maybe better-informed and more open-minded) treatment, but can spend the time focusing on patient care, managing expectations / risk management around uncertainty, managing the emotions inherent in someone who may be losing their life or the lives of their loved ones. Those are things AI is definitely not well-equipped to do as you point out.
It's medical students, not residents, who take research years, and that's only for extremely competitive specialties.
The lack of doctors, as it has always been, is because of the shortage of residency spots.
That cannot be addressed without Congress reversing 50+ years of neoliberalism trends and political failure and refusal to invest in public services and/or a communist revolution. Good luck fixing that.
Exactly. The people at the John Hopkins Camel-Winston Center for Respiratory Research know that Camel and Winston are gonna stop cutting the checks if they don't get something they can trot out to support them when they get hauled in front of congress even if that's only a minority of the work output they funded.
Even if nobody means to do evil the evil will be done just as a result of people factoring that in subconsciously and the pressure that applies systemically.
Where is really gets spicy is when you have dueling funding sources. Where I went to school you had the environmental compliance industry funding the public policy research to say that nobody should be allowed to install a fencepost without paying their way through some hoops while the Kochs (through some indirection) were funding different research in the same department to say no akshually that compliance stuff is making us all poorer.
MD doctors poll at extraordinarily high levels of trust, over almost any other professional group in the United States. So it really isn't correct to directly link this article's topic to "distrust". The effect you're talking about may exist in science, but this article is essentially a counter example to the effect you propose: clinicians publishing bullshit, but retain a high level of public trust.
Especially because the article is basically entirely quoting practicing scientists who identified this problem in the first place! More real scientific training or collaborating for clinicians who want to (or have to) do research could potentially improve the situation.
For certain specialties, the number of residency positions is so limited that medical students have to publish research to be competitive, even if they have no interest in doing so later in their career.
But altogether I sort of agree, the incentives are pretty maligned such that for many it's just easier to become a bad scientist with more publications than a good one with fewer.
Attendings and existing residents are consulted in the ranking process, they are picking people they will have to work pretty closely with for 4 years, they have skin in the game. Why does anyone put any weight on such a clearly bogus metric?
Botox and other cosmetic procedures. In any big city you can find swanky dermatology practices offering expensive cosmetic procedures to rich people.
Totally different if someone's self image is that of a researcher for benefit of humankind or if they pick the career because they want to drive a Porsche.
Such obvious common sense appears not obvious after all.
But certainly we should always approach the literature critically, including the author list, journal of publication and its peer-review practices, and the methods.
Do most medical students publish useless case studies trying to jockey for residency spots and signal hustle/devotion? No doubt!
But there are a good handful of medical students who are still (surprisingly) in it for the medicine and not the money. And that handful is exceedingly capable; no reason they can’t publish valuable work with the right collaborators and resources.
- Albert Einstein
But there are multiple issues that contribute to shortages. Just like with homes (as you brought up), there can be homes where nobody wants to live. Likewise, some specialties never fill all their places. So earning potential is a factor. Not wanting to live rural is often also a factor, despite efforts to attract people to both using things like PSLF. PSLF itself is on shaky ground under this administration and you will see physicians unwilling to sacrifice career potential for a program that won't trust will be there to forgive their debt.
And then there's burn out. Many doctors leave the profession in their 40s and 50s. And if you didn't really become a doctor until your 30s, that's a relatively short professional life. But why do they burn out? Insane hours, administration, insurance, work-life balance are all up there.
I'm mostly saying that being reproducible should become a higher badge of quality, right now reviewers in cartels can boost a researchers credibility by accepting each others articles to papers to let them become "influential" and money is then redirected even more to bullshit research (ie pure waste).
If up to 50% of research grants is spent on bullshit research based on fraud, spending 10% by earmarking it for reproduction to weed out irreproducible fraud is money well spent.
How do you propose the interested public make the distinction between genuine engagement and forced encouragement? Isn't it the task of journals to make that distinction before publishing? I don't think you can fault the public for dismissing everything out of hand when both academia and the journals are actively turning scientific publishing into a market for lemons.
Would you publish if the head honcho of your double-blind study insists to know what treatment a certain patient is receiving?
You have this discussion about research ethics and subsequent beratement once, and then you either mentally check out or go to another hospital.
"I'm a software engineer, I'm sure if I had the time to study Neuroscience, I'd figure out what all of these researchers failed to realize all these decades! I (alone) have the magic of critical and logical thinking"
Despite h-index claiming to balance quantity and quality, it obviously incentives quantity over quality (no single publication can increment h-index as much as churning out a few worthless publications that cite each other); med students overwhelmingly follow those incentives trying to secure better residencies
Medicine was among the worst if not the worst according to him. Didn't really want much to do with it anymore. Basically a case of subpar statistical knowledge and bad incentives.
But I agree, when youre on the internet no ones knows you're a dog.
Every morning, Joshua Wang sits down at his computer with a pastry and a can of cold, black coffee to look for the latest papers based on data from a popular research platform called TriNetX. Studies based on the platform—which provides access to anonymized electronic health records for more than 300 million patients in the United States and abroad—have skyrocketed in recent years. Wang, a neuroscientist at Taipei Tzu Chi Hospital who trains researchers there to use TriNetX, has noticed another trend, too. Some results, he says, look “a bit dodgy.”
He and others say the easy-to-use platform may be allowing inexperienced researchers—potentially aided by artificial intelligence (AI)—to churn out unreliable and bias-ridden studies with unrivaled speed. “We’ve seen a lot of these TriNetX studies, and they all seem to have very similar flaws,” says Samy Suissa, a pharmacoepidemiologist at McGill University. “They seem to always find these spectacular effects, remarkable benefits for drugs on all kinds of outcomes.”
In 2025, nearly 2700 publications mentioned TriNetX in the title or abstract, up from just 33 only 5 years prior, according to the Dimensions database, which tracks abstracts and citations. Less than halfway through this year, the number already exceeds 2100.
The rise mirrors recently reported trends seen in papers using publicly available health data sets, which are authored primarily by researchers in China. But TriNetX is only open to users at participating health care organizations, and most TriNetX papers come from authors at U.S. medical schools, often with a physician-in-training as lead author. Medical schools use TriNetX as a research training ground, and the resulting papers are a relatively easy way for medical students to boost their CVs before applying for residencies. “There is no substitute for learning this process than by doing it,” says Lisa Howley of the Association of American Medical Colleges (AAMC).
But the combination of inexperienced users and TriNetX’s push-button analysis tools can lead to shoddy publications, which often do not correct for potential biases that can make treatments appear more effective than they are. And because the data can be analyzed so quickly, users can easily cherry-pick positive results for publication, a practice known as p-hacking. “The flow of false discoveries is hugely greater,” says Matt Spick, a health-data scientist at the University of Surrey.
“My biggest concern,” Wang says, “is that doctors in 10 years’ time want to look into a particular concept and they go into the literature and everything is just associated with everything.”
TriNetX Chief Scientific Officer Jeffrey Brown agrees users need epidemiologic and statistical expertise and that papers should undergo robust peer review. But, he adds, “There’s more research happening, and I think that that’s good.”
Wang and many other researchers disagree. As one example, they point to a TriNetX paper published in the MDPI journal Cancers that made the news for finding what the authors described as “compelling evidence” that popular GLP-1 weight-loss drugs lower the risk for a long list of cancers in obese people. The paper failed to mention, let alone correct for, two key biases that can skew results in favor of the treatment being studied, called collider bias and immortal-time bias.
Collider bias can arise when both an exposure—for example, to a weight-loss drug—and an outcome such as cancer drive health care use, the so-called collider. The bias can create a spurious negative correlation between the exposure and outcome. Immortal-time bias can occur when researchers compare outcomes between patients who receive a certain treatment after a health event—say, a heart attack—and those who do not get the treatment, because any patient who dies before treatment automatically becomes part of the untreated group. That group then appears to have higher mortality. It’s “just an awful paper,” Suissa says.
Spick notes the drug being “miraculously protective” across several unrelated organ systems is “implausible,” given that “cancers are wildly different and have wildly different causes.” Neither of the paper’s two corresponding authors—one of whom had undisclosed ties to a manufacturer of weight-loss drugs—responded to emailed questions from Science.
In other cases, papers claim to have used TriNetX to do analyses the platform doesn’t in fact offer. Wang came across a paper published last year in Angiology suggesting diabetes drugs called gliflozins could reduce the risk of death after a heart attack. The authors, physicians at three top-tier U.S. medical schools, wrote they had conducted a key step to correct for immortal-time bias within TriNetX. Wang knew TriNetX offers no such tool.
“It really got me going,” says Wang, who has written dozens of letters to the editor pointing out problematic methods in published research using the software. “Either they have falsified their methods or they have uncritically copied a method sentence from a different article or from an AI output. … I think both are pretty scary.”
In response to questions from Science, the study’s first author, Rochell Issa, a final-year internal medicine resident at the Cleveland Clinic, defended the work, reiterating the methods as written in the paper. She stopped responding to follow-up questions about exactly how they had executed the analysis on the platform. Another author of the Angiology paper, David Kaelber, chief health informatics officer at the MetroHealth System in Cleveland and associate professor at Case Western Reserve University, denied using AI “to generate the methods or approach used in this or any of our studies.”
But Wang and colleagues asked seven large language models (LLMs) how to use TriNetX to complete a key step in correcting for immortal-time bias. Six suggested methods that were impossible to implement on the platform, they reported in the European Journal of Epidemiology. The researchers then searched TriNetX papers for the impossible approaches suggested by the LLMs and found eight papers, including the Angiology study. In five cases, medical students or residents in the U.S. appeared on the papers’ author lists, typically as first authors. Wang has since found five more papers suffering from the same problem.
TriNetX argued in a published response to Wang’s findings that the eight studies represent “a tiny fraction” of the work done with its software. In addition, the TriNetX authors write, descriptions of methods that are impossible to execute “can plausibly arise from misunderstanding, ambiguous terminology, incomplete reporting, or analysis performed outside the platform.”
The number of papers published annually that mention TriNetX has spiked in recent years, from single digits in the late 2010s to thousands in 2025 and ’26.
(Graphic) C. Bickel/Science; (Data) Dimensions database, Digital Science & Research Solutions Inc.
Kaelber, who has amassed 125 TriNetX publications, according to the Dimensions database—more than anyone else in the world—told Science that concerns about low-quality, unnecessary, and bias-ridden studies done on the platform are “totally valid.” One key “is transparency as to all of the design and TriNetX platform configuration decisions.” But neither he nor any other author contacted for this story agreed to share their TriNetX query parameters.
The problematic studies can influence patient care, says Brian VanderBeek, an ophthalmologist at the University of Pennsylvania who recently highlighted potential biases in a pair of TriNetX studies that suggested the food supplements turmeric and melatonin could drastically cut the risk of serious eye disease. “There may be some danger in that a physician could be falsely led to believe that there’s a protective effect,” VanderBeek says.
For its part, AAMC, which governs the residency application process in the U.S., is trying to address the problem of quick-and-dirty papers, Howley says. For the coming cycle, it will ask applicants to shift the focus of their publications list “from quantity to quality, emphasizing meaningful contributions, depth of involvement, and the impact of applicants’ work.”
Meanwhile, Wang continues his daily vigil, and he is working to promote best practices. At his own hospital, researchers seeking access to TriNetX must first complete a 1-hour training session with him. A lot of that time, he says, is spent illustrating how easy it is to get “beautiful-looking” but meaningless results. The hope, he says, is to “try and instill a little bit of fear so that they don’t run off and churn it out.”