AI Uncovers Hidden Side Effects in Weight-Loss Drugs via Social Media Analysis

Patients using semaglutida and tirzepatida experienced unreported side effects including irregular menstrual cycles, intermenstrual bleeding, chills, fever-like sensations, and fatigue.
Clinical trials catch what kills you. Social media catches what makes you miserable.
The difference between formal drug testing and what patients actually experience in their daily lives.

Em meio à explosão global do uso de medicamentos para perda de peso, pesquisadores da Universidade da Pensilvânia voltaram sua atenção não para laboratórios, mas para as conversas cotidianas de dezenas de milhares de pacientes na internet. Usando inteligência artificial para decifrar mais de 400 mil publicações no Reddit ao longo de cinco anos, eles encontraram algo que os ensaios clínicos formais não haviam registrado: um padrão consistente de efeitos colaterais — ciclos menstruais irregulares, calafrios, fadiga persistente — vividos por pessoas reais, mas ausentes das bulas oficiais. O achado levanta uma questão antiga com urgência renovada: quem define o que conta como evidência médica, e a quem essa definição serve?

  • Milhões de pessoas tomam semaglutida e tirzepatida sem saber que efeitos como sangramento irregular e sensações de febre sequer constam nos registros oficiais dos medicamentos.
  • A linguagem fragmentada e variada com que pacientes descrevem seus sintomas online tornava invisível, até agora, um padrão que se repetia em milhares de conversas.
  • Modelos de linguagem como GPT e Gemini conseguiram atravessar essa variação linguística e identificar queixas recorrentes que pesquisadores humanos dificilmente conectariam.
  • Os autores do estudo, publicado na revista Nature Health, argumentam que a análise de redes sociais pode sinalizar problemas em semanas — enquanto ensaios clínicos tradicionais levam anos.
  • O próximo passo é expandir a pesquisa para além do Reddit e do inglês, buscando confirmar se os mesmos padrões surgem em outras línguas, plataformas e populações ao redor do mundo.

Por mais de cinco anos, uma equipe da Universidade da Pensilvânia vasculhou o Reddit com o auxílio de inteligência artificial, analisando mais de 400 mil publicações de cerca de 70 mil usuários que tomavam semaglutida ou tirzepatida — medicamentos amplamente prescritos para perda de peso e controle glicêmico. O que emergiu dessa análise foi um conjunto de queixas físicas ausentes dos ensaios clínicos formais e das bulas: ciclos menstruais irregulares, sangramentos intermenstruais, calafrios, sensações de febre e fadiga persistente.

O obstáculo central sempre foi a linguagem. Pacientes descrevem os mesmos sintomas de formas radicalmente diferentes — um diz que sente frio o tempo todo, outro fala em ondas de calor, um terceiro menciona que está sempre cansado. Modelos de linguagem de grande escala, como GPT e Gemini, conseguem atravessar essa variação, reconhecer o padrão subjacente e quantificá-lo. É uma capacidade que escapa à leitura humana em escala.

Lyle Ungar, professor de sistemas de informação da Penn e coautor do estudo, destacou que ensaios clínicos são projetados para capturar os efeitos mais graves, mas frequentemente ignoram o que de fato incomoda os pacientes no dia a dia. Sharath Chandra Guntuku, autor sênior da pesquisa, acrescentou outro argumento: velocidade. Quando medicamentos como esses saltaram de uso restrito para adoção massiva quase da noite para o dia, a janela para detectar problemas pelos meios convencionais já havia se fechado.

Os pesquisadores deixam claro que não propõem substituir a ciência clínica rigorosa, mas complementá-la. O próximo passo é expandir a análise para outras plataformas e idiomas, verificando se os padrões se repetem em diferentes populações. Se confirmados, esses dados poderiam alertar médicos sobre efeitos que seus pacientes vivenciam, mas não relatam na consulta — por não associarem os sintomas ao medicamento, ou simplesmente por não acharem relevante mencionar.

A team of researchers at the University of Pennsylvania spent more than five years combing through Reddit, collecting and analyzing over 400,000 posts written by roughly 70,000 people taking popular weight-loss and diabetes medications. What they found, using artificial intelligence to sort through the noise, was a catalog of physical complaints that never made it into official clinical trials or onto medication labels.

The drugs in question—semaglutide and tirzepatida—have become ubiquitous in recent years, prescribed for weight loss and blood sugar control. But the side effects people reported online told a different story than the one documented in formal medical research. Users described irregular menstrual cycles, bleeding between periods, chills, sensations of fever, waves of heat, and persistent fatigue. These were not rare complaints buried in a handful of posts. They were patterns, repeated across thousands of conversations, in a language that varied enough to slip past traditional analysis.

The challenge, until now, was that people describe their symptoms in wildly different ways. One person might say they felt feverish; another might report waves of heat; a third might simply note they were cold all the time. A human researcher reading through hundreds of thousands of posts would struggle to connect these scattered descriptions into a coherent picture. But large language models like GPT and Gemini could do it. They could read across the variation, recognize the underlying complaint, and count it.

Lyle Ungar, a professor of information systems at Penn and one of the study's authors, explained the value of this approach in a statement: clinical trials are designed to catch the most dangerous side effects, but they often miss what actually bothers patients day to day. Social media, while imperfect and not necessarily representative of the broader population, can capture those concerns when enough people are talking about them. The research was published in Nature Health.

Sharath Chandra Guntuku, the study's senior author and an associate research professor of computer science at Penn Engineering, emphasized another advantage: speed. Traditional clinical trials take years. By the time a medication moves from a niche treatment to mainstream use—which happened almost overnight with these weight-loss drugs—the window for catching unexpected problems through conventional means has already closed. An AI-powered analysis of social media can flag concerns in weeks or months.

The researchers are not claiming this method replaces rigorous clinical science. But they are arguing it can complement it, filling gaps that formal testing leaves behind. Their next step is to expand beyond Reddit and beyond English-language communities. They want to see if the same patterns emerge in other languages and on other platforms, and whether those patterns hold up across different populations. If they do, doctors could be alerted to side effects their patients are experiencing but not reporting in the clinic—either because they don't think to mention them, or because they assume the symptoms are unrelated to the medication.

For the millions of people taking these drugs, the implications are significant. It suggests that the full picture of what these medications do to the body may not yet be visible in the official record. It also suggests that paying attention to what patients say online, rather than dismissing it as anecdotal noise, might be one way to make sure that picture becomes clearer.

Clinical trials generally identify the most dangerous side effects of medications. But they may not identify which symptoms concern patients most; while social media is not necessarily representative, a large volume of posts can reflect additional concerns.
— Lyle Ungar, professor of information systems, University of Pennsylvania
This approach is much faster than traditional trials. That speed is crucial when a medication moves from niche to mainstream almost overnight.
— Sharath Chandra Guntuku, senior author and associate research professor, University of Pennsylvania
La Conversación del Hearth Otra perspectiva de la historia
Inventor

Why does it matter that these side effects weren't caught in clinical trials? Aren't those trials supposed to be comprehensive?

Model

They are comprehensive for safety—they're designed to catch serious harm. But a clinical trial enrolls a few thousand people for a few months. Reddit has 70,000 people talking over five years. The trials catch what kills you. Social media catches what makes you miserable.

Inventor

So the AI is just reading Reddit posts and counting complaints?

Model

It's more than that. It's recognizing that when someone says they feel feverish and someone else says they have heat waves, they might be describing the same thing. A human would miss half of those connections. The AI doesn't.

Inventor

But Reddit users aren't a random sample. They're people who chose to post about their experience. Doesn't that skew the results?

Model

Absolutely. That's why the researchers are careful to say this doesn't replace clinical trials. But it does tell you what's on people's minds. If thousands of Reddit users are reporting irregular periods, that's a signal worth investigating, even if it's not a perfect count.

Inventor

What happens next? Do doctors just start warning patients about these side effects?

Model

Not immediately. The researchers want to see if the same patterns show up in other languages and other communities. If they do, that strengthens the signal. Then yes, doctors get alerted, and the conversation with patients changes.

Inventor

Does this change how we should think about drug safety?

Model

It suggests that safety monitoring doesn't end when a drug gets approved. It suggests we should be listening to patients, not just in clinics but in the places where they actually talk about their lives. That's a shift in how we think about what counts as evidence.

Quieres la nota completa? Lee el original en CNN Brasil ↗
Contáctanos FAQ