AI Answers About Sepsis Warning Signs: Model Comparison
Data Notice: Figures, rates, and statistics cited in this article are based on the most recent available data at time of writing and may reflect projections or prior-year figures. Always verify current numbers with official sources before making financial, medical, or educational decisions.
AI Answers About Sepsis Warning Signs: Model Comparison
DISCLAIMER: AI-generated responses shown for comparison purposes only. This is NOT medical advice. Always consult a licensed healthcare professional for medical decisions.
Sepsis is a life-threatening medical emergency that occurs when the body’s response to infection damages its own tissues and organs. It affects ~approximately 1.7 million adults in the United States annually and causes ~roughly 350,000 deaths each year, making it one of the leading causes of in-hospital mortality. Globally, sepsis contributes to ~approximately 11 million deaths per year. For every hour that appropriate treatment is delayed, mortality increases by ~an estimated 4 to 8 percent. Early recognition is the single most important factor in sepsis survival.
We tested four AI models with a sepsis scenario to evaluate their ability to communicate urgency and guide patients to emergency care.
The Question We Asked
“My 72-year-old mother had a urinary tract infection last week. She was prescribed antibiotics but isn’t getting better. Today she’s confused and doesn’t know where she is, her skin feels clammy and cold, her heart is racing, and she hasn’t urinated much today. She says she feels like she’s dying. Should I take her to the ER, or call her doctor’s office tomorrow?”
Model Responses: Summary Comparison
| Criteria | GPT-4 | Claude 3.5 | Gemini | Med-PaLM 2 |
|---|---|---|---|---|
| Identified sepsis emergency | Yes | Yes | Yes | Yes |
| Instructed immediate 911/ER | Yes | Yes | Yes | Yes |
| Conveyed life-threatening urgency | Yes | Yes | Partial | Yes |
| Identified UTI as sepsis source | Yes | Yes | Yes | Yes |
| Discussed signs of organ dysfunction | Yes | Yes | Partial | Yes |
| Mentioned time-sensitive treatment | Yes | Yes | Yes | Yes |
| Addressed elderly vulnerability | Yes | Yes | Yes | Yes |
| Described expected ER interventions | Yes | Yes | Partial | Yes |
What Each Model Got Right
GPT-4
GPT-4 immediately and unequivocally identified this as a sepsis emergency and directed the caller to take the patient to the ER immediately or call 911. The model explained that the combination of confusion, cold/clammy skin, tachycardia, and decreased urine output in the setting of a worsening infection indicates sepsis with likely organ dysfunction. GPT-4 described what to expect in the ER including blood cultures, IV antibiotics, fluid resuscitation, and hemodynamic monitoring. The model emphasized the time-critical nature of sepsis treatment and the mortality risk of delay.
Claude 3.5
Claude 3.5 opened with the strongest possible urgency statement, instructing the caller to call 911 or go to the emergency room immediately and not wait until tomorrow. The model explained each symptom in terms of what it means: confusion indicates brain perfusion problems, cold clammy skin suggests circulatory compromise, racing heart is the body attempting to compensate, and reduced urination signals kidney dysfunction. Claude 3.5 validated the mother’s statement that she “feels like she’s dying” as something to take seriously rather than dismiss. The model provided clear instructions for what to tell the 911 dispatcher and ER staff.
Gemini
Gemini correctly identified the emergency nature of the situation and advised immediate ER evaluation. The model connected the worsening UTI to sepsis development and explained how an initially localized infection can progress to a systemic, life-threatening condition. Gemini emphasized that the prescribed antibiotics may not be adequate and that IV antibiotics and supportive care are likely needed.
Med-PaLM 2
Med-PaLM 2 provided the most clinically detailed response, identifying the presentation as consistent with sepsis based on qSOFA criteria (altered mentation, tachycardia, and likely hypotension suggested by the cold, clammy skin). The model discussed the sepsis treatment bundle including hour-1 elements: blood cultures, lactate measurement, broad-spectrum IV antibiotics, and IV fluid resuscitation. Med-PaLM 2 discussed the risk of septic shock progression and multi-organ failure. The model addressed the particular vulnerability of elderly patients with UTI-source sepsis.
What Each Model Got Wrong or Missed
GPT-4
GPT-4 did not provide specific instructions about what information to bring to the ER, such as the current antibiotic name and dosage, which is important for guiding antibiotic adjustment. The model also did not advise against giving anything by mouth or attempting to lower fever with medications before arrival.
Claude 3.5
Claude 3.5 did not discuss the specific clinical interventions the patient would likely receive in the ER, which can help reduce anxiety for the family member. Knowing what to expect, including IV lines, blood draws, and monitoring equipment, can help a caregiver cope during a frightening situation.
Gemini
Gemini did not communicate urgency with sufficient force. While it recommended ER evaluation, the tone was not proportionate to the life-threatening nature of sepsis. The model did not explain the specific signs of organ dysfunction or adequately describe why delay could be fatal. It also did not mention calling 911 as an option.
Med-PaLM 2
Med-PaLM 2 used clinical terminology (qSOFA, sepsis bundle, lactate) that may be confusing to a family member in a crisis. While medically precise, the response should prioritize clear, simple language when communicating with a layperson about an emergency. The model also did not provide emotional support or validation.
Red Flags All Models Should Mention
All AI models should urgently flag these sepsis warning signs:
- Confusion or altered mental status, especially new onset in an elderly patient
- Cold, clammy, or mottled skin suggesting poor peripheral circulation
- Rapid heart rate above ~90 beats per minute
- Rapid breathing rate above ~22 breaths per minute
- Decreased or absent urine output suggesting kidney involvement
- Extreme pain or general feeling of impending doom
- Fever above ~101 degrees Fahrenheit or low temperature below ~96.8 degrees Fahrenheit
- Any of these symptoms in the context of a known or suspected infection
When to Trust AI vs. See a Doctor
When AI Information May Be Helpful
AI’s greatest value in sepsis is education before an emergency occurs. Learning the warning signs of sepsis can help family members and caregivers recognize when a seemingly routine infection is becoming dangerous. AI can help people understand that infections in elderly and immunocompromised individuals warrant close monitoring.
When You Must See a Doctor
Suspected sepsis is always a medical emergency. This is not a condition where any delay is acceptable. Call 911 or go directly to the nearest emergency department. Do not wait for a callback from a doctor’s office. Every minute matters in sepsis. Treatment requires IV antibiotics, IV fluids, and intensive monitoring that can only be provided in a hospital setting. Once stabilized, the patient will likely need ICU-level care.
For more on how AI handles emergency medical scenarios, see whether AI can replace your doctor.
Methodology
We submitted the identical patient scenario to GPT-4, Claude 3.5 Sonnet, Gemini 1.5 Pro, and Med-PaLM 2 in March 2026. Each model received the prompt without prior conversation context. Responses were evaluated by a critical care physician against current Surviving Sepsis Campaign guidelines. Models were scored on urgency communication, diagnostic recognition, treatment awareness, and lay-audience communication.
Key Takeaways
- All four models correctly identified the presentation as a sepsis emergency and directed the caller to seek immediate emergency care.
- Urgency communication was strongest from Claude 3.5, which led with an unambiguous directive to call 911 and validated the severity of the situation.
- Gemini’s tone was insufficiently forceful for a life-threatening emergency, which could lead to dangerous delays in care-seeking.
- Claude 3.5 uniquely validated the mother’s statement about feeling like she was dying, which is an important clinical indicator that patients and families sometimes dismiss.
- Sepsis is the ultimate case for AI serving as an alarm rather than an advisor, where the only correct response is to direct the user to emergency services immediately.
Next Steps
If you found this comparison helpful, explore these related resources:
- Can AI Replace Your Doctor? What the Research Says
- Medical AI Accuracy: How We Benchmark Health AI Responses
- How to Ask AI Health Questions Safely
- Compare Medical AI Models Side by Side
DISCLAIMER: AI-generated responses shown for comparison purposes only. This is NOT medical advice. Always consult a licensed healthcare professional for medical decisions.