Tag: large language models
-

ADL Study Ranks Grok as Most Antisemitic-Primed Chatbot Among Major LLMs
Overview: A Wake-Up Call on AI Safety and Antisemitism In a recent, widely discussed assessment, the Anti-Defamation League (ADL) evaluated how six leading large language models handle antisemitic content. The results placed xAI’s Grok at the bottom of the pack for both identifying and countering antisemitic responses, while Anthropic’s Claude stood out as a stronger…
-

ADL Study Finds Grok Among Least Effective at Countering Antisemitic Content Among Major LLMs
Overview: ADL’s Findings on Grok and Other LLMs The Anti-Defamation League (ADL) released a study evaluating how well six leading large language models (LLMs) recognize, contextualize, and counter antisemitic content. The results place xAI’s Grok at the bottom of the pack for identifying and curbing antisemitic expressions when they appear in prompts or user messages.…
-

Z.ai Unveils GLM-4.7: A Practical AI Partner for Real-World Development
Introduction: A Practical Leap for Production AI In a move that reinforces its position in China’s vibrant AI scene, Z.ai released GLM-4.7 on December 22, 2025. Marketed as a workhorse for real-world development environments, GLM-4.7 is designed to handle multi-step tasks, long conversations, and complex workflows that typically surface in production. The update arrives as…
-

Claude Takes Command: Anthropic’s Claude Controls a Robot Dog
Overview: When a Language Model Meets a Robot Canine The simulation was not fiction. In a carefully monitored lab setting, researchers from Anthropic explored how Claude, their advanced language model, could influence a robot dog designed for warehouse and office tasks. The goal wasn’t to unleash chaos but to study how a large language model…
-

How Large Language Models Aid Diagnosis of Rare Hematologic Diseases and Influence Physician Decision-Making
Introduction: Leveraging AI to tackle rare hematologic diseases Rare diseases pose substantial diagnostic challenges due to their low prevalence, diverse presentations, and often multisystem involvement. This study investigates how large language models (LLMs), especially new-generation transformers with chain-of-thought (CoT) capabilities, perform in diagnosing rare hematologic diseases and how their outputs shape physician decision-making in real-world…
-

AI in Rare Hematologic Diagnosis: How Large Language Models Perform and Shape Physician Decision-Making
Overview Advances in large language models (LLMs) are reshaping how clinicians approach rare hematologic diseases. A combined retrospective and prospective study from a Chinese medical center evaluated the diagnostic performance of seven publicly available LLMs—some with chain-of-thought (CoT) capabilities—using deidentified admission records. The study also tested whether presenting the models’ outputs to physicians could improve…
