Multimodal AI – Hour24 News

Can Alphabet Overtake Apple in AI Leadership with Gemini Powering Siri?

Alphabet’s Gemini Powering Siri: A Strategic Pivot in AI Leadership Alphabet has long positioned itself as a cornerstone of AI innovation, but its status in the public imagination often lagged behind the explosive momentum of newer AI models. The recent news that Gemini—Alphabet’s advanced AI family—could power Apple’s Siri marks a potential turning point. If…

Jan 21, 2026

Technology & AI in Creative Industries

From Firefly to Graph: How Adobe Envisions Creative AI in 2026

From Firefly to Graph: A snapshot of Adobe’s AI journey Adobe kicked off a new era for creative AI with Firefly, its image-generation model. Fast forward to 2026, and the company talks about a broader, more integrated AI ecosystem driven by what it calls Graph — a unifying framework designed to blend image, video, sound,…

Jan 2, 2026

Technology

From vibe coding to faster models: what’s new in Google’s Gemini update

Google’s Gemini update aims to sharpen AI performance for everyday users As the holiday season approaches, Google is rolling out a fresh wave of updates to its Gemini AI-powered assistant. The new features emphasize speed, reliability, and smarter interactions, signaling Google’s continued push to keep pace with a rapidly evolving AI landscape. The release combines…

Dec 19, 2025

Technology, AI

Gemini 3 Flash: Google’s Big Upgrade to the Gemini App

Introduction: A major leap for the Gemini app Google is rolling out a substantial upgrade to its Gemini app with Gemini 3 Flash. Marketed as a huge efficiency boost, the new model aims to deliver faster responses, better resource use, and more capable handling of complex requests. Gemini 3 Flash replaces Gemini 2.5 Flash as…

Dec 18, 2025

Technology/AI

Google Launches Nano Banana Pro: A Leap in AI Reasoning and Text Generation

Overview: What Nano Banana Pro Means for AI Tools Google has introduced Nano Banana Pro, an upgraded iteration of its AI-driven image editing and generation platform built on the Gemini 3 Pro architecture. The update promises more accurate reasoning and clearer text within generated content, addressing two long-standing pain points for creators and developers: reasoning…

Nov 21, 2025

Technology / Artificial Intelligence

Google Rolls Out Gemini 3 Pro Image Nano Banana Pro: What It Means for AI Image Creation

Google expands Gemini 3’s reach with Nano Banana Pro Google has introduced Nano Banana Pro, a continuation of its Gemini 3 Pro family, positioning it as a capable image generation and editing model. Officially branded as Gemini 3 Pro Image, the new model inherits the robust capabilities of its predecessor while aiming to deliver more…

Nov 21, 2025

Technology

How Google is Cleverly Harnessing AI to Redefine Everyday Tech

AI Thinking, Google-Style: A New Era of Practical Innovation For years, artificial intelligence felt like a distant, sometimes overhyped frontier. Lately, Google has shifted that perception by weaving AI into everyday tools in practical, user-centric ways. Rather than flashy headlines alone, Google’s latest AI-driven features aim to enhance clarity, speed, and reliability across search, productivity,…

Nov 17, 2025

Technology / Artificial Intelligence

Google’s Gemini 3: The AI Race Could Be Reshaped as Google Signals a Major Leap

Google’s Gemini 3: A Milestone Awaited by the AI World As the AI landscape continues to evolve at a breakneck pace, Google’s next major rollout—Gemini 3—has a growing chorus of anticipation. Industry insiders and analysts alike expect the new large language model to push Google back into a leading position in the AI race. The…

Nov 15, 2025

Artificial Intelligence / Computer Vision

MIT researchers teach AI to locate personalized objects in scenes

Overview: Beyond general recognition Vision-language models (VLMs) blend visual understanding with language processing, enabling them to recognize broad categories like “dog” or “car.” But users increasingly want these systems to locate a specific, personalized object—think your French bulldog Bowser or a child’s backpack—across different moments in time. A team from MIT and the MIT-IBM Watson…

Oct 16, 2025

Health Tech / Wearable Health Monitoring

Cough Detection Gets Smarter with Wearable Multimodal AI

Overview: A Leap Forward in Cough Detection for Wearables Researchers have unveiled a new approach to detecting coughs with wearable health monitors that combine audio data and accelerometer movement. This multimodal strategy improves the accuracy of cough detection, a vital capability for monitoring chronic respiratory conditions and predicting events such as asthma exacerbations. By better…

Oct 13, 2025

Tag: Multimodal AI