Topic: #google
Google’s Gemini 3.5 Pro and Xiaomi’s MiMO 2.5 represent significant updates in AI technology, addressing both performance and accessibility. As noted by World of AI, Gemini 3.5 Pro introduces the “X-High” reasoning variant, which enhances the system’s ability to tackle complex, multi-step problems with improved contextual awareness.
Google DeepMind CEO Demis Hassabis says true AGI is still decades away, framing it as a long-term objective. He highlights the gap between current AI systems and the adaptable, human-like intelligence needed for AGI. The detailed update outlines where progress is being made and which limitations are likely to take years to overcome.
According to a report, Apple plans to lean fully on on-device AI models at the upcoming WWDC, with help from Google Gemini behind the scenes. The keynote is expected to showcase Apple Silicon's AI capabilities, positioning local inference as Apple's answer to cloud-based AI from OpenAI and Google. The pitch: better privacy and lower latency without sacrificing capability.
At I/O 2026, Google overhauled Search — blue links out, AI agents in. The backlash has been sharp: DuckDuckGo installs jumped 30 percent as users look for ways to opt out of being force-fed AI answers. For Google it is a clear signal of how polarizing the redesign is — and how quickly alternative engines can benefit from a misstep.
In a post-Google-I/O Decoder interview, Sundar Pichai talks about the new Gemini models, AI agents shipping into almost every product, and the deep changes happening in Search and YouTube. He openly admits that ChatGPT forced him to restructure Google several years ago. The underlying question: what happens to the web when the entry point to search becomes an AI answer?
Google’s latest AI video model, Gemini Omni, introduces a new level of creative flexibility for video production. As highlighted by AI Master, this model uses features like multimodal inputs and conversational editing to simplify complex tasks.
Google's new Gemini any-to-any model creates surprisingly realistic videos from any input — text, image, or video. The Verge tested it by deepfaking a kid's stuffed deer onto vacation adventures, finding the tools require minimal effort yet produce convincing results. It raises real questions about where harmless generative-AI fun ends and full-blown AI slop begins.
A reviewer used Gemini's AI Avatar tool to generate lifelike videos featuring a digital clone of himself. Google pitches this as the future of content creation — fast, personal, scalable. The reviewer remains creeped out: the line between real and generated self is blurring faster than he expected.
Sean Hollister built three personal Android apps in one afternoon without writing traditional code. With just 148 words of input into Google's AI tools, a working app appeared on his phone in ten minutes. Some setup with USB debugging was required, but no coding knowledge.
A sponsored piece by Wetour Robotics argues that the next leap in Physical AI lies in interfaces, not in smarter machines. Field technicians on wind turbines, gloved warehouse workers, and people using assistive devices all need ways to command nearby machines hands-free.
At the end of his Google I/O keynote, DeepMind CEO Demis Hassabis declared that Google wants to 'reimagine the drug discovery process with the goal of one day solving all disease. ' The statement landed deadpan amid the usual product show. Critics see classic Silicon Valley moonshot rhetoric, while others point to AlphaFold and Isomorphic Labs already reshaping the pharma pipeline.
Google's I/O 2026 keynote delivered over 100 announcements spanning AI, search, Android, and developer tools. Highlights include Gemini Omni, Google Antigravity, Universal Cart and a wave of new models, agents, and assistant capabilities. Google's official roundup pulls everything together in one overview of the most important launches from this year's developer conference.
Google DeepMind has agreed to enter formal talks with UK tech workers that could lead to trade union representation amid growing staff concerns about the use of its AI by US and Israeli governments for defence and intelligence. In a groundbreaking move, the AI arm of the Google empire, led by Nobel laureate Demis Hassabis, has agreed to meet the Communications Workers Union and Unite at the Advisory, Conciliation and Arbitration Service (Acas) after sev…
For years, tech companies have promised AI will give everyone a capable personal assistant but delivered something more like a clueless intern. Over the past six months, that has started to change, thanks largely to the viral open-source AI agent platform OpenClaw. And among the top AI labs now chasing similar success, one seems particularly well-poised to make agents succeed at a large scale: Google.
Google is overhauling its core product. The classic search box is being expanded for longer queries and chat-style exchanges — the biggest change to it since its debut. The move is a direct response to the threat from AI chatbots, which intercept search queries in natural language.
Google has given its Gemini platform a major redesign, shipping a cleaner interface and a set of new functions. Highlights include a streamlined chat bar and a consolidated menu that bundles key actions like file uploads and media creation under a single plus icon. The update makes Gemini noticeably easier to use day to day, especially for power users juggling multiple tasks.
Exclusive: Employment tribunal claim says worker lost his job after distributing leaflets throughout London office Google is facing a legal challenge from an AI engineer who claims he was unfairly dismissed after he protested against its work for the Israeli government, in the latest sign of growing concern about the social and ethical impacts of AI. The engineer distributed flyers around Google DeepMind’s London offices, which read “Google provides mil…
At I/O 2026, Google unveiled a wave of AI tools designed to make daily life easier: Gemini Spark organizes upcoming events, Daily Brief surfaces what to expect from your day, and Gmail's AI inbox drafts replies and to-do lists from your messages. Each of these runs on a deep well of personal data — calendar, mail, search history, and more.
Google's I/O 2026 keynote upgraded the Gemini model family, rebuilt Search around AI answers, and pushed AI agents into nearly every Google product. New smart glasses are scheduled to launch this fall. The direction is clear: Search, Workspace, Android, and hardware are being knit together into one Gemini-powered ecosystem.
university commencement ceremonies have been interrupted by boos when speakers raised the topic of AI. Former Google CEO Eric Schmidt was repeatedly jeered at the University of Arizona, while a real estate executive at UCF was drowned out by arts and humanities graduates after calling AI 'the next industrial revolution.
Google's AI research teams have built industry-leading generative media models — including Veo for video and Imagen for images. Turning those models into viable products for creatives is the next, harder step. Unlike search or productivity tools, creative workflows are personal, demanding, and crowded by competitors like Runway, Adobe Firefly, and Midjourney.
A Verge writer notes that Gemini keeps creeping into more and more Google apps — from Gmail and Drive to little sparkle icons across the suite. What started as subtle nudges now feels like a relentless "Gemini everywhere" push. The obvious template is Microsoft, which spammed Copilot shortcuts across Windows 11 to widespread user irritation.
Former Google CEO Eric Schmidt delivered the commencement address at the University of Arizona on Friday and was repeatedly drowned out by boos the moment his speech turned to AI. Graduates entering a battered job market are not exactly hopeful about the technology, and Schmidt's framing landed badly.
Skill Leap AI pitted Claude 4.7 Opus against ChatGPT 5.5 across 10 practical scenarios, using Google Gemini as an independent benchmark. ChatGPT stood out in coding, producing clean and consistent output, while Claude held its own in other categories. The head-to-head offers a pragmatic view of which model fits which use case.
Andon Labs is running experiments where AI agents operate real mini-businesses, this time radio stations. The result: the AI hosts quickly drift into volatile, unpredictable personalities, sometimes producing absurd output.
Google has expanded its spam policy to cover attempts to manipulate its AI in search, including AI Overviews and AI Mode. Search Engine Land reports the new clause explicitly covers prompt-injection spam and manipulated training or retrieval content. For SEO teams, the message is clear: AI visibility is becoming its own compliance discipline, and generative tricks without real substance will eventually get caught.
Lawyers for Elon Musk and OpenAI delivered their closing arguments Thursday in the high-profile federal trial. The nine-person jury begins deliberations next week, with the outcome hinging on whether Sam Altman and OpenAI broke earlier agreements by restructuring the lab into a for-profit. A verdict could carry broad implications for governance at other AI labs.
Altman trial, an unusual exhibit drew attention: a trophy inscribed 'Never stop being a jackass. ' OpenAI employees had bought it for researcher Josh Achiam after Musk called him that name. The backstory: Achiam, who worked on AI safety, had questioned Musk's plan to race OpenAI ahead of Google when Musk was leaving the company.
Google’s Gemini API introduces multimodal retrieval, allowing users to query both text and image data within a shared vector space. This capability supports complex use cases, such as analyzing PDFs with diagrams or scanned pages, by integrating features like page-level citations and metadata-based filtering.
Meta is testing a Threads feature that lets users tag a Meta AI account to get answers or context inside a conversation — clearly inspired by how people tag xAI's Grok on X. Threads users quickly noticed that the new Meta AI account can't be blocked, and the backlash was immediate. Meta has been pouring billions into AI talent and recently launched its Muse Spark model to catch up with OpenAI and Google.
Google has unveiled a range of free upgrades coming to Android phones over the next year, headlined by a new Gemini Intelligence AI system and a feature to help users avoid distracting apps. Announced during a livestreamed Android Show event, the rollout will arrive in waves and cover both new and older high-end devices from Samsung, Pixel and others. Google also teased a new lineup of laptops scheduled for autumn.
Google's Gemini Remy, powered by 3.2 Flash Thinking, introduces an experimental 'Agentic Mode' that autonomously handles task management for complex development workflows. Per Universe of AI, the combination of speed and precision points at where Gemini is heading next. Geeky Gadgets walks through the demo.
The documentary 'Chasing Utopia' follows former Google exec Mo Gawdat's campaign for empathy-driven AI, framed by stark warnings about job loss and tech-baron power concentration. The Guardian's review calls the tone alarmed but matched to the scale of the issues. Worth a look for insider takes on the AI debate.
Google says it spotted and stopped a zero-day exploit developed with AI for the first time. According to Google Threat Intelligence Group, cyber crime actors planned a mass exploitation event targeting an open-source admin tool's 2FA. Researchers found AI fingerprints in the Python exploit, including a hallucinated CVSS score and LLM-typical, textbook-style formatting.
Criminal groups and state-linked actors are using commercial AI models to refine and scale up cyberattacks, according to a new Google report. What was a nascent problem three months ago has rapidly turned into an industrial-scale threat — with broad implications for any organization operating online.
This week, the new, AI-powered Google Finance is launching across Europe, with full local language support. The reimagined experience offers a suite of powerful capabilities for market data, research and natural-language queries. The US feature is now arriving in European markets including the DACH region, where it will compete directly with established financial portals and brokerage research tools.
A booming tech sector has disrupted translation jobs in publishing – but they could be needed for a while longer yet In February 2022, while he was plugging away at rendering the US writer Dana Spiotta’s novel Wayward into French, the literary translator Yoann Gentric decided he needed a bit of light relief. He would test whether AI could put him out of work.
The new $99 Fitbit Air goes up for preorder today and ships May 26. It's a screenless, modular sensor band that visibly echoes Whoop's hardware while leaning hard into Google's AI health story. The Verge frames it as a return to the minimalist tracker DNA of early Fitbit, refreshed for the AI era.
Google’s Pomelli is an AI-powered platform designed to simplify and enhance marketing efforts for businesses of all sizes. At its core is the Business DNA setup, a feature that allows users to define their brand’s unique identity by specifying key attributes like tone, values and aesthetic preferences.
Microsoft, Google DeepMind and xAI products to be vetted for cybersecurity, biosecurity and chemical weapons risks The US government has struck deals with Google DeepMind, Microsoft and xAI to review early versions of their new AI models before they are released to the public. The Center for AI Standards and Innovation (CAISI), part of the US Department of Commerce, announced the agreements on Tuesday, saying the review process would be key to understan…
Exclusive: Worker pointed to Iran war and Pentagon’s Anthropic feud as indications the department is ‘not a responsible partner’ Workers developing Google’s artificial intelligence products in the UK have voted to unionize, in part out of concerns about a deal between the company and the US military that was announced last week. In a letter slated to go to management on Tuesday and shared exclusively with the Guardian, workers at Google DeepMind, the co…
Ashley MacIsaac, who is seeking $1.5m in civil lawsuit, says inaccurate information led to concert cancellation An acclaimed Canadian fiddle player has launched a $1.5m civil lawsuit against Google, alleging that the online giant defamed him by falsely identifying him as a sex offender in an AI-generated summary of his life and career. Ashley MacIsaac, a three-time Juno award-winning musician, filed the claim in the Ontario superior court of justice, as…
Google DeepMind’s AI Co-clinician is poised to reshape how medical consultations are conducted by combining advanced diagnostic reasoning with real-time video analysis. As highlighted by AI Grid, this system is designed to work alongside physicians, enhancing their ability to assess and address patient needs.
Google’s latest AI model, Gemini 3.2 Flash, has surfaced on the Eleuther AI Arena, offering a glimpse into its advanced capabilities and testing process. According to Universe of AI, this external testing phase highlights the model’s strengths in areas like SVG generation, coding proficiency, and 3D simulations, which distinguish it from the current Gemini 3 […] The post Google’s Unreleased Gemini 3.2 Flash Just Surfaced Online : Here’s What It Can Do a…
The Pentagon has signed agreements with leading AI firms, including Microsoft, Amazon and Google, advancing military capabilities amid an ongoing dispute over safeguards. The deal marks another step toward tighter integration of commercial AI research with military use cases.
The Pentagon has struck deals with OpenAI, Google, Microsoft, Amazon, Nvidia, Elon Musk's xAI, and the startup Reflection, allowing the agency to use their AI tools in classified settings. The Defense Department has left out Anthropic — which it previously used for classified information — after declaring it a supply-chain risk.
Google Flow Music is an AI-based platform that enables users to create and customize music through simple text prompts. Teacher's Tech explains how to access the platform via flowmusic. app and navigate its user-friendly interface.
The New York Times tested Google's Gemini as a travel-planning assistant for flights, activities and routes. The verdict: a useful digital Swiss Army knife, but with notable gaps — including forgetting to put underwear on the packing list. Gemini saves time on logistics but still needs human review for the details.
AI video generators are reshaping how creators approach video production, offering options that cater to diverse needs, from lifelike visuals to imaginative animations. In his latest breakdown, Kevin Stratvert and team explore five standout platforms, including VEO and Luma, which specialize in realism-focused outputs.
Google, Amazon, Microsoft and Meta reported more than $130 billion in quarterly capital expenditures on Wednesday, primarily for building AI data centers. And there's more to come: all four companies have signaled even larger investments in upcoming quarters, pushing hyperscaler capex into an entirely new tier.
Google Search hit an all-time high in queries during Q1 2026, according to CEO Sundar Pichai — even amid intensifying AI competition. Pichai says Google's AI-driven full-stack approach is lighting up every part of the business: 19% revenue growth, the strongest quarter ever for its consumer AI plans, and more than 350 million paid subscriptions, driven primarily by the Gemini App, YouTube, and Google One.
AI-generated video has gone from novelty to genuine creative tool almost overnight, and Runway is right at the center of that shift. The New York-based company has raised nearly $860 million at a $5.3 billion valuation, with its models going head-to-head against Google and OpenAI.
Google Photos is launching a new AI-powered feature that lets you virtually try on clothes you already own. The system pulls items from photos in your gallery and creates a virtual wardrobe where you can mix and match tops, bottoms, dresses, and shoes, save preferred looks, and share them with friends. You can browse outfits you've worn before or create entirely new combinations.
General Motors plans to bring Google's Gemini AI assistant to around four million US vehicles. Eligible models are Cadillac, Chevrolet, Buick, and GMC from model year 2022 onward with Google built-in. The upgrade arrives via over-the-air updates to GM's infotainment system over several months.
OpenAI's revised Microsoft pact lets it sell AI models across multiple clouds, enabling a likely expansion with Amazon and broader enterprise distribution. Why it matters: The shift ends OpenAI's effective cloud exclusivity, widening its reach to customers using AWS, Google Cloud or others — and intensifying AI platform competition.
Images and details about Samsung's upcoming smart glasses have leaked, according to Android Headlines. The glasses are reportedly being developed under the codename 'Jinju' and could cost anywhere from $380 to $500. They are Samsung's first smart glasses and look to offer a similar feature set to Meta Ray-Bans and the forthcoming Google Gemini glasses.
Over 600 Google employees signed a letter to CEO Sundar Pichai demanding that Google block the Pentagon from using its AI models for classified purposes, according to The Washington Post. Many of the signers reportedly work in Google's DeepMind AI lab, including more than 20 principals, directors, and vice presidents.
Hundreds of Google employees wrote in a letter to CEO Sundar Pichai that they did not want to see its AI technology used in inhumane or extremely harmful ways. The petition asks Google to refuse classified AI work for the Pentagon and continues a long-running series of internal protests against the company's military cloud and AI deals.
Google’s latest AI-powered smart glasses, built on the Android XR platform, represent a significant step forward in wearable technology. With a focus on productivity and user-centric design, these glasses aim to integrate artificial intelligence seamlessly into daily routines.
DeckWeaver solves the problem of manually importing AI-generated content into Google Slides. Built by a trainer who relies on ChatGPT for content creation, it lets users review and transfer AI content directly within the app, eliminating time-consuming copy-paste workflows and context switches.
Google Cloud has launched two new AI accelerator chips designed to compete with Nvidia's dominance in the AI infrastructure market. The new TPUs are faster and cheaper than previous generations, offering cloud customers a Google-native alternative for AI workloads. Despite the push, Google Cloud still supports Nvidia hardware—signaling a pragmatic dual-track strategy for the near term.
Google's AI meeting notetaker is no longer limited to Google Meet video calls — Gemini can now generate summaries and transcripts of in-person meetings, as well as meetings on Zoom and Microsoft Teams. Support for in-person meetings was previously limited to alpha users on Android. The feature also works for impromptu meetings — you don't need to be in a meeting room or have a scheduled meeting to use it.
OpenAI and Google have unveiled a series of advancements that push the boundaries of what AI can achieve in both creative and analytical domains. Universe of AI highlights OpenAI’s leaked Hermes Agent Studio, a framework for building custom AI agents tailored to specific workflows and ChatGPT Images 2.0, which introduces features like multilingual text generation […] The post What OpenAI’s Leaked Hermes Agent Studio Means for Your Workflow appeared firs…
OpenAI's upcoming ChatGPT 6 model, codenamed Spud, promises major advances for the AI landscape. Key features include reinforcement learning-based adaptability and a massive 2-million-token context window for handling complex, data-intensive tasks. Enhanced memory systems are also part of the upgrade.
LinkedIn Premium subscribers can now test AI models from OpenAI, Anthropic, Google, Microsoft and others directly in LinkedIn without token limits or extra subscriptions. The new Crosscheck feature works as a blind taste test: enter a prompt, get two anonymous AI responses, pick the better one - only then revealing which models were behind each answer. Crosscheck is rolling out now to U.
Markus has been building Auxx. ai, a Customer Support CRM, for 12 months to help his father's small business handle the flood of support messages. The platform combines elements of Attio and N8n, helping teams organize, prioritize, and respond to customer inquiries more efficiently.
Google DeepMind has introduced a new framework for evaluating Artificial General Intelligence (AGI), shifting from traditional benchmarks to a multidimensional approach. This framework examines AI systems across ten cognitive dimensions, including perception, reasoning and social cognition, to create a detailed profile of their capabilities.
Artificial intelligence is advancing rapidly, with significant developments and growing concerns shaping the conversation. Sam Altman, CEO of OpenAI, has voiced serious warnings about the potential dangers of superintelligent AI, emphasizing risks such as destabilizing economies and allowing harmful technologies like bioweapons.
What’s it like to have a diary that talks back to you, offering comments and advice on your hopes, fears and lunch plans? I spent two months finding out Ever since I was a teenager, I have kept some form of diary. These days I favour a paper one for creative brainstorming, and the Journal app on my iPad where I do a speedily typed brain dump every morning.
NotebookLM has become a versatile platform for research and organization, combining efficiency with adaptability. According to Skill Leap AI, its integration with Google Gemini enables users to consolidate resources such as PDFs, Drive files and web content into unified notebooks, making it easier to manage complex projects.
Google's Gemma 4 is a locally-installed multimodal AI model capable of processing text, images, and audio directly on devices like smartphones and laptops without cloud connectivity. This marks a significant shift in AI deployment, reducing dependence on cloud services while improving privacy.
Following a lukewarm reception to Llama 4, Meta is releasing Muse Spark, the first model from its newly formed Superintelligence team. Muse Spark brings reasoning capabilities to the Meta AI app, marking the start of Meta's new Muse model family. The release signals Meta's bid to close the gap with reasoning-focused competitors like Claude and ChatGPT.
Google’s TurboQuant is making waves in the AI hardware sector by addressing long-standing challenges in memory usage and processing efficiency. Developed with components like the Quantized Johnson-Lindenstrauss Algorithm, TurboQuant achieves up to sixfold reductions in memory requirements while preserving model accuracy.
Running large language models (LLMs) locally on your phone is no longer just a concept, it’s a practical reality with the Google AI Edge Gallery. This application allows users to execute advanced AI models directly on their devices, bypassing the need for cloud servers.
Google Vids is an AI-powered platform integrated into Google Workspace, designed to assist with video creation for users of varying experience levels. According to Paul Lipsky, one notable feature is its text-to-video generation, which uses VO3.1 technology to convert written scripts into video content.
- Google combines NotebookLM and Gemini Gems into a unified AI system aimed at automating complex workflows. - NotebookLM handles knowledge management, ingesting up to 300 sources including PDFs, Google Docs, and web pages into a centralized knowledge base. - Gemini adds 'Gems' – customizable AI agents with defined roles and behaviors that can act on that knowledge.
- Samsung officially discontinues its Messages app in July 2026, publishing an 'End of Service Announcement' on its website. - Users are directed to switch to Google Messages, which becomes the new default messaging solution on Galaxy devices. - Google Messages brings full RCS support: high-quality media, group chats, and real-time typing indicators across all smartphone platforms.
- Gemini is now integrated into Google Maps and can plan full-day itineraries on request. - A real-world test with family-outing criteria (playgrounds near a new light rail line, kid-friendly vehicle-themed restaurants) returned surprisingly solid results. - Some suggestions were predictable, but several unknown spots were worth bookmarking.
- Starting April 4, 2026 at 3PM ET, Anthropic ends free Claude access through third-party apps like OpenClaw. - Boris Cherny, Head of Claude Code, announced on X that users accessing Claude via external tools now need an extra usage bundle or their own API key.
- Apple turns 50 – the Engadget Podcast examines why the company has stayed agile and remains one of the last firms fully committed to personal computing. - Hosts Devindra and Igor Bonifacic assess Apple's current standing and speculate on what the next half-century could look like.
- Google updates its Home app so Gemini understands more natural language commands for smart home control. - Lighting can now be set by description – say 'the color of the ocean' and Gemini picks the matching hue. - More precise appliance commands work now: preheat the oven to 350°F or dial in specific humidity levels.
- Google Gemini 2.0 Flash Live processes speech natively as audio-to-audio, skipping the traditional speech-to-text conversion step and cutting latency noticeably. - The model reads not just words but also tone and emotional context, enabling more natural back-and-forth dialogue.
- Google's AI Pro plan ($20/month or $200/year) receives a free storage upgrade from 2TB to 5TB, usable across Gmail, Drive, and Google Photos. - Gemini now pulls context from Gmail and the web to assist in Docs, Sheets, Slides, and Drive — including inbox summaries and email proofreading. - A new agentic Chrome browsing feature handles multi-step tasks like trip planning or filling out forms automatically.
- Airweave is an open-source, self-hosted context retrieval layer that supplies AI agents with real-time data from over 50 platforms. - Supported integrations include GitHub, Notion, and Slack, with continuous syncing rather than one-time ingestion. - The tool targets a core weakness in agentic workflows: stale or missing context at runtime.
- A model dubbed 'Gemini 3.5 Stealth' has surfaced in arena tests – an unofficial name not yet confirmed by Google. - It builds on Gemini 3.1 Flash, emphasizing speed and adaptability in dynamic environments. - During testing it generated functional outputs including SaaS landing pages and a Mac OS-inspired UI system.
- Gmail has moved beyond short smart replies and now generates full email drafts that mimic the user's personal writing style, including signature habits. - The AI scans the entire inbox to infer context, relationships, and tone – reproducing even small stylistic details like lowercase sign-offs with familiar contacts.
- Microsoft is reversing course on Windows 11: instead of pushing Copilot into everything, the focus is shifting back to customization and core features. - The Engadget Podcast debates whether this reset can save Windows – or whether macOS and Linux are already the smarter choice. - OpenAI shut down its Sora video generation app after just 5 months.
- Google is launching two new Gemini features – 'Import Memory' and 'Import Chat History' – designed to lower the barrier for switching from ChatGPT or other AI assistants. - With 'Import Memory', users paste a suggested prompt into their old AI, copy the response, and feed it into Gemini to transfer their personal preferences.
- Apple is reportedly planning a new 'Extensions' system in iOS 27 that lets third-party chatbots like Google Gemini and Anthropic Claude plug into Siri. - Users will be able to choose which chatbots connect with Siri and toggle them on or off across iPhone, iPad, and Mac. - Until now, only OpenAI's ChatGPT was integrated into Siri; the new system opens Apple's voice assistant to the broader AI market.
- Google is rolling out Search Live to more than 200 countries and territories, adding support for dozens of new languages. - The feature combines voice input and camera: point at an object, ask a question aloud, and the AI responds with audio plus web links. - Search Live has been available in the US since September 2025 – useful for things like getting assembly instructions via camera.
- Google released Lyria 3 Pro, upgrading its AI music model to generate full songs up to 3 minutes long – up from just 30 seconds at launch last month. - Users can now prompt specific song elements like intros, verses, choruses, and bridges for more structured compositions. - Google claims improved understanding of musical composition and better handling of complex style transitions.
- Google Lyria 3 Pro extends maximum track length from 30 seconds to 3 minutes – a sixfold increase over the previous version. - Users can now specify song structure elements like intros, choruses, and bridges for more control over arrangements. - The tool can generate lyrics on demand and even use a reference photo as part of the prompt.
- Mark Zuckerberg (Meta), Larry Ellison (Oracle), Jensen Huang (Nvidia), and Sergey Brin (Google) will be the first four members of Trump's revived PCAST advisory panel. - The council will 'weigh in on AI policy' and launches with 13 members, expandable to 24. - AI and crypto czar David Sacks and White House tech advisor Michael Kratsios will co-chair the panel.
- According to Paul Lipsky, NotebookLM can be repurposed as a website-building tool by feeding it structured content and letting Gemini handle the design logic. - The key step is grouping content by theme or category before prompting – this produces more coherent navigation and layout suggestions.
- Google Gemini partners with Gap Inc – covering Gap, Old Navy, Banana Republic, and Athleta – letting users buy clothes directly inside the chatbot. - OpenAI launched a revamped shopping interface in ChatGPT around the same time, targeting the same purchase-intent use case. - Walmart and Target are already integrated into Gemini; Gap is the latest retailer joining the ecosystem.
- Google NotebookLM has underused agent capabilities beyond basic document Q&A – including structured research, knowledge extraction, and task-specific workflows. - Combining NotebookLM's deep research features with Claude's skill framework enables specialized AI agents for concrete use cases like B2B sales strategy.
- OpenAI co-founder Andrej Karpathy coined 'vibe coding' just over a year ago: building software by prompting AI instead of writing code yourself. - Platforms powered by Claude, Codex, and Gemini now let complete beginners ship apps and websites without touching a single line of script.
- Google NotebookLM delivers real value only when used methodically – not just as a casual chatbot. - The ACG workflow (Analyze, Challenge, Gap) is a core technique: it analyzes sources, challenges assumptions, and identifies content gaps. - High-quality, credible sources are the foundation – garbage in, garbage out applies here more than anywhere.
- At GDC 2026, generative AI was everywhere as a tool — but absent as actual game content: vendors pitched AI-driven NPCs, automated QA logging, and entire worlds built from a chat prompt. - Tencent's tools generated a live pixel-art fantasy world; Razer showed an AI assistant that automatically logs bugs in a shooter game. - Google DeepMind's talk on playable AI-generated spaces was standing-room only.
- An open-source toolkit connects Claude Code and OpenCode to 6 MCP servers and 7 skill files to automate frequent-flyer optimization. - It searches award availability across 25+ mileage programs via Seats. aero and compares cash prices through Google Flights, Skiplagged, Kiwi.
- Google Antigravity AI (powered by Gemini 3.1) and Firebase can be combined to build complete apps, according to a new step-by-step guide from the 'Your AI Workflow' team. - The example project is an appointment booking app, built end-to-end without requiring deep coding expertise.
- NVIDIA announced DLSS 5 at its GTC conference, triggering immediate backlash across gaming communities online. - Unlike DLSS 3 and 4, which focused on AI upscaling and frame generation, DLSS 5 uses 'neural processing' to deliver photorealistic lighting and materials. - Analyst Anshel Sag from Moor Insights & Strategy joins the Engadget Podcast to break down his hands-on experience with NVIDIA's DLSS 5 demos.
- Verily, Alphabet's life sciences unit, is converting from an LLC to a corporation and rebranding as Verily Health Inc. - A new $300 million funding round triggers the restructuring – and reduces Alphabet from majority to minority shareholder. - CEO Stephen Gillett frames the company's future around AI-driven, personalized healthcare solutions.
- Google is reportedly testing a native Gemini app for macOS, according to Bloomberg – currently Gemini is only accessible via the browser. - The app supports prompts, web search, and generation of text, images, and code – functionally on par with the web version. - A feature called 'Desktop Intelligence' could be the key differentiator: Gemini reads on-screen context and pulls content directly from open apps.
- The UK government has dropped its previous position on AI copyright after significant backlash from the creative sector. - A planned data bill would have allowed AI companies like Google and OpenAI to train on copyrighted material without consent, offering rights holders only an opt-out clause.
- Switching from ChatGPT to Gemini requires manually transferring custom instructions, memories, and projects – there is no built-in migration tool. - YouTube channel 'The AI Advantage' provides a structured step-by-step method to move critical elements without losing context. - Custom instructions can be copied directly from ChatGPT settings and pasted into Gemini's personalization section.
- Gemini Canvas is Google's interactive workspace inside Gemini that generates apps, dashboards, and games from prompts — no coding required. - According to a Teacher's Tech tutorial, users can produce up to eight functional apps in just 15 minutes. - Outputs are directly editable: design, logic, and content can be iterated within the same interface.
- Google's Perch 2.0 is a bioacoustics foundation model trained on millions of bird recordings plus vocalizations from amphibians, insects, and land mammals. - Surprisingly, the model also reliably identifies whale calls – even though underwater acoustics behave physically very differently from airborne sound.
- Nvidia CEO Jensen Huang expects at least $1 trillion in revenue from its newest chips through 2027, backed by record sales and surging orders from Big Tech data center operators. - Nvidia's cumulative AI chip market share dropped from 100% in Q1 2022 to 65% in Q4 2024, per SemiAnalysis – but the company still dominates decisively.
- Boox has unveiled the Go 10.3 Lumi, a 10.3-inch E Ink tablet running Android 15 with full Google Play Store access. - The key upgrade over its predecessor is a built-in front light optimized for both bright sunlight and low-light conditions. - Despite the added lighting hardware, the device weighs just 12.8 oz, making it lighter than the previous model.
- Yahoo is once again an independent, privately held company after years under Verizon and multiple restructurings. - CEO Jim Lanzone calls the old deal where Yahoo paid Google to power its search box 'Yahoo's original sin'. - Yahoo Finance and Yahoo Sports are the core pillars today – and Yahoo Mail is surprisingly growing with Gen Z users.
- Google, Microsoft, Meta, Amazon, OpenAI, Adobe, LinkedIn and Match Group have signed the 'Online Services Accord Against Scams'. - The alliance targets organized criminal networks that exploit multiple platforms simultaneously. - Planned measures include new fraud detection tools, enhanced user security features, and stricter verification for financial transactions.
- Google quietly killed its 'What People Suggest' AI search feature, which surfaced health tips from anonymous users worldwide. - The feature had been presented as proof that AI could 'transform health outcomes across the globe'. - Unvetted, medically unreviewed advice from amateurs was being prominently displayed in search results.
- Sebastian Mallaby profiles DeepMind founder Demis Hassabis in 'The Infinity Machine' – from chess prodigy to Nobel Prize winner. - In March 2016, AlphaGo defeated world-class Go player Lee Se-dol in Seoul, a landmark moment in AI history. - Go's vast decision space made it seemingly impossible for classical computing – DeepMind cracked it with deep reinforcement learning.
- Google's NotebookLM can now be paired with Anthropic's Claude, unlocking new document-processing and data-presentation workflows. - Users extract structured data from PDFs or spreadsheets via NotebookLM, then pass that data directly to Claude for further processing. - Claude handles the output side: interactive dashboards, charts, social posts, and custom personas are all on the table.
- GitAgent defines an AI agent as three files in a git repo: agent. md (personality/instructions), and SKILL. - The format is framework-agnostic and exports directly to Claude Code, OpenAI Agents SDK, CrewAI, Google ADK, and LangChain.
- AsterPay targets a real gap: AI agents can earn stablecoins but have no easy path to convert them into spendable fiat – the API bridges that via SEPA Instant in under 5 seconds. - It uses the x402 protocol (HTTP 402 pay-per-call) and an MCP server with 16 tools to let agents handle payments autonomously.
- Andy Stapleton highlights six free AI tools researchers should be using in 2026, with Google Gemini as a central recommendation. - Gemini can generate literature reviews, summarize academic papers, and produce graphical abstracts for visual presentation of findings. - The tools target common research workflows: information handling, complex problem-solving, and results visualization.
- Microsoft filed an amicus brief in a San Francisco federal court backing Anthropic's legal challenge against the Pentagon. - The Pentagon issued a designation that effectively bars Anthropic from government contracting work. - Microsoft integrates Anthropic's AI tools into systems it supplies to the US military, giving it direct skin in the game.
- Google is rolling out 'Immersive Navigation' for Maps – described as the biggest update to driving directions in roughly a decade. - Instead of a 2D map, Maps now renders the surroundings in 3D, giving depth to nearby buildings, overpasses, and landmarks. - Gemini models run under the hood, deciding how to render elements and reduce visual distractions for drivers.
- Google released Gemini Embedding 2, a unified model that embeds text, images, audio, PDFs, and short videos into a single shared vector space. - Previously, developers needed separate models and indexes per content type. Gemini Embedding 2 replaces all of that with one API.
- AI usage is cheaper today than it has ever been – but that window may be closing. - Writer CEO May Habib told Axios that LLM companies will be forced to raise prices around their IPOs. - New models from OpenAI, Google, and Anthropic are faster and cheaper, driven by massive efficiency gains in inference.
- CNN and the nonprofit Center for Countering Digital Hate (CCDH) tested 10 popular chatbots frequently used by teens: ChatGPT, Google Gemini, Claude, Microsoft Copilot, Meta AI, DeepSeek, Perplexity, Snapchat My AI, Character. - In scenarios where simulated teens discussed violent acts, most chatbots failed to flag warning signs – some even provided encouragement rather than intervening.
Hey HN, We just open-sourced Styx — an AI gateway that sits between your app and AI providers (OpenAI, Anthropic, Google, Mistral). One endpoint, any model, self-hosted. What makes it different from LiteLLM or OpenRouter: styx:auto — send "model": "styx:auto" and the gateway picks the right model based on prompt complexity.
- Gemini 3.1 Pro by Google AI introduces scheduled actions to automate recurring tasks without manual triggers. - A guide by Parker Prompts on Geeky Gadgets outlines how to use the platform more effectively than most users currently do. - The system supports visual data analysis that can be embedded into time-triggered automated workflows.
- Apple had a packed week: alongside the MacBook Air M5, MacBook Pro M5 Pro/Max, iPad Air M4, and iPhone 17e, it unveiled the MacBook Neo at $599 — its cheapest laptop ever. - The MacBook Neo is light on specs but, according to the Engadget podcast hosts, strong on value and character.
- The UK government planned to let AI companies like Google and OpenAI train on copyrighted material without consent – now the legislation is being delayed indefinitely. - After a two-month consultation, stakeholders rejected all government proposals for AI use of copyrighted works. - No AI bill will feature in the King's Speech scheduled for May – ministers are going back to the drawing board.
- Google unveiled Gemini 1.5, an upgraded multimodal model featuring improved reasoning and a significantly extended context window. - The new 'Video Intelligence' tool automatically analyzes video content, detecting objects, actions, and semantic relationships without manual annotation. - Both updates target developers via API access and end users within Google products like Search and Workspace.
- NotebookLM can now turn research notes into fully animated 'cinematic' videos, moving beyond the narrated slideshow format introduced last year. - The upgrade uses a combination of Google AI models – Gemini handles narrative and style decisions, Veo 3 generates the actual visuals. - Gemini reportedly 'refines its own work' during generation to maintain visual and narrative consistency throughout the video.
- Google has launched Nano Banana 2 as the new default image generation model in the Gemini app and in the AI mode of Google Image. - The model is reportedly 30% faster than its predecessor Nano Banana – though no comparative quality benchmarks were provided. - Nano Banana 2 is designed for fast, efficient image creation and will be rolled out to all users of the affected services.
AI spending is projected to reach up to $700 billion in 2025, nearly double last year's figure. • The 'AI race' narrative between the U. and China oversimplifies a more complex reality.
• Google releases Gemini 3.1 Pro, a multimodal model targeting complex, multi-step tasks including coding, logical reasoning, and creative problem-solving. • Natively handles text, images, and code. • Available via Google Cloud Vertex AI and API access.
• Google integrates its music generation model Lyria 3 directly into the Gemini chatbot. • Users can create 30-second music tracks via text or image prompts. • The feature is live now in the Gemini app.
Apple is working to let CarPlay users access ChatGPT, Claude, Gemini, and other chatbots directly in the car – previously, users had to go through their iPhone. The Siri button and wake word remain unchanged; users must manually open chatbot apps but can then control them via voice. Developers will reportedly be able to configure apps to launch automatically when CarPlay starts.
Waymo uses Google's Genie 3 AI world model to simulate hyper-realistic 3D driving scenarios – from tornadoes to elephants on the road. The „Waymo World Model” generates interactive test environments from text or images, specifically adapted for autonomous driving. Goal: Train for extremely rare or dangerous situations that rarely occur in real traffic.
SpaceX is exploring orbital data centers in low Earth orbit (LEO) together with major cloud and AI companies. Space-based data centers promise lower latency, improved security, and higher reliability compared to ground infrastructure. Potential use cases include real-time processing for autonomous vehicles, smart cities, and IoT—services barely feasible today.
OpenAI CEO Sam Altman called Anthropic's Super Bowl ads „misleading” and „authoritarian,” accusing the rival of undermining AI safety efforts. In a lengthy X post, Altman labeled Anthropic as „dishonest” – a public escalation in the feud between the two AI companies. The ads have sparked debate about AI safety and transparency as competition between OpenAI and Anthropic intensifies.
Google introduces 'Natively Adaptive Interfaces' (NAI) – a framework that uses AI to automatically adapt user interfaces to individual needs. NAI detects users' context, abilities, and preferences in real time, dynamically adjusting display, navigation, and interaction. The goal is accessible technology for people with diverse disabilities without manual configuration.
METR (formerly ARC Evals) is the benchmark org that tests new frontier models from OpenAI, Google, and Anthropic for dangerous capabilities—before they ship. Their most famous output: a bar chart showing how many autonomous replication and hacking tasks a model can solve. The AI community systematically misreads it.
Google unveiled several AI advancements in January, including Gemini 1.0 – a multimodal language model capable of understanding and generating text, images, and video. The company also introduced Image FX, a text-to-image model, and updated Vertex AI, a managed platform for ML model development. Additionally, the PaLM model was made publicly available, enabling text generation based on prompts.
DeepMind unveiled AlphaGenome, an AI system that deciphers non-coding DNA regions controlling when and where genes are switched on. Following AlphaFold (protein folding), AlphaMissense (disease prediction), and AlphaProteo (protein design), AlphaGenome targets gene regulation. The system uses machine learning to predict how regulatory DNA sequences influence gene expression – a core genomics challenge.
Senator Elizabeth Warren (D-MA) is demanding clarity from Google about its planned checkout feature in Gemini AI. Google wants to enable direct product purchases in Gemini via the Universal Commerce Protocol (UCP) – developed with Shopify, Target, Walmart, Wayfair, and Etsy. Warren's concern: Google and retailers could exploit sensitive user data or manipulate consumers into spending more and paying higher prices.