Google Launches Gemini 3.5 Flash to Power AI Agents and Coding Automation

AI isn't just answering questions anymore—it's working on your behalf
Google's new model is designed to execute multi-step tasks autonomously while remaining under user supervision.

In the ongoing effort to transform artificial intelligence from a question-answering instrument into an autonomous collaborator, Google has released Gemini 3.5 Flash — a model designed not merely to respond, but to plan, iterate, and act across complex sequences of tasks. Now the default experience within both the Gemini app and Search worldwide, the model represents a quiet but significant reorientation: AI as a working partner rather than a conversational mirror. The speed at which it operates — four times faster than comparable frontier models — suggests that the measure of intelligence is shifting from depth of knowledge to velocity of execution.

  • The race to move AI beyond chat has intensified, and Google is staking its position with a model built explicitly for multi-step, autonomous task execution rather than single-turn dialogue.
  • Enterprises in banking and data science are already compressing weeks-long workflows into hours using agent-based systems, and Gemini 3.5 Flash is engineered to accelerate that compression further.
  • Outperforming its predecessor Gemini 3.1 Pro across coding and agentic benchmarks while generating output four times faster than rival frontier models, the release raises the competitive bar for the entire industry.
  • Google is weaving agentic capability into consumer life through Gemini Spark, a continuously operating personal AI agent currently in limited testing before a broader US rollout.
  • Safety frameworks, interpretability tools, and new training methods are being applied to inspect the model's internal reasoning before output — an acknowledgment that autonomous systems require a different kind of oversight than passive ones.

Google has released Gemini 3.5 Flash, an AI model built for work that demands planning and autonomous execution across multiple steps. It is now the default experience in the Gemini app and Search globally — a signal that the company is repositioning AI not as something you query, but as something that acts on your behalf while you retain oversight.

The model is accessible through the Gemini app, AI Mode in Search, developer tools including the Gemini API and Android Studio, and enterprise platforms. Its defining qualities are speed and iterative performance: it outpaces the previous Gemini 3.1 Pro on coding and agent-focused benchmarks, and produces output four times faster than leading competitor models — a distinction that matters enormously at the scale of automated enterprise workflows.

Google is targeting industries where that speed translates directly into business value. Banks and financial technology firms are already using agent systems to compress processes that once took weeks. Data science teams are applying similar tools to complex datasets. An updated version of Google's Antigravity platform enables developers to deploy collaborative subagents — smaller AI systems working in concert — to handle demanding, multi-step workloads with sustained performance.

The agentic ambition extends into consumer products as well. Gemini Spark, a personal AI agent designed to manage digital tasks on an ongoing basis, operates continuously rather than responding to individual prompts. It is currently in testing with a limited group before a wider beta rollout to US subscribers. The same automation logic is being embedded into Search itself, pointing toward a future where agentic AI is distributed across Google's entire product ecosystem rather than confined to a standalone tool.

The model was developed under Google's Frontier Safety Framework, with new training methods and interpretability tools designed to examine the model's reasoning before it generates a response — reducing harmful outputs while also limiting unnecessary refusals. A more advanced Gemini 3.5 Pro is already in internal development. The release reflects a broader industry contest in which speed, reliability, and practical utility have become as decisive as benchmark scores in determining which AI systems graduate from research achievements to everyday business assets.

Google has released Gemini 3.5 Flash, a new artificial intelligence model built to handle the kind of work that requires planning, thinking through multiple steps, and executing tasks with some autonomy. The model is now the default across Gemini's consumer app and Search globally, marking a shift in how the company is positioning AI—not as a conversational tool you ask questions of, but as something that can work on your behalf, carrying out complex sequences of actions while you maintain oversight.

The model is available through multiple channels: the Gemini app itself, AI Mode in Search, Google's developer tools including the Gemini API and Android Studio, and enterprise products like the Gemini Enterprise Agent Platform. What distinguishes Gemini 3.5 Flash is its speed and its performance on tasks that demand iteration and planning. In benchmark tests, it outperformed the previous generation Gemini 3.1 Pro on several coding and agent-focused measures—scoring 76.2% on Terminal-Bench 2.1, 1656 Elo on GDPval-AA, 83.6% on MCP Atlas, and 84.2% on CharXiv Reasoning for multimodal understanding. More practically, the model generates output at four times the speed of other leading models when measured by tokens per second, a metric that matters enormously when you're running large-scale automated workflows.

Google is positioning this model squarely at enterprise use cases. Banks and financial technology companies are already using agent-based systems to compress workflows that once took weeks into much shorter timelines. Data science teams are applying the technology to parse complex datasets. The company has also built an updated version of its Antigravity platform that allows developers to deploy what Google calls collaborative subagents—essentially smaller AI systems working in concert—to handle demanding workloads. These systems can execute multi-step workflows while maintaining performance over longer sequences of actions, and they can generate more interactive web interfaces and graphics, building on multimodal capabilities introduced in earlier versions.

Beyond the enterprise realm, Google is embedding this agentic functionality into consumer products. Gemini Spark, described as a personal AI agent, is designed to help users manage digital tasks on an ongoing basis. Unlike a chatbot you query, Spark operates continuously and can take actions on a user's behalf while remaining under the user's direction. The service is currently in testing with trusted users before a wider beta rollout to certain subscribers in the United States. The same automation features are being woven into Search itself, where they will power information agents and more dynamic generative interfaces—a signal that Google intends to move beyond standalone chat tools and distribute agentic AI across its entire product ecosystem.

Google developed Gemini 3.5 Flash under what it calls its Frontier Safety Framework, with particular attention to cybersecurity and chemical, biological, radiological, and nuclear safeguards. The company says it used new safety training methods and interpretability tools designed to inspect the model's internal reasoning before it produces a response, aiming to reduce harmful outputs while also cutting down on false refusals to answer legitimate questions. A more advanced version, Gemini 3.5 Pro, is already in development and being tested internally.

The release reflects a broader competition among major technology companies to demonstrate that AI models can move beyond answering prompts to executing structured work across coding, research, and administrative processes. In that contest, speed, cost, and reliability have become as important as raw benchmark scores. For companies trying to turn AI systems into everyday business tools, the ability to complete complex tasks in a fraction of the time previously required is what separates a research achievement from a practical asset. Google is betting that Gemini 3.5 Flash, with its combination of speed and performance, is that asset.

Google positioned the model as a system for developing applications, maintaining codebases and assisting with financial document preparation
— Google
Gemini Spark operates continuously and can take actions on a user's behalf while remaining under the user's direction
— Google
A Conversa do Hearth Outra perspectiva sobre a história
Inventor

What does it mean for a model to be "agentic"? That word keeps appearing but it's doing a lot of work.

Model

It means the AI isn't just answering a question you ask it. It's taking on a task that has multiple steps, figuring out what needs to happen in what order, and executing those steps with some independence—while you're still watching and can intervene. A bank might ask it to process a loan application, which involves checking documents, verifying information, flagging inconsistencies, and preparing a summary. The AI does all that without you having to prompt it at each stage.

Inventor

So it's less like ChatGPT and more like hiring someone to do a job.

Model

Exactly. Except it's much faster and much cheaper. And it can work on multiple tasks simultaneously through these subagents Google mentions. The speed matters too—four times faster than competitors means you can run more workflows in parallel without the system becoming a bottleneck.

Inventor

Why is Google making this the default everywhere? That seems like a big bet.

Model

Because they're trying to prove that AI isn't just a novelty or a research project anymore. If Gemini 3.5 Flash becomes the thing people use every day without thinking about it—in Search, in their email, in their work tools—then Google has moved AI from "interesting experiment" to "essential infrastructure." That's where the real value is.

Inventor

The safety measures mentioned—what's actually different there?

Model

They're using tools to look inside the model's reasoning before it answers, trying to catch problems before they happen. They're also being more careful about what they train it not to do, because the previous generation was apparently refusing to answer some legitimate questions. It's a balance: you want to prevent harmful outputs, but you don't want the system so cautious it becomes useless.

Inventor

What happens to Gemini 3.1 Pro now that this is the default?

Model

It doesn't disappear, but it becomes the option you choose when you need something different. Gemini 3.5 Pro, the more powerful version, is coming and will probably become the premium tier. This is how these companies manage their product lines—you have the fast, efficient model for everyday work, and the more capable one for harder problems.

Fale Conosco FAQ