MLXIO
a glass of beer
AI / MLMay 23, 2026· 7 min read· By MLXIO Insights Team

72% Fara1.5 AI Crushes OpenAI and Google on Web Tasks

Share

MLXIO Intelligence

Analysis Snapshot

66
Moderate
Confidence: LowTrend: 20Freshness: 98Source Trust: 75Factual Grounding: 91Signal Cluster: 20

Moderate MLXIO Impact based on trend velocity, freshness, source trust, and factual grounding.

Thesis

High Confidence

Microsoft’s open-weight Fara1.5-27B is positioned as a stronger web-task automation model than OpenAI Operator and Google Gemini 2.5 Computer Use, based on the article’s reported live-web benchmark scores.

Evidence

  • Fara1.5-27B scored 72% on a live-web task benchmark, versus 58.3% for OpenAI Operator and 57.3% for Google’s Gemini 2.5 Computer Use.
  • The Fara1.5 family was released on May 22, 2026 and includes 4B, 9B, and 27B parameter models.
  • Microsoft Research describes Fara1.5 as browser agents for searching, clicking, filling forms, comparing information, and completing multi-step workflows on real websites.
  • The source notes that the models are open-weight, which could improve accessibility for advanced web task automation.

Uncertainty

  • The article does not provide full benchmark methodology or test conditions.
  • Open-weight does not necessarily mean fully open-source, transparent training data, or unrestricted commercial use.
  • Real-world reliability on sensitive workflows such as purchases or account changes is not established in the provided text.

What To Watch

  • Independent replications of the live-web benchmark results.
  • License and deployment terms for the 4B, 9B, and 27B models.
  • Adoption by startups, research labs, or enterprise teams building browser agents.

Verified Claims

Microsoft’s Fara1.5-27B scored 72% on a live-web task benchmark, outperforming OpenAI Operator and Google’s Gemini 2.5 Computer Use in the article’s comparison.
📎 “Fara1.5-27B scored 72%... beating OpenAI Operator at 58.3% and Google’s Gemini 2.5 Computer Use at 57.3%.”High
The Fara1.5 model family includes 4B, 9B, and 27B parameter models.
📎 “The Fara1.5 family... includes 4B, 9B, and 27B parameter models.”High
Fara1.5 is described as an open-weight browser-agent model family designed for web task automation.
📎 “Microsoft Research frames Fara1.5 as a family of computer use agent models for the browser.”High
Fara1.5 is intended to perform browser actions such as searching, clicking, filling forms, comparing information, and completing multi-step workflows.
📎 “Built to search, click, fill forms, compare information, and complete multi-step workflows on real websites.”High
Fara1.5 uses an observe-think-act loop that considers conversation history and the three most recent browser screenshots before predicting the next action.
📎 “At each step, Fara1.5 takes in the conversation history and the three most recent screenshots... then... predicts the next single-step action.”High

Frequently Asked

What is Microsoft Fara1.5?

Fara1.5 is Microsoft Research’s open-weight family of browser-focused computer use agent models built to automate web tasks.

How did Fara1.5-27B perform against OpenAI Operator and Google Gemini 2.5 Computer Use?

The article says Fara1.5-27B scored 72% on a live-web task benchmark, compared with 58.3% for OpenAI Operator and 57.3% for Google’s Gemini 2.5 Computer Use.

What model sizes are available in the Fara1.5 family?

The article lists three Fara1.5 sizes: 4B, 9B, and 27B parameter models.

Why does Fara1.5 being open-weight matter?

Open-weight means developers can more directly download, adapt, and deploy the model than with a fully closed API-only system, though license and deployment terms still matter.

How is Fara1.5 different from a chatbot?

A chatbot mainly answers in text, while Fara1.5 is built to act in browser environments by searching, clicking, filling forms, comparing options, and moving multi-step tasks forward.

Updated on May 23, 2026

Microsoft’s Fara1.5-27B scored 72% on a live-web task benchmark, beating OpenAI Operator at 58.3% and Google’s Gemini 2.5 Computer Use at 57.3% — a gap that matters most to builders trying to automate messy browser work without depending entirely on closed systems.

The Fara1.5 family was released on May 22, 2026, and includes 4B, 9B, and 27B parameter models, according to CryptoBriefing. The short version: Microsoft Research says these are browser agents built to search, click, fill forms, compare information, and complete multi-step workflows on real websites.

That pushes web automation into a more competitive phase. The old model was brittle scripts and rule-based robotic process automation. The newer model is an AI agent that can interpret a web page, decide the next action, and ask for help when the task becomes sensitive or ambiguous.


Why do builders care that Fara1.5 is open-weight?

The most important word in Microsoft’s release is not “browser.” It is open-weight.

Open-weight means the model parameters are released for broader use, so developers can download, adapt, and deploy the model more directly than they can with a fully closed API-only system. It does not automatically mean every part of the system is fully open-source, that training data is transparent, or that commercial use is unrestricted. The license and deployment terms still matter.

For builders, the appeal is practical: can a startup, research lab, or enterprise team run a capable web agent without routing every workflow through a proprietary model endpoint?

Microsoft Research frames Fara1.5 as a family of computer use agent models for the browser. The company says the models are designed to remain “practical to deploy on modest hardware,” with three sizes to trade off cost and performance.

“We are releasing three model sizes: 4B, 9B and 27B, to accommodate different constraints on cost and performance,” Microsoft Research said in its Fara1.5 post.

That flexibility matters because not every automation task needs the largest model. A basic monitoring workflow may be fine on a smaller model. A long, multi-page process involving comparison, memory, and user clarification may need the larger one.

Readers tracking Google’s own AI product push can compare this with Google Sparks Search Revolution with Gemini 3.5 Flash AI and Google Sparks AI Race with Gemini 3.5 Flash’s Breakthrough Speed, though Fara1.5’s claim here is narrower: browser task execution.

How is Fara1.5 different from a chatbot that answers questions?

A chatbot can tell you how to book an event. A browser agent tries to go to the site, search for the event, compare options, fill fields, and move the task forward.

That is the core distinction. Fara1.5 is built for action in web environments, not just text generation. Microsoft says the models can handle tasks such as comparing products, filling out forms, booking events, and cross-site comparison shopping.

So what happens inside the agent?

Microsoft describes an observe-think-act loop. At each step, Fara1.5 takes in the conversation history and the three most recent screenshots from the browser. It then produces reasoning and predicts the next single-step action. Those actions can include mouse-and-keyboard inputs, web search, memorizing information for later, or asking the user a question.

Why browser work is hard

Websites are hostile terrain for automation. Layouts change. Pop-ups interrupt flows. Buttons may be visual rather than clearly labeled. Authentication walls block progress. Some actions, such as purchases or account changes, carry real consequences.

That is why Microsoft pairs the agent loop with safety boundaries. CryptoBriefing notes that Fara1.5 uses MagenticLite, a sandboxed browser interface, and includes a human-in-the-loop safeguard. The agent pauses before critical actions such as purchases or account changes and asks for user confirmation.

For finance and crypto users, that pause is not cosmetic. A mistaken click in a normal web form may be annoying. A mistaken approval in a financial workflow can be expensive.

Where does Fara1.5 beat OpenAI and Google, and where should readers be cautious?

The headline benchmark is Online-Mind2Web, which tests browser agents across 300 tasks on 136 live websites.

Model/system Online-Mind2Web score Source detail
Fara1.5-27B 72% Flagship Microsoft model
Fara1.5-9B 63.4% / Microsoft cites 63% Smaller open-weight model
OpenAI Operator 58.3% Proprietary system
Google Gemini 2.5 Computer Use 57.3% Proprietary system
Fara-7B 34.1% Microsoft’s previous model

The jump from Fara-7B to Fara1.5-27B is the clearest signal. Microsoft roughly doubled the benchmark result in about six months, from 34.1% for Fara-7B to 72% for Fara1.5-27B.

The 9B result may be even more strategically interesting. CryptoBriefing reports that Fara1.5-9B scored 63.4%, ahead of OpenAI Operator and Google Gemini 2.5 Computer Use on this benchmark despite being much smaller than the flagship 27B model. Microsoft Research also says the 9B model outperforms similarly sized models and cites GUI-Owl-1.5-8B at 49%.

But benchmark leadership is not universal superiority. Results can shift with task design, prompt style, latency, tool access, authentication, real-world site changes, and deployment setup. A model that wins Online-Mind2Web may still fail in a company’s internal portal or a regulated workflow with strict audit requirements.

How could finance or crypto teams use a Fara1.5-style agent?

A realistic use case is not “let the AI trade.” It is controlled information gathering.

Picture a crypto research team that monitors token project websites, governance forums, exchange notices, and regulatory pages. A Fara1.5-style browser agent could visit approved sources, identify new announcements, compare them with prior records, draft a morning summary, and route the output to an analyst.

The useful version of that workflow has hard boundaries:

  • Source control: The agent only visits approved websites.
  • Change detection: It flags new pages, edits, or announcements for review.
  • Human review: An analyst checks the summary before publication or action.
  • No autonomous execution: The agent does not trade, sign transactions, submit filings, or publish market calls.

CryptoBriefing makes the DeFi connection directly but carefully: Microsoft did not build Fara1.5 for crypto, and there are no direct integrations with blockchain protocols, DeFi applications, or Web3 projects in the supplied source material.

Still, DeFi interfaces are web applications. Token swaps, vault management, bridge flows, and governance actions involve forms, confirmations, and multi-step browser interactions. That overlaps with the kind of task structure Fara1.5 was trained to handle.

The human-in-the-loop design is the key constraint. In DeFi, transactions are irreversible. An agent that pauses before a critical action is more useful than one that races ahead.

What could slow adoption of Fara1.5 web agents?

Reliability is the first barrier. Web agents can click the wrong button, misunderstand a user goal, miss a pop-up, or declare success after completing only part of a task. The benchmark scores show progress, not perfection.

Security is the second. Automated browsing can expose credentials, personal data, internal documents, or sensitive financial information if teams deploy agents carelessly. A sandbox helps, but it does not replace access controls, logging, permissioning, and review.

Compliance is the third. Some websites restrict automated access. Some workflows require audit trails. Regulated financial activity cannot be handed to a model just because it can operate a browser.

Microsoft has made the 9B model available on Microsoft Foundry, with the 4B and 27B versions expected to follow, according to CryptoBriefing. The practical adoption test now moves from benchmark tables to controlled deployments.

The watch item is simple: if open-weight browser agents keep closing the gap with proprietary systems while adding stronger controls, advanced web automation becomes less of a premium feature and more of a buildable layer for teams that know exactly where humans must stay in the loop.

The Bottom Line

  • Microsoft’s open-weight Fara1.5-27B beat major closed competitors on live-web automation tasks.
  • Developers may gain more control over browser agents without relying entirely on proprietary API endpoints.
  • The release signals faster competition in AI agents that can search, click, fill forms, and complete web workflows.

Live-Web Task Benchmark Comparison

ModelBenchmark ScoreKey Context
Microsoft Fara1.5-27B72%Open-weight browser agent model
OpenAI Operator58.3%Closed system competitor
Google Gemini 2.5 Computer Use57.3%Closed system competitor

Live-Web Task Benchmark Scores

Microsoft Fara1.5-27B
%72
OpenAI Operator
%58.3
Google Gemini 2.5 Computer Use
%57.3
MLXIO

Written by

MLXIO Insights Team

Algorithmic Research & Human Oversight

Powered by advanced algorithmic research and perfected by human oversight. The Insights Team delivers highly structured, cross-verified analysis on emerging tech trends and digital shifts, filtering out the fluff to give you high-fidelity value.

Related Articles

logo
AI / MLMay 22, 2026

Cheap AI Agents: Google’s Gemini 3.5 Flash Bets Big

Google’s Gemini 3.5 Flash turns speed and cost into the real AI agent battleground.

8 min read

industrial robotic arm in blue lit factory
AI / MLMay 22, 2026

Singularity Bet Recasts Google I/O's AI-Driven Science

Google is selling AI-driven science as a path to agentic discovery, but proving those systems work may be the hard part.

7 min read

Bus with advertisement for prompt.io about accurate ai.
AI / MLMay 13, 2026

2026’s Top Large Language Model Platforms Shake Up Enterprise AI

2026’s leading large language model platforms redefine enterprise AI with unmatched scalability, security, and cost-effectiveness.

10 min read

a laptop and a computer
AI / MLMay 13, 2026

Top LLM Platforms in 2026 Reveal Shocking Pricing and Power

Explore the top LLM platforms in 2026 with real data on pricing, features, and benchmarks to find the best fit for your development needs.

10 min read

Students looking at phones in a lecture hall.
AI / MLMay 11, 2026

OpenAI Sparks Global AI Student Club Network With Exclusive Tools

OpenAI launches a global Campus Network to unite student AI clubs, offering exclusive tools and event support to boost innovation.

4 min read

logo
TechnologyMay 22, 2026

Google Bets $20B Safari Deal Can Save Its Antitrust Case

Google says its $20B Safari deal proves competition. Courts may see it as the clearest evidence of monopoly power.

8 min read

red xbox one game controller
TechnologyMay 22, 2026

€39.90 Nacon Revo Xbox Controllers Threaten Elite 2

Nacon’s Revo lineup brings Hall effect sticks, rear inputs and trigger tuning to Xbox controllers starting at €39.90.

7 min read

a close up of a video game controller
TechnologyMay 22, 2026

Forza Horizon 6 Grabs $325M While Steam Beats Xbox

Forza Horizon 6 reportedly nears 5M paid copies and $325M gross revenue, with Steam selling more copies but Xbox earning more.

5 min read

text
StartupsMay 23, 2026

Startup Battlefield 200 Puts $100K on a 7-Day Clock

Startup Battlefield 200 closes May 27, putting $100K and a Disrupt stage within reach for 200 early-stage startups.

6 min read

woman in black long sleeve shirt using black laptop computer
AI / MLMay 23, 2026

Six-Minute Songs Put Stock Music in Stability AI's Sights

Stable Audio 3.0 can generate six-minute tracks, turning AI music into a cheaper threat to stock audio and creator tools.

12 min read

Stay ahead of the curve

Get a weekly digest of the most important tech, AI, and finance news — curated by AI, reviewed by humans.

No spam. Unsubscribe anytime.