Radio
Now Playing
Quickyla Radio โ€” Click to play
Open โ†’
3 min left
Back to News

Googleโ€™s Android coding tests reveal an unexpected Gemini 3.5 Flash weakness

Affiliate links on Android Authority may earn us a commission. Learn more. Google has just refreshed its Android Bench rankings, and the results present developers with a puzzling picture. Googleโ€™s new Gemini 3.5 Flash is actively falling behind its predecessor while charging yo

Googleโ€™s Android coding tests reveal an unexpected Gemini 3.5 Flash weakness
Android Authority โ€” 15 June 2026
Text:
3 0 0

Affiliate links on Android Authority may earn us a commission. Learn more.

Google has just refreshed its Android Bench rankings, and the results present developers with a puzzling picture. Googleโ€™s new Gemini 3.5 Flash is actively falling behind its predecessor while charging you three times the price to use it.

The latest Android coding leaderboard , a benchmark that evaluates how well different AI models can perform Android development tasks, introduced Gemini 3.5 Flash for the first time, but the newcomer didnโ€™t make it into the top five. Topping the list was OpenAIโ€™s GPT 5.5 , which scored 74, followed by GPT 5.4 and an older Google model, Gemini 3.1 Pro Preview, both with 72.4. The new Claude Opus models also outperformed the Flash variant.

Gemini 3.5 Flash scored 63.7, placing sixth overall. What was more surprising, though, was its efficiency. The model averaged 355.9 total tokens, a big jump compared to other systems, according to Googleโ€™s benchmark data. That came to an average cost of $147.1, making it the most expensive model on the entire list even with slower performance than a number of rivals.

For context, Googleโ€™s Flash branding has always been about speed and cheaper prices. At Google I/O 2026, the company announced the most powerful Flash model it had ever built , Gemini 3.5 Flash, which it claimed had more robust coding capabilities and better support for AI agents and complex workflows. Google also said the model outperformed Gemini 3.1 Pro in a number of internal benchmarks and produced output up to four times faster than competing frontier models.

However, the Android benchmark tells a different story. Gemini 3.5 Flash might shine in the broader evaluations of agentic and coding tasks run by Google, but its performance on actual Android development tasks seems less than stellar. For example, Gemini 3.1 Pro Preview delivered a significantly better score while costing about one-third as much, as noted by 9to5Google .

The bigger question now is whether Google can improve Gemini 3.5 Flash with updates or whether the upcoming Gemini 3.5 Pro will better deliver on the companyโ€™s performance promises. For now, Googleโ€™s own numbers suggest that newer isnโ€™t always better.

Thank you for being part of our community. Read our Comment Policy before posting.

Advertisement
React:
Sponsored

More to Read

You can now beat ChatGPT Codex rate limits, if you have friโ€ฆ
๐Ÿ’ป Technology
You can now beat ChatGPT Codex rate limits, if you have friends
Android Authority ยท 3 days ago
Meta is reportedly developing an AI pendant
๐Ÿ’ป Technology
Meta is reportedly developing an AI pendant
TechCrunch ยท 16 days ago
Cash App made a magic wand for contactless payments
๐Ÿ’ป Technology
Cash App made a magic wand for contactless payments
The Verge ยท 11 days ago
CBS News insiders worry how 60 Minutes will endure after fiโ€ฆ
๐Ÿ’ฐ Business
CBS News insiders worry how 60 Minutes will endure after firings: โ€˜What are they going toโ€ฆ
Guardian Business ยท 11 days ago
'Astonishing': James Webb telescope spots the most chemicalโ€ฆ
๐Ÿ”ฌ Science
'Astonishing': James Webb telescope spots the most chemically primitive galaxy in the ancโ€ฆ
Live Science ยท 15 days ago
Sam Altman says OpenAI's top token spender uses 100 billionโ€ฆ
๐Ÿ“ˆ Markets & Finance
Sam Altman says OpenAI's top token spender uses 100 billion tokens a month โ€” and they're โ€ฆ
Business Insider Mkt ยท 12 days ago
Full view