How JPMorgan Chase Is Building The AI-Powered Bank Of The Future

JPMorgan Chase is giving the business world an early look at what happens when AI agents move beyond experiments and start reshaping workflows, jobs & customer experience

Samsung Confirms Galaxy Z Fold 8 Wide Smartphone In Juicy Tease

New folding phones are coming and Samsung has now confirmed that the biggest rumor, a wide-format model, has truth in it.

Apple iOS 26.5.2 New iPhone Software: Should You Upgrade?

The latest iPhone update is a surprise, featuring security fixes which were due to be released with iOS 26.6 but brought forward.

Galaxy Z Fold 8: Images Show How Oddly Small Samsung’s New Phone Is

New leaked cases and a size comparison show just how different the Galaxy Z Fold 8 Wide really is. Here’s what’s changed, and why it caught me off guard.

Anthropic launches Claude Sonnet 5 at a steep discount to its top model as the company races toward a blockbuster IPO

Anthropic today released Claude Sonnet 5, a new AI model that the company says delivers near-flagship performance at mid-tier prices — a move designed to give cost-conscious enterprise developers access to powerful agentic capabilities just as the San Francisco-based AI lab barrels toward an initial public offering that will test whether the private market’s staggering AI valuations can survive public scrutiny.

The release, which Anthropic describes as “the most agentic Sonnet model yet,” makes Sonnet 5 the default model for users on Anthropic’s Free and Pro plans, while also making it available to Max, Team, and Enterprise customers. Introductory API pricing is set at $2 per million input tokens and $10 per million output tokens through August 31, after which it rises to $3 and $15 respectively — still well below the $5 input and $25 output pricing of Anthropic’s top-of-the-line Opus 4.8.

The strategic logic is unmistakable: Anthropic is trying to democratize access to capabilities that until very recently only its most expensive models could deliver, while building the kind of broad-based developer adoption that will look attractive in an S-1 filing.

Sonnet 5 benchmarks show the mid-tier model closing in on Anthropic’s flagship Opus

Sonnet 5 posts major gains over its predecessor, Sonnet 4.6, across every evaluation Anthropic disclosed. On SWE-bench Pro, an agentic coding benchmark, Sonnet 5 scores 63.2% compared with Sonnet 4.6’s 58.1% — a jump that brings it within striking distance of Opus 4.8’s 69.2%. On Terminal-Bench 2.1, another coding evaluation, the gap narrows further: 80.4% for Sonnet 5 versus 67.0% for Sonnet 4.6 and 82.7% for Opus 4.8.

In multidisciplinary reasoning, as measured by Humanity’s Last Exam, Sonnet 5 scores 43.2% without tools and 57.4% with tools — the latter figure essentially matching Opus 4.8’s 57.9%. On computer use tasks evaluated through OSWorld-Verified, Sonnet 5 reaches 81.2%, up from 78.5%. And on GDPval-AA v2, a knowledge-work benchmark, it scores 1,618 — surpassing Opus 4.8’s 1,615 and far exceeding Sonnet 4.6’s 1,395.

The pattern across these evaluations tells a consistent story: Sonnet 5 doesn’t merely inch forward from its predecessor. It vaults into a performance tier that overlaps substantially with Anthropic’s flagship model, while costing roughly 60% less per token at standard pricing and even less during the introductory period.

Enterprise partners say Sonnet 5’s agentic AI capabilities finish jobs that previous models abandoned

The emphasis on agentic capabilities — the ability to plan, use tools like browsers and terminals, and execute multi-step workflows autonomously — reflects where the AI industry’s center of gravity has shifted in 2026. Enterprises are no longer simply asking chatbots questions; they are deploying AI systems that can navigate complex software environments, execute multi-step coding tasks, and operate with minimal human supervision.

Early access partners painted a picture of a model that doesn’t just start tasks but finishes them. Sualeh Asif, co-founder of Cursor, the AI-powered code editor that has become a bellwether for developer tool adoption, said that “with Claude Sonnet 5, agents stay on plan, follow our conventions, and ship clean multi-step changes, all at an efficient cost.” Daniel Shepard, a senior engineer at Zapier, described handing the model a two-part automation job — updating Salesforce account tiers and sending a launch announcement — that “used to stall halfway” with previous models but now completes end to end.

These testimonials matter because they describe exactly the kind of reliability gap that has kept many enterprises from moving agentic AI from pilot programs to production deployments. A model that gets 80% of the way through a complex task before stalling creates more problems than it solves; one that reliably completes the full workflow changes the economics of automation. Anthropic also introduced cost-performance curves showing that developers can now adjust effort levels across Sonnet 5 and Opus 4.8 to find the optimal balance of cost and accuracy for their specific use case — a granularity that reflects growing sophistication in how enterprises consume AI services.

An updated tokenizer boosts Sonnet 5 performance but could quietly raise costs for some workloads

One technical detail buried in the announcement’s footnotes deserves attention: Sonnet 5 uses an updated tokenizer that changes how the model processes text, similar to the change Anthropic introduced with Opus 4.7.

The tradeoff is that the same input can map to roughly 1.0 to 1.35 times as many tokens depending on content type. Anthropic says the introductory pricing is calibrated to make the transition “roughly cost-neutral,” but enterprise customers running high-volume workloads will want to benchmark their specific use cases carefully before assuming their bills won’t change.

Anthropic says Sonnet 5 is safer than its predecessor, but its most capable models still lead on alignment

Anthropic’s safety disclosures reveal a nuanced picture. The company reports that Sonnet 5 shows lower rates of hallucination and sycophancy than Sonnet 4.6, is better at refusing malicious requests, and is more resistant to prompt injection attacks in agentic contexts. On Anthropic’s automated behavioral audit — which tests for a wide range of misaligned behaviors including cooperation with misuse and deception — Sonnet 5 scored lower (meaning safer) overall than Sonnet 4.6.

However, Sonnet 5 showed “somewhat higher rates of misaligned behavior” compared with the more capable Opus 4.8 and Anthropic’s Claude Mythos Preview, the company’s powerful but tightly restricted cybersecurity-focused model. On a Firefox 147 exploit development evaluation created in collaboration with Mozilla, neither Sonnet model could develop a working exploit — both scored 0.0% — though Sonnet 5 showed a slightly higher partial success rate (13.2%) than Sonnet 4.6 (8.8%). Both remain far below Opus 4.8 (68.8% working exploits) and Mythos 5 (88.4%).

Because of these incremental gains in cyber-adjacent capabilities, Anthropic launched Sonnet 5 with cyber safeguards enabled by default — real-time systems that detect and block dangerous cybersecurity usage. The safeguards mirror those on Opus 4.7 and 4.8 but are less restrictive than those applied to Fable 5, the latest Mythos-class model that Bloomberg reported on June 10 is “blocked from responding to queries related to cybersecurity and biology.” Organizations enrolled in Anthropic’s Cyber Verification Program automatically receive the same access on Sonnet 5 without needing to reapply.

From $14 billion to $47 billion in revenue: Sonnet 5 arrives as Anthropic’s IPO narrative takes shape

The Sonnet 5 launch arrives at what may be the most consequential moment in Anthropic’s short history. The company confidentially filed its IPO prospectus with the SEC in early June, setting up what CNBC has described as “the most scrutinized public offering in tech history.”

The financial trajectory has been extraordinary. In February, Anthropic raised $30 billion at a $380 billion valuation, with the company reporting $14 billion in annualized revenue that had “grown more than tenfold in each of the past three years,” as The Guardian reported

By late May, Anthropic had closed a $65 billion Series H round at a $965 billion post-money valuation — co-led by Altimeter Capital, Sequoia Capital, and others — with a revenue run rate that had crossed $47 billion. Harrison Rolfes, an analyst at PitchBook, told CNBC that the number that will “either validate or collapse the entire narrative the private markets have been pricing for three years” won’t be the valuation or revenue, but gross margin — a figure no outside observer has yet seen.

In this context, Sonnet 5 serves a dual purpose. For developers, it offers genuine capability improvements at competitive prices. For Anthropic’s IPO narrative, it demonstrates the company can deliver a compelling product at a price tier that could drive the kind of broad adoption Wall Street rewards — high-volume, recurring API revenue from thousands of enterprise customers.

Government deals and growing competition define the market Sonnet 5 enters

The timing also aligns with Anthropic’s aggressive push into institutional contracts. Just yesterday, California Governor Gavin Newsom announced a first-of-its-kind partnership providing Claude to all state agencies at a 50% discount, with free workforce training.

Kate Jensen, Anthropic’s Head of Americas, called it an effort to “put Claude to work for the people who keep this state running.” The deal — which extends to California’s cities and counties — represents exactly the kind of durable, recurring adoption that could anchor revenue well beyond the developer community.

But Anthropic’s release lands in an increasingly crowded field. OpenAI, which raised a $122 billion round in March at an $852 billion valuation, is pursuing its own IPO. Elon Musk’s SpaceX, which merged with xAI, priced its IPO at $135 per share with a $1.77 trillion valuation. Google, Meta, and a growing wave of well-funded competitors — including Asian AI startups that, as the Wall Street Journal has reported, are developing Mythos-like cybersecurity capabilities — are all vying for the same enterprise market.

Gil Luria, head of technology research at D.A. Davidson, told CNBC that while Anthropic “appears to have the lead” in frontier AI models, “much of their current usage is for trials and experimentation and that may not sustain.” That observation cuts to the heart of the challenge facing every frontier AI lab: converting experimental developer usage into durable, production-grade revenue.

The real test for Sonnet 5 isn’t benchmarks — it’s whether cheaper AI can sustain a trillion-dollar story

Sonnet 5’s positioning — offering near-Opus performance at Sonnet prices — is a direct play for that conversion. Enterprise customers experimenting with expensive Opus-class models may find that Sonnet 5 delivers sufficient quality for production workloads at a price point that finance teams can approve at scale. If it works, it could accelerate the shift from experimentation to deployment that every AI company needs to justify its valuation.

Three things will determine whether Sonnet 5 matters beyond the initial benchmark charts. Real-world agentic reliability is the first: benchmarks measure capability, but production deployments measure consistency, and the true test will come when thousands of developers push the model through messy, unpredictable workflows at scale.

The tokenizer economics are the second: the updated tokenizer’s 1.0 to 1.35x token expansion could quietly erode the pricing advantage for certain workloads, and enterprise customers should run their own cost analyses rather than relying on headline per-token prices. The third is the IPO narrative itself: when Anthropic’s S-1 eventually becomes public, investors will scrutinize whether the Sonnet tier — cheaper but high-volume — or the Opus tier — expensive but high-margin — drives the bulk of revenue and, critically, gross profit.

As PitchBook’s Rolfes told CNBC, the 2026 IPO window “either becomes the most consequential IPO cycle since the dot-com era or the most expensive lesson in narrative-versus-fundamentals that public markets have ever taught.”

Anthropic is betting that a model good enough to rival its flagship and cheap enough to run at scale is the product that closes the gap between those two outcomes. The public markets will soon decide whether they agree.

Practical, Cost-Effective Ways Businesses Can Reduce E-Waste

Companies can often reduce e-waste—and reap business benefits—through smarter operational decisions and more intentional technology management.

Google’s Gemini Omni Flash hits the API, turning enterprise video production into a conversation

For most enterprises, a 90-second training video or a product explainer has never been an easy ask. It means a well planned brief, an internal film crew or an outside vendor, a shoot, an edit, and a round of revisions. Change one line of on-screen text due to a legal review and the whole chain runs again. The cost and the long time lines are why so much internal video never gets made.

That equation is what Google is aiming to rewrite with Gemini Omni Flash, the first model in its new “Omni” family, now rolling out to developers and enterprise customers through an API after debuting to consumers at I/O 2026. Google frames the family’s ambition as creating anything “from any input,” starting with video. But the headline interaction isn’t just a sharper text-to-video prompt. It’s the ability to edit a finished clip through conversation.

When the model launched in May, VentureBeat’s enterprise analysis flagged the catch: with no programmatic interface, Omni was a consumer and prosumer tool, not a production one. This API rollout changes that. It puts conversational editing in front of the marketing and learning-and-development teams that make the most videos in an organization.

The pitch: a five-tool pipeline collapses into a single conversation

Until now, many teams have been assembling AI videos the hard way, bolting together an LLM for a script, a text-to-image model, an image-to-video model, a separate lip-sync tool and a voice generator, each with its own contract, billing and data path.

Omni’s enterprise argument is unification: one model that takes text, images and video and returns a finished clip with synced audio.

That simplicity factor is the part decision-makers should weigh first. Collapsing several point tools into one model means fewer vendors and a single place to monitor output and enforce data-handling rules. For an organization that has avoided generative video because stitching the tools together wasn’t worth the overhead, the equation shifts.

With conversational editing each instruction builds on the last, so a marketer can relight a product shot, reframe it, or change the wardrobe without regenerating from scratch and losing the parts that already worked. It is the difference between booking a reshoot and sending a note.

Multimodal references and a physics engine for brand assets

Omni accepts far more than a text prompt. Alongside the words describing what you want, you can feed it multiple reference images, and existing video clips, and it carries those specifics into the result. Hand it a photograph of a particular object, ask the model to place that object into a scene, and it reproduces the real thing’s coloring and rough shape instead of inventing a generic stand-in. While the match might not be pixel-perfect, it is close enough to be recognizable. That reference-driven control is what makes the feature commercially interesting: a product photo, a brand logo, or a specific location can be dropped in as an ingredient rather than described in a prompt and hoped for.

Two of Google’s four highlighted strengths speak directly to enterprise work. The first is a world model, the system’s grasp of how physical scenes behave. Add light rain and puddles to an existing shot and it renders reflections of the people and objects in the wet pavement, the sort of physical consistency that separates real footage from obvious AI video. 

The second is text and logo insertion. Point it at a scene full of signage and you can have it rewrite those signs in another language, or for a brand of your choosing, and even drop in a company’s logo. The results aren’t flawless: in testing, sign tracking in complex scenes weren’t always perfect and some text slipped back to the original language between frames. For training videos that need on-screen labels, or ads that need a logo placed in-scene, it is a capability worth a close look, and a reminder that the output still needs a human review before it ships.

The interactions API and where the limits still bite

Under the hood, this runs on Google’s new interactions API, a stateful interface built for multi-turn tasks rather than open-ended chat. Each turn carries the previous video and its references forward, which is what lets edits accumulate coherently. Developers can chain generations. They can produce a clip, edit the cat into a puma kitten, restyle a video into 8-bit retro and then into a watercolor look, and store each version to branch from later.

The constraints are real and worth budgeting around. Clips currently cap at 10 seconds, per the model’s published model card. To make something longer, you generate chunks and edit them together. Uploaded footage can be edited too, as long as it runs 10 seconds or under and the user holds the rights to it. Google’s own model card is candid that holding consistency across edits and rendering accurate text remain open problems.

Guardrails, watermarking and the line Google won’t cross

For a CISO, the demos matter less than the provenance work shipping alongside the model. Every Omni clip carries Google’s SynthID watermark, Google is extending C2PA Content Credentials across its generative tools, and it has launched an AI Content Detection API that flags AI-generated media, both Google’s and other vendors’.

Google has also drawn a deliberate line. The model won’t take a still photo of a person plus an audio clip and lip-sync them into speech, an explicit move to limit deepfakes. It will, however, take a recording of someone talking and translate it into another language, a useful path for localizing global training content. For regulated enterprises, those constraints and the baked-in provenance are features rather than friction.

The numbers: cheap, 720p-only, and (preliminarily) ranked first

The pricing landed alongside the API, and it is aggressive. Omni Flash costs $0.10 per second of generated 720p video, which puts a ten-second clip at roughly a dollar. That matches Veo 3.1 Fast at the same resolution, runs double Veo 3.1 Lite, and undercuts standard Veo 3.1 by three-quarters.

Per second (USD)

Gemini Omni Flash

Veo 3.1 Lite

Veo 3.1 Fast

Veo 3.1

720p

$0.10

$0.05

$0.10

$0.40

1080p

n/a

$0.08

$0.12

$0.40

4K

n/a

n/a

$0.30

$0.60

The table also exposes the catch though. Omni Flash only generates 720p. There is no 1080p or 4K option, while the Veo tiers scale up to 4K. For internal training and most social video, 720p is fine. For premium brand work meant for a large screen, it is a real ceiling, and the reason Veo 3.1 still has a job

Clips run 3 to 10 seconds at 720p native, in landscape (16:9) or portrait (9:16). As reference inputs the model accepts up to seven images and up to three video clips of three seconds or less. It does not take audio as an input yet, though it generates audio alongside the video it produces. Output is standard MP4, and every clip ships with SynthID watermarking and C2PA credentials baked in.

On quality, the early signal is strong. In LMArena’s Text-to-Video Arena, a leaderboard where people vote on head-to-head outputs from competing models, Omni Flash sat at number one with a score of 1527. 

What it means for budgets, and what’s still missing

With real pricing in hand, the iteration story gets concrete. Every conversational edit is a fresh generation you pay for, so an edit-heavy session still adds up, roughly a dollar for each ten-second pass at 720p. What the stateful model changes isn’t the cost of an edit, it’s the number of wasted ones: because context carries across turns, those generations go toward refining a take that mostly works instead of restarting from a blank prompt and hoping the next attempt lands.

Omni isn’t alone in this field. Veo 3.1 remains Google’s production-grade option when you need higher resolution, and rivals from Bytedance, Alibaba and OpenAI are all chasing the same budgets. What Omni adds is the editing capability itself: the ability to treat a video as a living document instead of a one-shot render.

Google unveils Nano Banana 2 Lite aka Gemini 3.1 Flash-Lite for low cost, 4-second fast enterprise image generations

Google is upgrading its AI image generation capabilities today with the debut of Nano Banana 2 (NB2) Lite, an optimized model built for rapid execution and tight infrastructure budgets.

Technically designated as Gemini 3.1 Flash-Lite Image on Google’s application programming interface (API), NB2 Lite is positioned as the fastest and most cost-effective option within Google’s creative model family, capable of generating images in 4 seconds at a flat rate of $0.034 per 1,000 images.

It’s available immediately to enterprise developers through Google AI Studio, the Gemini API, and the Gemini Enterprise Agent Platform (GEAP).

It’s not quite as fast or customizable as startup Krea’s new, partially open licensed Krea 2 Turbo (which allows for open modification and commercial usage by small enterprises), but the big selling point here is the low price and bundling with Google’s larger Workplace and AI offerings.

This release lands alongside the public preview of Gemini Omni Flash, a multimodal conversational video generation and editing model.

However, while Omni Flash represents Google’s long-term bet on agentic video manipulation, Nano Banana 2 Lite is the immediate infrastructure workhorse, tailored specifically for high-throughput commercial application, rapid programmatic prototyping, and automated asset generation workflows.

The technology of speed

At its core, Nano Banana 2 Lite is built directly upon the Gemini 3.1 Flash Lite architecture, engineered to solve the persistent tension between computational latency and operational overhead.

In high-velocity enterprise frameworks, traditional large-scale image models introduce significant friction due to multi-second processing delays and high per-token costs. Google’s new lightweight model circumvents these bottlenecks by generating a standard 1k resolution image in under four seconds.

This represents a stark performance optimization over its legacy predecessor, Nano Banana (Gemini 2.5 Flash Image), achieved through targeted enhancements in core baseline capabilities.

According to internal documentation, the model features upgraded world knowledge for drafting rough data visualizations and contextual layouts, enhanced character consistency to preserve identity across continuous image streams, and localized typographic rendering capabilities.

The trade-offs inherent to this “Lite” designation are transparently outlined in Google’s technical data sheets.

Unlike the broader standard Nano Banana 2 (NB2) and Nano Banana Pro (NB Pro) lines, which support versatile multi-resolution scaling across 1k, 2k, and 4k outputs, Nano Banana 2 Lite restricts its resolution support exclusively to a 1k canvas. Yet, within this specialized operational boundary, the architectural tuning yields surprising competitive efficiencies. In standardized internal benchmarks, Nano Banana 2 Lite achieved a Text to Image arena Elo score of 1251. This score comfortably eclipses the legacy NB1 score of 1151 and remarkably edges out the bulkier, more expensive NB Pro, which sits at 1245 in the same text-to-image track. For specialized editing tasks, the model maintains a single-image editing Elo score of 1308 and a multiple-image editing score of 1294, providing a highly optimized sweet spot for real-time applications.

A boost to rapid prototyping and marketing research

From a product implementation perspective, Google is marketing Nano Banana 2 Lite not as an artistic engine, but as an invisible, high-throughput utility layer for automated workflows. T

he target demographic spans software engineers, programmatic ad platforms, and digital commerce applications where rapid iteration is crucial.

Think real-time A/B testing for thousands of targeted advertising variations or immediate layout adjustments on localized storefronts. Google highlights three specific production environments where the model excels.

First, its world knowledge allows systems to instantly draft accurate contextual scenes or location-specific mockups.

Second, its character consistency handles the rigorous demands of storyboarding tools and digital fashion try-ons, where keeping object fidelity static across sequential generations is historically difficult.

Finally, its text rendering improvements mean legible copy can be embedded directly into rapid ad generations, allowing teams to verify layout compatibility across various languages on the fly.

Developers should note, however, that while native image generation operates with lowest-latency profiles, conditional image editing tasks may experience marginally higher response times due to the secondary processing layers required to rewrite existing pixels.

Licensing and acess

The deployment mechanism of Nano Banana 2 Lite via proprietary APIs underscores an enterprise-first commercial licensing strategy.

Unlike open-weights models that developers can pull down to run locally under open-source frameworks like Apache 2.0 or modified OpenRAIL licenses, Google’s latest models remain tightly integrated into its managed cloud stack.

For enterprises, this eliminates the operational complexity of hosting hardware but binds usage strictly to Google’s metered pricing terms.Financially, this commercial strategy is highly aggressive.

At $0.034 per 1,000 images across both AI Studio and GEAP channels, the model undercuts the older, less capable NB1 model ($0.039) and slashes costs dramatically compared to standard NB2 ($0.067) and NB Pro ($0.134) tiers. Internal notes indicate that the model delivers roughly 60–70% of the general capability of NB2 and NB Pro while executing at significantly higher speeds and a fraction of the cost.

By lowering the fiscal barrier to high-frequency image generation, Google is making a direct play to lock enterprise developers into its commercial platform ecosystem.

Google App Update: A Stunning Gemini AI Upgrade For Mac Users

Google’s new Gemini AI upgrade will let Android users remotely control their Mac workflows.

Esports Company BLAST Reports Record Growth Following US Expansion

After a major expansion into the US, esports tournament operator BLAST has reported record growth and revenue over the past year.