Apple Quietly Surrenders To A Compromise On The New Siri, Leaning On NVIDIA’s B200 GPU Encryption To Prevent Google From Siphoning Off User Data

Jun 4, 2026 at 10:47am EDT
Futuristic interior view of a modern glass and steel structure with symmetrical architecture.

Apple's new Siri, empowered by a custom Google Gemini model in the cloud, was supposed to run on Apple silicon, or so the maker of iPhones had assured not too long ago.

Yet, Apple has struggled to accommodate Google's behemoth of a model on its own servers, forcing the Cupertino-based tech giant to resort to a NVIDIA GPU-based band-aid of sorts to safeguard at least a shred of its privacy-related credentials, all the while hosting the Siri-enabling Gemini model on Google's servers.

Related Story Apple’s AR Glasses To Replace The Vision Pro Lineup For Its Mass Market Appeal, But Display-Equipped Spectacles Still Several Years Away

We already know that the upcoming chatbot-style Siri will reportedly leverage a much more advanced version of Google's Gemini model, known internally as Apple Foundation Models version 11. According to Gurman, "the model is expected to be competitive with Gemini 3 and significantly more capable" than the one supporting the revamped Siri.

Meanwhile, Apple is also training a host of smaller on-device models via a technique called distillation, which imbues these student models with some of the same capabilities as those possessed by their teacher model, which in this case is the licensed Google Gemini model.

However, given the fact that Google's custom Gemini model has trillions of parameters, Apple has been struggling to accommodate it within its bespoke server network, called Private Cloud Compute. Accordingly, some user requests for the new Siri will be processed directly by the licensed Gemini model in Google Cloud to ensure optimal inference.

Now, The Information has come out with an interesting report, indicating that Apple is leaning towards deploying NVIDIA's B200 GPUs within Google's servers, especially as these GPUs come with a built-in encryption feature that enrypts data as it is being processed.

NVIDIA proclaims that the feature "preserves the confidentiality and integrity of AI models deployed on Rubin, Blackwell, and Hopper GPUs," while enabling "sensitive AI workloads to run securely at scale with near-native performance, even in shared or cloud environments."

This step should help Apple reassure its users that their data can't be siphoned off by Google, constituting the best possible compromise under the prevailing ground realities.

About the author: Writing is my one incontrovertible passion. Over the past six years, he has authored over 2,200 distinct articles on financial and tech-related topics, spanning nearly 1 million words. And he has been a member of Wcctech mobile team since 2025. As an alumnus of the University of Toronto, Rotman Commerce Program, I bring nuance, in-depth knowledge, and a unique perspective to every topic that I cover. When I'm not writing, I'm traveling the world, exploring hidden confectionaries and restaurants as an aspiring food connoisseur.

Follow Wccftech on Google to get more of our news coverage in your feeds.