How Deep Does The Rabbit Hole Go? Intel’s 14th & 13th Gen CPU Instability Issues Analysis: GamersNexus Analyzes Potential “Oxidation” Issues With Silicon, No Thermal Failure & More

Jul 22, 2024 at 12:25pm EDT
Unreal Engine Supervisor Discloses 50% Failure Rate With Intel's Core i9-14900K & 13900K CPUs, Switches To AMD For "Reliability" 1

Intel's 14th & 13th Gen CPU instability issues have existed for more than a year but while Intel has yet to give a solid reasoning for the problems, others have come up with potential causes of the silicon degradation and crashing issues associated with these chips.

Intel's Instability Issues Have Outgrown From Being "Software-Limited" To Possible Silicon Defects, Official Answer Is To "WAIT"

The crashes, instability, and performance issues present in Intel's 14th Gen & 13th Gen CPUs are bothering several consumers out there to the point where it is unbearable, and the community is now determined to switch towards alternatives, such as offerings from AMD.

Related Story Valve Confirms It’s Working With NVIDIA to Bring SteamOS To Even More PC Gamers

So far, here's the timeline of Intel's 14th & 13th Gen Instability issues:

We have seen game studios such as Alderon Games and Epic Games raising the issue on their respective platform, along with tech content creators such as Wendell from Level1Techs, providing their audience awareness about the issue.

Despite the problem spreading into mainstream media, Team Blue hasn't managed to address the root cause. The company has worked with AIBs & board partners to mitigate the issues while discovering several other such as the eTVB bug but outside of that, there's been no proper communication from the blue team which tells us two things, either the firm is worried about a heated backlash from its customer base (clients, partners, OEMS) or they want to drag the issue as much as possible until something new comes out and people simply forget about it.

Now, GamersNexus has compiled on-ground statistics on how the issue is affecting Intel's 14th & 13th Gen consumers, and according to one "unnamed" Intel customer, they have witnessed 600,000 to 2 million CPUs facing instability issues. This only includes the 13th Gen units and information surrounding the 14th Gen SKUs is currently unavailable. Interestingly, one Intel customer disclosed that the affected units have production dates from March 2023 to April 2024, spanning more than 12 months of retail SKUs revolving in the markets that are facing the problem.

From what we have heard, 1/3rd of all Intel Raptor Lake CPUs that have been shipped are Core i9-13900K or 14900K units so that's roughly around 40-60 million units (estimates from Mike Bruzzone). If that's the case, then Intel might be facing a huge recall, one that would end up being a major disaster for the company and that might be a potential reason why they are taking time with the appropriate response to the community.

Our editor, Hassan, reported in a post on X a while back that he started facing these issues in early 2023, just a few months after the release of the 14th Gen Desktop CPUs. While the BIOS mitigation has made things a little stable for him, one can easily say that applying the current "power limit" fix will reduce the performance of your chip versus what you originally had.

Onto the more intriguing bits, GamersNexus has compiled the possible reasons behind the instability issues based on Intel's internal documents and information from the customers. Newly surfaced information claims that Team Blue might have faced a "fabrication" issue with the affected 13th and 14th Gen chips, where the "anti-oxidation" on the SKUs wasn't applied sufficiently, causing disruptions in the electrical connections of the processors.

Image Credits: GamersNexus

Well, this reasoning does make some sense, considering that limiting power levels didn't solve the issue at all despite Intel releasing relevant microcodes. While we won't go into how oxidation has affected the functionalities of the CPUs (check out GamersNexus video below for detailed info), to sum it up, it might have affected individual layers, which is why the solution doesn't lie in any sort of software-level mitigation.

It's important that the 'power limiter' issue doesn't lead this story. It's not a 'power limit on the board' issue. It's a chip issue and always has been. The power limit issue was fixed with microcode. We have no idea if it affects Meteor Lake or not yet. The current possible affected processors are [about] 8 million shipped that we know of.

If you disable Turbo Boost, you can get 'stability' until the corrosion/contamination make the CPU fault. We have reports of some CPUs that will not even boot without blue screening because the contamination/corrosion is so bad.

- Large Intel Customer to GN

Well, what's next, then? According to GamersNexus, vendors are finding intermediary solutions, with some moving towards limiting the clock speeds to 5.3 - 5.5 GHz at an OEM level while others are waiting for Intel to come up with a solution. Intel has started to work with vendors, hinting them towards providing refunds for affected CPUs, and there are rumors of a potential "large-scale" callback as well, but nothing is certain for now.

Furthermore, below are failure rates segregated based on individual Intel SKUs. interestingly, there are no thermal-based failures, which hints that the instability issue is likely something more complex.

Meanwhile, leaker @Jaykihn also states that the reports of oxidation being the issue behind the Intel 13th and 14th Gen CPU issues seem unlikely. He also states that the report doesn't add up since he has statistics of Intel 7 (process node) chips tested as of June 2024.

Ian Cutress for More Than Moore also weighs in on potential reasons behind the issues:

We have pushed out extensive guidelines on how to solve the issue based on Intel's guidance along with suggestions from third-party sources; hence, you can check them out if you haven't implemented the solutions mentioned. Furthermore, we have talked with various board partners about this and they have said that they are taking extra caution and spending more time testing & evaluating the clock and power behavior, not only for existing chips but also the upcoming CPUs such as Arrow Lake.

For now, we have to wait and see how Team Blue moves with the situation. Given that Arrow Lake-S desktop CPUs are right around the corner, the whole fiasco is starting to become interesting yet unfortunate at the same time.

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Deal of the Day