The Nice NVIDIA Switcheroo | GPU Shrinkflation


Chart – Proportion of CUDA Cores Relative to Generational Flagship

We’ve already talked in regards to the pricing points and availability, however what we’re doing now’s revisiting a subject that we ran about 2 years in the past in a video referred to as the RTX 4080 downside, by which we explored why nobody was shopping for 4080s (watch our evaluate) on the time. It wasn’t simply the cash however as a result of the connection of what you bought for the cash. We’re taking the ideas the place we broke out the pricing, the elements, the die space, and so on. and making use of it to the 50 sequence and, briefly, it has not gotten higher.  

We simply have 2 principal charts to undergo on this article however they’re actually fascinating. Now that NVIDIA has shipped all the things aside from the 60-class card, we’ve bought a superb quantity to take a look at. The true objective of that is to discover the connection between the cash and what you get for it, however we’re additionally going to match a few of the playing cards in opposition to prior generations and performing some inflation changes.  

We began engaged on this piece for the 5080 launch after which realized it’s going to worsen. So, we waited for the 5070, which is now right here (sadly). Let’s get into the info for this.

The Nice NVIDIA Switcheroo | GPU Shrinkflation 1

This chart compares the proportion of the Flagship CUDA core depend that every configuration is. As a consequence of architectural adjustments, we’re not within the uncooked depend of CUDA cores, however the share occupancy of the utmost config for the flagship die.

Our chart plot tracks every same-named GPU class throughout NVIDIA generations on a share scale representing what number of CUDA cores every has relative to a bigger configuration. The GPUs are all proven relative to the CUDA core depend of that technology’s high gaming, non-Titan GPU – the 5090, 4090, 2080 Ti (watch our evaluate), 1080 Ti (learn our revisit), and so forth – which we’re calling the Flagship-class. Should you see 100% wherever, which means it’s equal in CUDA core depend to the flagship.

We went with the 3090 (watch our evaluate) for the 30 sequence relatively than the late-arriving, cash-grab, full die 3090 Ti (watch our evaluate). We needed to make a judgment name.

The Flagship-class is plotted relative to the biggest die’s most potential core depend. The GTX 780 Ti (watch our revisit) is without doubt one of the few exceptions the place NVIDIA made a flagship with the complete non-cut-down die. 

The GTX 780 (watch our revisit) had 80% of the CUDA cores that the flagship 780 Ti did. The RTX 2080 (watch our evaluate) introduced that all the way down to 68% of full CUDA depend, then simply 59% for the 4080, but it surely will get worse. The 5080 is a mockery of an 80-class card with solely 49% of the flagship-class CUDA depend configuration. We don’t care if the die is completely different or not for this chart, simply the config.

The 3080 quickly bucked the development at 83%, which was nice. This correlates with its extremely good worth and efficiency at launch with optimistic evaluations. Again in our evaluate of the 3080, we mentioned, “The cardboard efficiency total is spectacular. It’s a giant restoration from the 20 sequence after we reviewed it and referred to as 3 of the playing cards a waste of our time as a result of they had been 1080 Tis after which complained for 55 days about how there was no RTX and the playing cards had been named RTX. So this was a giant turnaround for NVIDIA.”

That additionally, nonetheless, aligns with the evaluations we and others gave to the 3090 and 3090 Ti. For instance, in our 3090 Ti evaluate, we acknowledged, “For us, onerous move on this. 8-12% for $2,200 is insane.”

The Nice NVIDIA Switcheroo | GPU Shrinkflation 2

And with an overclock again then, we had been capable of almost equate the 3090’s efficiency with the OC 3080. That’s how shut they had been.

The odd 80 Ti/Tremendous class from the 20 sequence to 40 sequence occupy the area between the 80 class and the flagships. There’ll possible be one other between the 5080 and 5090. 

The Tremendous refreshes actually needs to be referred to as the “oopsies” version GPUs. NVIDIA rolls these out once they make an “oopsies” on worth and public sentiment, utilizing Supers to satisfy midway on worth. Our hope is that the 5080 and 5090 hole finally ends up once more as an “oops, let’s repair this” rally from NVIDIA with a mixture of the 2080 Tremendous’s (watch our evaluate) or 4080 Tremendous’s comparatively sane pricing together with the 3080 Ti’s (watch our evaluate) aggressive configuration. Which may begin to assist repair this a little bit bit. 

The 70 Ti/Tremendous class drops onerous. The 1070 Ti (watch our evaluate) had a 68% CUDA core configuration for this class, falling to only 41% for the 5070 Ti (learn our evaluate). By this logic, the 1070 Ti provided much more GPU relative to the 1080 Ti than the 5070 Ti is to the 5090.

Subsequent, we’ll expose NVIDIA’s grand switcheroo between the 70 and 80 class GPUs. 

From the 770 sequence to the 3070 (watch our evaluate), the CUDA core depend of the 70-class playing cards as soon as reliably was between 53-59% of the flagship’s CUDA core depend. 

Then the 4070 bore solely 36% of the CUDA cores the 4090 had, and the cardboard falling within the 50-60% vary was now the 4080.

Transferring into the current, the RTX 5070 has an anemic 28% of the flagship’s configuration. Should you had been to increase the 70-class line out on its earlier development, you’d arrive across the identical place as the place the 80-class is now. Strictly talking in proportions and if we need to do humorous % math, the 3070’s (watch our evaluate) core allocation relative to its respective flagship was 100% increased proportionality in opposition to the 5070’s.

The 60-class is the place it will get actually unhealthy.

That 28% determine for the 5070 is decrease than nearly each 60 class configuration. The 60 class historically occupied the 30-40% vary with a excessive outlier within the 20 sequence at 44%. This tracks with the truth that we had been softly optimistic on the 2060 at its launch – extra optimistic than the 2080. The 3060 returned to the low 30% vary, however the 4060 bought slashed to 19%. Right here’s what we mentioned of the 4060’s worse cousin, the 4060 Ti, “The RTX 4060 Ti 8GB is without doubt one of the worst GPU launches from NVIDIA that we’ve ever coated.”

And that brings us to the 50-class. The 19% on the 4060 is the place the 50-class has sat a number of instances. NVIDIA coated this phase throughout the 20 sequence with the 16 sequence GPUs, which we didn’t plot for sake of simplicity. Transferring ahead, the 3050 was a 24% configuration, and it’s no marvel why the 4050 bought canned – it might be like scraping the underside of the barrel so onerous that you simply simply get splinters.

However what’s loopy is that the 5070 barely clears the 27% config of the previous GTX 950. That’s simply unhappy.

Now that we’ve established the tendencies, let’s hold all of that in thoughts and analyze pricing in the identical method.

Chart – Inflation Adjusted Costs

The Nice NVIDIA Switcheroo | GPU Shrinkflation 3
Seize a GN15 Massive Anti-Static Modmat to have a good time our fifteenth Anniversary and for a high-quality PC constructing work floor. The Modmat options helpful PC constructing diagrams and is anti-static conductive. Purchases straight fund our work! (or take into account a direct donation or a Patreon contribution!)
The Nice NVIDIA Switcheroo | GPU Shrinkflation 4

This line plot tracks the launch worth of all the identical GPUs in every class, adjusted for inflation from the month of every GPU’s launch to January 2025.

Instantly, we see that the flagship class has modified massively. The 780 Ti, 980 Ti (watch our revisit), and legendary 1080 Ti fall inside a constant $100 unfold. The 980 Ti was barely cheaper at a $650 launch worth, which is $865 after the inflation adjustment. The 1080 Ti sits at $912, in stark distinction to the large soar of the 2080 Ti at $1,510 adjusted. That’s a 66% price improve gen-on-gen for the shopper. It was technically accessible for $1,000, however in very restricted portions, and the overwhelming majority went for $1,200, which is what we adjusted from.

The worth went up once more for the 3090, with a slight aid within the 4090, earlier than leaping once more to the $2,000 mark with the 5090. It undoubtedly prices extra for NVIDIA to make a 5090 than it did to make a 1080 Ti, however there’s no argument that greater than double the retail worth is painful for a shopper.

The 80 class has additionally risen, although to not the identical excessive diploma because the flagship class. The GTX period 80s had inflation-adjusted costs between $734 and $886. There was a slight bump to only over $1,000 within the 20 sequence, adopted by aid to the mid-$800s on the 3080, earlier than rising insurmountably to the 4080.

When taken alongside the CUDA core configurations, all of this underscores each simply how good the 3080 was and the way horrible the 4080 was. The 3080 had a spike in core allocation and a return to “regular” pricing, whereas the 4080 fell off core config cliff and the worth went up on the identical time. Coming to the current, the 5080 is again on the identical relative worth because the 2080, however at a a lot worse relative CUDA core depend.

The 80 Ti/Tremendous class is an oddball – as if NVIDIA hasn’t been capable of resolve whether or not it’s higher as a later, higher worth 80 class different like within the 20 and 40 sequence, or if it needs to be a weirdly positioned, poor worth cash-grab just like the 3080 Ti.

The 70 Ti/Tremendous class has risen in worth throughout the generations that it’s existed, from roughly $500 at its introduction within the 10 sequence to $849 within the 40 sequence. AMD Radeon GPUs had been aggressive on this worth bracket again within the 10 and 20 sequence days, which is probably going the rationale why we see this aggressive pricing throughout that point interval. From the 30 sequence onward, NVIDIA’s dominance has allowed this class of card, particularly, to take a seat comfortably between the 80 and 70 lessons.

The 70 class has managed to remain comparatively flat from one finish of the chart to the opposite. The all time low worth was within the GTX 900 sequence at $440, and the excessive level was the 20 sequence at about $750. That’s a big swing, but it surely’s stayed comparatively flat since then.

The 60 class paints the same image. The inflation adjusted worth line is mostly flat total with a slight downward trajectory for the reason that 20 sequence, however in that very same time the core config has gone into the dumpster. We don’t know something a few theoretical future 5060, however we’d wager it received’t be a nice addition to this knowledge set.

Lastly, the 50 class hasn’t seen a lot motion, but it surely hasn’t seen many releases lately – in all probability as a result of the 4060 took its precise place. Judging by the 3050, NVIDIA might be unwilling to launch a GPU for beneath $250 once more, not to mention the $145 mark of the 1050 (watch our evaluate).

Further Segmentation

Through the years, the technique of product segmentation have migrated. Product segmentation isn’t inherently an evil factor, and particularly on the planet of silicon the place the prices are monumental to make any of those merchandise, however it may be utilized in methods which simply don’t really feel good as a shopper. Segmenting the 1080 Ti at 11GB versus the Titan playing cards at 12GB didn’t really feel significantly unhealthy. It was apparent what they had been doing, however the affected person base was a lot smaller.

A number of the different methods NVIDIA has traditionally segmented its merchandise embrace splitting double precision out into solely the highest-end playing cards, which at one level included Titans. One other is by forcing customers over to Quadro for verified drivers as an extra layer of legal responsibility discount for large organizations.

Neither of those two segmented options are noticeable to the overwhelming majority of finish customers, so it doesn’t really feel as unhealthy to the buyer. Over time, that has drifted to VRAM more and more, which now means there’s a new growing class of customers. 

The Nice NVIDIA Switcheroo | GPU Shrinkflation 5

For the gaming viewers, we get conditions the place a $750 video card can discover itself in conditions of unplayable stuttering and latency nearing 800 ms PCL as a consequence of VRAM overload and swapping.

Becoming a member of the scientific person base that when wanted double precision, or now may want numerous machine studying capabilities, there’s now the segmented buyer base of so-called “creators.” Not simply YouTubers, however anybody making 3D artwork, video games, or related media.

These customers are being pushed into the 90-class, which is additional diminishing the capabilities of the highest-end gaming playing cards or pushing these high-end gaming customers into worth classes of pros who use their GPUs to earn a living. It’s simpler to shrug it off realizing it’ll make again the time, even when it’s nonetheless disagreeable.

“Arbitrary” Naming

Again in our RTX 4080 Downside video, we talked about how all of that is predicated on the idea that the names imply something. Like Whose Line Is It Anyway, generally it feels just like the names are made up and the costs don’t matter.

We’ve been open about our opinions about this altering over time: At one level, it did really feel like names had been considerably arbitrary. It’s only a identify, and it’s finally the specs and worth that matter. However the shift got here over the past couple generations, the place we got here to understand that what’s in a reputation is vital.

NVIDIA has used the 80-class playing cards to ascertain an expectation in prospects, and no matter whether or not NVIDIA intends it to nonetheless be perceived because the high-end versus some mid-range card (which it’s now), the actual fact is that their customers do understand the 50 identify as meant to be high-end.

That is kind of a loss of life of the writer state of affairs, however then NVIDIA doesn’t need to identify a $1,000 video card a “5070.” That creates new issues. 

NVIDIA has died because the writer, and the buyer is now in management over what these names imply. To cite somebody within the trade, it’s the “notion of actuality” versus the truth. 

If NVIDIA desires to ascertain a actuality the place an 80-class card is half of a 90-class card, they will try this; nonetheless, if the tip customers understand {that a} 5080 needs to be a real high-end system, that’s all that really issues. NVIDIA can be answerable for this. The corporate spent a decade establishing the 80-class playing cards because the top-of-the-line, behind solely the Ti class playing cards. It has now bifurcated these two traces and created a big gulf between them.

And that is getting worse with 5070 playing cards that at the moment are extra just like older 50-class playing cards.

And so whereas the identify itself is technically arbitrary as in comparison with the specs, the identify issues. It defines an expectation. 

Let’s discover that philosophy a bit extra. If Toyota instantly begins transport rebadged Yugos that it calls Camrys, that’s going to trigger issues with the shopper base. That’s what NVIDIA is doing. If the AMC Gremlin is offered beneath actually any identify, it’s going to trigger issues.

The purpose is, the RTX 5080 is a Yugo. Or a Gremlin. Or a Ford Pinto. And NVIDIA has spent a decade branding it as a supercar (and it was at one level a supercar). 

Conclusion

The Nice NVIDIA Switcheroo | GPU Shrinkflation 6
Go to our Patreon web page to contribute a couple of {dollars} towards this web site’s operation (or take into account a direct donation or shopping for one thing from our GN Retailer!) Moreover, while you buy by means of hyperlinks to retailers on our web site, we might earn a small affiliate fee.

Zooming again out, we expect the general image is evident. NVIDIA has downsized primarily all of its gaming GPUs when it comes to relative configuration in comparison with every technology’s flagship. The entire traces go down. The chart from earlier had loads of phrases to say one factor: Line go down = unhealthy. We don’t need the road to go down. We would like the road to remain the identical or go up.

The 80 class is now according to former 70 class GPUs and the 70 Ti/Tremendous class is now according to former 60 Ti class territory. The final 60 class card was configured like a 50-class of yore. 

Some may argue that the 4090 and 5090 being such monsters skews the comparisons, however we expect that’s extra of a notion difficulty based mostly on NVIDIA’s success at pushing the price of the high-end increased. NVIDIA’s flagship GPUs have been very giant items of silicon for the reason that 20 sequence, and the CUDA config reducing had barely begun at that time, and the MSRP wasn’t as excessive as it’s now.

The Nice NVIDIA Switcheroo | GPU Shrinkflation 7

The worth of NVIDIA’s GPUs has typically gone up over time, even accounting for inflation that does issues like flip the previous $700 1080 Ti right into a $912 GPU in as we speak’s cash. However you then take a look at $900 GPUs in as we speak’s cash and that’s a 9070 XT. And the 9070 XT isn’t positioned the place the 1080 Ti was. The closest GPU is likely to be the 5080 at $1,000 and that additionally doesn’t really feel like a 1080 Ti by worth. Flagships, nonetheless, are the worst, rising from that degree to $2,000 with the 5090. Non-flagships haven’t risen fairly as a lot, but it surely’s nonetheless vital. On this case, line go up = unhealthy. For the buyer, anyway.

The 70-series right here is without doubt one of the most textbook examples of shrinkflation. Whereas the worth level has stayed pretty constant for a couple of generations, keep in mind that the relative CUDA core configuration has dropped by an enormous quantity throughout that point. It’s gone from $610 with a 56% configuration within the 30 sequence, all the way down to $550 with an embarrassing 28% core configuration within the 50 sequence.

NVIDIA is supplying you with a half-size slice of the GPU pie with the 5070 than it did with the 3070, but it surely’s charging you mainly the identical amount of cash for the privilege.

The entire GPUs are victims of the configuration reducing we talked about. Even the technically-cheaper-than-they-used-to-be 70 and 60 class playing cards are offering much less of a share of the capabilities of their respective flagships than they used to. 

And AMD isn’t resistant to this, in fact. Now we have a whole article devoted to the corporate’s faux MSRPs that delves into this. NVIDIA, nonetheless, holds 90% of the market, and it’s vital so that you can perceive how your cash is disproportionately dropping worth when it’s spent with NVIDIA versus a few years in the past.

We don’t have a solution for this. It’s kind of too large, but it surely’s vital to learn about and to begin fascinated by. Possibly sufficient folks will take note of this so that it’s going to assist them make knowledgeable buying choices.


Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles