Nvidia: The Geforce GTX 1080 Graphics Card Can Do Asynchronous Compute

Usman Pirzada • May 8, 2016 at 11:07am EDT

The Async Compute problem is probably one of the most controversial issues surrounding the older generation of Geforce graphics cards from Nvidia. Something very interesting, however, is present in the press release that they sent out to, well, the press. According to the official statement, the GTX 1080 is fully capable of performing Async Compute. If this turns out to be true, then this will give negate a major edge that Radeon graphics cards from AMD have enjoyed this past year.

Nvidia: GTX 1080 is capable of Async Compute

Keep in mind however, that even Maxwell featured Asynchronous Compute on paper. Unfortunately, due to the fact that expensive software based context switching had to be employed before it could be used, (since it did not have a dedicated hardware scheduler like AMD's GCN) resulted in lowered performance on Maxwell based graphics cards. Nvidia's style had been a technique called preemption, which it had perfected to an impressive degree. The reviews are going to be out on May 27th and if independent reviews confirm this fact, than it will be a huge win for the green camp.

Asynchronous compute has been a deal sweetener for Radeon buyers ever since the DirectX 12 API hit the stage. AMD currently leads in Hitman and AOTS which utilize their Asynchronous shader technology developed around DirectX 12 API. Interestingly, Nvidia GPUs historically perform much better without ASync turned on. This is probably due to the fact that Nvidia had apparently disabled ASync from their driver suite. The rationale given for that move is that its GPUs cannot process ASync concurrently on the hardware level, rather they need context switching which is expensive in terms of frame rate.

The Async Compute Story Distilled Down To Its Core

Async Compute has been a hot subject of debate ever since gamers became aware of its very existence. We dove deep a couple of months ago into this peculiar DirectX 12 feature in our two thousand word analysis piece dubbed “AMD’s Secret DirectX 12 Weapon That Nvidia Had To Trade Off – Demystifying Async Compute“. We explained the inherent architectural differences between Nvidia and AMD graphics cards and distilled the key reasons as to why they deal and perform so differently with asynchronous game code. We’d highly recommend giving it a read if you’re looking to wrap your head around this topic and get down to the core of the issue before proceeding.

The following is the relevant extract from the press release:

Five Marvels of Pascal: NVIDIA engineered the Pascal architecture to handle the massive computing demands of technologies like VR. It incorporates five transformational technologies:

Next-Gen GPU Architecture. Pascal is optimized for performance per watt. The GTX 1080 is 3x more power efficient than the Maxwell Architecture.

16nm FinFET Process. The GTX 1080 is the first gaming GPUs designed for the 16nm FinFET process, which uses smaller, faster transistors that can be packed together more densely. Its 7.2 billion transistors deliver a dramatic increase in performance and efficiency.

Advanced Memory. Pascal-based GPUs are the first to harness the power of 8GB of Micron's GDDR5X memory. The 256-bit memory interface runs at 10Gb/sec., helping to drive 1.7x higher effective memory bandwidth than that delivered by regular GDDR5.

Superb Craftsmanship. Increases in bandwidth and power efficiency allow the GTX 1080 to run at clock speeds never before possible -- over 1700 MHz -- while consuming only 180 watts of power. "New asynchronous compute advances improve efficiency and gaming performance." And new GPU Boost™ 3 technology supports advanced overclocking functionality.

Groundbreaking Gaming Technology. NVIDIA is changing the face of gaming from development to play to sharing. New NVIDIA VRWorks™ software features let game developers bring unprecedented immersiveness to gaming environments. NVIDIA's Ansel™ technology lets gamers share their gaming experiences and explore gaming worlds in new ways.

Async Compute on the GTX 1080 will allow developers to execute some tasks that would otherwise be allocated to the CPU, on the GPU. This means that if a game is being CPU-bound (that is to say the CPU is the bottlenec present), it will drastically increase frame rates. It may even improve performance in games that are GPU bound, by allowing full use of the GPUs resources. The thing we have to keep in mind however that Preemption and Asynchronous compute are both different approaches to achieve the same end result: maximizing the utilization of a GPU. And while AMD will have you believe Async is drastically superior choice, badly implemented Async will fare much worse than properly implemented preemption.

Due to the pressure exerted by the industry to make Async compatible graphics cards however, Nvidia has been working actively to implement Async in their GPUs but were held back due to the fact that this was something that had to be implemented at a hardware level. Chip design usually takes a lot of time (in the lieu of many years) and if Nvidia has actually managed to properly implement Async in the Pascal based GTX 1080 - that would be quite an accomplishment. Something their CEO stated a few weeks back (regarding the P100 being capable of advanced preemption) made us think that Pascal might stick with the Preemption approach for now, but the press release from Nvidia states otherwise. So consider us pleasantly surprised!

Nvidia Geforce 'Pascal' GP100 Compute Specifications

GPU	Kepler GK110	Maxwell GM200	Pascal GP100
Compute Capability	3.5	5.3	6.0
Threads / Warp	32	32	32
Max Warps / Multiprocessor	64	64	64
Max Threads / Multiprocessor	2048	2048	2048
Max Thread Blocks / Multiprocessor	16	32	32
Max 32-bit Registers / SM	65536	65536	65536
Max Registers / Block	65536	32768	65536
Max Registers / Thread	255	255	255
Max Thread Block Size	1024	1024	1024
CUDA Cores / SM	192	128	64
Shared Memory Size / SM Configurations (bytes)	16K/32K/48K	96K	64K

About the author: PC Hardware and Technology Enthusiast, Blood of Silicon (1 nm),

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Nvidia: The Geforce GTX 1080 Graphics Card Can Do Asynchronous Compute

Nvidia: The Geforce GTX 1080 Graphics Card Can Do Asynchronous Compute

Nvidia: GTX 1080 is capable of Async Compute

The Async Compute Story Distilled Down To Its Core

Nvidia Geforce 'Pascal' GP100 Compute Specifications

Trending Stories

NVIDIA’s GeForce RTX 5070 Ti SUPER – Specs, Performance, And Price, Everything We Know So Far

Trump Mobile Wants To Entice You To Buy The “Yellow Plastic” T1 Phone By Offering A Free Charging Brick

Cygames Revives Project Awakening a Decade After Reveal, Ditching Its Own Engine for Unreal Engine 5

MRDIMM’s Allow DDR5 Memory To Keep Up With Next-Gen Servers, Achieving DDR6-Class Bandwidth & No Pin-Change

Intel EMIB-T Breaks Past Existing AI & HPC Scaling Limits, Enabling Ultra-Large Die Complexes With Over 10x Reticle Dies & 12 Gb/s+ HBM4e DRAM

Popular Discussions

AMD Prepares For Zen 6 EPYC CPUs Launch For July 22nd-23rd, Confirms AMD’s Mark Papermaster

Intel’s Shot At Fabricating Apple’s A20 Chip For The Base iPhone 18 Collapses As A Credible Leaker Calls The Original Source A ‘Blowhard’

AMD’s Next-Gen Medusa Point “10-Core” CPU Beats Strix “10-Core” By 29% In Single-Core & 22% In Multi-Core While Running At Just 2.0 GHz

NVIDIA’s RTX 3060 12 GB Graphics Card Comeback Proves Just How Bad Things Are For The PC Gaming Market

AMD Ryzen Becomes The Top CPU Choice While Radeon Powers 1 In Every 3 Desktop Gaming GPUs Sold at Microcenter

Nvidia: The Geforce GTX 1080 Graphics Card Can Do Asynchronous Compute

Nvidia: GTX 1080 is capable of Async Compute

The Async Compute Story Distilled Down To Its Core

Nvidia Geforce 'Pascal' GP100 Compute Specifications

Further Reading

Trending Stories

Popular Discussions