Thursday , 21 November 2024
Breaking News

Supercharging Turing? RTX Goes SUPER! – NVIDIA RTX 2070 SUPER & RTX 2060 SUPER Review

TU-104 GPU Block Diagram

Here we have it, the same GPU which powers the RTX 2080 is now fitted in the new RTX 2070 SUPER. This, of course, is not a full-blown TU-104 as the RTX 2070 SUPER has 40 SM’s active while the RTX 2080 FE has 46. So, we should look at the RTX 2070 SUPER as a cut-down RTX 2080. One advantage of the RTX 2070 SUPER jumping up to the TU104 PCB and GPU is the 2070 SUPER now gains NVLink which some users were upset to see the XX70 series lose multi GPU support.

 

TU-106 GPU Block Diagram (RTX 2060 SUPER)

The same theme follows on the RTX 2060 SUPER as it is now equipped with 34 SM’s vs the 30 it had previously. Now the RTX 2070 still has 2 more SM’s at 36, but the RTX 2060 SUPER  also gets a nice boost beyond extra SM’s with the addition of 33% more memory, up to 8GB from 6GB. With the memory increase, we also see an increase in the memory bus up to 256-bit from 192-bit.

New Streaming Multiprocessor (SM)

Here you can see the die layout of the new SM for the TU-102, TU-104, and TU-106. I am not sure if this will carry to any other future spins or GPU launches but as of this launch, this is the layout showing the full build of the SM.

Each SM is made up of

  • 64 CUDA Cores
  • 8 Tensor Cores
  • 256KB register file
  • 4 texture units
  • 96KB of L1/shared memory

 

Deep Learning Super Sampling

Utilizing Nvidia NGX which is Nvidia deep neural network to build from and create the capability to accelerate graphics rendering by utilizing the Turing Tensor Cores for deep learning based operation. They accelerate Nvidias stored neural network information including stored supersampling data to better offer similar effects as high AA which would normally be very heavy on GPU loading and reducing framerates. Instead, the Tensor cores take a lower quality downsampled image and use the DLSS/Turing Tensor cores to build a super high AA image with a much lower graphical overhead on the shader.

Here is a list from Nvidia of the upcoming games which will support DLSS or be updated to support it.

  1. Ark: Survival Evolved from Studio Wildcard
  2. Atomic Heart from Mundfish
  3. Dauntless from Phoenix Labs
  4. Final Fantasy XV from Square Enix
  5. Fractured Lands from Unbroken Studios
  6. Hitman 2 from IO Interactive/Warner Bros.
  7. Islands of Nyne: Battle Royale from Define Human Studios
  8. Justice (Ni Shui Han) from NetEase
  9. JX3 from Kingsoft
  10. Mechwarrior 5: Mercenaries from Piranha Games
  11. PlayerUnknown’s Battlegrounds from PUBG Corp.
  12. Remnant: From the Ashes from Gunfire Games/Perfect World Entertainment
  13. Serious Sam 4: Planet Badass from Croteam/Devolver Digital
  14. Shadow of the Tomb Raider from Square Enix/Eidos-Montréal/Crystal Dynamics/Nixxes
  15. The Forge Arena from Freezing Raccoon Studios
  16. We Happy Few from Compulsion Games / Gearbox
  17. Darksiders 3 by Gunfire Games/THQ Nordic
  18. Deliver Us The Moon: Fortuna by KeokeN Interactive
  19. Fear the Wolves by Vostok Games / Focus Home Interactive
  20. Hellblade: Senua’s Sacrifice by Ninja Theory
  21. KINETIK by Hero Machine Studios
  22. Outpost Zero by Symmetric Games / tinyBuild Games
  23. Overkill’s The Walking Dead by Overkill Software / Starbreeze Studios
  24. SCUM by Gamepires / Devolver Digital
  25. Stormdivers by Housemarque

Ray Tracing (RTX)

With Turing as you saw above in the Die map, it is skirted by RT cores which are deployed to enable a world first real-time ray tracing, something that was hinted to still be 10 years away just a short time ago. The cores are only part of the package as it requires Nvidia’s RTX technology along with support for the new DirectX (DXR) Nvidia OptiX and Vulcan ray tracing to ensure that no matter the game engine there is likely to be a ray tracing opportunity to give a more immersive gaming environment.

Here is a list of the confirmed upcoming RTX games which will be RTX Enabled

  1. Assetto Corsa Competizione from Kunos Simulazioni/505 Games
  2. Atomic Heart from Mundfish
  3. Battlefield V from EA/DICE
  4. Control from Remedy Entertainment/505 Games
  5. Enlisted from Gaijin Entertainment/Darkflow Software
  6. MechWarrior 5: Mercenaries from Piranha Games
  7. Metro Exodus from 4A Games
  8. Shadow of the Tomb Raider from Square Enix/Eidos-Montréal/Crystal Dynamics/Nixxes
  9. Justice (Ni Shui Han) from NetEase
  10. JX3 from Kingsoft
  11. Project DH by Nexon

Hybrid Rendering (RTX-OPS)

With Turing, we have now seen the introduction of not just your normal SM but RT Cores and Tensor cores for AI. This enables a new Hybrid Rendering method where as mentioned previously, Ray tracing or RT cores are used for lighting workloads and Turing cores are used for AI calculations to accelerate rendering along with other features I’m sure to come along with traditional rendering methods for rasterization.

Obviously, not all of these will be at use all the time, and with that Nvidia ran some very deep mathematical calculations to show how RTX ops are calculated. Since I am quite sure most of you would not care about that I’m not gonna dig too deep into it but I will add it below for your reference, or for those like me who geek out on that kind of stuff.

The above is a visual representation of the hybrid rendering (RTX-OPS) calculation you will find below.

To compute RTX-OPs, the peak operations of each type based is derated on how often it is used. In particular:
– Tensor operations are used 20% of the time
– CUDA cores are used 80% of the time
– RT cores are used 40% of the time (half of 80%)
– INT32 pipes are used 28% of the time (35% of 80%)

For example, RTX-OPS = TENSOR * 20% + FP32 * 80% + RTOPS * 40% + INT32 * 28%

The above is an illustration of the peak operations of each type for GTX 2080 Ti. Plugging in those peak operation counts results in a total RTX-OPs number of 78. For example, 14 * 80% + 14 * 28% + 100 * 40% + 114 * 20%.

GPU Boost 4

GPU Boost 3

This is how your typical previous gen GPU Boost 3 implementation would work. As you can see the adjustment takes you straight across based on a power target/limit which you would set within the 3rd party app (Precision/Afterburner/etc) of your choice. however, the control was quite limited in terms of granularity. This is because the GPU boost 3 implementations while a good solution was mostly hidden in the driver away from users ability to really adjust it with the exception of the target sliders.

GPU Boost 4

GPU Boost 4 is a completely different animal as it allows you to set steps so that instead of dropping straight down to base clock when thermal limits are hit, it instead allows a lower boost clock plateau to be reached giving the card a chance to cool itself off at a higher boost speed rather than dropping drastically down to base clock. This, in turn, means more consistent control of your performance, thermals and acoustic characteristics of your GeForce card.

Check Also

Fifine Ampligame A6T

Introduction Much like the webcam, the USB microphone has become a rather indispensable tool in …

Cooler Master Hyper 622 Halo

Introduction The liquid cooling is the go to cooler for the PC enthusiasts who want …

Leave a Reply

instagram default popup image round
Follow Me
502k 100k 3 month ago
Share