Pictures and performance of NVIDIA’s upcoming RTX 3060 Ti graphics card have leaked out from Videocardz (who else). NVIDIA’s RTX 3060 Ti graphics card is going to feature 4864 CUDA cores and is going to be faster than the older RTX 2080 SUPER and priced in at just $399 USD. The card is expected to launch in early December and should make NVIDIA’s mid-spectrum lineup killer.
NVIDIA RTX 3060 Ti Pictures And Performance Leaked, landing on December 2, 2020
The NVIDIA GeForce RTX 3060 Ti will feature GA104-200 GPU with 4864 CUDA cores and 8GB of GDDR6 memory clocked at 14 Gbps. The bus width will be 256-bits for a total bandwidth of 448 GB/s of bandwidth. Since the board will require less power, the TBP should be around 200W with the reference design clocking in at around 180W. This is a 5W increase over the RTX 2070 at 175W and in line with how we expect the Samsung node to fair in terms of power efficiency when compared to TSMC’s nodes. It is expected to have an MSRP of $399 and will launch on December 2.
The Founder’s edition variant features the now-familiar metallic black aesthetic of the RTX 3080 and RTX 3090 but both fans are on the top of the card with a small cutout in the backplate visible in the back. The overall shape and form of the founder’s edition is very consistent with the RTX 3000 series. The fan towards the end of the card is going to be pushing the air through the heatsink and out the bottom – which is an interesting design philosophy when compared to graphics cards from earlier in this decade.
As far as benchmarks go, the RTX 3060 Ti is faster than the RTX 2080 SUPER and should trade blows with the RTX 2080 Ti. It is faster not only in shader performance but rendering (compute) and ray-tracing capabilities as well and completely demolishes the older RTX 2060 SUPER it replaces. Of course as with any benchmark comparison, always keep in mind that you should wait for third-party results before making any decision.
Keep in mind that the reason NVIDIA’s RTX 3090 and RTX 3080 cards are not scaling linearly in performance is due to bottlenecking by game engines and binaries that are not designed to handle this much amount of cores. We know that the hardware and driver stack scales linearly because of the benchmark performance of these cards in software like vRAY and Octane – which are designed to handle a huge amount of graphics power. This means that as you scale down cores, performance will not scale down linearly. In fact, I fully expect the RTX 3060 Ti to beat the RTX 2080 clock for clock! Assuming slightly less clocks than the RTX 2080 Ti, it should easily trade blows with the former flagship.
Yields for the initial few batches of the Samsung 8nm node are lower than expected and expensive. The company expects the yields (and therefore prices for wafers) to significantly improve in the coming months. To protect its shareholders, the company has been cautious about its orders of the first few batches and has placed significantly higher volume orders for later batches. This problem was further exacerbated by bots scraping up the entire volume from the get-go. We expect the bulk orders to start hitting by the end of the year (assuming, once again, that miners don’t gobble it all up of course).
NVIDIA GeForce RTX 30 Series ‘Ampere’ Graphics Card Specifications:
Graphics Card Name | NVIDIA GeForce RTX 3050 | NVIDIA GeForce RTX 3050 Ti | NVIDIA GeForce RTX 3060 | NVIDIA GeForce RTX 3060 Ti | NVIDIA GeForce RTX 3070 | NVIDIA GeForce RTX 3070 Ti? | NVIDIA GeForce RTX 3080 | NVIDIA GeForce RTX 3080 Ti? | NVIDIA GeForce RTX 3090 |
---|---|---|---|---|---|---|---|---|---|
GPU Name | Ampere GA107 | Ampere GA106? | Ampere GA106? | Ampere GA104-200 | Ampere GA104-300 | Ampere GA102-150 | Ampere GA102-200 | Ampere GA102-250 | Ampere GA102-300 |
Process Node | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm | Samsung 8nm |
Die Size | TBA | TBA | TBA | 395.2mm2 | 395.2mm2 | 628.4mm2 | 628.4mm2 | 628.4mm2 | 628.4mm2 |
Transistors | TBA | TBA | TBA | 17.4 Billion | 17.4 Billion | 28 Billion | 28 Billion | 28 Billion | 28 Billion |
CUDA Cores | 2304 | 3584 | 3840 | 4864 | 5888 | 7424 | 8704 | 10496 | 10496 |
TMUs / ROPs | TBA | TBA | TBA | 152 / 80 | 184 / 96 | 232 / 80 | 272 / 96 | 328 / 112 | 328 / 112 |
Tensor / RT Cores | TBA | TBA | TBA | 152 / 38 | 184 / 46 | 232 / 58 | 272 / 68 | 328 / 82 | 328 / 82 |
Base Clock | TBA | TBA | TBA | 1410 MHz | 1500 MHz | TBA | 1440 MHz | TBA | 1400 MHz |
Boost Clock | TBA | TBA | TBA | 1665 MHz | 1730 MHz | TBA | 1710 MHz | TBA | 1700 MHz |
FP32 Compute | TBA | TBA | TBA | 16.2 TFLOPs | 20 TFLOPs | TBA | 30 TFLOPs | TBA | 36 TFLOPs |
RT TFLOPs | TBA | TBA | TBA | 32.4 TFLOPs | 40 TFLOPs | TBA | 58 TFLOPs | TBA | 69 TFLOPs |
Tensor-TOPs | TBA | TBA | TBA | TBA | 163 TOPs | TBA | 238 TOPs | TBA | 285 TOPs |
Memory Capacity | 4 GB GDDR6? | 6 GB GDDR6? | 6 GB GDDR6? | 8 GB GDDR6 | 8 GB GDDR6 | 10 GB GDDR6X? | 10 GB GDDR6X | 20 GB GDDR6X | 24 GB GDDR6X |
Memory Bus | 128-bit | 192-bit? | 192-bit? | 256-bit | 256-bit | 320-bit | 320-bit | 320-bit | 384-bit |
Memory Speed | TBA | TBA | TBA | 14 Gbps | 14 Gbps | TBA | 19 Gbps | 19 Gbps | 19.5 Gbps |
Bandwidth | TBA | TBA | TBA | 448 Gbps | 448 Gbps | TBA | 760 Gbps | 760 Gbps | 936 Gbps |
TGP | 90W? | TBA | TBA | 180W? | 220W | 320W? | 320W | 320W | 350W |
Price (MSRP / FE) | $149? | $199? | $299? | $399 US? | $499 US | $599 US? | $699 US | $899 US? | $1499 US |
Launch (Availability) | 2021? | 2021? | 2021? | November 2020? | 29th October | Q4 2020? | 17th September | January 2021? | 24th September |