Vik


Cortex-M7 Launches: Embedded, IoT and Wearables

Last week, I had the distinct pleasure of visiting ARM’s Austin Texas campus for a meeting with Vice President of CPU Product Marketing Nandan Nayampally. The topic of discussion: ARM’s next Cortex-M processor, codename Pelican, is offic…

Cortex-M7 Launches: Embedded, IoT and Wearables

Last week, I had the distinct pleasure of visiting ARM’s Austin Texas campus for a meeting with Vice President of CPU Product Marketing Nandan Nayampally. The topic of discussion: ARM’s next Cortex-M processor, codename Pelican, is offic…

NVIDIA Announces SLI LED Bridges

NVIDIA Announces SLI LED Bridges

For those that just can’t get enough bling in their cases, NVIDIA today announced the availability of SLI LED bridges, available for the time being exclusively through the NVIDIA Store. Besides the obvious addition of LED lighting to the bridges, it’s also worth noting that these are rigid bridges so the cards need to be in specific slots. There are three bridges available, two for 2-way SLI, one for systems where the cards are two slots apart and one for systems where the cards are three slots apart, with a third bridge for 3-way SLI with the cards spaced two slots apart.

The new LED SLI bridges were recently on display at NVIDIA’s Game24 event, but this is the first time the bridges have been made available to the public. There are some caveats, unfortunately: for the time being the LED bridges are only available in the US/Canada at the NVIDIA Store. I suspect there will be some enterprising individuals that buy the bridges and then put them up on eBay or similar sites, but the other drawback is that the cost of the bridges is rather high already, so paying extra for non-North American markets will simply add to the price.

Speaking of which, the 2-way SLI LED bridges are priced at $29.99 while the 3-way SLI LED bridge will cost $39.99. That’s not really very different from EVGA’s existing illuminated 2-way SLI bridge and there’s now a 3-way LED offering, but it’s over four times the price of a normal 2-way or 3-way SLI bridge. Such is the price of bling.

As far as compatibility, the bridges are designed to work with all modern GTX cards with SLI support. NVIDIA specifically mentions the following products as being supported: GeForce GTX 770, GTX 780, GTX 780 Ti, GTX TITAN, GTX TITAN Black, GTX 970 and GTX 980. These are for NVIDIA reference designs, so the bridges may not work on cards with custom cooling solutions. GeForce Experience 1.7 or later is also required for the LED Visualizer, which allows control of the LEDs.

NVIDIA Announces SLI LED Bridges

NVIDIA Announces SLI LED Bridges

For those that just can’t get enough bling in their cases, NVIDIA today announced the availability of SLI LED bridges, available for the time being exclusively through the NVIDIA Store. Besides the obvious addition of LED lighting to the bridges, it’s also worth noting that these are rigid bridges so the cards need to be in specific slots. There are three bridges available, two for 2-way SLI, one for systems where the cards are two slots apart and one for systems where the cards are three slots apart, with a third bridge for 3-way SLI with the cards spaced two slots apart.

The new LED SLI bridges were recently on display at NVIDIA’s Game24 event, but this is the first time the bridges have been made available to the public. There are some caveats, unfortunately: for the time being the LED bridges are only available in the US/Canada at the NVIDIA Store. I suspect there will be some enterprising individuals that buy the bridges and then put them up on eBay or similar sites, but the other drawback is that the cost of the bridges is rather high already, so paying extra for non-North American markets will simply add to the price.

Speaking of which, the 2-way SLI LED bridges are priced at $29.99 while the 3-way SLI LED bridge will cost $39.99. That’s not really very different from EVGA’s existing illuminated 2-way SLI bridge and there’s now a 3-way LED offering, but it’s over four times the price of a normal 2-way or 3-way SLI bridge. Such is the price of bling.

As far as compatibility, the bridges are designed to work with all modern GTX cards with SLI support. NVIDIA specifically mentions the following products as being supported: GeForce GTX 770, GTX 780, GTX 780 Ti, GTX TITAN, GTX TITAN Black, GTX 970 and GTX 980. These are for NVIDIA reference designs, so the bridges may not work on cards with custom cooling solutions. GeForce Experience 1.7 or later is also required for the LED Visualizer, which allows control of the LEDs.

Chipworks Disassembles Apple's A8 SoC: GX6450, 4MB L3 Cache & More

Chipworks Disassembles Apple’s A8 SoC: GX6450, 4MB L3 Cache & More

One of the more enjoyable rituals with Apple’s annual iPhone launch is the decapping, deconstruction, and photographing of the processor die at the heart of Apple’s newest SoC.  While we can learn a lot about the SoC from software, for some things there’s just no replacement for looking at the hardware itself and counting the functional blocks present. And this year, as in past years, the honor of being the first to tear apart the SoC goes to Chipworks.

For determining the layout of A8, Chipworks reached out to us to solicit our input on their die shot, and after some rounds of going back and forth we believe we’ve come to a solid determination of some of A8’s features and how it has been configured. So let’s dive in.

First and foremost we’ll start with A8’s GPU, as this was one of the hardest elements to analyze in software. Based on Apple’s 50% performance improvement we had previously speculated that A8 contained an Imagination PowerVR GX6650. However as we noted back then, a die shot would reveal all, and right on schedule it has.

A close analysis of the die shot makes it clear that there are only 4 GPU cores available and not 6, which immediately rules out the 6 core GX6650 we were previously expecting. Instead with 4 cores present this is conclusive proof that Apple is using the smaller 4 core GX6450 on A8, the direct successor to the G6430 used on the A7. GX6450 induces some performance optimizations along with some feature updates – including ASTC support, which Apple’s documentation has already confirmed is present – so its inclusion here is a natural progression for Apple.

On A8 and its 20nm process this measures at 19.1mm2, versus A7’s 22.1mm2 G6430. As a result Apple is saving some die space compared to A7, but this is being partially offset by the greater complexity of GX6450 and possibly additional SRAM for larger caches on the GPU. Meanwhile looking at the symmetry of the block, it’s interesting that the blocks of texturing resources that every pair of GPU cores share is so visible and so large. With these resources being so big relative to the GPU cores themselves, you can see why Imagination would want to share them as opposed to building them 1:1 with the GPU cores.

Meanwhile opposite the GPU we have the CPU block. Unlike the GPU the CPU block has seen some significant shrinking, which Chipworks estimates is down from 17.1mm2 in A7 to 12.2mm2 in A8. In A7 Cyclone did not lend itself to easily picking apart the individual CPU cores, and neither does the CPU here in A8. We’ll be looking at the new CPU’s architecture in-depth in our iPhone 6 review, but for now it’s safe to say that while this is definitely derived from Cyclone, Apple has added a few tweaks over the last year that make it an even more potent CPU than the first Cyclone. Meanwhile based on this die shot Chipworks believes that the L2 cache has been reorganized to a per-core design, as there is no obvious single block of L2 on A8 like there was A7.


A8 With PoP DRAM Removed

The final major identifiable block on A8 is once again the SRAM cache memory. On A7 we discovered that this block was 4MB and was responsible for servicing the GPU and CPU. On A8 this block is similarly present and serving the same role. This 4MB of SRAM ends up being quite big despite the shrink from 28nm to 20nm, and while at first glance it seems like it should be larger than 4MB given the relative size, in practice what has happened is that the individual SRAM cells have not shrunk by a full 50%. Chipworks estimates the cell size to now be about 0.08µm2, versus 0.12µm2 on A7, which is closer to a 33% shrink that a 50% shrink. As a result the SRAM cache still takes up a fair bit of space, but the value of being able to serve larger memory requests without having to go off-die continues to be immense.

Apple A8 vs A7 SoCs
  Apple A8 (2014) Apple A7 (2013)
Manufacturing Process TSMC 20nm HKMG Samsung 28nm HKMG
Die Size 89mm2 104mm2
Transistor Count ~2B “Over 1B”
CPU 2 x Apple Enhanced Cyclone ARMv8 64-bit cores 2 x Apple Cyclone ARMv8 64-bit cores
GPU IMG PowerVR GX6450 IMG PowerVR G6430

Overall, Chipworks’ analysis points to A8 being fabbed on TSMC’s 20nm process. This makes A8 among the first SoCs to receive the 20nm treatment. Thanks to this smaller node Apple has been able to build in additional features to the SoC while simultaneously shaving off around 15% of their die size. Chipworks estimates the final die size of A8 to stand at 89mm2, versus the 104mm2 for the Samsung 28nm based A7. Chipworks notes that if this were a straight shrink that one would expect the A8 to be closer to 50% the size of A7 (though not all logic can shrink quite that well), which indicates that Apple has spent quite a bit of die size on improving performance through more complex CPU and GPU architectures and miscellaneous feature additions.

Wrapping things up, we’ll be back later this month with our review of the iPhone 6 family and our full analysis of the A8 SoC. So until then stay tuned.