Page 2 of 5

"Southern Islands"

First, a little about AMD's labeling of its latest products. The manufacturer has divided them into three levels according to performance. The code name "Cape Verde" refers to the Radeon HD 7700. The name "Pitcairn" hides today's participants Radeon testing HD 7870 and HD 7850. High performance products are called "Tahiti" or Radeon HD 7900. This is shown more clearly below.

  • Entry level = Cape Verde = Radeon HD 7700 series;
  • Mainstream = Pitcairn = Radeon HD 7800 series;
  • High performance products = Tahiti = Radeon HD 7900 series.

That is, at the moment, AMD has covered all market segments with its 28 nm graphics chips. Only the release of a dual-core video card based on Tahiti chips is expected. Tentative name Radeon HD 7990.

Series Features AMD Radeon HD 7800

The Radeon HD 7800 graphics processor (Pitcairn) has about 2.8 billion transistors and Graphic Core Next microarchitecture. As mentioned above, the Radeon HD 7850 chip (Pitcairn Pro) has 16 computing units, and its maximum TDP is 130 watts. For the Radeon HD 7870 (Pitcairn XT), these figures are 20 and 175, respectively.

The slide below shows the main specifications of the Radeon HD 7850 and HD 7870 video cards.

2GB of GDDR5 memory is already becoming the standard for most mid-range and high-end models. Thanks to 256-bit. bus and a high clock frequency of 1200 MHz (4800 MHz effective), the bandwidth is 154 GB / s. This will have a positive effect on performance in games with high resolution and picture quality.

PCI Express 3 interface

In the second half of 2011, almost all motherboard manufacturers presented their models of motherboards with PCI Express 3rd generation interface. With the release of the Radeon HD 7000 series, video cards with this interface also appeared. PCI Express 3 has twice the bandwidth (32 Gb/s) of the previous generation PCI Express. Compared to PCIe 2, the throughput per lane has been doubled from 500 Mb/s to 1 Gb/s.

Naturally, to take advantage of the new PCIe 3, you need to have not only a video card and motherboard with this interface, but also support from the processor (not all models from the Ivy Bridge family will support PCIe 3).

Eyefinity 2.0

AMD has gone further with their Eyefinity technology, which is designed to display images across multiple monitors. Thanks to the high processing power of the HD 7000 series and support for Eyefinity 2.0, it is now possible to display an image on multiple monitors with a total resolution of 16000 x 16000. This allows you to display the image on 5 displays with a resolution of 2560x1600 set in landscape orientation. To work with such resolutions, a record 3 GB GDDR5 (HD 7970 and HD 7950) is installed on the older models of the family.

AMD Catalyst drivers will support custom resolution starting in February. That is, you can set the required resolution depending on the configuration of the displays in Eyefinity. As of Catalyst 12.2, there is an option to set the Start menu to a display that is convenient for you, instead of the far left as it used to be. In addition, Eyefinity 2 supports stereo HD3D output. It supports the combination of three monitors that work in 3D mode.

Improved tessellation

AMD's Radeon HD 7000 family graphics cards feature the ninth generation tessellator and have seen significant performance improvements in geometry processing in today's games. The GCN core still includes two Graphics Engines, but if they previously contained blocks for tessellation and rasterization, now they consist of an arbitrary number of pipelines designed to process geometry and pixels.

AMD Radeon HD 7800 graphics cards support the HDMI 1.4a interface, which allows you to output a picture of 120 Hz (60 Hz for each eye), which allows you to display a 3D image. With earlier versions of HDMI, this was not possible. Starting in December, AMD enabled HD3D and Eyefinity to work together in their drivers.

DirectX 11.1

Video cards of the Radeon 7000 family will support the upcoming DirectX 11.1. What this will give in practice is too early to say, since DX 11.1 will be released along with Windows 8. The main advantages of the new API are indicated as follows:

  • Independent rasterization;
  • Flexible combination of graphics computing and video processing;
  • Native Stereo 3D support.

AMD Unified Video Decoder

Represents the hardware GPUs AMD, which is responsible for decoding the video stream. In the Radeon 7000 series, UVF received some improvements. In general, UVD retained all the features of its predecessors, namely support for H.264/AVCHD, MPEG-2, MPEG-4/DivX, VC-1/WMV profile D, Multi-View Codec (MVC), Video Codec Engine ( VCE), AMD Steady Video 2.0. Added support for Dual Stream HD+HD format.

Problems when registering on the site? CLICK HERE ! Do not pass by a very interesting section of our site - visitor projects. There you will always find the latest news, jokes, weather forecast (in ADSL newspaper), TV program of on-air and ADSL-TV channels, the latest and most interesting news from the world of high technologies, the most original and amazing pictures from the Internet, a large archive of magazines in recent years , appetizing recipes in pictures , informative . The section is updated daily. Always up-to-date versions of the best free programs for everyday use in the Essential programs section. There is almost everything that is required for daily work. Start to gradually abandon pirated versions in favor of more convenient and functional free counterparts. If you still do not use our chat, we strongly advise you to get acquainted with it. You will find many new friends there. It is also the fastest and most efficient way to contact project administrators. The Antivirus Updates section continues to work - always up-to-date free updates for Dr Web and NOD. Didn't have time to read something? The full content of the ticker can be found at this link.

AMD Radeon HD 7870 video card review. Inexpensive alternative to flagships

Most recently, we told you about the top video cards of the new seven thousandth line of video cards from AMD - Radeon HD 7970 and Radeon HD 7950. But, as you know, you won't succeed with tops alone. Therefore, it's time to talk about the AMD Radeon 7800 Series - a less expensive, but still productive line.

Introduction. Estimated cost

The AMD Radeon HD 7870 video card should be sold in stores at a price starting from 350 US dollars (in our case, it must be converted into rubles at the Central Bank rate). The recommended price for the "younger brother", AMD Radeon HD 7850, is $250. It must be understood that in Russia the price will always be slightly higher (according to price.ru, about Quote Request: AMD Radeon HD 7870 0903 3 for 7870 and around Quote Request: AMD Radeon HD 7850 0903 3 for 7850). So, assuming I buy one of these graphics cards, I would expect to spend about $50-$80 more each time. But even if we do not take into account the markups of the Russian market, the cost of each of the video cards is very high, and for that kind of money, buyers will expect very high performance. Let's find out what we get by buying one of these cards.

Architecture 7800 Series

The 7800 subline is based on a new GPU codenamed Pitcairn.

This GPU is very attractive price / performance ratio. For most buyers of a 7900 line video card, its performance will be excessive, and not everyone wants to overpay more money if they don't see the difference. In turn, HD 7700 video cards may be too weak for a comfortable game at maximum settings in the popular 1920x1080 resolution.

This is a bit like the history of the AMD Radeon HD 5850 graphics card.

Remember this one? Despite the fact that it could not be turned into a HD 5870 by simply flashing the BIOS, it won the love of buyers due to its optimal parameters, which allowed it to run modern games at maximum settings while maintaining an adequate price. The current generation of HD 7800 video cards may well fill this niche.

The new GPU carries 20 GCN (Graphic Core Next) Compute Units. Recall that in the Tahiti GPU, which is equipped with the top-end Radeon HD 7970 video card, there were 32 of them. Each computing unit carries four vector units, and each vector unit, in turn, has 16 scalar.

Thus, the total number of computing modules in the new GPU is 1280 pieces. The number of rasterization units (ROPs) has not changed and remains equal to 32. The GPU clock frequency is 1000 MHz. The amount of video memory is 2048 MB GDDR5. Clock frequency 1200 (4800 effective) MHz. Memory bus 256 bits. All these characteristics are valid for the older video card of the subseries - HD 7870.

The younger brother of the Radeon HD 7850 is equipped with the same GPU, only slightly trimmed. The computing units were cut, their number was reduced to 16. Thus, the junior video card carries only 1024 computing modules on board, i.e. exactly half of their number in the Tahiti GPU. Accordingly, there are also fewer texture processing units, there are 64 of them left. The clock frequency of the GPU is 860 MHz. Other characteristics remained unchanged (relative to the Radeon HD 7870).

So, as mentioned above, the Pitcairn GPU is based on the Graphics Core Next (GCN) architecture. This means that he inherited all solutions from Tahiti (7900 Series). The new GPU is built on a 28nm process. The video card is equipped with PCI-E 3.0 (Gen3). The energy-saving technologies AMD PowerTune and AMD ZeroCore Power have not gone away either. AMD Eyefinity 2.0 technology will allow us to connect multiple displays with a maximum overall resolution of 16384x16384. 3D support is present and implemented by AMD HD3D Technology. AMD also continues to develop technologies for increasing the performance of the entire system using GPUs. In this case, we are offered to use the Video Codec Engine (VCE) technology, which accelerates video encoding by the hardware of the video card, thereby greatly helping the central processor and reducing the amount of time required for data processing.

The slide above confirms that the Pitcairn GPU has completely inherited all the goodies from its older brothers. Moreover, the 7700 line is also not deprived of functions, but more on that in the next article.

Judging by AMD's slides, the new video cards have modest appetites. The older HD 7870 consumes no more than 175 watts in load. The TDP of the younger HD 7850 card is 130 watts. When the display is turned off in idle mode, graphics cards consume less than three watts.

The graphs on the slide look convincing. AMD promises almost a twofold increase in performance. However, this is not very surprising: the CrossFireX mode has always scaled well. Drivers play a huge role here. I'm glad that at least here everything is in order. On this we turn to the study of the structure of video cards.

AMD Radeon HD 7870

As usual, we will start by examining the exterior, then move on to dismantling the cooling system and studying printed circuit board and elements on it.

The appearance of the video card is absolutely traditional and recognizable. As you may have guessed, we received a reference engineering sample in the laboratory. The length of the board is 245 mm. So far, only one thing can be said about the cooling system: hot air is blown out of the case using a turbine. You can also see that the switch between BIOS chips has disappeared. In fact, looking ahead, I will say that the second BIOS chip also disappeared. From now on, when flashing a video card, there will be a certain risk of not starting it again. Most will think that this is a minus. However, how often do you flash video cards? Especially if the cores are not unlocked at the same time? Well, I don't think it's very often. And for most users, this is a plus, since the absence of such a function leads to a decrease in the cost of the product.

You can also see only one CrossFireX connector. This means that we can combine only two video cards in tandem. The situation is similar to the one we discussed above with the chip and BIOS switch. How many users bundle three or more video cards? I think that such people are an absolute minority, especially among those who use not the most top-end video adapters. At the same time, its absence again reduces the final cost of the product.

There is nothing remarkable on the reverse side, and we move on.

Under the grille for blowing hot air out of the case, in the usual place, there are four video outputs - one DVI, one HDMI and two miniDP.

The video card is equipped with two additional power connectors. This can help with overclocking by increasing the voltage applied to the GPU. At such times, energy consumption increases dramatically.

For better heat transfer, the place of contact between the heatsink and the GPU and the adjacent area are made of copper. Three copper heat pipes aid in heat dissipation.

If someone remembers what the board of the reference HD 6870 looks like, then they will understand me in the feeling that I have already seen it somewhere. Indeed, a board with a very similar arrangement of elements was used in the reference HD 6970.

In the center of the PCB is the Pitcairn XT GPU itself, rotated 45 degrees relative to the substrate.


Around the GPU are eight Hynix video memory chips marked T2C, which means that these chips can operate at frequencies up to 5000 MHz.


The heart of the power subsystem is the CHIL CHL 8225G chip. The GPU power subsystem is five-phase. Pay attention to the location of the power phases. There is one phase per video memory power subsystem. The printed circuit board looks thoughtfully and neatly, there is no feeling that they did it at the last moment “on the knee”.

This is where we finish our acquaintance with the Radeon HD 7870 board and move on to the study of the Radeon HD 7850.

Radeon HD 7850

Let me remind you that this is the younger brother of the Radeon HD 7850 video card.

Externally, the video card is no different from its older brother.

And there are no differences here, except for the only additional power connector. Since the HD 7850 consumes only 130 watts, versus 175 watts for the HD 7870, a single six-pin connector will suffice. The main thing is that this does not leave an imprint on the overclocking potential of the card.

There are also no differences from the HD 7850. We have one DVI, one HDMI and two miniDP at our disposal.

The cooling system does not differ from that of the Radeon HD 7870. We are offered a copper base at the point of contact between the GPU and the cooling system, and three copper heat pipes for efficient heat dissipation.

As expected, the printed circuit boards of the two video cards are also completely identical. No differences in soldered elements are observed. The only difference is the "cut" phase from the GPU power subsystem. For stable operation at nominal value, this is definitely enough, but how the GPU will show itself in overclocking and how stable it will be is another question.


And here is the very heart of the HD 7850 - AMDPitcairnPRO GPU. It is made in China and carries 1024 computing modules. Recall: the video card Radeon HD 6850 carried 960 computing modules, i.е. the gap is not so great, if you do not take into account the new architecture and look only at the numbers.


On the printed circuit board of this card, 8 video memory chips with a total capacity of 2 GB are soldered. The density of one chip, respectively, is 256 MB. The marking has not changed and reads "T2C", which corresponds to the nominal frequency of 5000 MHz. I would very much like this bar to be overcome in overclocking.

Alternative versions of Radeon HD 7800 Series

As always, most manufacturers will want to move away from reference cooling systems and offer something of their own. Let's see what exactly.

ASUS HD 7870 and HD 7850

Video cards manufactured by ASUS with the well-known DirectCU II cooling system differ very little in appearance, but this is not the main thing. The main thing is that the manufacturer promises up to 20% increased performance compared to the reference version.

Club3D HD 7870 and HD 7850

Club3D decided not to go far from stock. The Radeon HD 7870 looks no different from the reference. The younger brother, apparently, got a heatsink personally for the GPU with one heat pipe and a fan. If at the same time the cooling of the remaining elements did not suffer, then everything is fine.

Gigabyte HD 7870 and HD 7850

Gigabyte decided to equip the 7800 series with its proprietary cooling systems, as well as increase the clock speeds. The HD 7850 video card is content with a cooling system with two fans, while the Radeon HD 7870 got a cooling system with three fans. Outwardly, there is nothing to complain about, except for the well-known problem with the removal of hot air outside the case.

HIS HD 7870 and HD 7850

HIS has equipped the new products with its proprietary IceQ X cooling systems. These COs have always been famous for their good efficiency. The younger video card got only two heat pipes, while the older video card received all four.

MSI HD 7870 and HD 7850

The TwinFrozr III cooling system is considered highly efficient, but it always raises the price tag. It often happens that it is easier to take the graphics card with the next highest performance. But no one canceled the advantages in the form of high efficiency and low noise level.

PowerColor HD 7870 and HD 7850

PowerColor introduced two versions of the Radeon HD 7870 and one Radeon HD 7850. The youngest HD 7850 will be content with a standard cooling system, as far as it looks. This is not scary, since the HD 7850 has a very low power consumption, and, accordingly, heat dissipation. For the HD 7870, two versions are prepared: one reference, and one with a PCS + cooling system.

Sapphire HD 7870 and HD 7850

Video cards manufactured by Sapphire will be equipped with cooling systems very similar to the proprietary FleX. There is no reason to doubt the effectiveness of these COs, since we have already tested similar video cards, and, even with much hotter tempers, the temperatures remained within reasonable limits.

XFXHD 7870

XFX has presented only the older video card Radeon HD 7870 at the moment. It will be equipped with two fans. We have already seen a similar design in the 7900 series video cards from this manufacturer.

Specifications table

Process technology, nm

Stream processors, pcs

Rasterization units (ROPs), pcs

GPU frequency, MHz

Video memory frequency, MHz

Video memory size, MB

Video memory bus, bit

Supported version of DirectX

Cost, rub*

Request a Quote: HD 7870 0903 3

Quote request: 7850 0903 3

Quote request: 6970 0903 3

Quote request: 7950 0903 3

*For prices www.price.ru for Moscow

Overclocking and temperatures

Let's start with the youngest video card, the Radeon HD 7850


The nominal clock speeds are 860 MHz for the GPU and 1200 (4800 effective) MHz for the video memory. The cooling systems in the references inspire confidence, so you can safely count on successful overclocking.


Without increasing the voltage, the video card was overclocked to 1025 MHz for the GPU and 1375 (5500) MHz for the video memory. These are good results. The fan speed was 40%. Higher values ​​made work next to the stand impossible. Still, a series of seven thousand video cards cannot be compared even with vacuum cleaners, they are quieter. Of course, at face value everything is fine, video cards are not audible in idle. But if you disperse it and unscrew the speed manually, then that's it, put out the light.


The nominal frequencies for the older HD 7870 video card are 1000 MHz for the video processor and 1200 (4800) MHz for the video memory.


As in the case with the younger video card, we turned the cooler speed up to 40%, which helped to lower the temperature. The video processor was able to function stably at a frequency of 1120 MHz. But the video memory is very upset. Unfortunately, she could not work stably at higher frequencies at all. Most likely, this is a special case and we were just unlucky. Probably one of the video memory chips was not in the best shape.


The operating temperatures of the two new products practically do not differ from each other in nominal terms. The older video card HD 7870, as expected, loses a few degrees under load, this is normal. During overclocking, temperatures are lower due to increased cooler speeds up to 40%. At the same time, the HD 7950 turns out to be a much hotter video card. But no one can compete with the HD 6970 in terms of heat dissipation. As befits the previous generation of video cards, it is the hottest video adapter among all test participants.

Competitors

To compare performance, we took two video cards: AMD Radeon HD 7950 and AMD Radeon HD 6970.

AMD Radeon HD 7950

The second most powerful video card from AMD today is against single-chip ones. We have already told you about this video card. I wonder how new items will behave against its background. On the side of the HD 7950 video card, the amount of memory is three gigabytes and the number of stream processors is equal to 1792 pcs.

AMD Radeon HD 6970

This video card is the TOP of the previous generation. It has high performance sufficient for modern games. Of course, at a resolution of 2560 x 1600 it has nothing special to catch, but in such modes more powerful video cards are also bent. The amount of video memory is 2048 MB. The number of stream processors is 1536 pieces, they operate at a frequency of 880 MHz.

test bench

  • Processor - Intel Core i7 3960X
  • Motherboard - ASUS P9X79 Deluxe
  • RAM – Corsair XMS3 1600MHz 9CL 4x4GB
  • Hard drive - Intel SSD 160 GB
  • Power supply - Corsair HX850W

Test results AMD Radeon HD 78xx test results in Dirt 3





The new line shows good performance. In most cases, the overclocked Radeon HD 7850 is very close to the stock Radeon HD 7870. The latter, in overclocking, in most cases successfully competes with the nominal Radeon HD 7950. Everything is natural. The performance of the HD 6970 jumps a lot from application to application. In some places it outperforms the HD 7870, and in some places it loses to the HD 7850. In general, the performance of the new line is very pleasing.

Conclusion

The AMD Radeon HD 7800 series turned out to be very successful. At a lower cost, the Radeon HD 7870 in overclocking often catches up with a video card from the older top line - HD 7950. This will save money for those who want to get more performance for less money.

I really liked the temperature regimes of the new video cards. They are really cold. Under load at stock frequencies, the HD 7870 heated up to 68 degrees, and the HD 7850 - up to 66 degrees. This is a very good result for such performance. It becomes very noticeable if you start comparing all the results with the HD 6970 video card.

I really didn't like the noise of the video cards, if you set at least 50% of the cooler RPM. I can’t even imagine who will be able to work near the computer if you set the rotation speed to more than 50%.

As soon as prices are finally established, new items will be an excellent investment. This process usually takes a month or two. Most importantly, the price / performance parameter has not shifted towards higher prices.

Modern games every year require more and more powerful video cards for graphics processing. One of the budget solutions for gamers will be the AMD Radeon HD 7800 Series. Consider the technical characteristics of this series, as well as its features and performance in games.

Consider the characteristics of the AMD Radeon HD 7800 Series in the form of a table:

Process technology 28 nm
GPU Pitcairn
Frequency (min. max. on models) 800-1000MHz
RAM Type of GDDR5
Volume 2GB
Frequency 800-1200 MHz
Bandwidth 153.6 GB/s
Interfaces Memory capacity 256-bit
Tire type PCI Express 3.0
Architecture GCN
Stream processors per block from 64 to 80
Number of blocks from 16 to 20
Total Stream Processors 1024-1280
Cores for geometry processing 2 pcs.
Cores for asynchronous computing 2 pcs.
Connectors HDMI

Display Port 1.2

Supported technologies and software DirectX 11 Yes
OpenGL 4.2 Yes
Eyefinity (Monitor Linking) up to 6 pcs.
Zero Core Power Sleeping mode
Catalyst Branded Drivers and setting
App Acceleration Improving the quality of video playback
AMDHD3D 3D graphics processing
power tune Dynamic power setting

The line was produced in March 2012. Based on it, the following models were released:

  • HD7850;
  • HD7870;
  • HD7890.

The range is currently no longer in production. At the start of sales, the average price in stores was $249 and $349.

Review

At AMD, after the release of a new process technology, it was decided to divide the general production series into subgroups. Therefore, on the basis of the 28nm process technology, 4 lines were formed, which are presented in the table:


The HD 7800 with "Pitcairn" GPU uses the Graphic Core Next microarchitecture. The series was launched in March 2012 and is currently out of print.

At one time, video cards from Pitcairn were quite popular and showed an excellent combination of price / quality. For 2018, the current series is not popular and it is extremely difficult to find a device in a new condition. Despite the fact that the graphics core is already outdated, in an assembly with a powerful processor, a PC can pull various games on medium and high settings.

What games will run on AMD Radeon HD 7800 Series

Video cards were released back in 2012, but they can still be used on modern toys. Tests in games AMD Radeon HD 7800 Series were made with the following hardware:

  • Processor: Core I5 ​​6500 3.2 GHz.
  • RAM: 16 GB DDR4 2133 Dual.
  • Hard drive: Hitachi 1TB.
  • Motherboard: Asus H170M-Plus.
  • Resolution: 1920x1080px.

The results are as follows:

Name of the game

Graphics quality
Assassin's Creed Syndicate Hight 31
War Thunder Cinema (Ultra) 55-65
Quantum Break Average 30-42
Assassin's Creed Unity Average 30
Shadow Warrior 2 high 35-45
Dying Light high 40-50
Fallout 4 Ultra 38-43
gta 5 Above average 45-50
DOOM high 40
rise of The Tomb Raider high 30-40
warfare high 90-100
The Witcher 3: Blood and Wine high 25-35
World Of Tanks high 60-80

Overall performance largely depends on the right combination of processor and graphics card. If you take a powerful processor of the current generation like Ryzen or Core I5, then they will be able to show high FPS in most modern games, even with an old video card.

After analyzing the technical characteristics and tests in games, we come to the following conclusions: it is not recommended to buy for powerful games in 2018, it is better to choose newer models.

The performance will be enough for comfortable homework and for running multiplayer games like CS:GO, World Of Tanks.

How to overclock a video card

To achieve maximum performance, you can overclock the AMD Radeon HD 7800. To do this, you will need to install the driver and configure it.

The main changes should be made in the "Games" section. If you use an up-to-date driver, profiles can be configured individually for each video game.

Navigate to the desired profile to the Frame Rate Control setting. By default, the video card squeezes the maximum FPS and spends all resources on it.

For a comfortable game of shooters, 60 frames per second is enough. For CS:GO, WarFace, WarThunder it is enough to set the limit to 70 FPS.

The OverDrive setting allows you to adjust the operating parameters: the frequency of the GPU and memory, the efficiency of the fans and the level of power consumption. These settings must be configured individually for each PC build.

Download drivers for AMD Radeon HD 7800 Series

To download drivers for the Radeon HD 7800 Series, use the proprietary auto-search program. You can download it on the official website of the manufacturer. There you can also find drivers for each version of the OS: Windows 7, Windows 10, etc.

7800 Series with different specifications. The chip, built on the Graphic Core Next microarchitecture, occupies a space equal to 2.8 billion transistors. Like most cards from Radeon, there is Eyefinity technology here, which allows you to connect up to six monitors at the same time. They can work independently of each other, or they can form one large monitor. It all depends on what settings will be exposed.

Radeon 7850

This AMD 7800 Series graphics card has 800 megahertz of processor frequency. High performance and bandwidth (153 gigabits per second) provides a bus size of 256 bits. The computing system processes data equal to 1.76 teraflops. Computing units are present in the amount of 16 pieces, and texture units - in the amount of 64 pieces. There are two cores for computing processes.

The memory format corresponds to the GDDR5 marking, and support for DirectX version 11 will help speed up interaction with operating system applications. To better optimize the performance of the card, you need to follow the driver updates, as only they are able to fully unlock all the capabilities of the GPU and provide access to necessary settings. Basic drivers, which designate the video card in the system, come with the card, and updated version can be viewed on the AMD website.

This AMD Radeon HD 7800 Series graphics processor is equipped with the latest in-built technologies that allow you to enjoy high-quality and smooth images at 60 frames, while the resolution can reach up to 4096 x 2160 pixels. The same applies to the audio stream, which meets all modern requirements, giving out high-quality sound.

Radeon 7870

This AMD Radeon HD 7800 Series graphics card is a powerful successor of the previous card in terms of characteristics. She has a whole gigahertz to work with the graphics processor. Performance for computing operations is much higher than in the previous version - 2.56 teraflops. There are 20 compute units and 80 texture units.

Since this is the flagship of the 7800 series, it surpasses its brother in many respects. Support for tessellation technology has been introduced into the video cards of this manufacturer for a long time, but in this version it is brought to the limit. Now you can enjoy a three-dimensional image, striking in its realism and detail. And improved anti-aliasing will help to achieve a smooth and pleasant picture.

In other parameters, this representative of the AMD Radeon HD 7800 Series is completely identical in terms of characteristics to the previous video card. Both cards are capable of supporting 3D technology in both video and games. It is also possible to connect several cards in order to increase the performance, but this parameter may also depend on the capabilities of the motherboard.

  • Chip codename: "Tahiti"
  • 4.3 billion transistors (more than 60% more than Cayman and exactly twice as much as Cypress)
  • 384-bit memory bus: six 64-bit wide controllers with GDDR5 memory support
  • Core clock: up to 925 MHz (for Radeon HD 7970)
  • 32 GCN Compute Units with 128 SIMD cores for a total of 2048 floating point ALUs (Integer and Float formats, IEEE 754 FP32 and FP64 precision support)
  • 128 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 32 samples per clock, and in colorless mode (Z only) - 128 samples per clock
  • Integrated support for six monitors including HDMI 1.4a and DisplayPort 1.2

Radeon HD 7970 Graphics Specifications

  • Core clock: 925 MHz
  • Number of universal processors: 2048
  • Number of texture units: 128, blending units: 32
  • Effective memory frequency: 5500 MHz (4×1375 MHz)
  • Memory type: GDDR5
  • Memory capacity: 3 gigabytes
  • Memory bandwidth: 264 gigabytes per second
  • theoretical maximum speed fills: 29.6 gigapixels per second
  • Theoretical texture sampling rate: 118.4 gigatexels per second
  • Two CrossFire connectors
  • PCI Express 3.0 bus
  • Power consumption: 3 to 250 W
  • One 8-pin and one 6-pin power connector
  • Dual slot design
  • US MSRP: $549

Radeon HD 7950 Graphics Specifications

  • Core frequency: 800 MHz
  • Number of universal processors: 1792
  • Number of texture units: 112, blending units: 32
  • Effective memory frequency: 5000 MHz (4×1250 MHz)
  • Memory type: GDDR5
  • Memory capacity: 3 gigabytes
  • Memory bandwidth: 240 gigabytes per second
  • Theoretical maximum fill rate: 25.6 gigapixels per second.
  • Theoretical texture sampling rate: 89.6 gigatexels per second
  • Two CrossFire connectors
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 200 W
  • Dual slot design
  • US MSRP: $449

The high complexity of the new chip draws attention - 4.3 billion transistors, which is more than half the number of transistors in the previous top-end graphics processor. The ability to make such a complex crystal was made possible by the use of a modern 28-nanometer process technology, and the new chip turned out to be even slightly smaller size Cayman. And its practical characteristics that affect performance have been noticeably improved: the number of ALUs, TMUs, memory bus. Only the number of ROPs did not increase, and the frequency of GDDR5 video memory remained at the same level.

The principle of naming video cards of the company remained the same. The Radeon HD 7970 is the most productive single-chip solution from the company, after some time the younger model HD 7950 was released, which was announced a little later. Initially, the HD 7970 had no competitors in the market and did not replace any specific video card from the AMD line, but rather moved it down. As for the comparison with the competitor, NVIDIA released its 28nm solution much later.

The same GDDR5 memory is installed on the new AMD video card, but its volume, instead of 2 gigabytes in the previous generation, has grown to 3 gigabytes. This happened due to the expansion of the memory bus from 256-bit to 384-bit. And now you can put either 1.5 GB or 3 GB on the new board. Naturally, from a marketing point of view, installing a smaller volume would be a clear disadvantage, so the decision was made to put 3 GB, although today this is a bit too much. Only above high resolutions Yes, with MSAA 16x, 1.5-2 GB is not enough. However, AMD also has Eyefinity, and for games on three or more monitors, the screen buffer will just take up a very large amount.

So, let's take a look at the Radeon HD 7970. The new video card of the upper price range has a two-slot cooling system, which is covered with a plastic casing, which is familiar to all modern AMD motherboards, along the entire length of the card. Only the design of this casing has changed a little, although rear end still goes beyond the printed circuit board. But the design of the bar with pins was changed - to improve the cooling of the video card, one of the two slots (half of the bar) was occupied exclusively by a ventilation hole for heat dissipation.

But users shouldn't suffer from the reduction in the number of DVI connectors soldered right on the board. For their convenience, a special HDMI-DVI adapter will be included in the package, which will allow you to connect two monitors with DVI connectors. By the way, power consumption new card not lower than the Radeon HD 6970, so it had to install a set of one 8-pin and one 6-pin power connectors.

But in the new Radeon HD 7970, the cooling system has changed for the better. A new generation of evaporator chamber and a new larger cooler are used, with a modified blade shape and increased performance (greater airflow is provided). The result is an increase in cooler efficiency while reducing noise.

The Dual BIOS firmware switch, which we wrote about in the Radeon HD 6900 description, has not disappeared from the board. Both users and AMD liked this convenient solution so much that AMD decided to continue to complete top-end solutions with it.

We can only say hello this decision, which really helps in various cases related to both unexpected problems during flashing (turning off the power in the process, for example), and allows you to fearlessly conduct various experiments with BIOS images. Unsurprisingly, AMD hints again and again at the excellent overclocking capabilities of the new graphics card:

As you can see, overclocking to a frequency of 1 GHz and higher is practically promised, if you do not take into account the small inscription (which was not included in the screenshot) that the guarantee ceases to apply even if the video card fails as a result of an experiment with raising the frequency from the settings video driver.

Architectural features Radeon HD 7970

To assess the relevance of architectural modifications in the Southern Islands, first consider the development of the GPU over the past few years in the AMD view. Prior to 2002, graphics chips were specific hardware capable of purely graphics computing. Video chips of that time had limited functionality, they could only apply and filter textures, process geometry, engage in primitive rasterization, and therefore were not at all suitable for universal computing tasks.

Over the next few years, basic programmability was added to the GPU, but also focused exclusively on graphics tasks. This was the time of support for DirectX 8 and 9, limited functionality shader programs with floating point capability. Video chips of that time had specialized ALUs for vertex and pixel processing, as well as dedicated caches for pixels, textures, and other data. Universality was still not even close.

And only in 2007, AMD got a unified shader architecture DirectX 10, as well as the ability to program the GPU using special tools: CAL, Brook, ATI Stream. GPUs of the time already had advanced caching and support for local and global shared data. Architecturally, the chips were based on the VLIW5 and VLIW4 blocks, flexible enough for some basic non-graphical computing, but still focused on graphics algorithms.

And now it's time for a new architecture, even better suited for universal computing - Graphics Core Next (GCN). For AMD, this is a new architectural era, which is why the name was chosen. The new GPUs offer excellent graphics processing capabilities and performance, but the architectural changes made are intended primarily to improve the position in non-graphics computing - increasing performance and efficiency in complex general-purpose tasks. The new GPU design is designed for so-called heterogeneous computing - a mixture of graphics and general-purpose in a multitasking environment. The GCN architecture has become more flexible and should be even better suited for energy efficient execution of various tasks.

The basic block in the new architecture is the GCN block. It is on these "bricks" that all new GPUs of the Southern Islands series are based. The architecture for the first time for AMD graphics chips uses a non-VLIW design, it uses vector and scalar blocks, and one of the most important changes is that each of the GCN computing blocks has its own scheduler and can execute instructions from various programs (kernel).

The new computing architecture is designed for high efficiency loading of computing units in a multitasking environment. The GCN computational unit is divided into four subsections, each of which works on its own instruction stream every clock cycle. Threads can also use the scalar block found in GCN for flow control or pointer operations. The combination of vector and scalar blocks offers a very simple programming model. For example, function pointers and stack pointers are much easier to program, and the task of the compiler is now greatly simplified, since the execution units are scalar.

Each GCN block has a dedicated 64 KB local data store for data exchange or local stack extension for registers. Also, the block includes both a first-level cache memory with the ability to read and write, and a full-fledged texture pipeline (sampling and filtering blocks). Therefore, the new computing unit is able to work independently, without a central scheduler, which in previous architectures was responsible for distributing work among blocks. Now each of the GCN blocks is capable of scheduling and distributing commands on its own, one computing unit can execute up to 32 different command streams, which can be from different virtual address spaces in memory and are completely protected and independent of each other.

Previous AMD GPU architectures used the VLIW4 and VLIW5 architecture models, and although they are good enough for graphics tasks, they are not efficient enough for general-purpose computing, since it is very difficult to load all the execution units with work in such conditions. The new GCN architecture offers a similarly large number of execution units, but in a scalar execution that removes the limitations and dependencies of registers and instructions. The transition from the VLIW architecture to the scalar execution provides a noticeable simplification of code optimization tasks.

When executing instructions on the previous VLIW4 architecture, the compiler has to deal with register conflict resolution, complex distribution of instructions to execution units at the code compilation stage, etc. At the same time, to achieve high performance, non-trivial optimization is often required, which is suitable for most graphics tasks and much less flexible for other calculations. The new architecture offers a significant simplification of development and support, simplified creation, analysis and catching errors in low-level code, stable and predictable performance.

Memory caching subsystem

Bandwidth and memory and caches are never enough, and there is always a need and methods to increase them. AMD's new GPUs use a full two-level read/write cache. Each computing unit has 16 kilobytes of the first level cache, and the total volume of the second level cache is 768 kilobytes (in total, the chip gets 512 KB L1 and 768 KB L2), which is 50% more than in the previous chip, which does not have the ability to write at all to L2 cache.

In terms of performance, each GCN computational unit can receive or write 64 bytes of data from / to the L1 cache or global memory in one cycle, which serves to exchange data between command streams. The same amount of data is capable of transmitting and receiving each section of the L2 cache memory. As a result, the company's top-end GPU achieves 2 terabytes / s for L1 and 700 GB / s for L2, which is 50% more than the previous top AMD solution.

Tahiti GPU

Now that we've looked at the low-level architectural changes to the new Southern Islands series, it's time to move on to the details of the line's most powerful solution, the Radeon HD 7900, which includes two models. First of all, let's just note the huge complexity of the new GPU, because it includes more than 4.3 billion transistors, which is twice as much as was in the chip on which the Radeon HD 5870 is based! Naturally, such a powerful chip became possible only thanks to the use of a new 28 nm process technology. So what does he have inside?

The number of geometric blocks has not changed, compared to the Cayman, there are still two of them, but the efficiency of their work has been significantly increased - we will dwell on this in more detail a little later. On the graphics processor diagram, we see 32 GCN architecture compute units available on the Radeon HD 7970, and in the case of the junior solution, some of them will be disabled. If we consider the peak computing performance of the solution, then it is almost 3.8 teraflops (the number of floating point operations per second), which is an absolute record for a GPU today.

Each GCN block has 16 texture units, which gives a total figure of 128 TMUs per chip, or more than 118 gigatexels / sec - and this is another record at the time of release, and it will not be the last. But the number of ROP blocks has not changed, there are still 32 of them in 8 enlarged RBE blocks. Another interesting architectural change is that now ROP blocks are “attached” not to memory channels, as it was before, but to GCN blocks.

Although the theoretical framebuffer write speed has not changed much, and the maximum possible are the same 32 color values ​​and 128 depth values ​​per clock, the practical fill rate (fill rate) in real applications has increased significantly due to the increased memory bandwidth. AMD measured the Cayman at only 23 pixels per clock, while the new Tahiti approached the theoretical 32 pixels per clock.

This is understandable, because the new AMD video chip has a 384-bit memory bus - six 64-bit channels, just like the current top-end competitor's solution. It is this 1.5-fold increase in memory bandwidth that makes it possible to increase the actual speed of texture fetches and writing to the framebuffer. The bandwidth of 264 GB / s should help to squeeze out close to theoretical figures of 118 gigatexels / sec and 30 gigapixels / sec, and in the practical part we will check this.

In the case of the “stripped down” Radeon HD 7950 graphics processor, Tahiti includes 28 active computing units of the GCN architecture out of 32 physically available on the chip. In the case of the junior solution of the Radeon HD 7970 series, it was decided to disable four of them. Since each GCN has 16 texture units, the total TMU figure for the new model is 112 TMUs, which gives a throughput of almost 90 gigatexels/sec.

But the number of ROPs and memory controllers in the HD 7950 has not changed, they decided not to cut them down and keep the same 32 and 6 pieces, respectively. Therefore, the Tahiti Pro video chip has the same 384-bit memory bus, assembled from six 64-bit channels, as the top AMD solution. Apparently, it is computing functional devices that suffer the most from marriage during production, and they decided not to cut everything else.

Tessellation and Geometry Processing

From an architectural point of view, nothing special has changed in the geometric blocks of the Tahiti since the Cayman. It still uses two blocks for processing (setting vertices and tessellation) of geometric data and rasterization, and the scheme is very similar to the one we saw earlier, except that the tessellators are already called the 9th generation:

Despite the schematic similarities, the latest generation of these blocks is capable of significantly better tessellation and geometry processing performance, since the blocks have undergone significant modifications. Although peak performance has grown only to almost two billion vertices and primitives per second (925 MHz and two vertices per clock), real performance grew more. This was achieved by increasing the size of caches, improving buffering of geometry data, and reusing vertex data.

As a result, tessellation performance is improved at all triangle split ratios by up to four times over the previous generation Radeon HD 6970. But four times is not achieved in all cases, even on the diagram from AMD itself:

The chart shows a comparison of the tessellation performance of the Radeon HD 7970 compared to the HD 6970 at split ratios from 1 to 32. And as you can see, the performance difference is from 1.7 to 4 times. But this is naked synthetics. And in order to get closer to reality, we will give more data on the tessellation speed already in gaming applications:

As you can see, AMD's synthetic numbers are well supported by gaming ones - performance in real applications with "heavy" tessellation has grown significantly. This is a very good result, which we will definitely check in the practical part, using the example of synthetics and gaming applications.

Non-graphic calculations

From the point of view of heterogeneous and non-graphical computing tasks, two asynchronous computing engines (Asynchronous Compute Engines - ACE) have appeared. They are designed to schedule and distribute work between execution units for efficient multitasking and work in conjunction with a graphic command processor (Command Processor).

The Radeon HD 7900 has two independent compute engines and one graphics engine. In total, this gives three programmable blocks and three instruction streams, completely separated from each other. And in addition to issuing commands asynchronously for fast switching context, the new GPU also features two bi-directional direct memory access (DMA) controllers introduced in the Cayman. These two controllers are required to take full advantage of the new PCI Express 3.0 bus.

As we know, from the point of view of serious calculations, not only the speed of performing floating point operations with single precision, but also double precision (double precision floating point) is important. And AMD's new architecture does a pretty good job of that. At the moment, it is assumed that there are two versions of GCN computing units with different execution rates for FP64 instructions. For the older GPU, the execution rate is 1/4 of the FP32 speed, and for the younger chips, the execution rate is 1/16, which is quite enough to maintain compatibility, but does not complicate inexpensive solutions too much. As a result, the Radeon HD 7970 is capable of 947 billion double-precision operations per second (oh, they just barely reached the teraflop!) - there is another highest achievement of the new AMD chip.

Moreover, these are not the same gigaflops as in the case of previous architectures, but more “fat” ones. After all, the efficiency of the new GPU in complex computing tasks should seriously increase. First, the memory and caching subsystem has been improved. Secondly, each GCN compute unit has its own scheduler, which should improve branching code execution and overall efficiency. And thirdly, we note the scalar execution, which does not require complex optimizations from the compiler, as a result of which the computing units will be idle much less often. And as a result, in any tasks, it will be easier for the new chip to show high performance and ALU loading.

Among other innovations related to computing capabilities, we note the full support for ECC for DRAM and SRAM. On the software side, it is important that Tahiti is the first GPU with full support for new API versions: OpenCL 1.2, DirectCompute 11.1 and C++ AMP and their capabilities. For example, OpenCL 1.2 allows you to combine the capabilities of several computing devices into one, and AMD has already released support for this in the form of AMD APP SDK 2.6 and the Catalyst 11.12 driver.

Architectural Performance and Efficiency

After reviewing all the architectural innovations on the example of the top chip of the Southern Island series, it's time to talk about the effectiveness of all these changes. It is clear that the performance of the new chips is much higher than that of the previous ones, the opposite would be quite surprising. The question is how much faster. In various tasks, figures are obtained from 40-50% (minimum!) To a fivefold difference. Improvements in the architecture make it possible to exceed the theoretical 1.4-fold difference in dumb gigaflops. Let's look at this with examples:

The diagram compares the new top solution and the previous single-chip solution: Radeon HD 7970 and HD 6970, which is quite fair. Various performance tests are chosen: SmallptGPU and LuxMark are ray tracing on OpenCL, SHA256 is a secure hashing algorithm, and AES256 is a symmetric encryption algorithm. Well, Mandelbrot is a well-known problem calculated with double precision calculations.

The vertical broken line in the graph indicates the theoretical difference in performance, but the speed data shows that in three out of five tasks the speed of the new GPU was significantly higher. This is due to all the changes aimed at increasing efficiency: moving away from VLIW, the presence of a scheduler in each computing unit, improved caching, etc.

Changes in rendering quality

Actually, this part could well have been skipped, since there are no special claims to image quality lately and cannot be - for various reasons. For example, the quality of full-screen anti-aliasing for video cards different manufacturers very close, especially considering the wide distribution of software anti-aliasing methods using post-processing filters, which are performed on all GPUs in exactly the same way.

The same applies to texture filtering - now its quality is such that it's very difficult to distinguish between AMD and NVIDIA solutions even if you do a pixel-by-pixel comparison. The Radeon HD 6900 - the previous generation of the company - has improved anisotropic filtering a little more, and now even a "microscope" will not help to find any significant flaws there. The only note is that in motion the Radeon video cards were slightly inferior to the GeForce due to more noticeable specific artifacts, such as "noise" or "sand".

With the release of the new generation of video chips, the texel weights in the texture filter were revised once again, modified in such a way as to reduce such artifacts, sometimes visible on the Radeon HD 6900 in the presence of textures of a certain type (“high-frequency”, with sharp transitions from dark to light, for example). The change in quality is so hard to show with examples that AMD doesn't provide HD 7900 versus HD 6900 comparison pictures, but simply compares the quality of the "hardware" algorithm with a purely software algorithm running on GPU stream processors, and therefore ideal:

On such a small screenshot, the difference in quality is not visible, but AMD assures that all the changes made did not bring any performance drop and did not worsen the image quality in any of the aspects - it still does not depend on the angle and the filtering quality is close to ideal. In one of the future practical materials, we will definitely check this.

Partially Resident Textures

The idea of ​​Partially Resident Textures (PRT) is to use the hardware capability of the presented GPU - virtual memory. Surely many users have already seen id Software's RAGE game, which uses virtual texturing technology, the so-called megatexturing ("MegaTexture"), which provides the ability to use huge amounts of texture data and swap (streaming) them into video memory.

Using virtual video memory, it is very easy to get effective hardware support for such algorithms, which allow using up to 32 terabytes of textures in an application, which makes it possible to make unique locations in games, without repeating pieces of textures, while total absence problems loading texture data. True, AMD's illustrative example is too strange, from which nothing is particularly clear:

PRT allows you to achieve high picture quality and helps to increase the efficiency of video memory usage. Similar algorithms are already used in the id Software engine, and are expected to appear in many next generation engines. Games of the future need to work with huge amounts of data, and the advantage of the new GPU is that local graphics memory in PRT algorithms works like hardware cache memory, and textures are loaded into it when necessary. GPUs from the Southern Islands family support "mega-textures" up to 32 terabytes (resolution up to 16384×16384) and, most importantly, hardware texture filtering for them, which is not available on earlier video chips.

Virtual textures are divided into pieces of 64 kilobytes (kilobytes, not texels) and this piece size is fixed. And only those that are needed when rendering the current frame are loaded into the local memory of the video card. The technology works regardless of the texture format, just the sizes of the pieces in texels will be different. For example, for a regular uncompressed texture with 32 bits per color, the chunk size will be 128x128 texels, and for a DXT3-compressed texture it will be 256x256 texels.

The technology also involves the use of mip-levels of textures (smaller copies used in texture filtering). When rendering and filtering, they need to be accessed multiple times. Consider the operation of the algorithm on an example.

This figure highlights four different pieces from different mip levels required for rendering. When the shader program requests data from them, some of the pieces are already in local memory and this data is immediately sent to the shader for further calculations. But some pieces are missing from the table, and the application must decide what to do next on such a miss. For example, you can request data from a lower resolution mip-level, then the image will be fuzzy, but at least it will look like the truth and will be drawn without delay. And by the time the next frame is rendered, it can already be loaded into the cache - local video memory. Those who played RAGE will understand us.

This is a powerful algorithm that allows you to use huge textures that are unique to each of the objects. Similar algorithms have long been used in offline rendering, except for the need for real-time calculations. AMD even made a demo using the Per-Face Texture Mapping technique developed by Walt Disney Animation Studios for their animated films. Unfortunately, the demo isn't ready yet, and we've only seen low-res screenshots.

The essence of this texture mapping technique is to assign a certain piece of texture to each polygon, without the need to use UV-transformation (finding a correspondence between the coordinates of the surface of a three-dimensional object and the coordinates on a two-dimensional texture). This approach solves some of the problems with creating tessellated content by making the displacement mapping algorithm very simple. And PRT in this method is used for efficient storage and access to texture data.

Media Handling Instructions

An interesting innovation in Southern Islands seems to be support for specialized instructions used in image processing, both static and dynamic. For example, a widely used instruction called "sum of absolute differences", better known as SAD (Sum of Absolute Differences), has been improved. The speed of its execution is a very performance-critical bottleneck in many image and video data processing algorithms, such as motion detection, gesture recognition, image search, computer vision, and many others.

But in our review of the ancient video card Radeon HD 5870, we already wrote about SAD support. Now, in addition to the usual SAD (4 × 1), Southern Islands has new instruction- QSAD (quadruple SAD), which combines SAD with shift operators to increase performance and power efficiency, as well as the "mask" instruction MQSAD, which ignores background pixels and is used to isolate objects moving in the frame from the background.

The new GPUs can process up to 256 pixels per GCN compute unit per clock, which in the case of the AMD Radeon HD 7970 means that it can process up to 7.6 trillion pixels per second in the case of 8-bit integer color values. Although this is a theoretical figure, the visual processing capabilities of the new GPUs are quite impressive - many video processing tasks can be performed in real time.

PCI Express 3.0

We couldn't pass by the support of the third version of PCI Express by the whole line of new graphic solutions from Southern Islands. This support was quite expected, since the specifications of the third version of PCI Express were finally approved in the fall of 2010, but there were still no hardware solutions with its support, although motherboards are already appearing, video cards were released at the end of 2011, and central processors there are corresponding ones.

The updated interface has a transfer rate of 8 gigatransactions per second instead of 5 GT / s for version 2.0, and its bandwidth has once again doubled (up to 32 Gb / s) compared to the PCI Express 2.0 standard. The new bus uses a different coding scheme for data sent over the bus, but compatibility with previous versions of PCI Express has been preserved.

First motherboards with PCI support Express 3.0 was introduced in the summer of 2011, mainly based on the Intel Z68 chipset, and they appeared on the general market only in the autumn of the same year. So video cards arrived in time, and AMD again became ahead of the rest in terms of the speed of release of new graphics processors with support for the most advanced technologies. But it's too early to judge whether PCI-E 3.0 will be of any practical use.

AMD PowerTune technology

One of the most interesting innovations in the Cayman was PowerTune advanced power management technology. Flexible GPU power management has been used for a long time, but before the Radeon HD 6900, all these technologies were rather primitive and mostly software methods and changed the frequency and voltage in steps, not being able to turn off large parts of the video chips.

Even in the Radeon HD 5000 family, a performance limiter appeared when a certain consumption level was exceeded, and in the Radeon HD 6900 the system moved to a qualitatively different level. To do this, special sensors were included in the chip in all blocks that monitor boot parameters. The GPU constantly measures the load and power consumption and does not allow the latter to go beyond a certain threshold, automatically adjusting the frequency and voltage so that the parameters remain within the specified thermal package.

Unlike early power management technologies, PowerTune provides direct control over GPU power consumption, as opposed to indirect control by changing frequencies and voltages. This technology helps to set high GPU frequencies, get high performance in games, and not be afraid that consumption may go beyond safe limits. After all, most games and regular applications that use GPU computing have significantly lower power requirements and do not approach dangerous power consumption limits, unlike stability tests like Furmark and OCCT.

Even the heaviest games do not require maximum power consumption, and if you limit consumption by frequency, testing video cards with extreme tests, then in the case of 3D games, there will be quite a lot of unused performance and power opportunities. In the event that the video card has not reached the limit of the safe consumption level, the GPU will operate at the frequency set at the factory, and in the FurMark and OCCT tests, the GPU frequency will decrease to stay within the consumption limits.

Thus, PowerTune helps to set higher factory frequencies and tune the system for the most efficient use of GPU resources at the maximum power level set. In the example shown above, the HD 5870 does not use PowerTune and due to the GPU frequency limitation of high consumption in endurance tests does not use its full potential. While the maximum TDP is set for the Radeon HD 7970, and the video chip resets the frequencies only when it is exceeded, getting the highest possible performance in any application.

This is clearly shown in the following diagram. In the case of gaming applications, TDP can be achieved by increasing the frequency of the GPU, and for peak loads, endurance tests reduce the frequency to a safe level of power consumption. Without PowerTune, you would have to choose - either to get the probability of a video card failure when FurMark and OCCT are running for a long time, or to cut down on the potential performance in games. New technology solves these issues as efficiently as possible.

AMD PowerTune features fast responsiveness to changing conditions (microseconds) as it is a hardware technology. It is also distinguished by flexible frequency tuning, and not stepped, as it was in previous chips. All measurements are independent of the driver, but can be adjusted by the user using the video card settings.

The difference between PowerTune and the previously generally accepted approach is that in other cases, thermal throttling is used, which puts the GPU in a significantly reduced consumption mode, and PowerTune simply smoothly reduces its frequency, bringing the GPU consumption to the set limiter. This results in higher clock speeds and higher performance.

AMD ZeroCore Technology

AMD did not limit itself to using power management technology already known from previous solutions. In the first chips of the Southern Islands family, it introduces AMD ZeroCore technology, which helps to achieve even greater energy efficiency in the "deep idle" (or "sleep") mode with a disabled display device, which is supported by all operating systems.

After all, almost any system, even a gaming one, spends most of the time in low-load mode on the graphics processor. And the video card should not consume much power in this mode. And even more so, not to mention the mode with the monitor turned off - in this case, it is advisable to turn off the GPU altogether. That's what AMD did. Thanks to ZeroCore, the new GPU consumes less than 5% of the power in full mode when in deep idle, disabling most of the functional blocks in this mode.

AMD provides a schematic comparison with its own Radeon HD 5870, which did not support this technology. ZeroCore is a Southern Islands-exclusive innovation in desktop solutions from mobile GPUs designed for laptops. By the way, the advantages of this technology are associated not only with reduced consumption. In addition, in the long idle mode, when the display is turned off, the video card also completely turns off the fan on the video card cooler!

This is exactly what many users have been waiting for a long time. The most interesting thing is that according to our data, laboratory tests of solutions like PowerTune and ZeroCore took place several generations of video cards ago. Some of the engineering samples of video cards from AMD's series that have long since left the market worked exactly like this, completely turning off the cooler in idle time.

But it's not just single-GPU users who benefit from noise reduction and power consumption with AMD's new ZeroCore-enabled graphics cards. Similar improvements await happy owners of CrossFire systems based on two, three, and even four GPUs. Is it logical, after all, that in the mode of rendering the two-dimensional interface of the operating system, all video cards, except for the main one, should not work at all? But that's how they work now!

In the case of CrossFire systems on video cards with ZeroCore support in 2D mode, all secondary video cards are immersed in a deep sleep with minimal power consumption and a disabled cooler. This mode works both for several single-chip video cards and for two-chip solutions. In addition, the primary CrossFire graphics card will also enter this mode in the event of a long idle time configured in Windows. Visually, the difference in work looks like this:

By the way, the technology is not as simple as it might seem. AMD engineers had to solve a lot of issues related to the operation of the operating system in idle mode. For example, they found that Windows tries to update information on the screen even when the monitor is turned off. Which, of course, does not allow you to disable the GPU at all. Therefore, the company's programmers had to take a workaround, ignoring all screen drawing commands when the monitor was turned off in sleep mode.

AMD Eyefinity 2.0 Technology

Naturally, in the new architecture there was a place for improvements to the proven technology for displaying images on multiple monitors - AMD Eyefinity, now in version 2.0. It has received new features, higher resolutions, support for more displays and increased flexibility.

This technology is quite interesting, although an extremely small number of users will find space in the room and muster up the courage in front of the family to install more than two monitors. But it is better to have the opportunity to always be able to use it than not to have it at all. Moreover, the prices for monitors of large diagonals almost do not decrease, but mid-range solutions are constantly getting cheaper.

Indeed, now it is more profitable to buy three monitors with a screen diagonal of 24″ than one 30-inch one. AMD gives just such an example, where a 30″ 2560x1600 monitor costs over $1,000, while three 24″ FullHD monitors can be bought for half that price:

But how to spend your money and space in the room is a personal matter for each user. The main thing is that there is such an opportunity. Plus, Eyefinity 2.0 now supports image output in HD3D stereo mode - something that was lacking in previous solutions, which were inferior to competing ones in this parameter. Combining AMD Eyefinity and HD3D technologies, the Radeon HD 7970 is the first single-chip solution to support three monitors in stereo mode.

High resolution stereo rendering requires a very fast data transfer interface. And with previous versions of the HDMI outputs, the possibilities were limited to 24Hz per eye, which is just enough for watching Blu-ray 3D movies, but clearly not enough for gamers.

For such tasks, they began to use the frame packing format, when the frames for the left and right eyes are combined into one, and AMD Radeon HD 7970 supports the HDMI 1.4a frame packing format for stereo output. This is the first video card to support 3 GHz HDMI with frame packing, where each eye has a FullHD picture at 60 Hz (120 Hz in total):

Another interesting novelty seems to us to be Discrete Digital Multi-Point Audio (DDMA) multi-channel audio output technology, which works together with Eyefinity. All previous GPUs are capable of outputting via HDMI and DisplayPort only one audio stream. That is, even if three monitors located in different rooms are connected to the PC via HDMI, then sound channel only one is transmitted. But the AMD Radeon HD 7900 received support for the simultaneous output of several independent audio channels at once, which may well come in handy in some multi-monitor configurations.

The same feature will be very useful for video conferencing applications with the output of several interlocutors on separate screens, as well as multitasking applications such as playing on three monitors with game audio and watching news on a separate screen with an independent audio stream. Previously, for all this it was necessary to use several separate audio systems, but now everything works as conveniently as possible.

The software support of Eyefinity is also not forgotten, almost every month the technology is updated - new opportunities appear. So, back in October, support for resolutions up to 16384 × 16384 and new multi-monitor configurations appeared: horizontal and vertical 5 × 1, as well as based on six monitors in 3 × 2 mode.

The AMD Catalyst video driver update in December brings Eyefinity and HD3D to work together, and in February, support for custom resolutions, taskbar placement tweaks, and improvements to preset management are announced.

Output to six monitors can be achieved using two DisplayPort 1.2 ports and two MST hubs (which we wrote about earlier), while three or even four monitors require only one port and the corresponding hub. These hubs offer flexible display configurations, support up to four FullHD devices per DisplayPort 1.2 connector, and should be available by summer 2012.

Speaking of permission. High resolution or even ultra-high - Ultra High Resolution. Current devices with a resolution of 4000 pixels on the larger side require connection using several cables at once: two DP 1.1 or four DVI. Monitors of this resolution of the next generation will be connected using only one cable: DP 1.2 HBR2 or HDMI 1.4a 3 GHz. And AMD's new video card is already ready for such monitors, again it became the first in the world.

Video encoding and decoding

It is quite natural that the AMD Radeon HD 7970 includes the same UVD block for decoding video data, which appeared in the previous generation of the company's video chips. It simply doesn't need any modifications, supporting the MVC multi-stream codec, MPEG-2/MPEG-4 (DivX), VC-1 and H.264 decoding, as well as decoding two FullHD streams in all supported formats.

AMD solutions provide maximum quality video stream decoding, use dozens of special quality improvement algorithms and provide maximum results in quality tests like HQV. Among the supported features, we note: color and tone adjustment, noise reduction, sharpening, high-quality scaling, dynamic contrast, advanced deinterlacing, and inverse telecine. Here is an example of on-the-fly contrast enhancement:

But with decoding, all video chips have been more or less in order for a long time. All new GPUs provide decent quality and performance when viewing video data. But video encoding on the GPU is still in its infancy, and the main complaints from users are directed to the low quality of the resulting compressed image.

Perhaps the new Radeon HD 7000 series can help with this, because all the graphics processors in the series have a Video Codec Engine (VCE) video encoding unit. The Radeon HD 7970 was the first video card to support hardware-accelerated encoding and video compression using a specialized block (earlier, stream processors took part in encoding).

Quality and performance should be clearly better than before, with support for 1080p at 60fps encoding, even faster than real-time. It’s hard to say something about quality without tests, but we are promised different levels encoder optimizations for video data and games, as well as variable compression quality (the ability to choose between quality or performance improvement).

So far, there is no place to try VCE - there are simply no applications with support for it, but AMD is working with partners such as ArcSoft to provide support for VCE in relevant software products. In the future, we plan to release a software library for accelerating video encoding, which will make it easier for developers to support next-generation AMD products.

Encoding can be done in two modes: full and hybrid (using the capabilities of GPU stream processors). Full mode is designed for applications that require maximum energy efficiency and a consistent level of performance. Full mode encoding on VCE is faster than real time and provides low latency. But there is also a hybrid mode:

In this mode, GPU math blocks work together with VCE. All highly parallelizable stages, which are circled in yellow in the diagram, can use the power of the GCN computational units, and the dedicated VCE unit is engaged in efficient hardware entropy coding. This mode is well suited for video cards with great mathematical power, such as the Radeon HD 7970. There are still questions about the quality of these two modes, but this requires a thorough analysis in a separate material.

AMD Steady Video

In addition to encoding and decoding video data, there is another area where the power of AMD's new graphics can be used - improving poor-quality handheld videos without the use of a tripod or other similar image stabilization tools. The video stabilization technology is called AMD Steady Video, and its second version has already been released.

The algorithm of the software stabilizer is quite simple: based on the video stream, statistics are collected about the movement of the camera (shift, rotation, zoom) and this movement is compensated in the current frame, relative to the previous ones - the image is shifted, rotated and scaled so that the picture does not jump much and remains stable.

As simple as it is in words, it is just as difficult to implement. Simply because there are two million pixels on the screen, and up to 30 or even 60 frames per second. Imagine how many calculations you need to do to track all possible frame shifts. We have already written above about the QSAD function used in video processing, and it is also used in Steady Video 2.0 to speed up the motion detection algorithm. So the GPU must process random shifts with an amplitude of up to 32 pixels in any direction, and this requires a performance corresponding to more than 500 billion SAD operations per second (for 1920x1080 at 60 FPS).

By supporting the new QSAD instructions in the Radeon HD 7970, its advantage over powerful CPUs in the motion detection algorithm exceeds 10x! That is, high-quality video will now be provided to us, and not only when processing home videos in video editors, but also when watching other people's online videos, shot by no one knows what and no one knows how.

Details: Radeon HD 7800 series

  • Chip codename: "Pitcairn"
  • Production technology: 28 nm
  • 2.8 billion transistors (slightly more than the Cayman, which is the basis of the Radeon HD 6900 series)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • 256-bit memory bus: four 64-bit wide controllers with GDDR5 memory support
  • Core clock: up to 1000 MHz (for Radeon HD 7870)
  • 20 GCN Compute Units with 80 SIMD cores for a total of 1280 floating point ALUs (Integer and Float formats, IEEE 754 FP32 and FP64 precision support)
  • 80 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 32 samples per clock, and in colorless mode (Z only) - 128 samples per clock

Radeon HD 7870 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 1280
  • Number of texture units: 80, blending units: 32
  • Memory type: GDDR5
  • Memory capacity: 2 gigabytes
  • Theoretical maximum fill rate: 32.0 gigapixels per second.
  • Theoretical texture sampling rate: 80.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 175 W
  • Two 6-pin power connectors
  • Dual slot design
  • US MSRP: $349

Radeon HD 7850 Graphics Specifications

  • Core frequency: 860 MHz
  • Number of universal processors: 1024
  • Number of texture units: 64, blending units: 32
  • Effective memory frequency: 4800 MHz (4×1200 MHz)
  • Memory type: GDDR5
  • Memory capacity: 2 gigabytes
  • Memory bandwidth: 153.6 gigabytes per second
  • Theoretical maximum fill rate: 27.5 gigapixels per second.
  • Theoretical texture sampling rate: 55.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 130 W
  • Dual slot design
  • US MSRP: $249

And this time, the principle of naming the company's products was not changed and the trends of the previous series were continued. The mid-budget series of video cards based on the GCN architecture differs from the top and budget lines in the second digit in the index: instead of 7 and 9, the number 8 is set, which is quite logical. Since AMD has taken the psychological threshold of 1000 MHz for the frequency of the GPU, the Radeon HD 7870 received the addition of "GHz Edition" to the name, indicating the adoption of this frequency.

From the name it is clear that the Radeon HD 7800 is more productive than the HD 7700, but has a lower speed compared to the older models - HD 7900. As for the comparison with NVIDIA solutions, the older HD 7870 released at the time of release competes with the GeForce GTX video card 570, and the younger one is aimed at fighting the GTX 560 Ti, and NVIDIA has not yet released new 28 nm mid-range chips.

Both models of video cards from AMD have GDDR5 memory of the same amount of 2 gigabytes. They both use a 256-bit memory bus, and so you could put 1, 2, or 4 GB on them. 1 gigabyte is too small, and 4 GB is too expensive for this price segment. Therefore, we can say that the ideal amount of 2 GB of video memory has been chosen, which is quite sufficient for the vast majority of games even at high resolutions, and not too costly in terms of cost.

In other respects, from the point of view of the consumer, the HD 7850 and HD 7870 models are still different. The older Radeon HD 7870 has a higher power consumption, so it needs two additional 6-pin power connectors, and the HD 7850 is content with only one of them. Both boards have a two-slot cooling system design, but most manufacturers produce boards with their own design of at least a cooler, and even a PCB.

Architectural features of the Radeon HD 7800 family

Above, we have carefully described all the features of the new Graphics Core Next (GCN) architecture, so we will repeat only the most important. All of the company's new GPUs offer excellent features and performance not only in graphics processing, but also in non-graphical computing, including a mixture of different types of calculations. Also, the new GCN architecture offers a serious simplification of code optimization tasks, simplification of development and support, as well as stable and predictable performance and, in general, quite high efficiency.

The base block of the new architecture is the GCN block, and all GPUs of the Southern Islands series are assembled from them. Consider the block diagram of the Pitcairn chip:

The diagram shows the Radeon HD 7870 GPU (the "simplified" HD 7850 differs from it by several disconnected blocks), we see 20 computing units of the GCN architecture. In the case of the junior solution of the Radeon HD 7800 series, four of them were disabled, and the number of active blocks in it is 16. This corresponds to 1280 and 1024 stream processors, respectively (just like in the case of the HD 7700 family, only there are exactly twice as many blocks) . Since each GCN unit has four texture units, the total number of TMUs for the older model is 80 TMUs, and for the younger one - 64 TMUs.

But the number of ROPs and memory controllers in the HD 7870 and HD 7850 is also the same as in the solutions of the youngest line. The number of ROP blocks was left quite high - 32 pieces for both models. The memory bus for boards based on Pitcairn has been cut down to 256-bit, it is assembled from four 64-bit channels. This is not bad for a solution of this level, although it is one and a half times less than in the top line, because the memory bus is traditionally cut down first. It's good that the use of fast GDDR5 memory gave a relatively high bandwidth of 153 GB / s.

Like the rest of the GCN architecture chips, Pitcairn incorporates a 9th generation tesselator block, featuring numerous buffering and caching optimizations, which can significantly improve geometry processing performance. Here is a comparison of the new AMD board with the solution of the previous generation in a synthetic problem, according to which we can assume an increase in the tessellation speed up to four times:

A lot of AMD technologies that have been introduced and improved in the new Radeon HD 7000 video chips are also supported in the same way. Here is an incomplete list of them: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, texture filtering quality improvements, etc. All this is described in more detail above. To add to the list, the Radeon HD 7800 fully supports both the improved MLAA 2.0 anti-aliasing algorithm and supersampling anti-aliasing (SSAA).

As far as gaming performance is concerned, the Radeon HD 7870 is significantly faster than its direct competitor GeForce GTX 570, especially given the latter's 1.25 GB of VRAM (compared to 2 GB for the solutions in question) observed in modern games at high rendering resolutions . The younger Radeon HD 7850 can be compared with the GeForce GTX 560 Ti, and here it can no longer boast of the amount of memory. However, according to AMD's measurements, their new solution is still faster than the competition in most games.

Details: Radeon HD 7700 Series

  • Chip codename: "Cape Verde"
  • Production technology: 28 nm
  • 1.5 billion transistors (less than Barts, which is the basis of the Radeon HD 6800 series)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • Core clock: up to 1000 MHz (for Radeon HD 7770)
  • 10 GCN Compute Units with 40 SIMD cores for a total of 640 floating point ALUs (Integer and Float formats, IEEE 754 FP32 and FP64 precision support)
  • 40 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • Integrated support for up to six monitors including HDMI 1.4a and DisplayPort 1.2

Radeon HD 7770 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 640
  • Number of texture units: 40, blending units: 16
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Theoretical texture sampling rate: 40.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 80 W
  • One 6-pin power connector
  • Dual slot design
  • US MSRP: $159

Radeon HD 7750 Graphics Specifications

  • Core frequency: 800 MHz
  • Number of universal processors: 512
  • Number of texture units: 32, blending units: 16
  • Effective memory frequency: 4500 MHz (4×1125 MHz)
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Memory bandwidth: 72 gigabytes per second
  • Theoretical maximum fill rate: 12.8 gigapixels per second.
  • Theoretical texture sampling rate: 25.6 gigatexels per second
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, one DisplayPort 1.2
  • Power consumption: 3 to 55 W
  • Does not require additional power
  • Single slot design
  • US MSRP: $109

The low-cost series of video cards based on the GCN architecture differs from the top and middle lines in the second digit in the index: the 9th place was taken by the number 7, as it was before. The Radeon HD 7770 is a more productive solution, but there is also a younger model - the HD 7750. The older board had no direct competitors on the market at the time of release, being located somewhere between the GeForce GTX 560 and GTX 550 Ti, and the younger one is aimed at fighting the GTX 550 Ti. For the HD 7770, a competitor was later announced in the face of the GeForce GTX 560 SE (all NVIDIA solutions are based on older GPUs).

Both considered models of AMD video cards have GDDR5 memory of the same amount of 1 gigabyte. Due to the use of a 128-bit memory bus, 2 GB of memory could be put on them, but this amount of GDDR5 memory will cost too much for their price segment. Therefore, models with such a volume have so far been released, although options with 2 GB of video memory may be released in the future. In the meantime, we decided to leave this volume for the HD 7800.

In terms of other consumer characteristics, the HD 7750 and HD 7770 models are quite different. If the older Radeon HD 7770 has a two-slot design of the cooling system and its cooler is covered with a plastic casing like in older solutions, then the younger HD 7750 looks noticeably simpler, occupying one slot and having a simple cooler. However, most manufacturers still produce boards with their own designs. The power consumption of the new models in this price range is also different, the older one has one 6-pin auxiliary power connector, and the younger one is powered by PCI Express.

Architectural features Radeon HD 7700

The base block of the new architecture is the GCN block, and all GPUs of the series are assembled from them. Each of the available GCN blocks is capable of scheduling and distributing instructions on its own, and one computing unit can execute up to 32 independent instruction streams. Let's look at the block diagram of the Cape Verde chip:

The diagram shows the Radeon HD 7770 GPU (the “stripped down” HD 7750 features a few disconnected units), we see 10 compute units of the GCN architecture. In the case of the junior solution of the Radeon HD 7700 series, it was decided to disable two of them, and the number of blocks became 8. This corresponds to 640 and 512 stream processors. And since each GCN unit has 4 texture units in its composition, the final figure for the number of TMUs for the older model is 40 TMUs, and for the younger one - 32 TMUs.

The number of ROPs and memory controllers in the HD 7770 and HD 7750 is the same, and we decided not to cut the ROPs too much, leaving them at 16 each. But the memory bus at Cape Verde is cut down to 128-bit, which is assembled from two 64-bit channels. In general, this is three times less than in the top series, and we saw another confirmation that the memory bus is traditionally cut down in inexpensive chips first of all. Although the use of fast GDDR5 memory made it possible to leave a relatively high (for such inexpensive solutions) bandwidth of 72 GB / s.

It remains for us to note a rather large amount of L2 cache - as much as 512 kilobytes (compared to 768 KB for a top-end chip - apparently, the L2 cache does not take up too much space on the chip), as well as improvements in geometric performance. Like the top-of-the-line chip, Cape Verde features a 9th generation tesselator featuring multiple buffering and caching optimizations to deliver a notable improvement in geometry processing performance over the Radeon HD 6000 series.

In general, we will not repeat all the information about AMD technologies that have been implemented and improved in the new Radeon HD 7000 video chips (here is a partial list: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, texture filtering quality improvements, etc. .p.), all this is described in more detail above. The HD 7700 series supports all the features listed there, including AMD Eyefinity 2.0 with six monitors and stereo rendering, as well as an improved video decoding and encoding unit.

But what about the most important thing - performance in games? The first estimates of rendering speed can always be made from the manufacturer's presentations. AMD believes that the Radeon HD 7770 is somewhere in the middle between the GeForce GTX 560 and GeForce GTX 550 Ti, respectively, and compares it in its materials with the second competitor model.

But they do not compare the Radeon HD 7750 with anything, simply noting that most modern games are playable on this model at maximum settings in FullHD resolution. However, this is not surprising, since in recent years there have been practically no PC-exclusives, and multi-platform games are much less demanding. So the Radeon HD 7700 series boards are perfect for undemanding users.

Details: Model Radeon HD 7790

  • Chip codename: "Bonaire"
  • Production technology: 28 nm
  • 2.08 billion transistors (more than Cape Verde in the Radeon HD 7700, but less than Pitcairn in the Radeon HD 7800)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • 128-bit memory bus: two 64-bit wide controllers with GDDR5 memory support
  • Core frequency: 1000 MHz
  • 14 GCN Compute Units of 56 SIMD Cores with a total of 896 Floating Point ALUs (Integer and Float formats, supports IEEE 754 FP32 and FP64 precision)
  • 56 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 16 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 16 samples per clock, and in colorless mode (Z only) - 64 samples per clock

Radeon HD 7790 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 896
  • Number of texture units: 56, blending units: 16
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Memory bandwidth: 96 gigabytes per second
  • Theoretical maximum fill rate: 16.0 gigapixels per second.
  • Theoretical texture sampling rate: 56.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 85 W
  • One 6-pin power connector
  • Dual slot design
  • US MSRP: $149

An inexpensive video card model based on a new mid-budget chip differs from the previous top model of the HD 7700 subfamily by the third digit in the index: instead of 7, they put the number 9, which indicates an increase in performance. At the same time, the Radeon HD 7790 index clearly indicates that this is a less productive video card compared to the one step higher line - HD 7800.

However, everything is not so simple here either - it will certainly be able to argue with the younger HD 7850. But the recommended price for the Radeon HD 7790 is set at $149, that is, approximately in the middle between the prices for the HD 7770 and HD 7850. As for the competitor's solutions from the same price segment, the release of the HD 7790 was clearly intended to have something to fight with NVIDIA The GeForce GTX 650 Ti, based on the GK106 chip, sits right between the HD 7770 and HD 7850 in terms of price and speed. But NVIDIA immediately responded to the release of the new board by AMD by launching an overclocked version of the GeForce GTX 650 Ti Boost, which is characterized by greater performance.

This AMD graphics card model has GDDR5 memory with a capacity of only 1 gigabyte. The GPU has a 128-bit memory bus, and theoretically 2 GB could be supplied, but this amount of fast GDDR5 memory is still too expensive for this price segment, and AMD has released a model with a smaller memory, although it may not be enough for some modern games even at low settings and resolutions. However, video cards from partners with 2 GB of video memory are also possible.

Like the models standing next to it in the line, the Radeon HD 7790 has a dual-slot design of the cooling system, which is covered with a plastic casing. Although most manufacturers still release boards with their own cooler design, so the reference one is not that important. Interestingly, the new model's power consumption hasn't increased much compared to the HD 7770, but the improvement in power efficiency was to be expected. By the way, that's why the novelty also has only one 6-pin auxiliary power connector.

architectural features

The new Bonaire GPU, on which the released Radeon HD 7790 is based, belongs to the same Graphics Core Next (GCN) architecture that we have known for a year and a half, but AMD calls it GCN 1.1, hinting at minor changes. In fact, the chip is architecturally almost the same as the previous ones, although there are indeed some minor changes. For example, the new architecture introduced instructions that are useful for heterogeneous architecture (Heterogeneous System Architecture - HSA), support for more simultaneously executing threads, as well as a new version of AMD PowerTune technology, which we will talk about later. But all these changes cannot be called significant, because there is nothing new in the basic blocks and improving their efficiency.

Therefore, we can safely refer to, which carefully describes all the features of the new Graphics Core Next (GCN) architecture, and here we will only repeat the most important characteristics and features of a particular product. All of AMD's latest GPUs offer excellent features and performance in both graphics and non-graphics processing, including mixtures of the two. The new GCN architecture also provides a major simplification of optimization and software development tasks, while maintaining high efficiency.

As you know, the basic block of the architecture is the GCN block, from which all the GPUs of the Southern Islands series are assembled. The GCN computing unit is divided into subsections, each of which works on its own instruction stream. GCN blocks have a dedicated 64 KB local data storage for data exchange or local register stack expansion. Also, the block has a first-level cache memory with the ability to read and write and a full-fledged texture pipeline with sampling and filtering blocks. Each of the existing GCN units is capable of scheduling and distributing commands on its own, and one computing unit can execute several independent instruction streams. Let's look at the block diagram of the new chip:

The Bonaire scheme confirms the goal of the new solution to offer performance between Cape Verde, which has 10 GCN compute units, and Pitcairn, which has 20 GCN units. These two GPUs, released in 2012, differ from each other by almost exactly half, so there was a rather large performance gap in the middle between them, which Bonaire has now filled.

The diagram shows the graphics processor in the form of a Radeon HD 7790, which is a complete solution without cutting any blocks. The chip includes 14 computing units of the GCN architecture, which corresponds to 896 stream processors. Since each GCN has 4 texture units, the total number of TMUs for the new model is 56 TMUs. That is, Bonaire is exactly 1.4 times faster than the Cape Verde chip in terms of the speed of mathematical calculations and texture fetches, provided that the frequency is equal.

But the number of ROP units and memory controllers in Bonaire and Radeon HD 7790 is similar to what we saw in Cape Verde and Radeon HD 7770 - it was decided to leave 16 ROP units, and the memory bus of the new chip is 128-bit, assembled from two 64- bit channels. The small number of ROPs can be the “Achilles heel” of the solution, since the use of fast GDDR5 memory made it possible to provide a relatively high throughput of 96 GB / s, but nothing can be done about ROP performance.

But in the new GPU there are improvements in geometric performance and tessellation speed. Yes, Cape Verde also has a 9th generation tesselator, but Bonaire also doubled the number of geometry blocks, rasterizers and command processors (shown as ACE in the diagram) - now there are two of them. This improvement gives Bonaire the ability to process up to two geometric primitives per clock - just like the more powerful Pitcairn and Tahiti.

As you remember, it was in the Radeon HD 7770 that AMD first took the important psychological threshold of the GPU clock frequency, equal to 1 GHz. So, the HD 7790 also has exactly the same reference frequency of 1 GHz, so the increase in performance compared to the HD 7770 will be justified solely by architectural changes and an increase in the number of execution units.

But the frequency of the video memory of the novelty is much higher. If the HD 7770 had a relatively low memory frequency of 4.5 GHz, then the HD 7790 is equipped with a fast GDDR5 memory operating at 6 GHz, which provides a third more bandwidth. The 33% increase in video memory bandwidth compared to the Radeon HD 7700 sub-family resulted in a clear increase in gaming performance. AMD provides this chart, comparing HD 7790 frame rates with memory running at 4.5 and 6.0 GHz:

The maximum speedup from the increase in memory bandwidth was achieved in games such as StarCraft II and Crysis 2. And on average, a 33% increase in memory bandwidth gives somewhere around 10% increase in average frame rate in a set of modern games. Not a bad indicator, showing that memory bandwidth is quite important in our time, although it is not the only emphasis on performance. Although it is quite possible that with more ROP, Bonaire's speed would be even higher...

It is clear that the average power consumption has increased slightly compared to the HD 7770. If for the old model this value is 80 W, then for the HD 7790 it is 85 W - this is a very small price to pay for a theoretical performance increase of 33-40%! Architectural improvements (PowerTune), the design of a new GPU based on the experience of previous ones, as well as the continuous improvement of the technical process at TSMC - all this led to a small increase in consumption with a significant improvement in speed characteristics.

As for the area of ​​the chip and the number of transistors in Bonaire, the new chip is clearly larger than Cape Verde, but the addition of computational, texture and geometric units could not go unnoticed. According to these parameters, Bonaire is also located approximately in the middle between Cape Verde and Pitcairn. Bonaire contains 2.08 billion transistors in a 160 mm 2 chip, for Cape Verde these figures are 1.5 billion and 123 mm 2 , respectively, and for Pitcairn - 2.8 billion transistors and 212 mm 2 chip area.

Naturally, the new chip supports all AMD technologies that have been introduced and improved in the new Radeon HD 7000 family (their incomplete list: PowerTune, ZeroCore, Eyefinity, HD3D, Steady Video, texture filtering quality improvements, etc.), both all this is described in detail in the article AMD Radeon HD 7970: New single-processor leader. The HD 7790 model supports all the features listed there, including AMD Eyefinity 2.0 with six monitors and stereo rendering, and also has an improved video decoding and encoding unit.

Enhanced PowerTune Technology

Back in 2010, AMD introduced PowerTune technology in its Cayman chip (AMD Radeon HD 6900 series). This GPU was the first to feature dynamic power management called PowerTune. It allowed us to increase the maximum clock speeds for typical applications, while avoiding too much power consumption in specialized stability tests like FurMark. Then the technology was applied to the dual-chip model AMD Radeon HD 6990, which needed it even more for obvious reasons.

The technology received a major update in mid-2012, when automatic frequency increase - Boost - was added to AMD PowerTune. In the AMD Radeon HD 7970 GHz Edition, this algorithm allowed even more performance improvements compared to the regular version of the video card. The PowerTune operation algorithm in video cards without automatic overclocking uses three states: idle (idle), light load mode (low-3D) and full speed. In the HD 7970 GHz, the Boost overclocking mode was added to them. PowerTune serves to stay within the required consumption by switching to a lower load mode when necessary. In this case, the technology sharply reduces the clock frequency values. In practice, such jumps are rare - due to the large gap between the two active modes.

Reducing the GPU clock speed reduces power consumption, but for more effective management voltage needs to be reduced. This is exactly what the Radeon HD 7790 does. The new Bonaire graphics chip has eight states with different frequency and voltage settings, allowing for higher clock speeds than before, while ensuring that the GPU is always running at optimal voltage and frequency. Switching between states is based on GPU load as well as current GPU power consumption.

In the new algorithm, PowerTune does not have to abruptly drop the frequency when the consumption level is exceeded, and along with the frequency, the voltage also decreases. State transitions must be as fast as possible so as not to exceed the consumption limit even for a short time, so Bonaire switches PowerTune states every 10 ms, that is, in every second, the state of the chip changes 100 times.

With such constant change frequencies, third-party applications like MSI Afterburner and GPU-Z will show not instantaneous clock rates, but averages over a period of time - the so-called "effective" frequency. Another interesting innovation is that AMD is opening up new PowerTune settings for third party applications. Partners can also set their own PowerTune settings to help create factory overclocked graphics cards and provide more features beyond AMD's reference values. Truth, different settings PowerTune can lead to the fact that video cards of the same model from different manufacturers will not only have different clock frequencies, but also the algorithm for changing them over time, which makes it difficult to compare under the same conditions.

Sales of video cards of the Radeon HD 7790 model began on the market at the very beginning of April 2013. AMD, together with its partners, organized the release of both motherboards with reference frequencies and factory overclocked solutions. And now, both manufacturers are launching new video cards to the market in much the same way, with various options quickly available from their partners. In fact, partners have released almost more overclocked versions of the HD 7790 than conventional ones, and the graphics chips in them operate at frequencies of about 1075 MHz.

Details: Radeon HD 7990 Model

  • Codename "Malta"
  • Production technology: 28 nm
  • 2 chips with 4.3 billion transistors each
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • Dual 384-bit memory bus: twice six 64-bit wide controllers with GDDR5 memory support
  • GPU frequency: 1000 MHz
  • Twice 32 GCN Compute Units, including 128 SIMD cores, consisting of a total of 4096 floating point ALUs (integer and floating formats, support for FP32 and FP64 precision within the IEEE 754 standard)
  • 2x128 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 2x32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 64 samples per clock, and in colorless mode (Z only) - 256 samples per clock
  • Integrated support for up to six monitors via HDMI 1.4a and DisplayPort 1.2

Radeon HD 7990 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 4096
  • Number of texture units: 2x128, blending units: 2x32
  • Effective memory frequency: 6000 MHz (4×1500 MHz)
  • Memory type: GDDR5
  • Memory capacity: 2x3 gigabytes
  • Memory bandwidth: 2x288 gigabytes per second
  • Theoretical maximum fill rate: 64 gigapixels per second.
  • Theoretical texture sampling rate: 256 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, four Mini-DisplayPort 1.2
  • Power consumption up to 375 W
  • Two 8-pin auxiliary power connectors
  • Dual slot design
  • The recommended price for Russia is 32,999 rubles. (for the USA - $999).

Already in the second generation of AMD video cards, the naming principle for dual-chip models remains unchanged. The top solution based on two most powerful video chips differs from the corresponding class of the previous generation model by the first digit in the index: instead of 6, it received the number 7, indicating a new series. The announced video card differs from the single-chip solution by the third digit, indicating the maximum performance within the generation.

As for the comparison with competitors, for the Radeon HD 7990 model announced today, the main rival is GeForce graphics card GTX 690, released almost a year ago, and it is these two-chip solutions that will have to fight each other. True, NVIDIA also has one more powerful solution, but already based on a single GPU - GeForce GTX Titan, which can also be considered a competitor to the AMD board in question.

The new dual-chip Radeon video card is equipped with GDDR5 memory with a capacity of 3 gigabytes per GPU, which is due to the 384-bit memory bus of the Tahiti chips. Such a volume is quite justified for a product of such a high level, since in some modern gaming applications with maximum settings, enabled anti-aliasing and high resolutions, a smaller amount of memory (2 gigabytes per chip or less) may no longer be enough. And even more so this applies to rendering in stereo mode or on multiple monitors in Eyefinity mode.

It is clear that such a powerful dual-chip graphics card has a massive dual-slot cooling system that is different from traditional AMD card coolers. It features a massive heatsink hidden under a shroud with three large fans running at relatively low speeds. The power consumption of the dual-GPU card is quite high for obvious reasons, and it has two 8-pin power connectors, but at least it's not three, as was the case with non-reference samples based on two Tahiti chips.

Architecture

Since the video card codenamed “Malta” is based on two “Tahiti” GPUs from the Southern Islands family, you can simply refer to, which thoroughly describes all the features of the current Graphics Core Next (GCN) architecture. In the base materials, we repeat only the most important characteristics and features of specific products.

The basic block of the architecture is the GCN block, from which all the GPUs of the series are assembled. The computational unit is divided into subsections, each of which works on its own stream of commands, it has a dedicated local storage for data, a first-level cache memory with the ability to read and write, and a full-fledged texture pipeline with sampling and filtering units. Each of the GCN blocks is capable of scheduling and distributing commands on its own, and one computing block can execute several independent command streams. The Radeon HD 7990 uses two Tahiti chips already known to us:

The graphics processor diagram (there are two of them in the Radeon HD 7990) shows 32 computing units of the GCN architecture, and all of them are active. Previously, it was assumed that for a two-chip solution, some of them would have to be turned off, and even lowered the frequency in order to enter the power consumption of 375 W, but AMD engineers managed to successfully solve this difficult task. Perhaps a special new revision of the Tahiti with lower power consumption has been released, or the chips just pass a very strict selection.

Since each GCN unit has 16 texture units, the number of TMUs is 128 units per chip, which gives a total performance of 256 gigatexels per second, which is very good for a GeForce GTX 690 competitor. The number of ROP units and memory controllers in HD 7990 also did not change compared to its single-chip counterpart, they were left in the amount of 32 and 6 pieces per GPU, respectively. The Radeon HD 7990 has a dual 384-bit memory bus made up of twelve 64-bit channels for a total memory bandwidth of 576 GB/s, another record.

In other respects, the new board supports all modern AMD technologies that have been introduced and improved in the new Radeon HD 7000 video chips: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, improved texture filtering quality, etc. All this is described in detail above in the description of the Radeon HD 7970, and there is simply no point in repeating it.

Cooling system and power consumption

In the case of such serious dual-chip boards, a highly efficient cooling system becomes especially important. If in the case of solutions from partners based on two Tahiti, three-slot solutions were used, and in the case of ASUS ARES II even water cooling, in this case it was necessary to manage with less effort, so a cooler was designed that has a very massive heatsink and three fans with improved acoustic characteristics.

The noise of the cooling system and the provided temperature for GPUs are one of the most important consumer characteristics for any video card, including the top solution designed for enthusiasts. Too loud or inefficient cooling system will be regarded by buyers as a less profitable purchase, other things (roughly) being equal. So AMD took this issue very seriously with the Radeon HD 7990 when compared to other top solutions on the market. Consider the acoustic characteristics of the new system:

The diagram shows the noise level from three different video cards: the Radeon HD 7990 and two competitors: the dual-chip GeForce GTX 690 and the single-chip GTX Titan from NVIDIA. Moreover, the noise was measured in different conditions - in idle mode (System Idle) and at maximum load using Furmark. According to AMD's figures, even the single-chip Titan falls short of their new product in terms of cooler noise, not to mention dual-chip GTX 690, which is the loudest in this comparison.

But isn't it to the detriment of the GPU temperature that such impressive acoustic performance was achieved? The following chart shows GPU temperatures measured on AMD's Radeon HD 7990 and the same two competitors. This time, AMD used only the high load mode when testing at Furmark.

And again, the “cunning” coordinate axis is used with the origin not from zero value. The real difference between 80 and 82 degrees for the Radeon HD 7990 and GTX Titan will be virtually imperceptible, although the 87 degrees for the GTX 690 clearly stand out for the worse. Again, we note that all of these tests were carried out by the interested party and are subject to independent verification.

In terms of power consumption, the dual-chip solution is nothing new, but support for the previously announced ZeroCore Power technology is also here. This technology helps to achieve significantly lower power consumption in "deep idle" (or "sleep") mode with the display device turned off. In this mode, the idle GPU is almost completely disabled, and consumes less than 5% of the power of the full mode, turning off most of the functional blocks. And in the case of a two-chip board, it is even more important that in the CrossFire system, when drawing a two-dimensional interface of the operating system, all GPUs, except for the main one, do not work at all. That is, in the case of the Radeon HD 7990, one of the chips in 2D mode will be immersed in a deep sleep with minimal power consumption, and the second one can “fall asleep” in the PC deep idle mode.