search technology reviews, news, features, group tests
Popular Searches:   video , dvd , windows
 |  Register
 |  Newsletters  | 
Sitemap  |  RSS
RSS
Sunday November 22, 2009 11:45 AM AEST
Skip Navigation LinksPC Authority > News > An analysis of CPUs and GPUs
An analysis of CPUs and GPUs
NEWS

An analysis of CPUs and GPUs

by The Inquirer  on Nov 3, 2009
Tags: cpu | gpu | hardware | intel | amd
"This article is SO BADLY WRITTEN it should be pulled and redone immediately. Here I go: Para 1. The "sharing the die 'til we die" quip is irrelevant, and confusing. To me, it hints at a future ..."
 
Will AMD blow its lead again in graphics? An analysis of the current state of play in the race for processor and graphics superiority

Earlier, when discussing the possible integration of CPUs and GPUs on the system level, at least for high end systems, before the chip-level "sharing the die 'til we die" CPU-GPU on-die marriage happens, we looked into the most efficient ways of executing that integration.

Obviously, PCI Express, even in its v2 and upcoming v3 versions, has too much latency and protocol baggage to most efficiently link CPUs and GPUs for working together on common tasks. Merely seamless coherent memory access is usually a big problem.

Now, AMD as a company does have one huge - albeit very temporary - advantage here right now: all the pieces of the puzzle are in place for quite a while. Namely, a reasonably fast (but could be faster, please) CPU core in those 6-core Istanbul Opterons and their coming desktop equivalents, complemented by a very fast HyperTransport 3 low latency, high speed - up to 25.6GBps per link for version 3.1 - interconnect protocol between CPUs and I/O, as well as performance-leading GPUs in the ATI Radeon R800 family. Both the CPU core and HyperTransport are stable, proven old-timers, with stuff like FPGA accelerators and ultrafast supercomputing cluster network links already being hosted on the Opteron HyperTransport combo for years. There's even the HTX slot spec for I/O cards hitting directly into system memory.

As mentioned before, Nvidia lacks both the CPU and interconnect portions here. If it had an Alpha license that would have solved both issues, and as a bonus, propelled it to the top of the CPU performance pack fairly quickly, assuming it had a reliable CPU fab partner. A hint - IBM Microelectronics and Global Foundries did share technology that was once used by IBM Micro to fab the last batch of Alphas some 8 years ago.

amdatigpu

Now, if AMD, for instance, released a HyperTransport flavour of the R800 family chip that adds an MMU or at least a very fast DMA (Direct Memory Access) mechanism to access the main memory and CPU resources directly via HTX, and maybe had another bunch of HyperTransport links to add other GPUs the same way, it could - on top of the existing fast local GDDR5 memory - have far faster, nearly seamless access to the rest of the system resources and not just system memory. Even wrapping it as a kind of co-processor to the CPU could work, in the same fashion as the old 80287 was to the 80286 some 25 years ago.

With the HTX scalability and a sufficient number of links, each CPU could drive a bunch of GPUs, scalable only by the total system bandwidth required. Remember, with each new GPU here, you add to the total interconnect bandwidth and total memory bandwidth, as you can see on the graph. Due to less contention between them, not only CrossFire gaming framerates would scale better, but so would OpenGL engineering apps as well as, of course, the computational codes.

All this looks beautiful, but you may ask, why temporary? Well, Intel did work hard to use the base of the former Alpha EV7 / EV8 interconnect to create its Quick Path Interconnect. While it is still far from HyperTransport's level of maturity, and even without any slot spec for card expansion, QPI has Intel and its market share behind it.

Then, Intel will - one day soon, hopefully - have Larrabee out there, up and running. If the original early Larrabee focus on workstation and computational graphics is kept, then so much so Intel has more drive to enable even the initial Larrabees to talk to their CPU brethren directly via QPI, not just over the slower PCI. On top of that, since Larrabee has an X86 ISA front-end anyway, it'll be that much easier to treat it as a co-processor right away, and offload the code portions that it can run on its own.

I think that Intel will have all these running, one way or another, in about a year's time. Imagine the same graph as above, but with Intel Xeons and Larrabees instead. Whether Intel decides to go with the QPI link on Larrabee from the start or later, it's Intel's decision, but nothing prevents it technically. In that sense, before it risks being pushed into the corner again later, AMD should use the chance while it has it. When Intel gets it, it surely will.

 

theinquirer.net (c) 2009 Incisive Media
Email a Friend Email this
Print Page Print this
Tweet This Tweet this
Feedback Send us your tips


Ads by Google

Comments: 1
Thoughts on this article? Add a comment below.
totoaus
Nov 3, 2009 8:19 PM
This article is SO BADLY WRITTEN it should be pulled and redone immediately. Here I go:
Para 1. The "sharing the die 'til we die" quip is irrelevant, and confusing. To me, it hints at a future technology where IC chips will no longer be on a die. But it even starts badly with "earlier...", what came before?
para 2. No it isn't obvious, latency can be concelaed, for example with enough GPU's comms sould be occuring with one while set up for another happens. The final sentence is meaniningless. "Merely coherent memory...?"
para 3. I'll concede a pass on this one, but barely.
para 4. I doubt many people know much about Alpha, there is certainly no explanation of what architectural feature of this chip is relevant to the article. I knew the Alpha well and cannot see the point.
para 5. "amdatigpu"? A heading gone wrong perhaps, or a failure to speak English.
para 6 & 7. needs graphics and a less hasty attempt to explain what the author has in mind. SLOW DOWN, use the words necessary to express your point and think before you write.
para 8 & 9. A dramatic shift from AMD to Intel's future technology, again needing graphics and more depth, plus probably a heading to handle this segue. Or do yu know what a segue is, I only learned last year, so you can be forgiven that, but only that.
This could be a very good article, but for now it I would wipe my butt with the paper it was written on.




Comment made about the PC Authority article:
An analysis of CPUs and GPUs?
Will AMD blow its lead again in graphics? An analysis of the current state of play in the race for processor and graphics superiority

What do you think? Join the discussion.
Login or register to submit a comment.
 

Top Stories

Box battle: Telstra takes on TiVo and Foxtel with T-Box trial in Melbourne
It's not quite Foxtel IQ and it's isn't TiVo either. The T-Box lets Telstra users watch movies and TV from the Bigpond site, as well as record and watch digital TV
 
5 More Free Linux Apps You Can't Do Without
More digital Swiss Army knife software, including Linux utilities and tools that are so useful you won't know how you ever did without them
 
Microsoft delivers Office 2010 public beta
Vendor details editions for Office 2010 along with application virtualisation for testing.
 


 
Intel
 
 
LogMeIn
 
 
Amazing Dell Coupons now available
 
Discover Apple