ABXZone Computer  Forums



Welcome to the ABXZone Computer Forums forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact contact us.

Reply
 
LinkBack Thread Tools Display Modes
Old 05-10-2007, 04:42 PM   #1
Registered User
 
shaihulud's Avatar
 
Join Date: Jan 2005
Posts: 1,239
AMD's Optimization Guide for 10h


I have been reading on the new processor from AMD. One of my biggest questions have been answered, it seems. I was curious if the SSE4 instruction set was to be supported and apparently it is. In the optimization guide it is referenced as SSE4a. I am wondering if this is Penryn's set of SSE4 instructions, and SSE4b will be the latter to come as we all know from Intel's new architecture?

Other questions that I already had answers to, such as the L3 cache being a victim cache, was true also. However, this was easy due to the fact of Athlon XP/64 cache architecture. This actually disappoints me a bit, and even with some of the improvements I do not think AMD will ever meet the ability Intel's cache architecture. But there are some new direct load to L1 features for 10h that makes me all so curious.

Anyways here is the link to the processor's optimization guide: http://www.amd.com/us-en/assets/cont...docs/40546.pdf

(Offline)   Reply With Quote
Old 05-10-2007, 08:50 PM   #2
Remembering TQ
 
k0NG0's Avatar
 
Join Date: Mar 2001
Location: Sweden
Posts: 13,623
Don't they use the L2 as a victim cache for the L1 (or at least L1D) in K8, too?

Intel still uses an inclusive cache hierarchy, from what I know, which has different characteristics when it comes to misses. I believe AMD has traditionally favored an exclusive hierarchy, although my own memory certainly could use refreshing, being DRAM and all.
__________________

Use Firefox - "the one that blocks all the schmutz"
Feeling multicore elation? Remember this correlation: Amdahl's Law.
(Offline)   Reply With Quote
Old 05-10-2007, 11:13 PM   #3
Registered User
 
shaihulud's Avatar
 
Join Date: Jan 2005
Posts: 1,239
For the K8 and XP, the victim buffer is before L2. This way if an evict is sent out of L1 it is not fully lost if a miss occurs in L1. Without the victim buffer the total cycles will still be of L1+L2. It checks the victim buffer first if present then it will move, in a fast manner, back to L1. If not in victim then it will most likely be present in L2, and then sent to L1. If not in L2 then it will be a fetch, and many cycles would have been wasted.

So now, with the new processor, there is a victim buffer between the individual cores L1 and L2, and then an L3 victim buffer that all the cores victim and copy backs from L2 go to.
(Offline)   Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



Powered by vBulletin® Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.0.1
vBulletin Skin developed by: vBStyles.com