Re: [Gems-users] State transitions per cycle and Instruction Profiling

Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

Date:	Tue, 28 Aug 2007 11:19:14 -0500 (CDT)
From:	Mike Marty <mikem@xxxxxxxxxxx>
Subject:	Re: [Gems-users] State transitions per cycle and Instruction Profiling

zz_recycle* is used as a crude hack so that the entire incoming requestqueue does not block when one request for a given cache block isoutstanding (in a transient state). What recycle does is remove theblocked request from the incoming queue and re-enqueue it towards the end.This allows other requests that are not blocked to be handled. If youwant to model more realistic hardware to handle this functionality, youcould do so (i.e., have a seperate holding buffer for blocked requests).The reason why zz_recycle* and *TRANSITIONS_PER_RUBY_CYCLE don't play wellis that real hardware can have more efficient wakeup logic on blockedrequests.

In the past, I have limited the number of snoops/cycle by adding thenotion of "busy banks". There are many ways to do this. One way is tobuild some kind of TimerTable structure used by each controller instance(one controller instance per cache bank). When a snoop occurs, you dosomething so that getState() returns a BUSY state for _all_ addresses.Then, when the bank is no longer busy X cycles later, a wakeup occurswhich clears the global BUSY state.

Yes, it is a good idea to add your own profiling logic especially whenplaying with enabling/disabling fast path. For a fast-path hit, arequest will not reach the mandatory queue.


--Mike

Hi,

I have a couple of questions about Ruby:

I've seen that the variable Lx_CACHE_TRANSITIONS_PER_RUBY_CYCLE is set to32 and that is recommended to set it higher if it's a protocol that useszz_recycle... actions. I would like to set it to a more realistic value(smaller) to limit the number of snoops/cycle but I don't see why this isnot desirable if I use the zz_recycle... actions.

The second question is about profiling instructions. I already have thevalues for the data cache with REMOVE_SINGLE_CYCLE_DCACHE_FAST_PATH=truebut it seems that for instructions the profiler only takes into accountthe misses and I allways get a 100% miss rate for the instruction cache.That's why I added a call to the counter in the doRequest function of theSequencer. The call is activated when (hit && request.getType() ==CacheRequestType_IFETCH) == true. If I understood well this requests nevergo to the mandatory queue. Do you think it's correct?


Thank you for your help!

Enric






____________________________________________________________________________________

Sé un Mejor Amante del Cine¿Quieres saber cómo? ¡Deja que otras personas te ayuden!

http://advision.webevents.yahoo.com/reto/entretenimiento.html

[← Prev in Thread]	Current Thread	[Next in Thread→]
[Gems-users] State transitions per cycle and Instruction Profiling, Enric Herrero Re: [Gems-users] State transitions per cycle and Instruction Profiling, Mike Marty <=

Previous by Date:	Re: [Gems-users] A question on event scheduling, Mike Marty
Next by Date:	Re: [Gems-users] A question on event scheduling, Niket Agarwal (niketa@xxxxxxxxxxxxx)
Previous by Thread:	[Gems-users] State transitions per cycle and Instruction Profiling, Enric Herrero
Next by Thread:	[Gems-users] The distribution of "total_misses" in ruby stats output, Lide Duan
Indexes:	[Date] [Thread]

Mailing List Archives

Public Access

Re: [Gems-users] State transitions per cycle and Instruction Profiling