Re: [Gems-users] About memory latency for CMP

Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

Date:	Sat, 29 Jan 2011 19:52:24 -0600
From:	Byn Choi <bynchoi1@xxxxxxxxxxxx>
Subject:	Re: [Gems-users] About memory latency for CMP


On Jan 29, 2011, at 6:05 PM, junli gu wrote:

Hey all:
I am simulating a 16-core CMP using Simics+Ruby. First I know thatthe latency values are all ruby cycles, which means 1 ruby cyclesequals to 2 CPU cycles. I am simulating a 16-core CMP using thedefault values as the following:

That depends on the SIMICS_RUBY_MULTIPLIER parameter, which is 4 bydefault, which means simics will be advanced 4 times for every rubycycle. I personally use 1 since the processor modeled by the simics isa very simple single-issue in-order 5-stage processor.

NULL_LATENCY: 1 ; Shortestpossible latencyISSUE_LATENCY: 2 ; Latencyto send out a request to the interconnectCACHE_LATENCY: 1 ; Latencyto source data from a cache to the interconnectMEMORY_LATENCY: 35 ; Latencyto source data from a memory module to the interconnectDIRECTORY_LATENCY: 1 ; Latencyof directory lookupNETWORK_LINK_LATENCY: 1 ; Latencyfor a single node-to-node hop in the interconnectSEQUENCER_TO_CONTROLLER_LATENCY: 8 ; Latencyadded by sequencer to requests to cache controllerTRANSITIONS_PER_RUBY_CYCLE: 32 ; Maximumtransitions per cycle for all SLICC state machinesSEQUENCER_OUTSTANDING_REQUESTS: 20 ; Number ofoutstanding requests per sequencer
My questions are:
A) I am positive about the L2 cache latency and memory latency. Itis supposed to be 10 and 35 ruby cycles, which means 20 and 70 cpucycles. Am I right?

This depends on how far the L2 bank is located wrt to the requestor.The latency will vary depending on the number of hops and number ofrouters that the request has to go through.

B) are these numbers realistic? I mean do they match the ones arein real products?

The following paper has detailed latency numbers from the IntelNehalem and AMD Shanghai chips.

Comparing Cache Architectures and Coherency Protocols on x86-64Multicore SMP Systems (MICRO'09)

C) For big cores like 16-core or even 32-core, how should thesenumbers change? I guess when we have more cores the interconnection latency and memory latency will also increase? Also I amnot sure whether NETWORK_LINK_LATENCY: 1 is toosmall.

The per-hop interconnection latency and memory latency (memory look uptime) should remain unchanged here. Again, as mentioned in A), theoverall (average) latency would increase due to increasedinterconnection diameter.

Byn


Thank you in advance!

--
************************************************
Junli Gu--谷俊丽
Coordinated Science Lab
University of Illinois at Urbana-Champaign
************************************************
_______________________________________________
Gems-users mailing list
Gems-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/gems-users

Use Google to search the GEMS Users mailing list by adding"site:https://lists.cs.wisc.edu/archive/gems-users/"; to your search.


---
Byn Choi
Ph.D. Candidate in Computer Science
University of Illinois, Urbana-Champaign

[← Prev in Thread]	Current Thread	[Next in Thread→]
[Gems-users] About memory latency for CMP, junli gu Re: [Gems-users] About memory latency for CMP, Byn Choi <= Re: [Gems-users] About memory latency for CMP, Abdullah Kayi

Previous by Date:	Re: [Gems-users] GEMS Installation 64-bit Ubuntu 10.10, Philip Garcia
Next by Date:	Re: [Gems-users] About memory latency for CMP, Abdullah Kayi
Previous by Thread:	[Gems-users] About memory latency for CMP, junli gu
Next by Thread:	Re: [Gems-users] About memory latency for CMP, Abdullah Kayi
Indexes:	[Date] [Thread]

Mailing List Archives

Public Access

Re: [Gems-users] About memory latency for CMP