@ Phoenix
1) Ideal vs. Mesh
The CPI gap between ideal and mesh is not so big.
I’m afraid even if we optimize mapreduce on the mesh network,
It may not represent much to the performance.
2) miss rate of each phase
I used 64K 4way L1 cache and 1M 4way L2 cache(shared) for each node.
Garnet does not show the actual miss hit/rate, so we should compare the misses per 1000 inst.
3) injection rate of each node with time.
The result files are attached.
[msg.png] represents the injection rate of coherence messages to the processing time.
[data.png] represents the injection rate of coherence data.
We can see the diagonal lines as expected,
But, at the same time, some nodes are heavily used than other.
@ Booksim
1) injection voq
There was a little problem at the last result.
I fixed the program and the new result is attached.
[3000cycle.xlsx]
Voq is 3~4 times better than no-voq when the injection rate is very high.
2) test on 64 nodes
I also tested 8x8 mesh, 64 ring, and 4ary 3 fly.
For fly topology, voq is 8~9 times better than no-voq.
for mesh and torus, the simulation cannot be done to the high injection rate,
we cannot see the difference between voq and no-voq.
3) A New Scalable and Cost-Effective Congestion Management Strategy for Lossless Multistage Interconnection Networks (HPCA 05)
Now I am trying to re-implement the system in the paper.
Thanks.
minjeong
'Lab.work' 카테고리의 다른 글
Meeting 12월 29일 (0) | 2009.12.29 |
---|---|
11월 11일 Meeting (0) | 2009.11.11 |
11월 4일 Meeting (0) | 2009.11.04 |
Weekly Report (0) | 2009.11.04 |
10월 28일 Meeting (0) | 2009.10.28 |