Fully Buffered DIMM

Topic Tools

Papers

Patent•

Fully buffered DIMM variable read latency

[...]

Muthukumar P. Swaminathan¹, Tessil Thomas, Pete Vogt•Institutions (1)

30 Jun 2005

TL;DR: In this article, a memory controller may include read latency logic to identify and match received read data with read commands to the memory devices based on values indicative of the read latency for the devices.

...read moreread less

Abstract: Memory control that access memory devices having different read latencies is described. In on embodiment, a memory controller may include read latency logic to identify and match received read data with read commands to the memory devices based on values indicative of the read latency for the memory devices. In another embodiment, the memories may include read delay control to insert an amount of delay into the time a memory device takes in responding to a read command.

...read moreread less

106 citations

Proceedings Article•10.1109/HPCA.2007.346190•

Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling

[...]

Brinda Ganesh¹, Aamer Jaleel², David T. Wang¹, Bruce Jacob¹•Institutions (2)

University of Maryland, College Park¹, Intel²

10 Feb 2007

TL;DR: In general, FBDIMM performance is similar to that of DDRx systems, and provides better performance characteristics at higher utilization, making it a relatively inexpensive mechanism for scaling capacity at higher bandwidth requirements, and the mechanism is also largely insensitive to scheduling policies, provided certain ground rules are obeyed.

...read moreread less

Abstract: Performance gains in memory have traditionally been obtained by increasing memory bus widths and speeds. The diminishing returns of such techniques have led to the proposal of an alternate architecture, the fully-buffered DIMM. This new standard replaces the conventional memory bus with a narrow, high-speed interface between the memory controller and the DIMMs. This paper examines how traditional DDRx based memory controller policies for scheduling and row buffer management perform on a fully-buffered DIMM memory architecture. The split-bus architecture used by FBDIMM systems results in an average improvement of 7% in latency and 10% in bandwidth at higher utilizations. On the other hand, at lower utilizations, the increased cost of serialization resulted in a degradation in latency and bandwidth of 25% and 10% respectively. The split-bus architecture also makes the system performance sensitive to the ratio of read and write traffic in the workload. In larger configurations, we found that the FBDIMM system performance was more sensitive to usage of the FBDIMM links than to DRAM bank availability. In general, FBDIMM performance is similar to that of DDRx systems, and provides better performance characteristics at higher utilization, making it a relatively inexpensive mechanism for scaling capacity at higher bandwidth requirements. The mechanism is also largely insensitive to scheduling policies, provided certain ground rules are obeyed

...read moreread less

102 citations

Proceedings Article•10.1145/1250662.1250701•

Thermal modeling and management of DRAM memory systems

[...]

Jiang Lin¹, Zheng Hongzhong², Zhichun Zhu², Howard S. David³, Zhao Zhang¹ - Show less +1 more•Institutions (3)

Iowa State University¹, University of Illinois at Chicago², Intel³

9 Jun 2007

TL;DR: A new approach that controls the memory thermal issues from the source generating memory activities - the processor is investigated and it will smooth the program execution when compared with shutting down memory abruptly, and therefore improve the overall system performance and power efficiency.

...read moreread less

Abstract: With increasing speed and power density, high-performance memories, including FB-DIMM (Fully Buffered DIMM) and DDR2 DRAM, now begin to require dynamic thermal management(DTM) as processors and hard drives did. The DTM of memories, nevertheless, is different in that it should take the processor performance and power consumption into consideration. Existing schemes have ignored that. In this study, we investigate a new approach that controls the memory thermal issues from the source generating memory activities - the processor. It will smooth the program execution when compared with shutting down memory abruptly, and therefore improve the overall system performance and power efficiency. For multicore systems, we propose two schemes called adaptive core gating and coordinated DVFS. The first scheme activates clock gating on selected processor cores and the second one scales down the frequency and voltage levels of processor cores when the memory is to be over-heated. They can successfully control the memory activities and handle thermal emergency. More importantly, they improve performance significantly under the given thermal envelope. Our simulation results show that adaptive coregating improves performance by up to 23.3% (16.3% on average) on a four-core system with FB-DIMM when compared with DRAM thermal shutdown; and coordinated DVFS with control-theoretic methods improves the performance by up to 18.5% (8.3% on average).

...read moreread less

75 citations

Fully-Buffered DIMM Technology Moves Enterprise Platforms to the Next Level

[...]

Jon Haas

1 Jan 2005

62 citations

Proceedings Article•10.1109/ISPASS.2007.363740•

DRAM-Level Prefetching for Fully-Buffered DIMM: Design, Performance and Power Saving

[...]

Jiang Lin¹, Zheng Hongzhong², Zhichun Zhu², Zhao Zhang¹, Howard S. David³ - Show less +1 more•Institutions (3)

Iowa State University¹, University of Illinois at Chicago², Intel³

25 Apr 2007

TL;DR: An AMB prefetching method is proposed that prefetches memory blocks from DRAM chips to AMBs but does not consume the crucial channel bandwidth, and it is found that the performance gain comes from the reduction of idle memory latency and the improvement of channel bandwidth utilization.

...read moreread less

Abstract: We have studied DRAM-level prefetching for the fully buffered DIMM (FB-DIMM) designed for multi-core processors. FB-DIMM has a unique two-level interconnect structure, with FB-DIMM channels at the first-level connecting the memory controller and advanced memory buffers (AMBs); and DDR2 buses at the second-level connecting the AMBs with DRAM chips. We propose an AMB prefetching method that prefetches memory blocks from DRAM chips to AMBs. It utilizes the redundant bandwidth between the DRAM chips and AMBs but does not consume the crucial channel bandwidth. The proposed method fetches K memory blocks of L2 cache block sizes around the demanded block, where K is a small value ranging from two to eight. The method may also reduce the DRAM power consumption by merging some DRAM precharges and activations. Our cycle-accurate simulation shows that the average performance improvement is 16% for single-core and multi-core workloads constructed from memory-intensive SPEC2000 programs with software cache prefetching enabled; and no workload has negative speedup. We have found that the performance gain comes from the reduction of idle memory latency and the improvement of channel bandwidth utilization. We have also found that there is only a small overlap between the performance gains from the AMB prefetching and the software cache prefetching. The average of estimated power saving is 15%

...read moreread less

33 citations

...

Expand

Year	Papers
2016	1
2011	1
2009	1
2008	5
2007	5
2006	2

Topic Tools

Papers

Fully buffered DIMM variable read latency

Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling

Thermal modeling and management of DRAM memory systems

Fully-Buffered DIMM Technology Moves Enterprise Platforms to the Next Level

DRAM-Level Prefetching for Fully-Buffered DIMM: Design, Performance and Power Saving

Related Topics (5)

Performance Metrics