Open AccessJournal Article
Improving Bandwidth Utilization using Eager Writeback
TL;DR: This paper presents a technique called Eager Writeback, which avoids performance loss due to clustered memory tra‐c patterns found in streaming and graphics applications by speculatively "cleaning" dirty cache lines prior to their eviction.
read more
Abstract: Cache memories have been incorporated into almost all modern, general-purpose microprocessors. To maintain data consistency between cache structures and the rest of the memory systems, most of thesecachesemployeithera writebackora write-throughstrategytodealwithstoreoperations. Writethroughcachespropagatedatatomoredistantmemorylevelsatthetimeeachstoreoccurs,producinga signiflcantbustra‐coverheadtomaintainconsistencybetweenthememoryhierarchylevels. Writeback caches can signiflcantly reduce the bandwidth requirements between caches and memory by marking cachelinesas dirtywhenstoresareprocessedandwritingthoselinestothememorysystemonlywhen thatdirtylineisevicted. Writebackcachesworkwellformanyapplications;however,forapplications that experience signiflcant numbers of cache misses over a very short interval due to streaming data, writebackcachedesignscandegradeoverallsystemperformancebyclusteringbusactivitywhendirty lines contend with data being fetched into the cache. In this paper we present a technique called Eager Writeback, which avoids performance loss due to clustered memory tra‐c patterns found in streaming and graphics applications by speculatively "cleaning" dirty cache lines prior to their eviction. Eager Writeback can be viewed as a compromise betweenwrite-throughandwritebackpolicies,inwhichdirtylinesarewrittenlaterthanwrite-through, butpriortowriteback. Wewillshowthatthisapproachcanefiectivelyavoidtheperformancedegradation caused by clustering bus tra‐c in a writeback approach, while incurring very minimal additional memory tra‐c.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Method and apparatus for memory management
Yasuko Eckert,Syed Ali Jafri,Srilatha Manne,Gabriel H. Loh +3 more
- 28 Aug 2013
TL;DR: In this paper, a method of managing cache memory includes identifying a group of cache lines in a cache memory, based on a correlation between the cache lines, and tracking evictions of cache line in the group from the cache memory.
19
Patent
Early write-back of modified data in a cache memory
Syed Ali Jafri,Yasuko Eckert,Srilatha Manne +2 more
- 27 Aug 2013
TL;DR: In this paper, a set of cache lines with an index associated with the modified data is identified, and modified data are stored in the set in a cache line with an eviction priority that is at least as high as an eviction prioritization.
10
•Dissertation
Soft error mitigation techniques for future chip multiprocessors
Gaurang Upasani
- 01 Feb 2016
TL;DR: This work proposes a lightweight and scalable soft error mitigation solution that protects the whole processor (logic, flip-flop, latches and memory arrays) incurring minimum overheads and shows how to use acoustic wave detectors for detecting and locating particle strikes.
7
Patent
Multi-level system memory with near memory scrubbing based on predicted far memory idle time
Zhe Wang,Christopher B. Wilkerson,Zeshan A. Chishti +2 more
- 27 Jun 2016
TL;DR: In this article, an apparatus is described that includes a memory controller to interface to a multi-level system memory and an idle time predictor circuitry to predict idle times of a lower level of the system memory.
4
Patent
Writing data to system memory in a data processing system in which cache line states are tracked
Brian C. Grayson,Wichaya T. Changwatchai +1 more
- 10 Jun 2011
TL;DR: In this article, the cache line is one of a plurality of state indicators associated with a corresponding cache line, and if a threshold is exceeded, data stored in a selected cache line are written to the associated section of the region of system memory, and a clean indication is stored in the state indicator corresponding to the cache lines.
4
References
•Book
Computer Architecture: A Quantitative Approach
John L. Hennessy,David A. Patterson +1 more
- 01 Dec 1989
TL;DR: This best-selling title, considered for over a decade to be essential reading for every serious student and practitioner of computer design, has been updated throughout to address the most important trends facing computer designers today.
12.6K
The microarchitecture of the Pentium 4 processor
G. Hinton
- 01 Jan 2001
TL;DR: The main features and functions of the NetBurst microarchitecture of Intel’s new flagship Pentium 4 processor are described, including its new form of instruction cache called the Execution Trace Cache.
671
Implementing streaming SIMD extensions on the Pentium III processor
TL;DR: The streaming SIMD extensions (SSE) provides a rich set of instructions to meet the requirements of demanding multimedia and Internet applications and makes a number of design trade-offs to satisfy tight die size constraints and attain frequency goals.
209
Instruction issue logic for high-performance, interruptable pipelined processors
Gurindar S. Sohi,Sriram Vajapeyam +1 more
- 01 Jun 1987
TL;DR: This paper presents a design for instruction issue logic that resolves dependencies dynamically and, at the same time, guarantees a precise state of the machine, without a significant hardware overhead.
Design issues and tradeoffs for write buffers
Kevin Skadron,Douglas W. Clark +1 more
- 01 Feb 1997
TL;DR: This paper uses instruction level simulation of SPEC92 benchmarks to investigate how different write buffer depths, retirement policies, and load-hazard policies affect these three types of write-buffer stalls.
78