Efficient trace-driven simulation method for cache performance analysis

doi:10.1145/98457.98497

Proceedings Article10.1145/98457.98497

Efficient trace-driven simulation method for cache performance analysis

Wen-Hann Wang, +1 more

- 01 Apr 1990

- Vol. 18, Iss: 1, pp 27-36

49

TL;DR: This work reduces the program traces to the extent that exact performance can still be obtained from the reduced traces and devise an algorithm that can produce performance results for a variety of metrics for a large number of set-association write-back caches in just a single simulation run.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/254180.254184

Trace-driven memory simulation: a survey

Richard Uhlig, +1 more

- 01 Jun 1997

- ACM Computing Surveys

TL;DR: A survey and analysis of trace-driven memory simulation tools can be found in this article, where the authors discuss the strengths and weaknesses of different approaches and show that no single method is best when all criteria, including accuracy, speed, memory, flexibility, portability, expense, and ease of use are considered.

...read moreread less

332

Proceedings Article•10.1145/1024393.1024415

Dynamic tracking of page miss ratio curve for memory management

Pin Zhou, +5 more

- 07 Oct 2004

TL;DR: The real system experiments on Linux with applications including Apache Web Server show that the MRC-directed memory allocation can speed up the applications' execution/response time by up to a factor of 5.86 and reduce the number of page faults byUp to 63.1%.

...read moreread less

269

•Journal Article•10.1109/12.286300

A comparison of trace-sampling techniques for multi-megabyte caches

R. E. Kessler, +2 more

- 01 Jun 1994

- IEEE Transactions on Computers

TL;DR: The paper compares the trace-sampling techniques of set sampling and time sampling using the multi-billion reference traces of A.A. Borg et al. (1990) and applies both techniques to multi-megabyte caches, where sampling is most valuable, to find that set sampling meets the 10% sampling goal, while time sampling does not.

...read moreread less

151

Journal Article•10.1016/J.JSS.2006.07.021

Quantifying software performance, reliability and security

Vibhu Saujanya Sharma, +1 more

- 01 Apr 2007

- Journal of Systems and Software

TL;DR: In this paper, an architecture-based unified hierarchical model for software performance, reliability, security and cache behavior prediction is proposed, which employs discrete time Markov chains (DTMCs) to model software systems and provides expressions for predicting the overall behavior of the system based on its architecture as well as the characteristics of individual components.

...read moreread less

112

Proceedings Article•10.1145/1006209.1006221

PB-LRU: a self-tuning power aware storage cache replacement algorithm for conserving disk energy

Qingbo Zhu, +2 more

- 26 Jun 2004

TL;DR: Results show that PB-LRU without any parameter tuning provides similar or even better performance and energy savings than the previous power-aware algorithm with the best parameter setting for each workload.

...read moreread less

103

...

Expand

References

Journal Article•10.1147/SJ.92.0078

Evaluation techniques for storage hierarchies

R. L. Mattson, +3 more

- 01 Jun 1970

- Ibm Systems Journal

TL;DR: A new and efficient method of determining, in one pass of an address trace, performance measures for a large class of demand-paged, multilevel storage systems utilizing a variety of mapping schemes and replacement algorithms.

...read moreread less

1.4K

Journal Article•10.1145/68182.68207

Available instruction-level parallelism for superscalar and superpipelined machines

Norman P. Jouppi, +1 more

- 01 Apr 1989

TL;DR: A parameterizable code reorganization and simulation system was developed and used to measure instruction-level parallelism and the average degree of superpipelining metric is introduced, suggesting that this metric is already high for many machines.

...read moreread less

368

•Journal Article•10.1145/17356.17404

A class of compatible cache consistency protocols and their support by the IEEE futurebus

P. Sweazey, +1 more

- 01 May 1986

TL;DR: This paper defines a class of compatible consistency protocols supported by the current IEEE Futurebus design, referred to as the MOESI class of protocols, which has the property that any system component can select (dynamically) any action permitted by any protocol in the class, and be assured that consistency is maintained throughout the system.

...read moreread less

314

•Journal Article•10.1109/2.16187

A case for direct-mapped caches

Mark D. Hill

- 01 Dec 1988

- IEEE Computer

TL;DR: Direct-mapped caches are defined, and it is shown that trends toward larger cache sizes and faster hit times favor their use.

...read moreread less

273

Journal Article•10.1145/633625.52422

Multiprocessor cache analysis using ATUM

R. L. Sites, +1 more

- 17 May 1988

TL;DR: The multiprocessor extension of ATUM, a scheme to get reliable operating system and multiprogramming traces on single processors, is described and the resulting traces are used to analyze physical versus virtual addressing of large caches, process-identifier hashing in virtual caches, cache interference between multiple processes, cache interfered between multiple CPUs, process affinity, and semaphore usage in writeback caches.

...read moreread less

98

Efficient trace-driven simulation method for cache performance analysis

Chat with Paper

AI Agents for this Paper

Citations

Trace-driven memory simulation: a survey

Dynamic tracking of page miss ratio curve for memory management

A comparison of trace-sampling techniques for multi-megabyte caches

Quantifying software performance, reliability and security

PB-LRU: a self-tuning power aware storage cache replacement algorithm for conserving disk energy

References

Evaluation techniques for storage hierarchies

Available instruction-level parallelism for superscalar and superpipelined machines

A class of compatible cache consistency protocols and their support by the IEEE futurebus

A case for direct-mapped caches

Multiprocessor cache analysis using ATUM

Related Papers (5)

Evaluation techniques for storage hierarchies

Efficient trace-driven simulation methods for cache performance analysis

Computer Architecture: A Quantitative Approach

Iterative cache simulation of embedded CPUs with trace stripping

Generation and analysis of very long address traces