Efficient trace-driven simulation methods for cache performance analysis

doi:10.1145/128738.128740

Journal Article10.1145/128738.128740

Efficient trace-driven simulation methods for cache performance analysis

Wen-Hann Wang, +1 more

- 01 Aug 1991

- ACM Transactions on Computer Systems

- Vol. 9, Iss: 3, pp 222-241

90

TL;DR: This work reduces the program traces to the extent that exact performance can still be obtained from the reduced traces and devise an algorithm that can produce performance results for a variety of metrics for a large number of set-associative write-back caches in just a single simulation run.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/254180.254184

Trace-driven memory simulation: a survey

Richard Uhlig, +1 more

- 01 Jun 1997

- ACM Computing Surveys

TL;DR: A survey and analysis of trace-driven memory simulation tools can be found in this article, where the authors discuss the strengths and weaknesses of different approaches and show that no single method is best when all criteria, including accuracy, speed, memory, flexibility, portability, expense, and ease of use are considered.

...read moreread less

332

Proceedings Article•10.1145/1024393.1024415

Dynamic tracking of page miss ratio curve for memory management

Pin Zhou, +5 more

- 07 Oct 2004

TL;DR: The real system experiments on Linux with applications including Apache Web Server show that the MRC-directed memory allocation can speed up the applications' execution/response time by up to a factor of 5.86 and reduce the number of page faults byUp to 63.1%.

...read moreread less

269

•Journal Article•10.1109/12.286300

A comparison of trace-sampling techniques for multi-megabyte caches

R. E. Kessler, +2 more

- 01 Jun 1994

- IEEE Transactions on Computers

TL;DR: The paper compares the trace-sampling techniques of set sampling and time sampling using the multi-billion reference traces of A.A. Borg et al. (1990) and applies both techniques to multi-megabyte caches, where sampling is most valuable, to find that set sampling meets the 10% sampling goal, while time sampling does not.

...read moreread less

151

Journal Article•10.1016/J.JSS.2006.07.021

Quantifying software performance, reliability and security

Vibhu Saujanya Sharma, +1 more

- 01 Apr 2007

- Journal of Systems and Software

TL;DR: In this paper, an architecture-based unified hierarchical model for software performance, reliability, security and cache behavior prediction is proposed, which employs discrete time Markov chains (DTMCs) to model software systems and provides expressions for predicting the overall behavior of the system based on its architecture as well as the characteristics of individual components.

...read moreread less

112

Journal Article•10.1109/2.675632

Performance analysis and its impact on design

Pradip Bose, +1 more

- 01 May 1998

- IEEE Computer

TL;DR: This work focuses on architectural performance, typically measured in cycles per instruction, and covers some of the advances in dealing with modern problems in performance analysis.

...read moreread less

109

...

Expand

References

Journal Article•10.1147/SJ.92.0078

Evaluation techniques for storage hierarchies

R. L. Mattson, +3 more

- 01 Jun 1970

- Ibm Systems Journal

TL;DR: A new and efficient method of determining, in one pass of an address trace, performance measures for a large class of demand-paged, multilevel storage systems utilizing a variety of mapping schemes and replacement algorithms.

...read moreread less

1.4K

Journal Article•10.1145/68182.68207

Available instruction-level parallelism for superscalar and superpipelined machines

Norman P. Jouppi, +1 more

- 01 Apr 1989

TL;DR: A parameterizable code reorganization and simulation system was developed and used to measure instruction-level parallelism and the average degree of superpipelining metric is introduced, suggesting that this metric is already high for many machines.

...read moreread less

368

•Journal Article•10.1145/17356.17404

A class of compatible cache consistency protocols and their support by the IEEE futurebus

P. Sweazey, +1 more

- 01 May 1986

TL;DR: This paper defines a class of compatible consistency protocols supported by the current IEEE Futurebus design, referred to as the MOESI class of protocols, which has the property that any system component can select (dynamically) any action permitted by any protocol in the class, and be assured that consistency is maintained throughout the system.

...read moreread less

314

•Book

Available instruction-level parallelism for superscalar and superpipelined machines

Norman P. Jouppi, +1 more

- 01 Mar 1995

TL;DR: A parameterizable code reorganization and simulation system was developed and used to measure instruction-level parallelism and the average degree of superpipelining metric is introduced, suggesting that this metric is already high for many machines.

...read moreread less

285

•Journal Article•10.1109/2.16187

A case for direct-mapped caches

Mark D. Hill

- 01 Dec 1988

- IEEE Computer

TL;DR: Direct-mapped caches are defined, and it is shown that trends toward larger cache sizes and faster hit times favor their use.

...read moreread less

273

...

Expand

Efficient trace-driven simulation methods for cache performance analysis

Chat with Paper

AI Agents for this Paper

Citations

Trace-driven memory simulation: a survey

Dynamic tracking of page miss ratio curve for memory management

A comparison of trace-sampling techniques for multi-megabyte caches

Quantifying software performance, reliability and security

Performance analysis and its impact on design

References

Evaluation techniques for storage hierarchies

Available instruction-level parallelism for superscalar and superpipelined machines

A class of compatible cache consistency protocols and their support by the IEEE futurebus

Available instruction-level parallelism for superscalar and superpipelined machines

A case for direct-mapped caches

Related Papers (5)

Evaluation techniques for storage hierarchies

Efficient (stack) algorithms for analysis of write-back and sector memories

Computer Architecture: A Quantitative Approach

Efficient trace-driven simulation method for cache performance analysis

Cache Memories