|
|
| 290 | AEGIS: architecture for tamper-evident and tamper-resistant processing |
| 247 | Conserving disk energy in network servers |
| 234 | High performance RDMA-based MPI implementation over InfiniBand |
| 147 | Predictive dynamic thermal management for multimedia applications |
| 97 | A performance analysis of the Berkeley UPC compiler |
| 76 | Estimating cache misses and locality using stack distances |
| 63 | Reducing register ports using delayed write-back queues and operand pre-fetch |
| 55 | Collective operations in application-level fault-tolerant MPI |
| 49 | Enhancing scalability of parallel structured AMR calculations |
| 48 | Partitioned first-level cache design for clustered microarchitectures |
| 47 | Enhancing memory level parallelism via recovery-free value prediction |
| 42 | PowerHerd: dynamic satisfaction of peak power constraints in interconnection networks |
| 38 | Automatic fence insertion for shared memory multiprocessing |
| 31 | Result checking in global computing systems |
| 28 | A compiler approach for reducing data cache energy |
| 24 | Profile-guided I/O partitioning |
| 23 | A high performance multi-perspective vision studio |
| 21 | Predicate prediction for efficient out-of-order execution |
| 20 | Roccom: an object-oriented, data-centric software integration framework for multiphysics simulations |
| 18 | A GSA-based compiler infrastructure to extract parallelism from complex loops |
| 16 | Performance characteristics of openMP constructs, and application benchmarks on a large symmetric multiprocessor |
| 15 | A fast approximate interprocedural analysis for speculative multithreading compilers |
| 13 | Evaluation of the memory page migration influence in the system performance: the case of the SGI O2000 |
| 13 | The impact of data dependence analysis on compilation and program parallelization |
| 12 | Inferential queueing and speculative push for reducing critical communication latencies |
| 10 | Selecting long atomic traces for high coverage |
| 10 | miNI: reducing network interface memory requirements with dynamic handle lookup |
| 9 | Recycling waste: exploiting wrong-path execution to improve branch prediction |
| 9 | Compiler support for efficient processing of XML datasets |
| 7 | Is there anything more to learn about high performance processors? |
| 6 | Modeling and optimization of non-blocking checkpointing for optimistic simulation on myrinet clusters |
| 6 | Placement of I/O servers to improve parallel I/O performance on switch-based clusters |
| 5 | Inter-procedural stacked register allocation for itaniumŪ like architecture |
| 3 | A framework for incremental extensible compiler construction |
| 2 | Dynamic memory instruction bypassing |
| 0 | Wireless networks... what does the future have in store? |
| 0 | A new speculation technique to optimize floating-point performance while preserving bit-by-bit reproducibility |