|
|
| 182 | The End-to-End Performance Effects of Parallel TCP Sockets on a Lossy Wide-Area Network |
| 158 | Communication Characteristics of Large-Scale Scientific Applications for Contemporary Cluster Architectures |
| 138 | Bandwidth-Centric Allocation of Independent Tasks on Heterogeneous Platforms |
| 126 | Predicting the Performance of Wide Area Data Transfers |
| 117 | Finding Good Peers in Peer-to-Peer Networks |
| 68 | Models and Scheduling Mechanisms for Global Computing Applications |
| 59 | A Prediction-Based Real-Time Scheduling Advisor |
| 57 | Load Balancing in Distributed Systems: An Approach Using Cooperative Games |
| 54 | Optimizing Graph Algorithms for Improved Cache Performance |
| 51 | A New Approach to Fault-Tolerant Wormhole Routing for Mesh-Connected Parallel Computers |
| 50 | On Reliable and Scalable Peer-to-Peer Web Document Sharing |
| 48 | The R-LRPD Test: Speculative Parallelization of Partially Parallel Loops |
| 46 | Fine-Grain Access Control for Securing Shared Resources in Computational Grids |
| 45 | Graph Partitioning for Parallel Applications in Heterogeneous Grid Environments |
| 44 | A SIMD Vectorizing Compiler for Digital Signal Processing Algorithms |
| 44 | Cluster Load Balancing for Fine-Grain Network Services |
| 43 | nfsp: A Distributed NFS Server for Clusters of Workstations |
| 41 | Parallel JPEG2000 Image Coding on Multiprocessors |
| 36 | A Metric and Mixed-Integer-Programming-Based Approach for Resource Allocation in Dynamic Real-Time Systems |
| 34 | Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines |
| 34 | Compile/Run-Time Support for Thread Migration |
| 31 | Can User-Level Protocols Take Advantage of Multi-CPU NICs? |
| 31 | Faster Collective Output through Active Buffering |
| 31 | Task Allocation for Distributed Multimedia Processing on Wirelessly Networked Handheld Devices |
| 30 | Static Mapping Heuristics for Tasks with Dependencies, Priorities, Deadlines, and Multiple Versions in Heterogeneous Environments |
| 25 | Enforcing Resource Sharing Agreements among Distributed Server Clusters |
| 24 | Preemptive Multiprocessor Scheduling Anomalies |
| 24 | Virtual Machine Based Heterogeneous Checkpointing |
| 24 | Detecting Temporal Logic Predicates on the Happened-Before Model |
| 23 | Parallel Wavelet Transform for Large Scale Image Processing |
| 22 | Parallel and Distributed Computing Issues in Pricing Financial Derivatives through Quasi Monte Carlo |
| 22 | Modeling and Evaluating Peer-to-Peer Storage Architectures |
| 22 | Design and Implementation of a Pluggable Fault Tolerant CORBA Infrastructure |
| 22 | On QoS-Based Scheduling of a Meta-Task with Multiple QoS Demands in Heterogeneous Computing |
| 21 | A Single Phase Distributed Commit Protocol for Main Memory Database Systems |
| 21 | A New Clustering Algorithm for Large Communication Delays |
| 19 | A Strategy to Compute the InfiniBand Arbitration Tables |
| 19 | Hierarchical Interconnects for On-Chip Clustering |
| 19 | Accountable Web-Computing |
| 18 | Scheduling Multiple Data Visualization Query Workloads on a Shared Memory Machine |
| 18 | Twiddle-Factor-Based FFT Algorithm with Reduced Memory Access |
| 17 | GNBD/VIA: A Network Block Device over Virtual Interface Architecture on Linux |
| 17 | Efficient Pipelining of Nested Loops: Unroll-and-Squash |
| 17 | High-Performance Parallel and Distributed Computing for the BMI Eigenvalue Problem |
| 14 | Improving the Performance of Distributed CORBA Applications |
| 14 | A New Model for Static Mapping of Parallel Applications with Task and Data Parallelism |
| 13 | An Out-of-Core Sorting Algorithm for Clusters with Processors at Different Speed |
| 12 | A Novel Approach to Reduce L2 Miss Latency in Shared-Memory Multiprocessors |
| 11 | Parallel Incremental 2D-Discretization on Dynamic Datasets |
| 11 | Routing Permutations in Partitioned Optical Passive Star Networks |
| 11 | Fault-Tolerant Broadcasting in Wormhole-Routed Torus Networks |
| 10 | Performance Characterization of a Molecular Dynamics Code on PC Clusters: Is There Any Easy Parallelism in CHARMM? |
| 10 | Minimum Average Transmission Power Routing in CDMA Ad Hoc Networks Utilizing the Blind Multiuser Detection |
| 9 | A Parallel Ultra-High Resolution MPEG-2 Video Decoder for PC Cluster Based Tiled Display Systems |
| 9 | Adaptive Scheduling under Memory Pressure on Multiprogrammed SMPs |
| 9 | Optimal Remapping in Dynamic Bulk Synchronous Computations via a Stochastic Control Approach |
| 9 | Buckets Strike Back: Improved Parallel Shortest Paths |
| 8 | Short Cut Eulerian Routing of Datagrams in All Optical Point-to-Point Networks |
| 8 | Enhancing Data Migration Performance via Parallel Data Compression |
| 8 | Parallel Out-of-Core Matrix Inversion |
| 7 | Using Programmable NICs for Time-Warp Optimization |
| 7 | An Adaptive Hash Join Algorithm on a Network of Workstations |
| 7 | Achieving Scalability in Parallel Tabled Logic Programs |
| 7 | A Multithreaded Concurrent Garbage Collector Parallelizing the New Instruction in Java |
| 7 | A Parallel Two-Level Hybrid Method for Diagonal Dominant Tridiagonal Systems |
| 6 | Compiler and Runtime Support for Irregular Reductions on a Multithreaded Architecture |
| 6 | Java Mirrors: Building Blocks for Remote Interaction |
| 6 | Capturing Causality by Compressed Vector Clock in Real-Time Group Editors |
| 6 | Toward Optimal Diffusion Matrices |
| 6 | Barrier Synchronization on a Loaded SMP Using Two-Phase Waiting Algorithms |
| 5 | Model-Based Fault Detection in Powerline Networking |
| 5 | Generalized Multipartitioning for Multi-Dimensional Arrays |
| 5 | A Parallel Cloth Simulator Using Multilevel Algorithms |
| 5 | Architecture of the Entropia Distributed Computing System |
| 4 | Compression-Domain Parallel Rendering |
| 4 | Compiling Several Classes of Communication Patterns on a Multithreaded Architecture |
| 4 | Quantifying and Resolving Remote Memory Access Contention on Hardware DSM Multiprocessors |
| 4 | A Performance Model for k-Ary n-Cube Networks with Self-Similar Traffic |
| 4 | The Least Choice First Scheduling Method for High-Speed Network Switche |
| 4 | Clustering Algorithm for Scheduling Parallel Programs on NOWs with Synchronization Requirements at the Application Level |
| 3 | A Recursion-Based Broadcast Paradigm in Wormhole Routed Mesh/Torus Networks |
| 3 | Fast Inductance Extraction of Large VLSI Circuits |
| 3 | Peppermint and Sled: Tools for Evaluating SMP Systems Based on IA-64 (IPF) Processors |
| 3 | A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms |
| 2 | Incorporating Quality-of-Service in the Virtual Interface Architecture |
| 2 | Compiler-Directed I/O Optimization |
| 2 | Comparing the Memory System Performance of DSS Workloads on the HP V-Class and SGI Origin 2000 |
| 2 | Issues Concerning Linux Clustering: Cluster Management and Application Porting |
| 1 | An Efficient Technique for Corner-Turn in SAR Image Reconstruction by Improving Cache Access |
| 1 | Distribution Sweeping on Clustered Machines with Hierarchical Memories |
| 1 | Blending of Composite Panel Designs Using Genetic Algorithms |
| 1 | Fault Recovery for a Distributed SP-Based Delay Constrained Multicast Routing Algorithm |
| 1 | Efficient Support for Two-Dimensional Data Distributions in Distributed Shared Memory Systems |
| 1 | A Feasibility Study of Hierarchical Multithreading |
| 1 | Deleting Keys of B-trees in Parallel |
| 0 | Average-Case Scalability Analysis of Parallel Computations on k-ary d-cubes |
| 0 | A Parallel Numerical Algorithm for Boundary-Value FIDES on a PC Cluster |
| 0 | A High Performance Algorithm for Incompressible Flows Using Local Solenoidal Functions |
| 0 | Creating a National Lab Shared Storage Infrastructure |
| 0 | Hybrid Predication Model for Instruction Level Parallelism |
| 0 | DEM-1: A Particle Simulation Machine for Efficient Short-Range Interaction Computations |
| 0 | Determination of the Topology of a Directed Network |