

184  Characterizing and Evaluating Desktop Grids: An Empirical Study 
97  Cycloid: A ConstantDegree and LookupEfficient P2P Overlay Network 
91  BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines 
90  A Trust Brokering System and Its Application to Resource Management in PublicResource Grids 
86  Building a Scalable Bipartite P2P Overlay Network 
82  Scalable and Modular Algorithms for FloatingPoint Matrix Multiplication on FPGAs 
72  Replication Under Scalable Hashing: A Family of Algorithms for Scalable Decentralized Data Distribution 
69  MobilitySensitive Topology Control in Mobile Ad Hoc Networks 
67  Policy Based Scheduling for Simple Quality of Service in Grid Computing 
67  FaultAware Job Scheduling for BlueGene/L Systems 
67  A Game Theory Based Pricing Strategy for Job Allocation in Mobile Grids 
66  Distributed Adaptive Task Allocation in Heterogeneous Computing Environments to Maximize Throughput 
64  A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors 
58  Design and Implementation of MPICH2 over InfiniBand with RDMA Support 
58  Pipelining Broadcasts on Heterogeneous Platforms 
57  Performance Characteristics of the MultiZone NAS Parallel Benchmarks 
56  A Multiple LID Routing Scheme for FatTreeBased InfiniBand Networks 
55  Towards Efficient MultiLevel Threading of H.264 Encoder on Intel HyperThreading Architectures 
53  Architecture of LAMPI, A NetworkFaultTolerant MPI 
52  Malicious Node Detection in Wireless Sensor Networks 
52  SATMatch: A SelfAdaptive Topology Matching Method to Achieve Low Lookup Latency in Structured P2P Overlay Networks 
51  Fast SharedMemory Algorithms for Computing the Minimum Spanning Forest of Sparse Graphs 
45  Designing WDM Optical Interconnects with Full Connectivity by Using Limited Wavelength Conversion 
44  On the Feasibility of Incremental Checkpointing for Scientific Computing 
44  Queue Scheduling and Advance Reservations with COSY 
42  HostAssisted ZeroCopy Remote Memory Access Communication on InfiniBand 
41  Nemos: A Framework for Axiomatic and Executable Specifications of Memory Consistency Models 
40  Fast and Scalable MPILevel Broadcast Using InfiniBand?s Hardware Multicast Support 
38  Performance Comparison of Pure MPI vs Hybrid MPIOpenMP Parallelization Models on SMP Clusters 
36  DCache Warn: An IFetch Policy to Increase SMT Efficiency 
35  Parallelization and Performance of Interactive Multiplayer Game Servers 
34  Clustered Multithreaded Architectures  Pursuing both IPC and Cycle Time 
32  Towards Efficient Load Balancing in Structured P2P Systems 
32  A New Algorithm for Relative Localization in Wireless Sensor Networks 
28  Integrating Remote Invocation and Distributed Shared State 
28  SRUMMA: A Matrix Multiplication Algorithm Suitable for Clusters and Scalable Shared Memory Systems 
26  High Performance Implementation of MPI Derived Datatype Communication over InfiniBand 
26  How to Run Experiments with Large PeertoPeer Data Structure 
26  A Flexible IO Scheme for Grid Workflows 
25  Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study 
24  EnergyEfficient Caching and Prefetching with Data Consistency in Mobile Distributed Systems 
23  A Multiprocessor Implementation of the Total Bandwidth Server 
22  Single SignOn in InVIGO: RoleBased Access via Delegation Mechanisms Using ShortLived User Identities 
22  Unobtrusiveness and Efficiency in Idle Cycle Stealing for PC Grids 
22  A General Model for Detecting Distributed Termination in Dynamic Systems 
21  Running OpenMP Applications Efficiently on an EverythingShared SDSM 
21  Optimising Static Workload Allocation in Multiclusters 
20  Two Fast and Efficient Message Scheduling Algorithms for Data Redistribution through a Backbone 
20  Employing Nested OpenMP for the Parallelization of MultiZone Computational Fluid Dynamics Applications 
20  SAMOA: Framework for Synchronisation Augmented Microprotocol Approach 
18  Almost WaitFree Resizable Hashtable 
18  Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers 
17  A Parallel ObjectOriented Application for 3D Electromagnetism 
17  Scaling and Parallelizing a Scientific Feature Mining Application Using a Cluster Middleware 
15  Optimal MultiChannel Data Allocation with Flat Broadcast Per Channel 
15  Efficient Synthesis of OutofCore Algorithms Using a Nonlinear Optimization Solver 
14  On Constructing the Minimum Orthogonal Convex Polygon in 2D Faulty Meshes 
14  Parallel Brutus: The First Distributed, FPGA Accelerated Chess Program 
13  BLACKBUS: A New DataTransfer Technique Using Local Address on NetworksonChips 
13  StarColoring of Graphs for ConflictFree Access to Parallel Memory Systems 
12  Solving Large Sparse Linear Systems in Endtoend Accelerator Structure Simulations 
12  A SNAPBased Community Resource Broker Using a ThreePhase Commit Protocol 
12  Scalable Highlevel Caching for Parallel I/O 
11  The UPC Memory Model: Problems and Prospects 
11  A Hierarchical Parallel Scheme for Global Parameter Estimation in Systems Biology 
11  Randomized Smoothing Networks 
11  Hierarchical Gather/Scatter Algorithms with Graceful Degradation 
11  LessLog: A Logless File Replication Algorithm for PeertoPeer Distributed Systems 
10  Load Balancing: Dimension Exchange on Product Graphs 
10  An Efficient Distributed Mutual Exclusion Algorithm Based on Relative Consensus Voting 
10  A Novel Method for Adding Multiprocessor Support to a Large and Complex Uniprocessor Kernel 
9  Parallel Maximum Weight Bipartite Matching Algorithms for Scheduling in InputQueued Switches 
9  Optimization of the POLCOMS Hydrodynamic Code for Terascale HighPerformance Computers 
9  A Scalable Architecture for Distributed Shared Memory Multiprocessors Using Optical Interconnects 
9  Translating Submachine Locality into Locality of Reference 
9  Taking Advantage of the Overlay Geometrical Structures for Mobile Agent Communications 
9  MemoryBased Scheduling for a Parallel Multifrontal Solver 
9  Speculation Control for Simultaneous Multithreading 
9  ABC: A ClusterBased Protocol for Resource Location in PeertoPeer Systems 
8  A Novel Static Task Scheduling Algorithm in Distributed Computing Environments 
8  LORE  Local Reconfiguration for Fault Management in Irregular Interconnects 
8  Adaptive Memory Paging for Efficient Gang Scheduling of Parallel Applications 
8  A Distributed Hash Table for Computational Grids 
8  Exploiting Client Cache: A Scalable and Efficient Approach to Build Large Web Cache 
8  Compiler Support for Parallel Code Generation through Kernel Recognition 
8  Parallel Mining of Association Rules from Text Databases on a Cluster of Workstations 
7  A Large Scale Monte Carlo Simulator for Cellular Microphysiology 
7  ApplicationPerceived Multicast Push Performance 
7  Diagnostics for Causes of Packet Loss in a High Performance Data Transfer System 
7  Highly Efficient Synchronization Based on Active Memory Operations 
7  Packet Probing as Network Load Detection for Scientific Applications at RunTime 
7  Finding Satisfying Global States: All for One and One for All 
7  SPEAR: A Hybrid Model for Speculative PreExecution 
7  ProcessorEmbedded Distributed MEMSBased Storage Systems for HighPerformance I/O 
6  A Cluster Oriented Model for Dynamically Balanced DHTs 
6  Specification and Architecture Supports for Component Adaptations on Distributed Environments 
6  A Neural Network Based Approach for Overlay Multicast in Media Streaming Systems 
6  Multithreaded HomeBased Lazy Release Consistency over VIA 
6  Pareto Approximations for the Bicriteria Scheduling Problem 
6  Dynamic Adjustment of Execution Order in RealTime Databases 
6  Improving Response Time in ClusterBased Web Servers through Coscheduling 
5  Scheduling of Query Execution Plans in Symmetric Multiprocessor Database Systems 
5  Dynamic versus Static Locking in RealTime Parallel Database Systems 
5  Bounded Service Time and Memory Space Optimal SelfStabilizing Token Circulation Protocol on Unidirectional Rings 
5  Secure and Reliable Decentralized PeertoPeer Web Cache 
5  PredictionBased Routing through Least Cost Delay Constraint 
5  Adapting to Memory Pressure from within Scientific Applications on Multiprogrammed COWs 
5  coNCePTuaL: A Network Correctness and Performance Testing Languag 
4  Hierarchical Routing with SoftState Replicas in TerraDir 
4  Transaction Based Dynamic Partial Replication in Mobile Environments 
4  Evaluation of Elementary Functions Using Multimedia Features 
3  Integrating Java and Matlab Components into the Same Parallel and Distributed Application Using JavaPorts 
3  On the IP Routing Tables Minimization with Addresses Reassignment 
3  Distributed Embedded Systems for Low Power: A Case Study 
3  Architecture and Protocol for Reliable Event Delivery to Clients of a HighAvailability Cluster 
3  Utilizing Home Node Prediction to Improve the Performance of Software Distributed Shared Memory 
3  ReArchitecting Flow Control Adaptation for Grid Environments 
3  Network Communications in Grid Computing: At a Crossroads between Parallel and Distributed Worlds 
2  AverageCase Performance Analysis and Validation of Online Scheduling of Independent Parallel Tasks 
2  Random Broadcasting in Product Networks and Symmetric Networks 
2  Bundling: Reducing the Overhead of Multiprocessor Prefetchers 
2  Using Speculation to Simplify Multiprocessor Design 
2  Facilitating Interactive Distributed Data Stream Processing and Mining 
2  Optimal Layout for Fast Fourier Transform in Multilayer VLSI 
1  Flecc: A Flexible Cache Coherence Protocol for Dynamic ComponentBased Systems 
1  Fast and Scalable Parallel Algorithms for Euclidean Distance Transform on LARPBS 
1  Parallel Simulation of Fluid Slip in a Microchannel 
1  Isocoupling: Reusing Kernel Coupling Values to Predict the Performance of Parallel Applications 
1  A SelfAdapting Distributed Memory Package for Fast Signal Transforms 
1  Integrating Program Component Executables on Distributed Memory Architectures via MPH 
1  Assignment and Scheduling of Realtime DSP Applications for Heterogeneous Functional Units 
1  An Optimal Protocol for Causally Consistent Distributed Shared Memory Systems 
1  Exploiting Memory Bank Locality in Multiprocessor SoC Architectures 
1  Supercompilers, the AMD Opteron, and Your Cell Phone 
0  Assignment of Shortest Paths Spanning Trees in Meshes 
0  Distributed Correction of Proximity Effect in Electron Beam Lithography on a Heterogeneous Cluster 
0  An Algorithm for Geometric Load Balancing with Two Constraints 
0  Survivable Systems Based on an Adaptive NMR Algorithm 
0  An Efficient Parallel Algorithm for Isosurface Visualization from LargeScale Scientific Simulation Data 
0  Distributed Algorithms for Partially Clairvoyant Dispatchers 
0  Ouroboros: A Tool for Building Generic, Hybrid, Divide and Conquer Algorithms 
0  Sparse Matrix Transpose Unit 