

67  Lattice Boltzmann simulation optimization on leading multicore platforms 
62  Overcoming scaling challenges in biomolecular simulations across multiple platforms 
56  Allpairs: An abstraction for dataintensive cloud computing 
46  Receiverinitiated message passing over RDMA Networks 
41  Online scheduling in grids 
35  Performance characterization and optimization of parallel I/O on the Cray XT 
32  Massively parallel cosmological simulations with ChaNGa 
30  Data gathering in wireless sensor networks with mobile collectors 
29  Accelerating ReedSolomon coding in RAID systems with GPUs 
26  Financial modeling on the cell broadband engine 
25  Portioned staticpriority scheduling on multiprocessors 
24  DiCoCMP: Efficient cache coherency in tiled CMP architectures 
23  Efficient resource management using advance reservations for heterogeneous Grids 
23  MVAPICHAptus: Scalable highperformance multitransport MPI over InfiniBand 
21  Massive supercomputing coping with heterogeneity of modern accelerators 
20  I/O performance on a massively parallel Cray XT3/XT4 
20  Intermediate checkpointing with conflicting access prediction in transactional memory systems 
20  A new diffusionbased multilevel algorithm for computing graph partitions of very high quality 
20  A helper thread based EDP reduction scheme for adapting application execution in CMPs 
20  An optimal checkpoint/restart model for a large scale high performance computing system 
18  Parallel biological sequence alignments on the Cell Broadband Engine 
17  On the representation and multiplication of hypersparse matrices 
17  A plugandplay model for evaluating wavefront computations on parallel architectures 
17  Highspeed string searching against large dictionaries on the Cell/B.E. Processor 
16  Avoiding communication in sparse matrix computations 
16  DHTassisted probabilistic exhaustive search in unstructured P2P networks 
16  A dynamic scheduling approach for coordinated widearea data transfers using GridFTP 
15  Lightweight process migration and memory prefetching in openMosix 
15  High performance MPEG2 software decoder on the cell broadband engine 
14  Analysis of double buffering on two different multicore architectures: Quadcore Opteron and the CellBE 
14  An efficient hybrid peertopeer system for distributed data sharing 
14  Junction tree decomposition for parallel exact inference 
13  Optimizations in financial engineering: The LeastSquares Monte Carlo method of Longstaff and Schwartz 
13  Scalable groupbased checkpoint/restart for largescale messagepassing systems 
13  Parallel IP lookup using multiple SRAMbased pipelines 
13  Balancing HPC applications through smart allocation of resources in MT processors 
13  Approximating maxmin linear programs with local algorithms 
12  Efficient automated marshaling of C++ data structures for MPI applications 
12  Picking up the Pieces: SelfHealing in reconfigurable networks 
11  Simultaneous transducers for dataparallel XML parsing 
10  Modeling and predicting application performance on parallel computers using HPC challenge benchmarks 
10  SLAbased resource allocation in cluster computing systems 
10  Scalable methods for monitoring and detecting behavioral equivalence classes in scientific codes 
10  Efficient and robust sensor data aggregation using linear counting sketches 
9  ContinuStreaming: Achieving high playback continuity of Gossipbased PeertoPeer streaming 
9  Towards a decentralized architecture for optimization 
9  On performance bottleneck of anonymous communication networks 
9  Random choices for churn resilient load balancing in peertopeer networks 
9  Waitfree Programming for General Purpose Computations on Graphics Processors 
9  Energy efficient sleep scheduling based on moving directions in target tracking sensor network 
8  Efficient MPI Bcast across different process arrival patterns 
8  Providing flow based performance guarantees for buffered crossbar switches 
8  DVS based energy minimization algorithm for parallel machines 
8  Performance adaptive UDP for highspeed bulk data transfer over dedicated links 
8  A deterministic multiway rendezvous library for haskell 
8  CoSL: A coordinated statistical learning approach to measuring the capacity of multitier websites 
8  Data throttling for dataintensive workflows 
7  A parallel software toolkit for statistical 3D virus reconstructions from cryo electron microscopy images using computer clusters with multicore sharedmemory nodes 
7  Sacrificing Reliability for Energy Saving: Is it worthwhile for disk arrays? 
7  Supporting faulttolerance in streaming grid applications 
7  Modelbased fault localization in largescale computing systems 
7  Epochbased reconfiguration: Fast, simple, and effective dynamic network reconfiguration 
6  Distributed asymmetric verification in computational grids 
6  SenCast: Scalable multicast in wireless sensor networks 
5  Scheduling with storage constraints 
5  Usercentric data migration in networked storage systems 
5  Energy efficient media streaming in wireless hybrid peertopeer systems 
5  Gametheoretic scalable peertopeer media streaming 
5  Designing passive synchronization for MPI2 onesided communication to maximize overlap 
5  Understanding tuning complexity in multithreaded and hybrid web servers 
5  A transparent noninvasive file data model for algorithmic skeletons 
5  An adaptive parallel pipeline pattern for grids 
4  HelperCoreDB: Exploiting multicore technology to improve database performance 
4  Parallel mining of closed quasicliques 
4  Achieving 100% throughput in inputbuffered WDM optical packet interconnects 
4  Parallelizing irregular C codes assisted by interprocedural shape analysis 
4  Sweep coverage with mobile sensors 
3  On utilization of contributory storage in desktop grids 
3  A game theoretical data replication technique for mobile ad hoc networks 
3  Evaluating the role of scratchpad memories in chip multiprocessors for sparse matrix computations 
3  Decentralized marketbased resource allocation in a heterogeneous computing system 
3  Low power/area branch prediction using complementary branch predictors 
3  Result reuse in design space exploration: A study in system support for interactive parallel computing 
2  PROD: Relayed file retrieving in overlay networks 
2  Optimizing XML processing for grid applications using an emulation framework 
2  Efficient resources assignment schemes for clustered multithreaded processors 
2  Towards reliable and efficient data dissemination in heterogeneous peertopeer systems 
2  Continuous answering holistic queries over sensor networks 
1  Heterogenous dating service with application to rumor spreading 
1  DCSIMD : Dynamic communication for SIMD processors 
1  Optimal replication transition strategy in distributed hierarchical systems 
1  An effective pointer replication algorithm in P2P networks 
1  A unified model of pollution in P2P networks 
1  Selfstabilizing algorithms for sorting and heapification 
0  A space and timeefficient hash table hierarchically indexed by Bloom filters 
0  An interconnectaware power efficient cache coherence protocol for CMPs 
0  Selfoptimizing distributed trees 
0  Selfstabilizing population of mobile agents 
0  A predicatebased approach to dynamic protocol update in group communication 
0  SNAP, Smallworld Network Analysis and Partitioning: An opensource parallel graph framework for the exploration of largescale networks 
0  A softwarehardware hybrid steering mechanism for clustered microarchitectures 
0  The impact of outoforder commit in coarsegrain, finegrain and simultaneous multithreaded architectures 
0  Fault tolerance with shortest paths in regular and irregular networks 
0  Computational monitoring and steering using networkoptimized visualization and Ajax web server 
0  Scalable data dissemination using hybrid methods 