Edgar Solomonik, Devin Matthews, Jeff Hammond, and James Demmel; Cyclops Tensor Framework: reducing communication and eliminating load
imbalance in massively parallel contractions;
IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Boston, MA, May 2013.
(text)
Edgar Solomonik, Abhinav Bhatele, and James Demmel; Improving
communication performance in dense linear algebra via topology
aware collectives; ACM/IEEE Supercomputing Conference 2011, Seattle, WA, November 2011.
(text)
Edgar Solomonik, James Demmel; Communication-optimal parallel 2.5D matrix
multiplication and LU factorization algorithms; Lecture Notes in Computer Science,
Euro-Par, Bordeaux, France, August 2011. "Distinguished Paper"(text)
Edgar Solomonik and Laxmikant V. Kale; Highly Scalable Parallel Sorting;
IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Atlanta, GA, April 2010.
(text)
Abhinav Bhatele, Lukasz Wesolowski, Eric Bohm, Edgar Solomonik, and Laxmikant V. Kale;
Understanding Application Performance via Micro-benchmarks on Three Large Supercomputers:
Intrepid, Ranger and Jaguar; International Journal of High Performance
Computing Applications (IJHPCA); November 2010.
(text)