# Faculty Publications - James Demmel

## Books

- J. Demmel,
*Applied Numerical Linear Algebra*, Philadelphia, PA: Society for Industrial and Applied Mathematics, 1997. [abstract]

## Book chapters or sections

- J. Demmel, J. Dongarra, B. N. Parlett, W. M. Kahan, M. Gu, D. Bindel, Y. Hida, X. Li, O. Marques, E. J. Riedy, C. Vomel, J. Langou, P. Luszczek, J. Kurzak, A. Buttari, J. Langou, and S. Tomov, "Prospectus for the next LAPACK and ScaLAPACK libraries," in
*Applied Parallel Computing: State of the Art in Scientific Computing. Proc. 8th Intl. Workshop (PARA 2006). Revised Selected Papers*, B. Kagstrom, E. Elmroth, J. Dongarra, and J. Wasniewski, Eds., Lecture Notes in Computer Science, Vol. 4699, Berlin, Germany: Springer-Verlag, 2007, pp. 11-23. - D. S. Bindel, Z. Bai, and J. Demmel, "Model reduction for RF MEMS simulation," in
*Applied Parallel Computing: State of the Art in Scientific Computing. Proc. 7th Intl. Workshop (PARA 2004): Revised Selected Papers*, J. Dongarra, K. Madsen, and J. Wasniewski, Eds., Lecture Notes in Computer Science, Vol. 3732, Berlin, Germany: Springer-Verlag, 2006, pp. 286-295. - E. J. Im, I. Bustany, C. Ashcraft, J. Demmel, and K. A. Yelick, "Performance tuning of matrix triple products based on matrix structure," in
*Applied Parallel Computing: State of the Art in Scientific Computing. Proc. 7th Intl. Workshop (PARA 2004): Revised Selected Papers*, J. Dongarra, K. Madsen, and J. Wasniewski, Eds., Lecture Notes in Computer Science, Vol. 3732, Berlin, Germany: Springer-Verlag, 2006, pp. 740-746. - J. Nie and J. Demmel, "Shape Optimization of Transfer Functions," in
*Multiscale Optimization Methods and Applications*, W. W. Hager, S. J. Huang, P. M. Pardalos, and O. A. Prokopyev, Eds., Nonconvex Optimization and Its Applications, Vol. 82, Berlin, Germany: Springer-Verlag, 2006, pp. 313-326. - D. S. Bindel, J. Demmel, M. J. Friedman, W. J. F. Govaerts, and Y. A. Kuznetsov, "Bifurcation analysis of large equilibrium systems in MATLAB," in
*Computational Science: Proc. 5th Intl. Conf. (ICCS 2005)*, V. S. Sunderam, G. D. van Albada, P. M. A. Sloot, and J. J. Dongarra, Eds., Lecture Notes in Computer Science, Vol. 3514, Berlin, Germany: Springer-Verlag, 2005, pp. 50-57. - E. Mizutani and J. Demmel, "Iterative scaled trust-region learning in Krylov subspaces via Peralmutter's implicit sparse Hessian-vector multiply," in
*Advances in Neural Information Processing Systems 16: Proc. 17th Annual Conf. (NIPS 2003)*, S. Thrun, L. K. Saul, and B. Scholkopf, Eds., Advances in Neural Information Processing, Vol. 16, Cambridge, MA: MIT Press, 2004, pp. 209-216. - R. Vuduc, A. Gyulassy, J. Demmel, and K. A. Yelick, "Memory hierarchy optimizations and performance bounds for Sparse {A sup T Ax}," in
*Computational Science: Proc. Intl. Conf. on Computational Science (ICCS 2003)*, P. M. A. Sloot, D. Abramson, A. V. Bogdanov, J. J. Dongarra, A. Y. Zomaya, and Y. E. Gorbachev, Eds., Lecture Notes in Computer Science, Vol. 2659, Berlin, Germany: Springer-Verlag, 2003, pp. 705-714. - L. A. Drummond, J. Demmel, C. R. Mechoso, H. Robinson, K. Sklower, and J. A. Spahr, "A data broker for distributed computing environments," in
*Computational Science -- Part I: Proc. Intl. Conf. (ICCS 2001)*, V. N. Alexandrov, J. Dongarra, B. A. Juliano, R. S. Renner, and C. J. K. Tan, Eds., Lecture Notes in Computer Science, Vol. 2073, Berlin, Germany: Springer-Verlag, 2001, pp. 31-40. - R. Vuduc, J. Demmel, and J. Bilmes, "Statistical models for automatic performance tuning," in
*Computational Science: Proc. Intl. Conf. on Computational Science (ICCS 2001)*, V. N. Alexandrov, J. J. Dongarra, B. A. Juliano, R. S. Renner, and C. J. K. Tan, Eds., Lecture Notes in Computer Science, Vol. 2073, Berlin, Germany: Springer-Verlag, 2001, pp. 117-126. - E. Mizutani and J. Demmel, "On iterative Krylov-dogleg trust-region steps for solving neural networks nonlinear least squares problems," in
*Advances in Neural Information Processing Systems 13: Proc. 14th Annual Conf. (NIPS 2000)*, T. K. Leen, T. G. Dietterreich, and V. Tresp, Eds., Advances in Neural Information Processing Systems, Vol. 13, Cambridge, MA: MIT Press, 2000, pp. 605-611. - R. Vuduc and J. Demmel, "Code generators for automatic tuning of numerical kernels: Experiences with FFTW," in
*Semantics, Applications, and Implementation of Program Generation: Proc. 2000 Intl. Workshop (SAIG 2000)*, W. Taha, Ed., Lecture Notes in Computer Science, Vol. 1924, Berlin, Germany: Springer-Verlag, 2000, pp. 190-211. - D. T. Blackston, J. Demmel, A. R. Neureuther, and B. Wu, "Integration of an adaptive parallel N-body solver into a particle by particle electron-beam interaction simulator," in
*Proc. SPIE: Charged Particle Optics IV*, E. Munro, Ed., Vol. 3777, Bellingham, WA: SPIE -- The International Society for Optical Engineering, 1999, pp. 228-239. - L. S. Blackford, A. J. Cleary, J. Demmel, I. S. Dhillon, J. Dongarra, S. Hammarling, A. Petitet, H. Ren, K. Stanley, and R. C. Whaley, "Practical experience in the dangers of heterogeneous computing," in
*Applied Parallel Computing: Industrial Computation and Optimization. Proc. 3rd Intl. Workshop (PARA '96)*, J. Wasniewski, J. Dongarra, K. Madsen, and D. Olesen, Eds., Lecture Notes in Computer Science, Vol. 1184, Berlin, Germany: Springer-Verlag, 1996, pp. 57-64. - J. Choi, J. Demmel, I. S. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. W. Walker, and R. C. Whaley, "ScaLAPACK, a portable linear algebra library for distributed memory computers -- Design issues and performance," in
*Applied Parallel Computing: Computations in Physics, Chemistry, and Engineering Science. Proc. 2nd Intl. Workshop (PARA '95)*, J. Dongarra, K. Madsen, and J. Wasniewski, Eds., Lecture Notes in Computer Science, Vol. 1041, Berlin, Germany: Springer-Verlag, 1996, pp. 95-106. - Z. Bai, D. Day, J. Demmel, M. Gu, J. Dongarra, A. Ruhe, and H. van der Vorst, "Templates for Linear Algebra Problems (Invited Paper)," in
*Computer Science Today: Recent Trends and Developments*, J. van Leeuwen, Ed., Lecture Notes in Computer Science, Vol. 1000, Belin, Germany: Springer-Verlag, 1995, pp. 115-140. - J. Demmel, J. Dongarra, and W. M. Kahan, "On designing portable high performance numerical libraries," in
*Numerical Analysis 1991: Proc. 14th Dundee Conf. on Numerical Analysis*, D. F. Griffiths and G. A. Watson, Eds., Pitman Research Notes in Mathematics, Essex, UK: Longman Scientific & Technical, 1992, pp. 69-84. - J. Demmel, "The condition number of equivalence transformations that block diagonalize matrix pencils," in
*Matrix Pencils: Conf. Proc.*, B. Kagstrom and A. Ruhe, Eds., Lecture Notes in Mathatics, Vol. 973, Berlin, Germany: Springer-Verlag, 1983, pp. 2-16.

## Articles in journals or magazines

- K. AsanoviÄ‡, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick, "A View of the Parallel Computing Landscape,"
*Communications of the ACM*, vol. 52, no. 10, pp. 56-67, Oct. 2009. [abstract] - O. A. Marques, C. Vomel, J. Demmel, and B. N. Parlett, "Algorithm 880: A testing infrastructure for symmetric tridiagonal eigensolvers,"
*ACM Trans. Mathematical Software*, vol. 35, no. 1, pp. 8:1-13, July 2008. - G. W. Howell, J. Demmel, C. T. Fulton, S. hammarling, and K. Marmol, "Cache efficient bidiagonalization using BLAS 2.5 operators,"
*ACM Trans. Mathematical Software*, vol. 34, no. 3, pp. 14:1-33, May 2008. - J. Demmel, I. Dumitriu, O. Holtz, and P. Koev, "Accurate and efficient expression evaluation and linear algebra,"
*Acta Numerica*, vol. 17, pp. 87-145, May 2008. - J. Nie, J. Demmel, and M. Gu, "Global minimization of rational functions and the nearest GCDs,"
*J. Global Optimization*, vol. 40, no. 4, pp. 697-718, April 2008. - J. Demmel, O. A. Marques, B. N. Parlett, and C. Vomel, "Performance and accuracy of LAPACK's symmetric tridiagonal eigensolvers,"
*SIAM J. Scientific Computing*, vol. 30, no. 3, pp. 1508-1526, April 2008. - D. Bindel, J. Demmel, and M. Friedman, "Continuation of invariant subspaces in large bifurcation problems,"
*SIAM J. Scientific Computing*, vol. 30, no. 2, pp. 637-656, Feb. 2008. - J. Demmel, I. Dumitriu, and O. Holtz, "Fast linear algebra is stable,"
*Numerische Mathematik*, vol. 108, no. 1, pp. 59-91, Oct. 2007. - L. Grigori, J. Demmel, and X. S. Li, "Parallel symbolic factorization for sparse LU with static pivoting,"
*SIAM J. Scientific Computing*, vol. 29, no. 3, pp. 1289-1314, May 2007. - R. Nishtala, R. W. Vuduc, J. Demmel, and K. A. Yelick, "When cache blocking of sparse matrix vector multiply works and why,"
*Applicable Algebra in Engineering, Communication and Computing*, vol. 18, no. 3, pp. 297-311, May 2007. - H. Choo, D. Garmire, J. Demmel, and R. S. Muller, "Simple fabrication process for self-aligned, high-performance microscanners--Demonstrated use to generate a 2-D ablation pattern,"
*J. Microelectromechanical Systems*, vol. 16, no. 2, pp. 260-268, April 2007. - J. Demmel, I. Dumitriu, O. Holtz, and R. Kleinberg, "Fast matrix multiplication is stable,"
*Numerische Mathematik*, vol. 106, no. 2, pp. 199-224, March 2007. - J. Demmel, Y. Hida, W. M. Kahan, X. S. Li, S. Mukherjee, and E. J. Riedy, "Error bounds from extra-precise iterative refinement,"
*ACM Trans. Mathematical Software*, vol. 32, no. 2, pp. 325-351, June 2006. - J. Nie, J. Demmel, and B. Sturmfels, "Minimizing polynomials via sum of squares over the gradient ideal,"
*Mathematical Programming*, vol. 106, no. 3, pp. 587-606, May 2006. - J. Demmel and P. Koev, "Accurate and efficient evaluation of Schur and Jack functions,"
*Mathematics of Computation*, vol. 75, no. 253, pp. 223-239, Jan. 2006. - J. Demmel and P. Koev, "The accurate and efficient solution of a totally positive generalized Vandermonde linear system,"
*SIAM J. Matrix Analysis and Applications*, vol. 27, no. 1, pp. 142-152, 2005. - R. Vuduc, J. Demmel, and K. A. Yelick, "OSKI: A library of automatically tuned sparse matrix kernels,"
*J. Physics: Conference Series*, vol. 16, no. 1, pp. 521-530, 2005. - J. Nie and J. Demmel, "Minimum ellipsoid bounds for solutions of polynomial systems via sum of squares,"
*J. Global Optimization*, vol. 33, no. 4, pp. 511-525, Dec. 2005. - J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, R. C. Whaley, and K. A. Yelick, "Self-adapting linear algebra algorithms and software,"
*Proc. IEEE*, vol. 93, no. 2, pp. 293-312, Feb. 2005. - R. Vuduc, J. Demmel, and J. A. Bilmes, "Statistical models for empirical search-based performance tuning,"
*Intl. J. High Performance Computing Applications*, vol. 18, no. 1, pp. 65-94, 2004. - J. Demmel and Y. Hida, "Fast and accurate floating point summation with application to computational geometry,"
*Numerical Algorithms*, vol. 37, no. 1-4, pp. 101-112, Dec. 2004. - J. Demmel and P. Koev, "Accurate SVDs of weakly diagonally dominant M-matrices,"
*Numerische Mathematik*, vol. 98, no. 1, pp. 99-104, July 2004. - J. Demmel and Y. Hida, "Accurate and efficient floating point summation,"
*SIAM J. Scientific Computation*, vol. 25, no. 4, pp. 1214-1248, 2003. - X. S. Li and J. Demmel, "SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems,"
*ACM Trans. Mathematical Software*, vol. 29, no. 2, pp. 110-140, June 2003. - E. Mizutani and J. Demmel, "On structure-exploiting trust-region regularized nonlinear least squares algorithms for neural-network learning,"
*Neural Networks: Special Issue on Advances in Neural Networks Research (IJCNN '03)*, vol. 16, no. 5-6, pp. 745-753, June 2003. - X. S. Li, J. Demmel, D. H. Bailey, G. Henry, Y. Hida, J. Iskandar, W. M. Kahan, S. Y. Kang, A. Kapur, M. C. Martin, B. J. Thompson, T. Tung, and D. J. Yoo, "Design, implementation and testing of Extended and Mixed Precision BLAS,"
*ACM Trans. Mathematical Software*, vol. 28, no. 2, pp. 152-205, June 2002. - L. S. Blackford, J. Demmel, J. Dongarra, I. Duff, S. Hammarling, G. Henry, M. Heroux, L. Kaufman, A. Lumsdaine, A. Petitet, R. Pozo, K. Remington, and R. C. Whaley, "An updated set of Basic Linear Algebra Subprograms (BLAS),"
*ACM Trans. Mathematical Software*, vol. 28, no. 2, pp. 135-151, June 2002. - D. Bindel, J. Demmel, W. M. Kahan, and O. Marques, "On computing Givens rotations reliably and efficiently,"
*ACM Trans. Mathematical Software*, vol. 28, no. 2, pp. 206-238, June 2002. - J. Demmel, B. Diament, and G. Malajovich, "On the complexity of computing error bounds,"
*Foundations of Computational Mathematics*, vol. 1, no. 1, pp. 101-125, Sep. 2001. - J. Demmel, L. Dieci, and M. J. Friedman, "Computing connecting orbits via an improved algorithm for continuing invariant subspaces,"
*SIAM J. Scientific Computing*, vol. 22, no. 1, pp. 81-94, 2000. - J. Demmel, "Accurate singular value decompositions of structured matrices,"
*SIAM J. Matrix Analysis and Applications*, vol. 21, no. 2, pp. 562-580, Oct. 2000. - J. Demmel, J. R. Gilbert, and X. S. Li, "An asynchronous parallel supernodal algorithm for sparse Gaussian elimination,"
*SIAM J. Matrix Analysis and Applications*, vol. 20, no. 4, pp. 915-952, Oct. 1999. - J. Demmel, S. C. Eisenstat, J. R. Gilbert, X. S. Li, and J. W. H. Liu, "A supernodal approach to sparse partial pivoting,"
*SIAM J. Matrix Analysis and Applications*, vol. 20, no. 3, pp. 720-7551, July 1999. - B. G. Pfrommer, J. Demmel, and H. Simon, "Unconstrained energy functionals for electronic structure calculations,"
*J. Computational Physics*, vol. 150, no. 1, pp. 287-298, March 1999. - J. Saltz, A. Sussman, S. L. Graham, J. Demmel, S. Baden, and J. Dongarra, "Programming tools and environments,"
*Communications of the ACM*, vol. 41, no. 11, pp. 64-73, Nov. 1998. - Z. Bai and J. Demmel, "Using the matrix sign function to compute invariant subspaces,"
*SIAM J. Matrix Analysis and Applications*, vol. 19, no. 1, pp. 205-225, Jan. 1998. - S. Chakrabarti, J. Demmel, and K. A. Yelick, "Models and scheduling algorithms for mixed data and task parallel programs,"
*J. Parallel and Distributed Computing: Special Issue on Dynamic Load Balancing*, vol. 47, no. 1, pp. 168-184, Nov. 1997. - Z. Bai, J. Demmel, J. Dongarra, A. Petitet, H. Robinson, and K. Stanley, "The spectral decomposition of nonsymmetric matrices on distributed memory parallel computers,"
*SIAM J. Scientific Computing*, vol. 18, no. 5, pp. 1446-1461, Sep. 1997. - L. S. Blackford, A. Cleary, A. Petitet, R. C. Whaley, J. Demmel, I. Dhillon, H. Ren, K. Stanley, J. Dongarra, and S. Hammarling, "Practical experience in the numerical dangers of heterogeneous computing,"
*ACM Trans. Mathematical Software*, vol. 23, no. 2, pp. 133-147, June 1997. - Z. Bai, J. Demmel, and M. Gu, "An inverse free parallel spectral divide and conquer algorithm for nonsymmetric eigenproblems,"
*Numerische Mathematik*, vol. 76, no. 3, pp. 279-308, May 1997. - J. Choi, J. Demmel, I. S. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. W. Walker, and R. C. Whaley, "ScaLAPACK, a portable linear algebra library for distributed memory computers -- Design issues and performance,"
*Computer Physics Communications*, vol. 97, no. 1-2, pp. 1-15, Aug. 1996. - D. Manocha and J. Demmel, "Algorithms for intersecting parametric and algebraic curves. II. Multiple intersections,"
*Graphical Models and Image Processing*, vol. 57, no. 2, pp. 81-100, March 1995. - J. Demmel and X. Li, "Faster numerical algorithms via exception handling,"
*IEEE Trans. Computers*, vol. 43, no. 8, pp. 983-992, Aug. 1994. - D. Manocha and J. Demmel, "Algorithms for intersecting parametric and algebraic curves. I. Simple intersections,"
*ACM Trans. Graphics*, vol. 13, no. 1, pp. 73-100, Jan. 1994. - Z. Bai and J. Demmel, "Computing the generalized singular value decomposition,"
*SIAM J. Scientific Computing*, vol. 14, no. 6, pp. 1464-1486, Nov. 1993. - V. Pan and J. Demmel, "A new algorithm for the symmetric tridiagonal eigenvalue problem,"
*J. of Complexity*, vol. 9, no. 3, pp. 387-405, Sep. 1993. - J. Demmel and B. Kagstrom, "The generalized Schur decomposition of an arbitrary pencil A - lambdaB: Robust software with error bounds and applications. Part II: Software and applications,"
*ACM Trans. Mathematical Software*, vol. 19, no. 2, pp. 175-201, June 1993. - J. Demmel and B. Kagstrom, "The generalized Schur decomposition of an arbitrary pencil A - lambdaB: Robust software with error bounds and applications. Part I: Theory and algorithms,"
*ACM Trans. Mathematical Software*, vol. 19, no. 2, pp. 160-174, June 1993. - Z. Bai, J. Demmel, and A. McKenney, "On computing condition numbers for the nonsymmetric eigenproblem,"
*ACM Trans. Mathematical Software*, vol. 19, no. 2, pp. 202-223, June 1993. - J. Demmel and K. Veselic, "Jacobi's method is more accurate than QR,"
*SIAM J. Matrix Analysis and Applications*, vol. 13, no. 4, pp. 1204-1245, Oct. 1992. - J. Demmel and N. J. Higham, "Stability of block algorithms with fast level-3 BLAS,"
*ACM Trans. Mathematical Software*, vol. 18, no. 3, pp. 274-291, Sep. 1992. - J. Demmel, "LAPACK: A portable linear algebra library for high-performance computers,"
*Concurrency: Practice and Experience*, vol. 3, no. 6, pp. 655-666, Dec. 1991. - J. Dongarra and J. Demmel, "LAPACK: A portable high-performance numerical library for linear algebra,"
*Supercomputer*, vol. 8, no. 6, pp. 33-38, Nov. 1991. - J. Demmel and W. M. Kahan, "Accurate singular values of bidiagonal matrices,"
*SIAM Journal on Scientific and Statistical Computing*, vol. 11, no. 5, pp. 873-912, Sep. 1990. - K. Perlin, J. Demmel, and P. K. Wright, "Simulation software for the Utah/MIT dextrous hand,"
*Robotics and Computer-Integrated Manufacturing*, vol. 5, no. 4, pp. 281-292, 1989. - Z. Bai and J. Demmel, "On a block implementation of Hessenberg multishift QR iteration,"
*Intl. J. High Speed Computing*, vol. 1, no. 1, pp. 97-112, May 1989. - J. Demmel, G. Lafferriere, J. Schwartz, and M. Sharir, "Theoretical and experimental studies using a planar multifinger manipulator,"
*Naval Research Reviews*, vol. 40, no. 3, pp. 14-23, 1988. - J. Demmel, "The probability that a numerical analysis problem is difficult,"
*Mathematics of Computation*, vol. 50, no. 182, pp. 449-480, April 1988. - J. Demmel, "Three methods for refining estimates of invariant subspaces,"
*Computing*, vol. 38, no. 1, pp. 43-57, Dec. 1987. - J. Demmel, "The geometry of III-conditioning,"
*J. of Complexity*, vol. 3, no. 2, pp. 201-229, June 1987. - J. Demmel, "On condition numbers and the distance to the nearest ill-posed problem,"
*Numerische Mathematik*, vol. 51, no. 3, pp. 251-289, May 1987. - J. Demmel, "A counterexample for two conjectures about stability,"
*IEEE Trans. Automatic Control*, vol. AC-32, no. 4, pp. 340-342, April 1987. - J. Demmel and B. Kagstrom, "Computing stable eigendecompositions of matrix pencils,"
*Linear Algebra and Its Applications*, vol. 88-89, pp. 139-186, April 1987. - J. Demmel, "The smallest perturbation of a submatrix which lowers the rank and constrained total least squares problems,"
*SIAM J. Numerical Analysis*, vol. 24, no. 1, pp. 199-206, Feb. 1987. - J. Demmel and F. Kruckeberg, "An interval algorithm for solving systems of linear equations to prespecified accuracy,"
*Computing*, vol. 34, no. 2, pp. 117-129, June 1985. - J. Demmel, "Underflow and the reliability of numerical software,"
*SIAM J. Scientific Computing*, vol. 5, no. 4, pp. 887-919, Dec. 1984.

## Articles in conference proceedings

- B. C. Catanzaro, S. A. Kamil, Y. Lee, K. AsanoviÄ‡, J. Demmel, K. Keutzer, J. Shalf, K. A. Yelick, and A. Fox, "SEJITS: Getting productivity and performance with selective embedded JIT specialization," in
*Proceedings First Workshop on Programming Models for Emerging Architectures*, 2009. - V. Volkov and J. Demmel, "Benchmarking GPUs to tune dense linear algebra," in
*Proc. 2008 ACM/IEEE Conf. on Superconducting (SC '08)*, Piscataway, NJ: IEEE Press, 2008, pp. Art. 31:1-11. - L. Grigori, J. Demmel, and H. Xiang, "Communication avoiding Gaussian elimination," in
*Proc. 2008 ACM/IEEE Conf. on Superconducting (SC '08)*, Piscataway, NJ: IEEE Press, 2008, pp. Art. 29:1-12. - J. Demmel, M. Hoemmen, M. Hohiyuddin, and K. A. Yelick, "Avoiding communication in sparse matrix computations," in
*Proc. 22nd IEEE Intl. Parallel & Distributed Processing Symp. (IPDPS 2008)*, Piscataway, NJ: IEEE Press, 2008, pp. 12 pg. - S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. A. Yelick, and J. Demmel, "Optimization of sparse matrix-vector multiplication on emerging multicore platforms," in
*Proc. 2007 ACM/IEEE Conf. on Supercomputing (SC '07)*, New York, NY: The Association for Computing Machinery, Inc., 2007, pp. Art. 38. - D. Garmire, H. Choo, R. Kant, S. govindjee, C. H. Séquin, R. S. Muller, and J. Demmel, "Diamagnetically levitated MEMS accelerometers (Poster Paper)," in
*14th IEEE Intl. Conf. on Solid-State Sensors, Actuators and Microsystems (TRANSDUCERS 2007) Digest of Technical Papers*, Piscataway, NJ: IEEE Press, 2007, pp. 1203-1206. - S. Kim, S. Pakzad, D. E. Culler, J. Demmel, G. Fenves, S. Glaser, and M. Turon, "Health monitoring of civil infrastructures using wireless sensor networks," in
*Proc. 6th Intl. Symp. on Information Processing in Sensor Networks (IPSN 2007)*, New York, NY: The Association for Computing Machinery, Inc., 2007, pp. 254-263. - H. Gahvari, M. Hoemmen, J. Demmel, and K. A. Yelick, "Benchmarking sparse matrix-vector multiply in five minutes," in
*Proc. 2007 SPEC Benchmark Workshop*, Warrenton, VA: Standard Performance Evaluation Corporation, 2007, pp. 11 pg. - S. Kim, S. Pakzad, D. E. Culler, J. Demmel, G. Fenves, S. Glaser, and M. Turon, "Wireless sensor networks for structural health monitoring," in
*Proc. 4th Intl. Conf. on Embedded Networks Sensor Systems (SynSys '06)*, New York, NY: The Association for Computing Machinery, Inc., 2006, pp. 427-428. - J. Demmel, I. Dumitriu, and O. Holtz, "Toward accurate polynomial evaluation in rounded arithmetic (Short report)," in
*Algebraic and Numerical Algorithms and Computer-Assisted Proofs*, B. Buchberger, S. Oishi, M. Plum, and S. M. Rump, Eds., Dagsthul Seminar Proceedings, Dagstuhl, Germany: IBFI, 2006, pp. 1-15. - T. Koyama, D. S. Bindel, W. He, E. P. Quevy, S. Govindjee, J. Demmel, and R. T. Howe, "Simulation tools for damping in high frequency resonators," in
*Proc. 4th IEEE Conf. on Sensors (SENSORS 2005)*, Piscataway, NJ: IEEE Press, 2005, pp. 349-352. - S. N. Pakzad, S. Kim, G. L. Fenves, S. D. Glaser, D. E. Culler, and J. Demmel, "Multi-purpose wireless accelerometers for civil infrastructure monitoring," in
*Structural Health Monitoring 2005: Proc. 5th Intl. Workshop (IWSHM 2005)*, F. K. Chang, Ed., Lancaster, PA: DEStech Publications, Inc., 2005, pp. 125-132. - E. Mizutani, S. E. Dreyfus, and J. Demmel, "Second-order backpropagation algorithms for a stagewise-partitioned separable Hessian matrix," in
*Proc. 2005 IEEE Intl. Joint Conf. on Neural Networks (IJCNN '05)*, Vol. 2, Piscataway, NJ: IEEE Press, 2005, pp. 1027-1032. - H. Choo, D. Garmire, J. Demmel, and R. S. Muller, "A simple process to fabricate self-aligned, high-performance torsional microscanners; demonstrated use in a two-dimensional scanner," in
*IEEE/LEOS Optical MEMs 2005: Intl. Conf. on Optical MEMs and Their Applications*, Piscataway, NJ: IEEE Press, 2005, pp. 21-22. - D. Garmire, R. S. Muller, and J. Demmel, "Vision-based teleoperation of a Stroboscopic Microscopic Interferometric System for remote dynamic MEMS testing," in
*IEEE/LEOS Optical MEMs 2005: Intl. Conf. on Optical MEMs and Their Applications*, Piscataway, NJ: IEEE Press, 2005, pp. 163-164. - D. S. Bindel, E. Quevy, T. Koyama, S. Govindjee, J. Demmel, and R. T. Howe, "Anchor loss simulation in resonators," in
*18th IEEE Intl. Conf. on Micro Electro Mechanical Systems Technical Digest (MEMS 2005)*, Piscataway, NJ: IEEE Press, 2005, pp. 133-136. - B. C. Lee, R. W. Vuduc, J. Demmel, and K. A. Yelick, "Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply (Best Paper Prize)," in
*Proc. 2004 Intl. Conf. on Parallel Processing (ICPP 2004)*, R. Eigenmann, Ed., Vol. 1, Los Alamitos, CA: IEEE Computer Society, 2004, pp. 169-176. - E. Mizutani and J. Demmel, "On sparsity-exploiting memory-efficient trust-region regularized nonlinear least squares algorithms for neural-network learning," in
*Proc. 2003 IEEE Intl. Joint Conf. on Neural Networks (IJCNN '03)*, Vol. 1, Piscataway, NJ: IEEE Press, 2003, pp. 242-247. - R. Vuduc, J. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee, "Performance optimizations and bounds for sparse matrix-vector multiply," in
*Proc. ACM/IEEE 2002 Conf. on Supercomputing (SC '02)*, Los Alamitos,CA: IEEE Computer Society, 2002, pp. 35 pg. - J. Demmel, "The complexity of accurate floating point computation," in
*Proc. Intl. Congress of Mathematicians*, T. Li, Ed., Vol. 3, Beijing, China: The Higher Education Press of China, 2002, pp. 697-706. - E. Mizutani and J. Demmel, "On separable nonlinear least squares algorithms for neuro-fuzzy modular network learning," in
*Proc. 2002 International Joint Conf. on Neural Networks (IJCNN '02)*, Vol. 3, Piscataway, NJ: IEEE Press, 2002, pp. 2399-2404. - J. V. Clark, D. Bindel, W. Kao, E. Zhu, A. Kuo, N. Zhou, J. Nie, J. Demmel, Z. Bai, S. Govindjee, K. Pister, M. Gu, and A. Agogino, "Addressing the needs of complex MEMS design," in
*Proc. 15th IEEE Intl. Conf. on Micro Electro Mechanical Systems*, Piscataway, NJ: IEEE Press, 2002, pp. 204-209. - X. S. Li and J. Demmel, "Making sparse Gaussian elimination scalable by static pivoting," in
*Proc. of the 1998 ACM/IEEE Supercomputing Conf.*, Washington, D.C.: IEEE Computer Society, 1998, pp. 19 pp.. - E. Anderson, Z. Bai, J. Dongarra, A. Greenbaum, A. McKenney, J. Du Croz, S. Hammarling, J. Demmel, C. H. Bischof, and D. C. Sorensen, "LAPACK: A portable linear algebra library for high-performance supercomputers," in
*Proc. Supercomputing '90*, Los Alamitos, CA: IEEE Computer Society, 1990, pp. 2-11. - J. Demmel, "On structured singular values," in
*Proc. 27th IEEE Conf. on Decision and Control*, Vol. 3, Piscataway, NJ: IEEE Press, 1988, pp. 2138-2143.

## Technical Documentation

- J. Demmel, J. R. Gilbert, and X. S. Li, "SuperLU Users' Guide," 2003.
- E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen, "LAPACK Users' Guide (3rd. ed., 407 pp., ISBN:0-89871-447-8)," 1999. [abstract]
- L. S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley, "ScaLAPACK Users' Guide (325 pp., ISBN 0-89871-397-8)," 1997. [abstract]

## Technical Reports

- J. Demmel, L. Grigori, and S. Cayrols, "Low Rank Approximation of a Sparse Matrix Based on LU Factorization with Column and Row Tournament Pivoting," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2016-122, June 2016. [abstract]
- J. Demmel, P. Ahrens, and H. D. Nguyen, "Efficient Reproducible Floating Point Summation and BLAS," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2016-121, June 2016. [abstract]
- P. Ahrens, H. D. Nguyen, and J. Demmel, "Efficient Reproducible Floating Point Summation and BLAS," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2015-229, Dec. 2015. [abstract]
- E. Carson, J. Demmel, L. Grigori, N. Knight, P. Koanantakool, O. Schwartz, and H. V. Simhadri, "Write-Avoiding Algorithms," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2015-163, June 2015.
- Y. You, J. Demmel, K. Czechowski, L. Song, and R. Vuduc, "CA-SVM: Communication-Avoiding Parallel Support Vector Machines on Distributed Systems," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2015-9, Feb. 2015. [abstract]
- E. Carson and J. Demmel, "Accuracy of the s-step Lanczos method for the symmetric eigenproblem," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-165, Sep. 2014. [abstract]
- G. Ballard, J. Demmel, A. Gearhart, B. Lipshitz, O. Schwartz, and S. Toledo, "Contention Bounds for Combinations of Computation Graphs and Network Topologies," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-147, Aug. 2014. [abstract]
- E. Solomonik, D. Matthews, J. Hammond, J. Stanton, and J. Demmel, "A massively parallel tensor contraction framework for coupled-cluster computations," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-143, Aug. 2014. [abstract]
- E. Carson and J. Demmel, "Error analysis of the s-step Lanczos method in finite precision," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-55, May 2014. [abstract]
- E. Carson and J. Demmel, "Analysis of the finite precision s-step biconjugate gradient method," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-18, March 2014. [abstract]
- E. Solomonik, E. Carson, N. Knight, and J. Demmel, "Tradeoffs between synchronization, communication, and work in parallel linear algebra computations," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2014-8, Jan. 2014. [abstract]
- G. Ballard, J. Demmel, L. Grigori, M. Jacquelin, H. D. Nguyen, and E. Solomonik, "Reconstructing Householder Vectors from Tall-Skinny QR," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-175, Oct. 2013. [abstract]
- G. Ballard, J. Demmel, and N. Knight, "Avoiding Communication in Successive Band Reduction," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-131, July 2013. [abstract]
- G. Ballard, D. Becker, J. Demmel, J. Dongarra, A. Druinsky, I. Peled, O. Schwartz, S. Toledo, and I. Yamazaki, "Communication-Avoiding Symmetric-Indefinite Factorization," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-127, July 2013. [abstract]
- J. Demmel, "An arithmetic complexity lower bound for computing rational functions, with applications to linear algebra," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-126, July 2013. [abstract]
- M. Christ, J. Demmel, N. Knight, T. Scanlon, and K. A. Yelick, "Communication Lower Bounds and Optimal Algorithms for Programs That Reference Arrays - Part 1," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-61, May 2013. [abstract]
- N. Knight, E. Carson, and J. Demmel, "Exploiting Data Sparsity in Parallel Matrix Powers Computations," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-47, May 2013. [abstract]
- J. Demmel, L. Grigori, M. Gu, and H. Xiang, "Communication Avoiding Rank Revealing QR Factorization with Column Pivoting," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-46, May 2013. [abstract]
- G. Ballard, A. Buluc, J. Demmel, L. Grigori, B. Lipshitz, O. Schwartz, and S. Toledo, "Communication Optimal Parallel Multiplication of Sparse Random Matrices," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-13, Feb. 2013. [abstract]
- G. Ballard, J. Demmel, B. Lipshitz, O. Schwartz, and S. Toledo, "Communication Efficient Gaussian Elimination with Partial Pivoting using a Shape Morphing Data Layout," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-12, Feb. 2013. [abstract]
- E. Solomonik, D. Matthews, J. Hammond, and J. Demmel, "Cyclops Tensor Framework: reducing communication and eliminating load imbalance in massively parallel contractions," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-11, Feb. 2013. [abstract]
- E. Solomonik, A. Buluc, and J. Demmel, "Minimizing communication in all-pairs shortest paths," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-10, Feb. 2013. [abstract]
- J. Byun, R. Lin, K. A. Yelick, and J. Demmel, "Autotuning Sparse Matrix-Vector Multiplication for Multicore," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-215, Nov. 2012. [abstract]
- E. Solomonik, D. Matthews, J. Hammond, and J. Demmel, "Cyclops Tensor Framework: reducing communication and eliminating load imbalance in massively parallel contractions," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-210, Nov. 2012. [abstract]
- J. Demmel, D. Eliahu, A. Fox, S. A. Kamil, B. Lipshitz, O. Schwartz, and O. Spillinger, "Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-205, Oct. 2012. [abstract]
- E. Carson and J. Demmel, "A Residual Replacement Strategy for Improving the Maximum Attainable Accuracy of s-step Krylov Subspace Methods," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-197, Sep. 2012. [abstract]
- G. Ballard, J. Demmel, O. Holtz, B. Lipshitz, and O. Schwartz, "Graph Expansion Analysis for Communication Costs of Fast Rectangular Matrix Multiplication," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-194, Sep. 2012. [abstract]
- J. Demmel and A. Gearhart, "Instrumenting Linear Algebra Energy Consumption via On-chip Energy Counters," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-168, June 2012. [abstract]
- J. Demmel, A. Gearhart, O. Schwartz, and B. Lipshitz, "Perfect strong scaling using no additional energy," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-126, May 2012. [abstract]
- B. Lipshitz, G. Ballard, O. Schwartz, and J. Demmel, "Communication-Avoiding Parallel Strassen: Implementation and Performance," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-90, May 2012. [abstract]
- G. Ballard, J. Demmel, O. Holtz, and O. Schwartz, "Sequential Communication Bounds for Fast Linear Algebra," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-36, March 2012. [abstract]
- G. Ballard, J. Demmel, O. Holtz, B. Lipshitz, and O. Schwartz, "Communication-Optimal Parallel Algorithm for Strassen’s Matrix Multiplication," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-32, March 2012. [abstract]
- G. Ballard, J. Demmel, O. Holtz, B. Lipshitz, and O. Schwartz, "Strong Scaling of Matrix Multiplication Algorithms and Memory-Independent Communication Lower Bounds," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-31, March 2012. [abstract]
- E. Solomonik, J. Hammond, and J. Demmel, "A preliminary analysis of Cyclops Tensor Framework," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-29, March 2012. [abstract]
- E. Solomonik and J. Demmel, "Matrix multiplication on multidimensional torus networks," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-28, Feb. 2012. [abstract]
- E. Solomonik, A. Buluc, and J. Demmel, "Minimizing communication in all-pairs shortest-paths," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-19, Feb. 2012. [abstract]
- A. Khabou, J. Demmel, L. Grigori, and M. Gu, "LU Factorization with Panel Rank Revealing Pivoting and Its Communication Avoiding Version," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-15, Jan. 2012. [abstract]
- E. Carson, N. Knight, and J. Demmel, "Avoiding Communication in Two-Sided Krylov Subspace Methods," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-93, Aug. 2011.
- E. Solomonik, A. Bhatele, and J. Demmel, "Improving communication performance in dense linear algebra via topology aware collectives," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-92, Aug. 2011. [abstract]
- E. Solomonik and J. Demmel, "Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-72, June 2011. [abstract]
- G. Ballard, J. Demmel, O. Holtz, and O. Schwartz, "Graph Expansion and Communication Costs of Fast Matrix Multiplication," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-40, May 2011. [abstract]
- G. Ballard, J. Demmel, O. Holtz, and O. Schwartz, "Minimizing Communication in Numerical Linear Algebra," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-15, Feb. 2011. [abstract]
- G. Ballard, J. Demmel, and I. Dumitriu, "Minimizing Communication for Eigenproblems and the Singular Value Decomposition," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-14, Feb. 2011. [abstract]
- G. Ballard, J. Demmel, and A. Gearhart, "Communication Bounds for Heterogeneous Architectures," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-13, Feb. 2011. [abstract]
- E. Solomonik and J. Demmel, "Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2011-10, Feb. 2011.
- A. Kaiser, S. Williams, K. Madduri, K. Ibrahim, D. Bailey, J. Demmel, and E. Strohmaier, "TORCH Computational Reference Kernels: A Testbed for Computer Science Research," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2010-144, Dec. 2010. [abstract]
- M. Anderson, G. Ballard, J. Demmel, and K. Keutzer, "Communication-Avoiding QR Decomposition for GPUs," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2010-131, Oct. 2010. [abstract]
- J. Demmel, L. Grigori, and H. Xiang, "CALU: A Communication Optimal LU Factorization Algorithm," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2010-29, March 2010. [abstract]
- B. Catanzaro, S. A. Kamil, Y. Lee, K. AsanoviÄ‡, J. Demmel, K. Keutzer, J. Shalf, K. A. Yelick, and A. Fox, "SEJITS: Getting Productivity and Performance With Selective Embedded JIT Specialization," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2010-23, March 2010. [abstract]
- G. Ballard, J. Demmel, O. Holtz, and O. Schwartz, "Communication-optimal Parallel and Sequential Cholesky decomposition," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2009-29, Feb. 2009.
- J. Demmel, L. Grigori, M. F. Hoemmen, and J. Langou, "Communication-optimal parallel and sequential QR and LU factorizations," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2008-89, Aug. 2008. [abstract]
- J. Demmel, M. F. Hoemmen, Y. Hida, and J. Riedy, "Non-Negative Diagonals and High Performance on Low-Profile Matrices from Householder QR," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2008-76, May 2008. [abstract]
- V. Volkov and J. Demmel, "LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2008-49, May 2008. [abstract]
- K. Asanovic, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, E. A. Lee, N. Morgan, G. Necula, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick, "The Parallel Computing Laboratory at U.C. Berkeley: A Research Agenda Based on the Berkeley View," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2008-23, March 2008. [abstract]
- V. Volkov and J. Demmel, "Using GPUs to Accelerate the Bisection Algorithm for Finding Eigenvalues of Symmetric Tridiagonal Matrices," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2007-179, Dec. 2007. [abstract]
- J. Demmel, M. F. Hoemmen, M. Mohiyuddin, and K. A. Yelick, "Avoiding Communication in Computing Krylov Subspaces," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2007-123, Oct. 2007. [abstract]
- J. Demmel, Y. Hida, X. Li, and E. J. Riedy, "Extra-precise Iterative Refinement for Overdetermined Least Squares Problems," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2007-77, May 2007. [abstract]
- S. Kim, S. Pakzad, D. E. Culler, J. Demmel, G. Fenves, S. Glaser, and M. Turon, "Health Monitoring of Civil Infrastructures Using Wireless Sensor Networks," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2006-121, Oct. 2006. [abstract]
- D. S. Bindel, J. Demmel, and M. Friedman, "Continuation of Invariant Subspaces for Large Bifurcation Problems," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2006-13, Feb. 2006. [abstract]
- J. W. Demmel, Y. Hida, W. Kahan, X. S. Li, S. Mukherjee, and E. J. Riedy, "Error Bounds from Extra Precise Iterative Refinement," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-04-1344, March 2005. [abstract]
- R. Nishtala, R. W. Vuduc, J. W. Demmel, and K. A. Yelick, "Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-04-1335, 2004. [abstract]
- B. C. Lee, R. W. Vuduc, J. W. Demmel, K. A. Yelick, M. de Lorimier, and L. Zhong, "Performance Optimizations and Bounds for Sparse Symmetric Matrix-Multiple Vector Multiply," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-03-1297, 2003. [abstract]
- R. Vuduc, A. Gyulassy, J. Demmel, and K. A. Yelick, "Memory Hierarchy Optimizations and Performance Bounds for Sparse A^T Ax," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-03-1232, Feb. 2003. [abstract]
- J. Demmel and Y. Hida, "Accurate Floating Point Summation," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-02-1180, May 2002. [abstract]
- D. Bindel, J. Demmel, W. M. Kahan, and O. Marques, "On Computing Givens Rotations Reliably and Efficiently," University of Tennessee, Knoxville, Computer Science Department, Tech. Rep. UTK/CS-00-449, Oct. 2000.
- X. S. Li, J. Demmel, D. H. Bailey, G. Henry, Y. Hida, J. Iskandar, W. M. Kahan, S. Y. Karg, A. Kapur, M. C. Martin, B. J. Thompson, T. Tung, and D. J. Yoo, "Design, Implementation and Testing of Extended and Mixed Precision BLAS," Lawrence Berkeley National Laboratory, Tech. Rep. LBNL-00-45991, June 2000.
- J. W. Demmel, J. Gilbert, and X. S. Li, "SuperLU Users' Guide," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-97-944, May 1997. [abstract]
- J. W. Demmel, J. R. Gilbert, and X. S. Li, "An Asynchronous Parallel Supernodal Algorithm for Sparse Gaussian Elimination," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-97-943, Feb. 1997. [abstract]
- J. Demmel, M. Gu, S. Eisenstat, I. Slapnicar, K. Veselic, and Z. Drmac, "Computing the Singular Value Decomposition with High Relative Accuracy," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-97-934, Feb. 1997. [abstract]
- S. L. Smith, M. Y. Ivory, and J. Demmel, "Modeling and Identifying Bottlenecks in the EOSDIS Architecture," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-97-957, Sep. 1996. [abstract]
- J. W. Demmel, S. C. Eisenstat, J. R. Gilbert, X. S. Li, and J. W. Liu, "A Supernodal Approach to Sparse Partial Pivoting," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-95-883, July 1995. [abstract]
- J. W. Demmel, I. Dhillon, and H. Ren, "On the Correctness of Parallel Bisection in Floating Point," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-94-805, March 1994. [abstract]
- Z. Bai, J. W. Demmel, and M. Gu, "Inverse Free Parallel Spectral Divide and Conquer Algorithms for Nonsymmetric Eigenproblems," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-94-793, Feb. 1994. [abstract]
- J. W. Demmel and X. Li, "Faster Numerical Algorithms via Exception Handling," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-93-728, Feb. 1993. [abstract]
- Z. Bai and J. W. Demmel, "Design of a Parallel Nonsymmetric Eigenroutine Toolbox, Part I," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-718, Feb. 1993. [abstract]
- Z. Bai and J. W. Demmel, "Computing the Generalized Singular Value Decomposition," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-720, Dec. 1992. [abstract]
- J. W. Demmel, M. T. Heath, and H. A. van der Vorst, "Parallel Numerical Linear Algebra," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-703, Oct. 1992. [abstract]
- J. W. Demmel, "Trading Off Parallelism and Numerical Stability," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-702, Sep. 1992. [abstract]
- J. W. Demmel and A. Edelman, "The Dimension of Matrices (Matrix Pencils) with Given Jordan (Kronecker) Canonical Forms," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-706, Sep. 1992. [abstract]
- D. Manocha and J. W. Demmel, "Algorithms for Intersecting Parametric and Algebraic Curves," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-92-698, Aug. 1992. [abstract]
- J. Demmel and Z. Bai, "LAPACK Working Note 38: On a Direct Algorithm for Computing Invariant Subspaces with Specified Eigenvalues," University of Tennessee, Knoxville, Computer Science Department, Tech. Rep. UT-CS-91-139, Aug. 1991.
- J. Demmel, J. Dongarra, and W. M. Kahan, "LAPACK Working Note 39: On Designing Portable High Performance Numerical Libraries," University of Tennessee, Knoxville, Department of Computer Science, Tech. Rep. UTK/CS-91-141, Aug. 1991. [abstract]
- Z. Bai and J. W. Demmel, "Computing the Generalized Singular Value Decomposition," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-91-645, Aug. 1991. [abstract]
- E. Anderson, C. Bischof, J. Demmel, J. Dongarra, J. DuCroz, S. Hammarling, and W. M. Kahan, "LAPACK Working Note 26: Prospectus for an Extension to LAPACK: A Portable Linear Algebra Library for High-Performance Computers," University of Tennessee, Knoxville, Department of Computer Science, Tech. Rep. UTK/CS-90-118, Nov. 1990.
- J. W. Demmel and N. J. Higham, "Improved Error Bounds for Underdetermined System Solvers," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-90-587, Aug. 1990. [abstract]
- J. W. Demmel and N. J. Higham, "Stability of Block Algorithms with Fast Level 3 BLAS," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-90-584, July 1990. [abstract]
- J. Demmel and W. M. Kahan, "LAPACK Working Note 3: Computing Small singular Values of Bidiagonal Matrices with Guaranteed High Relative Accuracy," Argonne National Laboratory, Argonne, IL, Tech. Rep. ANL/MCS-TM-110, Feb. 1988. [abstract]
- J. W. Demmel, "Effects of Underflow on Solving Linear Systems," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-83-128, Aug. 1983. [abstract]
- J. W. Demmel, "The Condition Number of Similarities that Diagonalize Matrices," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-83-127, July 1983. [abstract]
- J. W. Demmel and F. Kruckeberg, "An Interval Algorithm for Solving Systems of Linear Equations to Prespecified Accuracy," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-83-126, July 1983. [abstract]

## Software

- E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, J. Langou, J. Langou, A. McKenney, S. Ostrouchov, and S. Sorenson, "LAPACK, Linear Algebra PACKage," 2006. [abstract]
- K. A. Yelick and J. Demmel, "OSKI -- Optimized Sparse Kernel Interface," 2006. [abstract]
- S. Li, J. Demmel, and J. Gilbert, "SuperLU -- High Performance Sparse Gaussian Elimination," 2006. [abstract]
- A. Agogino, D. Bindel, J. Clark, J. Demmel, S. Govindjee, M. Gu, Z. Bai, K. Pister, C. H. Séquin, N. Zhou, and J. Nie, "SUGAR -- Simulation tool for micro-electro-mechanical systems (MEMS)," 2002. [abstract]
- J. Choi, I. Dhillon, J. Demmel, J. Dongarra, A. Petitet, S. Ostrouchov, K. Stanley, D. Walker, and C. Whaley, "ScaLAPACK, Scalable Linear Algebra Package," 1997. [abstract]

## Patents

- D. Garmire, H. Choo, R. S. Muller, J. Demmel, and S. Govindjee, "Integrated MEMS metrology device using complementary measuring combs," U.S. Patent 8,079,246. Dec. 2011. [abstract]
- H. Choo, D. Garmire, R. S. Muller, and J. Demmel, "Method for fabricating vertically-offset interdigitated comb actuator device," U.S. Patent 7,573,022. Aug. 2009. [abstract]
- H. Choo, R. S. Muller, D. Garmire, J. Demmel, and R. Kant, "MEMS-based, phase-shifting interferometer," U.S. Patent 7,564,559. July 2009. [abstract]
- H. Choo, D. Garmire, R. S. Muller, and J. Demmel, "CMOS-compatible high-performance microscanners, including structures, high-yield simplified fabrication methods, and applications," U.S. Patent Application. Feb. 2007. [abstract]
- H. Choo, D. Garmire, R. S. Muller, and J. Demmel, "CMOS-compatible high-performance microscanners, including structures, high-yield simplified fabrication methods and applications," U.S. Patent Application. July 2006.
- H. Choo, R. S. Muller, D. Garmire, J. Demmel, and R. Kant, "MEMS-based, phase-shifting interferometer," U.S. Patent Application. June 2006. [abstract]

## Talks or presentations

- J. Demmel, W. M. Kahan, and B. N. Parlett, "Forsythe, Golub, and the Future of Matrix Computations," presented at Matrix Computations & Scientific Computing Seminar, 380 Soda Hall, March 2007.

## Masters Reports

- O. Spillinger, D. Eliahu, A. Fox, and J. Demmel, "Matrix Multiplication Algorithm Selection with Support Vector Machines," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2015-29, May 2015. [abstract]
- D. Eliahu, O. Spillinger, A. Fox, and J. Demmel, "FRPA: A Framework for Recursive Parallel Algorithms," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2015-28, May 2015. [abstract]
- A. Singh, "Communication-Avoiding Optimization of Geometric Multigrid on GPUs," J. Demmel, Ed., EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2012-258, Dec. 2012. [abstract]