Papers and Publications for R. Clint Whaley


Journal Publications

  1. "Scaling LAPACK Panel Operations Using Parallel Cache Assignment", by Anthony M. Castaldo, Siju Samuel and R. Clint Whaley. ACM Transactions on Mathematical Software (TOMS), Volume 39, Number 4, pp 22:1-22:30, Article 22, July 2013.

  2. "Reducing Floating Point Error in Dot Product using the Superblock Family of Algorithms", by Anthony M. Castaldo, R. Clint Whaley and Anthony T. Chronopoulos. SIAM Journal on Scientific Computing (SISC), Volume 31, Number 2, pp 1156-1174, 2008.

  3. "Achieving accurate and context-sensitive timing for code optimization", by R. Clint Whaley and Anthony M. Castaldo. Software: Practice & Experience, Volume 38, Number 15, pp 1621-1642, April, 2008.

  4. "Minimizing Development and Maintenance Costs in Supporting Persistently Optimized BLAS", by R. Clint Whaley and Antoine Petitet. Software: Practice & Experience, Volume 35, Number 2, pp 101-121, February, 2005.

  5. "Self-Adapting Linear Algebra Algorithms and Software", by J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, R. C. Whaley and K. Yelick. Proceedings of the IEEE, Volume 93, Number 2, pp 293-312, February, 2005.

  6. "An Updated Set of Basic Linear Algebra Subprograms (BLAS)", by L. Susan Blackford, James Demmel, Jack Dongarra, Iain Duff, Sven Hammarling, Greg Henry, Micheal Heroux, Linda Kaufman, Andrew Lumsdain, Antoine Petitet, Roldan Pozo, Karin Remington, and R. Clint Whaley. ACM Transactions on Mathematical Software, 28(2):135-151, June 2002.

  7. "Automated Empirical Optimization of Software and the ATLAS project" by R. Clint Whaley, Antoine Petitet and Jack Dongarra. Parallel Computing, 27(1-2):3-35, 2001.

  8. "Practical Experience in the Numerical Dangers of Heterogeneous Computing", by L. S. Blackford, A. Cleary, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, A. Petitet, H. Ren, K. Stanley and R. C. Whaley. ACM Transactions on Mathematical Software Volume 23, Number 2, pages 133-147, June 1997.

  9. "ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance", by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. Computer Physics Communications Volume 97, pages 1-15, 1996.

  10. "The Design and Implementation of ScaLAPACK LU, QR, and Cholesky", by J. Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker, and R. C. Whaley. Scientific Programming Volume 5, pages 173-184, 1996.


Refereed Conference Publications

  1. "Effectively exploiting parallel scale for all problem sizes in LU factorization" by Md Rakib Hasan and R. Clint Whaley. In 28th International Parallel & Distributed Processing Symposium (IPDPS2014) Phoenix, AZ, May 19-23, 2014.

  2. "Vectorization Past Dependent Branches Through Speculation" by Majedul Haque Sujon, R. Clint Whaley and Qing Yi. In 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT2013), pages 353-362, Edinburgh, Scotland, September 9-11, 2013.

  3. "Achieving Scalable Parallelization For The Hessenberg Factorization"" by Anthony M. Castaldo and R. Clint Whaley. In IEEE Cluster 2011, pages 65-73, Austin, TX, September 26-30, 2011.

  4. "Scaling LAPACK Panel Operations Using Parallel Cache Assignment" by Anthony M. Castaldo and R. Clint Whaley. In 15th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, pages 223-231, Bangalore, India, January 9-14, 2010.

  5. "Minimizing Startup Costs for Performance-Critical Threading" by Anthony M. Castaldo and R. Clint Whaley. 23rd IEEE International Parallel and Distributed Processing Symposium, pages 1-8, Rome, Italy, May 25-29, 2009.

  6. "Empirically Tuning LAPACK's Blocking Factor for Increased Performance", by R. Clint Whaley. International Multiconference on Computer Science and Information Technology, Wisla, Poland, October 20-22, 2008.

  7. "Automated Transformation for Performance-Critical Kernels", by Qing Yi and R. Clint Whaley. ACM SIGPLAN Symposium on Library-Centric Software Design, Montreal, Canada. Oct, 2007.

  8. "Tuning High Performance Kernels through Empirical Compilation" by R. Clint Whaley and David B. Whalley. The 2005 International Conference on Parallel Processing (ICPP-05), June 14-17, 2005.

  9. "Automatically Tuned Linear Algebra Software" by R. Clint Whaley and Jack Dongarra. Ninth SIAM Conference on Parallel Processing for Scientific Computing, March 22-24, 1999, CD-ROM Proceedings.

  10. "Numerical Linear Algebra Problem Solving Environment Designer's Perspective", Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999.

  11. "Automatically Tuned Linear Algebra Software" by R. Clint Whaley and Jack Dongarra. Winner, best paper in systems catagory, SuperComputing 1998: High Performance Networking and Computing.

  12. "ScaLAPACK: A Linear Algebra Library for Message-passing Computers", by L. Susan Blackford, Jaeyoung Choi, Andrew J. Cleary, Eduardo F. D'Azevedo, James Demmel, Inderjit S. Dhillon, Jack Dongarra, Sven Hammerling, Greg Henry, Antoine Petitet, Ken Stanley, David Walker and R. Clint Whaley. Proceedings of 1997 SIAM Conference on Parallel Processing for Scientific Computing, March 1997.

  13. "A Proposal for a Set of Parallel Basic Linear Algebra Subprograms", by Jaeyoung Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker and R. C. Whaley. Second International Workshop, PARA'95, Lyngby, Denmark, August 1995. Proceedings in Lecture Notes in Computer Science, Number 1041, pages 107-114, Springer-Verlag, Berlin - Heidenberg - New York, 1996.

  14. "Two Dimensional Basic Linear Algebra Communications Subprograms", by Jack Dongarra, Robert A. van de Geijn and R. Clint Whaley, Proceedings of the sixth SIAM Conference on Parallel Processing for Scientific Computing, SIAM Publications, pages 347-352, Norfolk, Virginia, March 22-24, 1993.


Books

  1. Software Automatic Tuning: From Concepts to State-of-the-Art Results by K. Naono, K. Teranishi, J. Cavazos, R. Suda (Eds.). Springer New York Dordrecht Heidelberg London, 2010, ISBN: 978-1-4419-6934-7.
  2. ScaLAPACK Users' Guide by L.S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley. SIAM Publications, Philadelphia, 1997, ISBN 0-89871-397-8.
  3. Handbook on Parallel and Distributed Processing, editors: J. Blazewicz, K. Ecker, B. Plateau, D. Trystram. Springer-Verlag Berlin Headelberg, 2000, ISBN: 3-540-6641-6.


Doctoral Dissertation

  1. "Automated Empirical Optimization of High Performance Floating Point Kernels" by R. Clint Whaley. Defended November 2, 2004.
    Advisor: David Whalley

Master's Thesis

  1. "Basic Linear Algebra Communication Subprograms: Analysis and Implementation Across Multiple Parallel Architectures" by R. Clint Whaley. May, 1994.
    Advisor: Jack Dongarra

Selected Workshops and Presentations

  1. "ATLAS Version 3.8 : Overview and Status" by R. Clint Whaley. International Workshop on Automatic Performance Tuning (iWAPT07), Tokyo, Japan, September 20-21, 2007. Invited speaker with paper and talk. Proceedings available.
  2. "Automatically Tuned Linear Algebra Software" by R. Clint Whaley, Workshop on Automatic Tuning for Petascale Systems, Snowbird, Utah, July 9-12 2007.
  3. "NSF CRI CNS-0551504, ATLAS Support and Development", by R. Clint Whaley, 2007 NSF/CISE CRI PI Meeting, Boston, MA, June 4-5, 2007.

UTSA Technical Reports

  1. "ATLAS Installation Guide", by R. Clint Whaley. Technical Report CS-TR-2008-002, University of Texas at San Antonio, January 2008.
  2. "Achieving accurate and context-sensitive timing for code optimization", by Anthony M. Castaldo and R. Clint Whaley. Technical Report CS-TR-2008-001, University of Texas at San Antonio, January 2008.
  3. "Automated Transformation for Performance-Critical Kernels", by Qing Yi and R. Clint Whaley. Technical Report CS-TR-2007-003, University of Texas at San Antonio, June 2007.
  4. "Error Analysis of Various Forms of Floating Point Dot Products", by Anthony M. Castaldo and R. Clint Whaley. Technical Report CS-TR-2007-002, University of Texas at San Antonio, May 2007.

User's Guides, HOWTOs, and miscellaneous.

  1. ATLAS Installation Guide.
  2. "Some notes on using assembly", by R. Clint Whaley.
  3. "A Guide to User Contribution to ATLAS", by R. Clint Whaley. Also available online as html.
  4. "A Collaborative Guide to ATLAS Development", by R. Clint Whaley and Peter Soendergaard. Also available online as html.
  5. "A User's Guide to Extract", by R. Clint Whaley. Also available online as html.
  6. "Installation Guide and Design of the HPF 1.1 interface to ScaLAPACK, SLHPF" by L. S. Blackford, J. J. Dongarra, C. A. Papadopoulos, and R. C. Whaley. August, 1998.
  7. "ScaLAPACK Evaluation and Performance at the DoD MSRCs" by L. S. Blackford and R. C. Whaley. UT-CS-98.388, April 1998.
  8. "Installation Guide for the BLACS and its Test Suite" by R. Clint Whaley.
  9. "A User's Guide to the BLACS v1.1", by J. Dongarra and R. C. Whaley". March, 1995 (last updated, May 5, 1997).
  10. "Installation Guide for ScaLAPACK" by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. March, 1995.
  11. "Using BLACS and MPI in ScaLAPACK" by R. Clint Whaley.
  12. "Outstanding Issues in the MPIBLACS" by R. Clint Whaley.
  13. "Some Plebian Extensions to MPI" by R. Clint Whaley.

Back to homepage