I'm a PhD student in the EECS Department at the University of California, Berkeley. I work with the AMPLab research group. My co-advisors are Randy Katz and Marti Hearst. Please email me if you would like a copy of my curriculum vitae.

Follow me on Twitter! @salspaugh



RESEARCH


Analyzing Log Analysis: An Empirical Study of User Log Mining. [BEST STUDENT PAPER]
S. Alspaugh, B. Chen, J. Lin, A. Ganapathi, M. Hearst, and R. Katz.
Large Installation System Administration Conference (LISA). November 2014.
[paper] [slides]


Better Logging to Improve Interactive Data Analysis Tools.
S. Alspaugh, A. Ganapathi, M. Hearst, and R. Katz.
Workshop on Interactive Data Exploration and Analytics (IDEA). Co-located with KDD. August 2014.
[paper] [slides]


Building Blocks for Exploratory Data Analysis Tools.
S. Alspaugh, A. Ganapathi, M. Hearst, and R. Katz.
Workshop on Interactive Data Exploration and Analytics (IDEA). Co-located with KDD. August 2013.
[paper]


Towards a Data Analysis Recommendation System.
S. Alspaugh and A. Ganapathi.
Workshop on Managing Systems Automatically and Dynamically (MAD). Co-located with OSDI. October 2012.
[paper]


SYSTEMS FOR MIXED BATCH AND INTERACTIVE WORKLOADS

Cake: Enabling High-level SLOs on Shared Storage Systems.
A. Wang, S. Venkataraman, S. Alspaugh, R. Katz, and I. Stoica.
Symposium on Cloud Computing ( (SoCC). October 2012.
[paper] [slides] (slides by A. Wang)


Interactive Query Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads.
Y. Chen, S. Alspaugh, and R. Katz.
International Conference on Very Large Data Bases (VLDB). August 2012.
[paper]


Sweet Storage SLOs with Frosting.
A. Wang, S. Venkataraman, S. Alspaugh, R. Katz, and I. Stoica.
Workshop on Hot Topics in Cloud Computing (HotCloud). Co-located with USENIX ATC. June 2012.
[paper] [slides] (slides by A. Wang)


POWER-PROPORTIONAL CLUSTERS

Reducing Cluster Energy Consumption through Workload Management.
S. Alspaugh.
Master's Thesis. May 2012.
[paper]


Energy Efficiency for Large-Scale MapReduce Workloads with Significant Interactive Analysis.
Y. Chen, S. Alspaugh, D. Borthakur, and R. Katz.
European Conference on Computer Systems (EuroSys). April 2012.
[paper]


Design and Evaluation of an Energy Agile Computing Cluster.
A. Krioukov, S. Alspaugh, P. Mohan, S. Dawson-Haggerty, D. Culler and R. Katz.
UC Berkeley EECS Tech Report. January 2012.
[paper]


Integrating Renewable Energy Using Data Analytics Systems: Challenges and Opportunities.
A. Krioukov, C. Goebel, S. Alspaugh, Y. Chen, D. Culler, and R. Katz.
IEEE Computer Society Technical Committee Bulletin on Data Engineering. March 2011.
[paper]


An Information-Centric Energy Infrastructure: The Berkeley View.
R. Katz, D. Culler, S. Sanders, S. Alspaugh, Y. Chen, S. Dawson-Haggerty, P. Dutta, M. He, X. Jifang, L. Keys, A. Krioukov, K. Lutz, J. Ortiz, P. Mohan, E. Reutzel, J. Taneja, J. Hsu, and S. Shankar.
Journal of Sustainable Computing. January 2011.
[paper]


Napsac: Design and Implementation of a Power-Proportional Web Cluster.
A. Krioukov, P. Mohan, S. Alspaugh, L. Keys, D. Culler, and R. Katz.
Workshop on Green Networking. Co-located with SIGCOMM. August 2010.
[paper]


UNDERGRADUATE

Policy-Driven Data Management for Distributed Scientific Collaborations Using a Rule Engine.
S. Alspaugh, A. Chervenak and E. Deelman.
ACM Student Research Competition Best Undergraduate Student Poster. Co-located with SC. November 2008.
[summary] [poster]


Policy-Driven Data Management for Distributed Scientific Collaborations Using a Rule Engine.
S. Alspaugh and A. Chervenak. CRA-W Distributed Mentor Project Final Report. September 2008.
[paper]


Efficient Time-Aware Prioritization with Knapsack Solvers.
S. Alspaugh, K. Walcott, M. Belanich, G. Kapfhammer, and M. Soffa.
Workshop on Empirical Assessment of Software Engineering Languages and Technologies (WEASELTech). Co-located with ASE. November 2007.
[paper] [slides]



INVITED TALKS

Improving data analysis systems by studying interaction data from logs.
IBM Watson Research Center. 17 October 2014. [slides]


Understanding the process of data analysis by studying interaction records.
Trifacta. 2 October 2014. [slides]


Design and analysis of an energy agile cluster computing system.
Variaya Energy Group (VEG). 30 November 2011. [slides]


TEACHING

University of California, Berkeley

University of Virginia