Dr. rer. nat. André Bauer

I am a computer scientist working as a postdoctoral scholar at Globus Labs led by Prof. Ian Foster in the Department of Computer Science at the University of Chicago since November 2022. I am also the founder and elected chair of the SPEC RG Predictive Data Analytics Working Group.

In a nutshell, my research aims to expand the potential of data science in scientific computing. In other words, my research contributes to key aspects of developing intuitive, efficient, and sustainable data science solutions across disciplines and domains. In addition, my research is inherently interdisciplinary, and I apply a translational approach as I work to develop, apply, and evaluate methods and techniques in various domains. In the long term, I expect that my research will contribute to the creation of dynamic data science ecosystems that will inherently accelerate scientific computing applications.

My primary research interests focus on the following areas integrating experience and expertise from performance engineering and data science, but is not limited to:

  • Data science: My focus is on data analytics, clustering, and imputation. I am also interested in benchmarking and developing data analytics methods. In addition, my research is inherently interdisciplinary, and I apply a translational approach to transfer methods and techniques in various domains.
  • Data science clouds: I am interested in the development, autonomous management (i.e., autonomous scaling of resources), benchmarking of various building blocks of such clouds, and runtime prediction and scheduling of data analytic tasks.
  • Data management: The focus is on FAIR (findable, accessible, interoperable, and reusable) data management and the promotion of publicly available research data.
  • Data privacy: The idea here is to exchange data with third parties, preserving the privacy of the data. I am interested in synthetic data generation and homomorphic encryption.
  • Sustainable data science: Specifically, this involves the development of an energy efficiency benchmark for Deep and Machine Learning.

Most Recent News

Feb 6, 2024 Just wrapped up an inspiring talk at the PSD Peer Mentorship Program @UChicago! I shared insights with my talk “Academic Success: Navigating your Journey”. Grateful for the opportunity to empower students in their educational pursuits.
Jan 22, 2024 I have received a certificate of leadership for successfully completing the 2023 National Postdocotoral Assocication SmartSkills program.
Jan 18, 2024 I accepted the invitation as program committee member at the 10th International Conference on Time Series and Forecasting (ITISE).
Dec 15, 2023 I accepted the invitation as program committee member at the 15th ACM/SPEC International Conference on Performance Engineering (ICPE).
Dec 11, 2023 Our article, “The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the Cloud”, has been accepted for publication in the Journal of Future Generation Computer Systems. Delve into the world of federated function-as-a-service (FaaS) through our unique dataset from the Globus Compute platform. This dataset spans 31 weeks, capturing 2 million task submissions, 252 users, and 580 computing endpoints. Explore intriguing observations, from short task runtimes to the diverse use of endpoints. As the first federated FaaS dataset with user workloads, it will be a goldmine for research in FaaS architecture and distributed computing.

Selected Publications

  1. The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the Cloud
    André Bauer, Haochen Pan, Ryan Chard, Yadu Babuji, Josh Bryan, Devesh Tiwari, Ian Foster, and Kyle Chard
    Future Generation Computer Systems, Apr 2024
  2. An Empirical Study of Container Image Configurations and Their Impact on Start Times
    Martin Straesser, André Bauer, Robert Leppich, Nikolas Herbst, Kyle Chard, Ian Foster, and Samuel Kounev
    In Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), May 2023
  3. Methodological Principles for Reproducible Performance Evaluation in Cloud Computing
    Alessandro V. Papadopoulos, Laurens Versluis, André Bauer, Nikolas Herbst, Jóakim Kistowski, Ahmed Ali-Eldin, Cristina Abad, J. Nelson Amaral, Petr Tuma, and Alexandru Iosup
    IEEE Transactions on Software Engineering (TSE), Aug 2021
  4. Libra: A Benchmark for Time Series Forecasting Methods
    André Bauer, Marwin Züfle, Simon Eismann, Johannes Grohmann, Nikolas Herbst, and Samuel Kounev
    In Proceedings of the 12th ACM/SPEC International Conference on Performance Engineering (ICPE), Apr 2021
  5. Time Series Forecasting for Self-Aware Systems
    André Bauer, Marwin Züfle, Nikolas Herbst, Albin Zehe, Andreas Hotho, and Samuel Kounev
    Proceedings of the IEEE, Jul 2020
  6. Telescope: An Automatic Feature Extraction and Transformation Approach for Time Series Forecasting on a Level-Playing Field
    André Bauer, Marwin Züfle, Nikolas Herbst, Samuel Kounev, and Valentin Curtef
    In Proceedings of the 36th IEEE International Conference on Data Engineering (ICDE), Apr 2020
The list of all publications can be found here.