Welcome!

me.jpg

Dr. rer. nat. André Bauer

I am an Assistant Professor in the Department of Computer Science at the Illinois Institute of Technology and the founder and elected chair of the SPEC RG Predictive Data Analytics Working Group.

The overarching goal of my research is to expand the potential of data science in scientific computing by designing robust, efficient, and sustainable system solutions tailored to the evolving needs of data-driven science. As scientific progress increasingly depends on the effective use of data science ecosystems, the diversity of hardware architectures, application demands, and usage patterns poses significant challenges. My work addresses these complexities through a focus on systems and performance engineering, leveraging interdisciplinary expertise to optimize and adapt scientific computing infrastructures for emerging data science applications.

In particular, I see the following key challenges that need to be addressed:

  • Resource Optimization: Efficiently allocating and managing resources across diverse hardware platforms to accommodate changing, data-intensive workloads.
  • Adaptive Systems: Developing systems that can dynamically adapt to evolving data and model requirements.
  • Data Security and Privacy: Safeguarding sensitive data while enabling collaborative data science.
  • System-Level Optimization: Optimizing the complex interplay of components within data science ecosystems for maximum performance.
  • Sustainable Computing: Minimizing the environmental impact of data science practices.

Most Recent News

Mar 28, 2025 I accepted the invitation as program committee member at the 6th International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS).
Feb 25, 2025 Our research paper, “Microservice Applications and Their Workloads on GitHub”, has been accepted for presentation at the 8th Workshop on Hot Topics in Cloud Computing Performance (HotCloudPerf). Many cloud applications use microservices for scalability and flexibility, but their distributed nature poses challenges. Performance engineers rely on representative applications for evaluation, yet existing benchmarks are limited and their industry relevance is debated. To explore alternatives, we mine GitHub for microservice applications and workloads, creating two datasets: 553 applications and 8 workload repositories. Our analysis offers a foundation for future research in microservice benchmarking.
Feb 20, 2025 All good things come in threes! The following research papers have been accepted for presentation at the 16th ACM/SPEC International Conference on Performance Engineering (ICPE):
  • “Bridging Clusters: A Comparative Look at Multicluster Networking Performance in Kubernetes” – Exploring performance trade-offs in multicluster networking.
  • “Quantifying Data Leakage in Failure Prediction Tasks” – Shedding light on hidden risks in predictive modeling.
  • “Generating Executable Microservice Applications for Performance Benchmarking” – Automating realistic benchmarking for microservices.
Dec 10, 2024 Our research paper, “Object Proxy Patterns for Accelerating Distributed Applications”, has been published in IEEE Transactions on Parallel and Distributed Systems. Distributed applications benefit from workflow and serverless frameworks, but advanced data flow optimization is often left to developers—especially as data scales. We propose three high-level proxy-based programming patterns—distributed futures, streaming, and ownership—to simplify and optimize distributed programming. Using benchmarks and scientific applications, we demonstrate significant improvements in runtime, throughput, and memory usage, showcasing the power of these patterns for complex distributed systems.
Dec 10, 2024 Two of our research papers have been accepted for publication: “Interatrial block is an independent risk factor for new-onset atrial fibrillation after cardiac surgery” in the Journal of Thoracic and Cardiovascular Surgery Open and “Preoperative interatrial block is associated with postoperative atrial fibrillation after cardiac surgery” in Interdisciplinary CardioVascular and Thoracic Surgery.

Selected Publications

  1. The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the Cloud
    André Bauer, Haochen Pan, Ryan Chard, Yadu Babuji, Josh Bryan, Devesh Tiwari, Ian Foster, and Kyle Chard
    Future Generation Computer Systems, Apr 2024
  2. An Empirical Study of Container Image Configurations and Their Impact on Start Times
    Martin Straesser, André Bauer, Robert Leppich, Nikolas Herbst, Kyle Chard, Ian Foster, and Samuel Kounev
    In Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), May 2023
  3. Methodological Principles for Reproducible Performance Evaluation in Cloud Computing
    Alessandro V. Papadopoulos, Laurens Versluis, André Bauer, Nikolas Herbst, Jóakim Kistowski, Ahmed Ali-Eldin, Cristina Abad, J. Nelson Amaral, Petr Tuma, and Alexandru Iosup
    IEEE Transactions on Software Engineering (TSE), Aug 2021
  4. Libra: A Benchmark for Time Series Forecasting Methods
    André Bauer, Marwin Züfle, Simon Eismann, Johannes Grohmann, Nikolas Herbst, and Samuel Kounev
    In Proceedings of the 12th ACM/SPEC International Conference on Performance Engineering (ICPE), Apr 2021
  5. Time Series Forecasting for Self-Aware Systems
    André Bauer, Marwin Züfle, Nikolas Herbst, Albin Zehe, Andreas Hotho, and Samuel Kounev
    Proceedings of the IEEE, Jul 2020
  6. Telescope: An Automatic Feature Extraction and Transformation Approach for Time Series Forecasting on a Level-Playing Field
    André Bauer, Marwin Züfle, Nikolas Herbst, Samuel Kounev, and Valentin Curtef
    In Proceedings of the 36th IEEE International Conference on Data Engineering (ICDE), Apr 2020