Projektpartner


Karlsruher Institut für Technologie


Technische Universität München

Johannes Guttenberg Universität Mainz

Rheinisch-Westfälische Technische Hochschule Aachen


Publikationen

2017
  • Josef Weidendorfer, Dai Yang and Carsten Trinitis: LAIK: A Library for Fault Tolerant Distribution of Global Data for Parallel Applications. PARS Workshop 2017 --> Pre Print
  • Dai Yang, Josef Weidendorfer, Tilman Küstner, Carsten Trinitis and Sibylle Ziegler: Enabling Application-Integrated Proactive Fault Tolerance. Par-Co 2017, Bologna, Italy. Accepted for publication. --> Pre Print
  • Pickartz, Simon and Baude, Jonas and Lankes, Stefan and Monti, Antonello: A Locality-Aware Communication Layer for Virtualized Clusters; High Performance Computing : ISC High Performance 2017 International Workshops, DRBSD, ExaComm, HCPM, HPC-IODC, IWOPH, IXPUG, P^3MA, VHPC, Visualization at Scale, WOPSSS, Frankfurt, Germany, June 18-22, 2017
  • Pickartz, Simon and Clauss, Carsten and Lankes, Stefan and Monti, Antonello: Enabling Hierarchy-aware MPI Collectives in Dynamically Changing Topologies; [24th European MPI Users' Group Meeting, EuroMPI '17, 2017-09-25 - 2017-09-28, Chicago, Illinois, USA]
2018
  • Thomas Becker, Dai Yang, Tilman Küstner and Martin Schulz: Co-Scheduling in a Tasked-Based Programming Model. Workshop on Co-Scheduling of HPC Applications. (COSH'18) in Conjunction with HiPEAC Conference 2018. January 2018, Manchester, United Kingdom
  • Simon Pickartz, Carsten Clauss, Stefan Lankes and Antonello Monti: "Revisiting Locality-Awareness in View of Dynamically Changing Topologies. Parallel Computing Vol 77, pages 1 - 18. September 2018
  • Ramy Mohamed Aly Gad: Enhancing application checkpointing and migration in HPC. Ph.D. Thesis. University Mainz, Germany 2018
2019
  • David Jauk, Dai Yang, and Martin Schulz. Predicting Faults in High Performance Computing Systems: An In-Depth Survey of the State-of-the-Practice. In SC 19’: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, 2019, Denver, Colorado, United States.
  • Bengisu Elis, Dai Yang, and Martin Schulz. 2019. QMPI: A Next Generation MPI Profiling Interface for Modern HPC Platforms. In Proceedings of the 26th European MPI Users’ Group Meeting (EuroMPI’ 19), Torsten Hoefler and Jesper Larsson Träff (Eds.). ACM, New York, NY, USA, Article 4, 10 pages
  • Amir Raoofy, Dai Yang, Josef Weidendorfer, Carsten Trinitis and Martin Schulz: Enabling Malleability for Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics using LAIK. PARS Workshop 2019 (accepted for publication)
  • Thomas Becker, Nico Rudolf, Dai Yang and Wolfgang Karl: Symptom-based Fault Detection in Modern Computer Systems. PARS Workshop 2019 (accepted for publication)
  • Frank, Alvaro; Süß, Tim; Brinkmann, André: Effects and benefits of node sharing strategies in HPC batch systems. Proceedings of the 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS). 2019.
  • S. Lankes, S. Pickartz, and J. Breitbart, „HermitCore," in Operating Systems for Supercomputers and High Performance Computing, B. Gerofi, Y. Ishikawa, R. Riesen, and R. W. Wisniewski, Eds. Springer International Publishing, November 2019.
  • S. Lankes, S. Pickartz, and J. Breitbart, “Exploring Rust for Unikernel Development,” in Proceedings of the 10th Workshop on Programming Languages and Operating Systems (PLOS 2019), held in conjunction with 27th ACM Symposium on Operating Systems Principles (SOSP 2019), Huntsville, Ontario, Canada, 2019.
  • Pierre Olivier, Daniel Chiba, Stefan Lankes, Changwoo Min, and Binoy Ravindran, "A Binary-Compatible Unikernel", in The 15th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE), 2019 (Winner of the Best Paper Award)
  • Pierre Olivier, A K M Fazla Mehrab, Stefan Lankes, Mohamed Karaoui, Rob Lyerly, and Binoy Ravindran, "HEXO: Offloading HPC Compute-Intensive Workloads on Low-Cost, Low-Power Embedded Systems", in The 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2019
  • Becker, T., (2019). Integrating Organic Computing Mechanisms into a Task-based Runtime System for Heterogeneous Systems. In: Draude, C., Lange, M. & Sick, B. (Hrsg.), INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge). Bonn: Gesellschaft für Informatik e.V.. (S. 531-544).