Computer Engineering
Department
Nuria
Losada
Publications by the researcher in collaboration with Nuria Losada (10)
2020
-
Fault tolerance of MPI applications in exascale systems: The ULFM solution
Future Generation Computer Systems, Vol. 106, pp. 467-481
2019
-
Local rollback for resilient MPI applications with application-level checkpointing and message logging
Future Generation Computer Systems, Vol. 91, pp. 450-464
2018
-
Application-level Fault Tolerance and Resilience in HPC Applications
Application-level Fault Tolerance and Resilience in HPC Applications
-
Insights into application-level solutions towards resilient MPI applications
Proceedings - 2018 International Conference on High Performance Computing and Simulation, HPCS 2018
2017
-
A portable and adaptable fault tolerance solution for heterogeneous applications
Journal of Parallel and Distributed Computing, Vol. 104, pp. 146-158
-
Assessing resilient versus stop-and-restart fault-tolerant solutions in MPI applications
Journal of Supercomputing, Vol. 73, Núm. 1, pp. 316-329
-
Resilient MPI applications using an application-level checkpointing framework and ULFM
Journal of Supercomputing, Vol. 73, Núm. 1, pp. 100-113
2016
-
Portable application-level checkpointing for hybrid MPI-OpenMP applications
Procedia Computer Science
2015
-
I/O optimization in the checkpointing of OpenMP parallel applications
Proceedings - 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2015
2014
-
Extending an application-level checkpointing tool to provide fault tolerance support to openMP applications
Journal of Universal Computer Science, Vol. 20, Núm. 9, pp. 1352-1372