Publicaciones (111) Publicaciones de Basilio Bernardo Fraguela Rodríguez

2023

  1. VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores

    Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2023

  2. VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores

    International Conference for High Performance Computing, Networking, Storage and Analysis, SC

2022

  1. A highly optimized skeleton for unbalanced and deep divide-and-conquer algorithms on multi-core clusters

    Journal of Supercomputing, Vol. 78, Núm. 8, pp. 10434-10454

  2. Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM routine on Ampere GPUs

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

  3. The New UPC++ DepSpawn High Performance Library for Data-Flow Computing with Hybrid Parallelism

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2020

  1. An automatic optimizer for heterogeneous devices

    Future Generation Computer Systems, Vol. 106, pp. 572-584

2019

  1. A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)

    IEEE Access, Vol. 7, pp. 23365-23378

  2. Analysis of interval-grouped data in weed science: The binnednp Rcpp package

    Ecology and Evolution, Vol. 9, Núm. 19, pp. 10903-10915

  3. Easy dataflow programming in clusters with UPC++ DepSpawn

    IEEE Transactions on Parallel and Distributed Systems, Vol. 30, Núm. 6, pp. 1267-1282

  4. Enhanced global optimization methods applied to complex fisheries stock assessment models

    Applied Soft Computing Journal, Vol. 77, pp. 50-66

  5. Portable and efficient FFT and DCT algorithms with the Heterogeneous Butterfly Processing Library

    Journal of Parallel and Distributed Computing, Vol. 125, pp. 135-146

2018

  1. Guiding the Optimization of Parallel Codes on Multicores Using an Analytical Cache Model

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. Heterogeneous distributed computing based on high-level abstractions

    Concurrency Computation