Publications by the researcher in collaboration with Emilio López Zapata (31)

2004

  1. A compiler tool to predict memory hierarchy performance of scientific codes

    Parallel Computing, Vol. 30, Núm. 2, pp. 225-248

2003

  1. Probabilistic miss equations: Evaluating memory hierarchy performance

    IEEE Transactions on Computers, Vol. 52, Núm. 3, pp. 321-336

2001

  1. A data parallel formulation of the barnes-hut method for N−body simulations

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. A data-parallel formulation for divide and conquer algorithms

    Computer Journal, Vol. 44, Núm. 4, pp. 303-320

  3. Parallelization of a recursive decoupling method for solving tridiagonal linear systems on distributed memory computer

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

1999

  1. Automatic analytical modeling for the estimation of cache misses

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, pp. 221-231

  2. Direct mapped cache performance modeling for sparse matrix operations

    Proceedings of the 7th Euromicro Workshop on Parallel and Distributed Processing, PDP 1999

  3. HPF-2 support for dynamic sparse computations

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  4. Set associative cache behavior optimization

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

1998

  1. Cache misses prediction for high performance sparse algorithms

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. Cache probabilistic modeling for basic sparse algebra kernels involving matrices with a non-uniform distribution

    Proceedings - 24th EUROMICRO Conference, EURMIC 1998

  3. Modeling set associative caches behavior for irregular computations

    Performance Evaluation Review, Vol. 26, Núm. 1, pp. 192-201

1997

  1. A probabilistic model for best-first search B&B algorithms

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. High-performance VLSI architecture for the viterbi algorithm

    IEEE Transactions on Communications, Vol. 45, Núm. 2, pp. 168-176

  3. Mapping tridiagonal system algorithms onto mesh connected computers

    International Journal of High Speed Computing, Vol. 9, Núm. 2, pp. 101-126

1996

  1. FFTs on mesh connected computers

    Parallel Computing, Vol. 22, Núm. 1, pp. 19-38

  2. Implementation and experimental evaluation of the constrained ART algorithm on a multicomputer system

    Signal Processing, Vol. 51, Núm. 1, pp. 69-76

  3. Parallel sparse modified gram-schmidt QR decomposition

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  4. Sparse Householder QR factorization on a mesh

    Proceedings of 4th Euromicro Workshop on Parallel and Distributed Processing, PDP 1996

1995

  1. Digit on-line large radix CORDIC rotator

    Proceedings of the International Conference on Application Specific Array Processors