Saturday, August 1, 2015

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Source-to-Source transformations for efficient SIMD code generation

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia


  Descargar recurso

Detalles del recurso

Pertenece a: UPCommons - E-prints UPC Universitat Politècnica de Catalunya   

Descripción: In the last years, there has been much effort in commercial compilers to generate efficient SIMD instructions-based code sequences from conventional sequential programs. However, the small numbers of compilers that can automatically use these instructions achieve in most cases unsatisfactory results. Therefore, the code often has to be written manually in assembly language or using compiler built-in functions to achieve high performance. In this work, we present source-to-source transformations that help commercial vectorizing compilers to generate efficient SIMD code. Experimental results show that excellent performance can be achieved. In particular, for the problem of matrix product (SGEMM) we almost achieve as high performance as hand-optimized numerical libraries. Our source-tosource transformations are based on the scalar replacement and unroll and jam transformations presented by Callahan et all. In particular, we extend the use of scalar replacement to vectorial replacement and combine this transformation with unroll and jam and outer loop vectorization to fully exploit the vector register level and thus to help the compiler to generate efficient SIMD code. We will show experimentally the effectiveness of our proposal.

Autor(es): Berna Juan, Alejandro -  Jiménez Castells, Marta -  Llaberia Griñó, José M. - 

Id.: 55053379

Idioma: English  - 

Versión: 1.0

Estado: Final

Tipo:  8 p. - 

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Sistemes d'informació - 

Tipo de recurso: Conference report  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Formatos:  8 p. - 

Requerimientos técnicos:  Browser: Any - 

Relación: [References] http://jp2011.pcg.ull.es/downloads/jp2011/Actas_JP2011.pdf

Fecha de contribución: 05-may-2013

Contacto:

Localización:
* Berna, A.; Jimenez, M.; Llaberia, J. Source-to-Source transformations for efficient SIMD code generation. A: Jornadas de Paralelismo. "Actas de las XXII Jornadas de Paralelismo". La Laguna, Tenerife: 2011, p. 719-726.
* 978-84-694-1791-1


Otros recursos del mismo autor(es)

  1. Multilevel tiling for non-rectangular interation spaces La motivación principal de esta tesis es el desarrollo de nuevas técnicas de compilación dirigidas a...
  2. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
  3. Source code transformations for efficient SIMD code generation Despite the effort inverted the last years in commercial compilers to generate efficient SIMD instru...
  4. ICT4Girls: from high school to university: an approach Technological advances are improving living conditions. Oddly, there continues to be a decrease in t...
  5. Filtering directory lookups in CMPs Coherence protocols consume an important fraction of power to determine which coherence action shoul...

Otros recursos de la misma colección

  1. A local constitutive model for the discrete element method: application to geomaterials and concrete This paper presents a local constitutive model for modelling the linear and non linear behavior of s...
  2. Increasing yield of nanocrystalline cellulose preparation process by a cellulase pretreatment In this work the introduction of a cellulase treatment prior to NCC isolation was assessed. NCC was ...
  3. FUTUR. Portal for the scientific output of researchers at the Universitat Politècnica de Catalunya Postprint (author’s final draft)
  4. Les llàgrimes de Santa Eulàlia Award-winning
  5. A meshless finite point method for three dimensional analysis of compressible flow problems involving moving boundaries and adaptivity A finite point method for solving compressible flow problems involving moving boundaries and adaptiv...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso