Friday, November 21, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Source-to-Source transformations for efficient SIMD code generation

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia


  Descargar recurso

Detalles del recurso

Pertenece a: UPCommons - E-prints UPC Universitat Politècnica de Catalunya   

Descripción: In the last years, there has been much effort in commercial compilers to generate efficient SIMD instructions-based code sequences from conventional sequential programs. However, the small numbers of compilers that can automatically use these instructions achieve in most cases unsatisfactory results. Therefore, the code often has to be written manually in assembly language or using compiler built-in functions to achieve high performance. In this work, we present source-to-source transformations that help commercial vectorizing compilers to generate efficient SIMD code. Experimental results show that excellent performance can be achieved. In particular, for the problem of matrix product (SGEMM) we almost achieve as high performance as hand-optimized numerical libraries. Our source-tosource transformations are based on the scalar replacement and unroll and jam transformations presented by Callahan et all. In particular, we extend the use of scalar replacement to vectorial replacement and combine this transformation with unroll and jam and outer loop vectorization to fully exploit the vector register level and thus to help the compiler to generate efficient SIMD code. We will show experimentally the effectiveness of our proposal.

Autor(es): Berna Juan, Alejandro -  Jiménez Castells, Marta -  Llaberia Griñó, José M. - 

Id.: 55053379

Idioma: English  - 

Versión: 1.0

Estado: Final

Tipo:  8 p. - 

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Sistemes d'informació - 

Tipo de recurso: Conference report  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Formatos:  8 p. - 

Requerimientos técnicos:  Browser: Any - 

Relación: [References] http://jp2011.pcg.ull.es/downloads/jp2011/Actas_JP2011.pdf

Fecha de contribución: 05-may-2013

Contacto:

Localización:
* Berna, A.; Jimenez, M.; Llaberia, J. Source-to-Source transformations for efficient SIMD code generation. A: Jornadas de Paralelismo. "Actas de las XXII Jornadas de Paralelismo". La Laguna, Tenerife: 2011, p. 719-726.
* 978-84-694-1791-1


Otros recursos del mismo autor(es)

  1. Multilevel tiling for non-rectangular interation spaces La motivación principal de esta tesis es el desarrollo de nuevas técnicas de compilación dirigidas a...
  2. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
  3. Source code transformations for efficient SIMD code generation Despite the effort inverted the last years in commercial compilers to generate efficient SIMD instru...
  4. ICT4Girls: from high school to university: an approach Technological advances are improving living conditions. Oddly, there continues to be a decrease in t...
  5. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...

Otros recursos de la misma colección

  1. Nonuniform Fuchsian codes for noisy channels We develop a new transmission scheme for additive white Gaussian noisy (AWGN) channels based on Fuch...
  2. Preservation of controllability-observability in expanded systems The result contributed by the article is that controllability-observability of an original continuou...
  3. Overlapping reliable control for a cable-stayed bridge benchmark The brief presents a reliable 1-out-2 reduced order control design strategy for a cable-stayed bridg...
  4. Generalized selection of complementary matrices in the inclusion principle This paper presents a strategy for choosing complementary matrices in the framework of the inclusion...
  5. Fixed subgroups in free groups: a survey This note is a survey of the main results known about fixed subgroups of endomorphisms of finitely g...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso