Wednesday, September 24, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Source-to-Source transformations for efficient SIMD code generation

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia

  Descargar recurso

Detalles del recurso

Pertenece a: UPCommons - E-prints UPC Universitat Politècnica de Catalunya   

Descripción: In the last years, there has been much effort in commercial compilers to generate efficient SIMD instructions-based code sequences from conventional sequential programs. However, the small numbers of compilers that can automatically use these instructions achieve in most cases unsatisfactory results. Therefore, the code often has to be written manually in assembly language or using compiler built-in functions to achieve high performance. In this work, we present source-to-source transformations that help commercial vectorizing compilers to generate efficient SIMD code. Experimental results show that excellent performance can be achieved. In particular, for the problem of matrix product (SGEMM) we almost achieve as high performance as hand-optimized numerical libraries. Our source-tosource transformations are based on the scalar replacement and unroll and jam transformations presented by Callahan et all. In particular, we extend the use of scalar replacement to vectorial replacement and combine this transformation with unroll and jam and outer loop vectorization to fully exploit the vector register level and thus to help the compiler to generate efficient SIMD code. We will show experimentally the effectiveness of our proposal.

Autor(es): Berna Juan, Alejandro -  Jiménez Castells, Marta -  Llaberia Griñó, José M. - 

Id.: 55053379

Idioma: English  - 

Versión: 1.0

Estado: Final

Tipo:  8 p. - 

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Sistemes d'informació - 

Tipo de recurso: Conference report  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Formatos:  8 p. - 

Requerimientos técnicos:  Browser: Any - 

Relación: [References] http://jp2011.pcg.ull.es/downloads/jp2011/Actas_JP2011.pdf

Fecha de contribución: 05-may-2013

Contacto:

Localización:
* Berna, A.; Jimenez, M.; Llaberia, J. Source-to-Source transformations for efficient SIMD code generation. A: Jornadas de Paralelismo. "Actas de las XXII Jornadas de Paralelismo". La Laguna, Tenerife: 2011, p. 719-726.
* 978-84-694-1791-1


Otros recursos del mismo autor(es)

  1. Multilevel tiling for non-rectangular interation spaces La motivación principal de esta tesis es el desarrollo de nuevas técnicas de compilación dirigidas a...
  2. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
  3. Source code transformations for efficient SIMD code generation Despite the effort inverted the last years in commercial compilers to generate efficient SIMD instru...
  4. ICT4Girls: from high school to university: an approach Technological advances are improving living conditions. Oddly, there continues to be a decrease in t...
  5. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...

Otros recursos de la misma colección

  1. The automorphism group of the non-split Cartan modular curve of level 11 We derive equations for the modular curve X-ns(11) associated to a non-split Cartan subgroup of GL(2...
  2. A secure communication system based on a modified chaotic chua oscillator In this paper we propose a new scheme for secure communications us- ing a modified Chua oscillator. ...
  3. Fracture toughness of zirconia from a shallow notch produced by ultra-short pulsed laser ablation Ceria partially stabilized zirconia ceramics (Ce-TZP) with identical grain size and different amount...
  4. Anhydric maleic functionalization and polyethylene glycol grafting of lactide-co-trimethylene carbonate copolymers Lactide and trimethylene carbonate copolymers were successfully grafted with polyethylene glycol via...
  5. Effect of structure, topography and chemistry on fibroblast adhesion and morphology Surface biofunctionalisation of many biodegradable polymers is one of the used strategies to improve...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso