Friday, October 24, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Source-to-Source transformations for efficient SIMD code generation

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia


  Descargar recurso

Detalles del recurso

Pertenece a: UPCommons - E-prints UPC Universitat Politècnica de Catalunya   

Descripción: In the last years, there has been much effort in commercial compilers to generate efficient SIMD instructions-based code sequences from conventional sequential programs. However, the small numbers of compilers that can automatically use these instructions achieve in most cases unsatisfactory results. Therefore, the code often has to be written manually in assembly language or using compiler built-in functions to achieve high performance. In this work, we present source-to-source transformations that help commercial vectorizing compilers to generate efficient SIMD code. Experimental results show that excellent performance can be achieved. In particular, for the problem of matrix product (SGEMM) we almost achieve as high performance as hand-optimized numerical libraries. Our source-tosource transformations are based on the scalar replacement and unroll and jam transformations presented by Callahan et all. In particular, we extend the use of scalar replacement to vectorial replacement and combine this transformation with unroll and jam and outer loop vectorization to fully exploit the vector register level and thus to help the compiler to generate efficient SIMD code. We will show experimentally the effectiveness of our proposal.

Autor(es): Berna Juan, Alejandro -  Jiménez Castells, Marta -  Llaberia Griñó, José M. - 

Id.: 55053379

Idioma: English  - 

Versión: 1.0

Estado: Final

Tipo:  8 p. - 

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Sistemes d'informació - 

Tipo de recurso: Conference report  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Formatos:  8 p. - 

Requerimientos técnicos:  Browser: Any - 

Relación: [References] http://jp2011.pcg.ull.es/downloads/jp2011/Actas_JP2011.pdf

Fecha de contribución: 05-may-2013

Contacto:

Localización:
* Berna, A.; Jimenez, M.; Llaberia, J. Source-to-Source transformations for efficient SIMD code generation. A: Jornadas de Paralelismo. "Actas de las XXII Jornadas de Paralelismo". La Laguna, Tenerife: 2011, p. 719-726.
* 978-84-694-1791-1


Otros recursos del mismo autor(es)

  1. Multilevel tiling for non-rectangular interation spaces La motivación principal de esta tesis es el desarrollo de nuevas técnicas de compilación dirigidas a...
  2. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
  3. Source code transformations for efficient SIMD code generation Despite the effort inverted the last years in commercial compilers to generate efficient SIMD instru...
  4. ICT4Girls: from high school to university: an approach Technological advances are improving living conditions. Oddly, there continues to be a decrease in t...
  5. Vectorized register tiling In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...

Otros recursos de la misma colección

  1. Laser linewidth requirements for remote heterodyne OFDM based PON scenario The relationship between the FFT size, modulation format and laser linewidth is studied in a remote ...
  2. Layer 2 redesign for Metro-Access next generation PON Necessary modifications and extensions into Layer 2 of the actual PON access are necessary in order ...
  3. Performance evaluation of an all-optical OCS/OPS-Based network for intra-data center connectivity services As the size of Data Centers (DCs) and the amount of information managed by them grows, nowadays elec...
  4. Accurate geolocation of rfi sources in smos imagery based on superresolution algorithms Accurate geolocation of SMOS RFI sources is very important for effectively switching-off the illegal...
  5. Experimental validation of a real-time energy management system using multi-period gravitational search algorithm for microgrids in islanded mode Both performance optimization and scheduling of the distributed generation (DG) are relevant impleme...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso