< retour aux publications

Application level Performance Optimization: A Computer Vision Case Study on STHORM

Auteur(s) : V. Schwambach, S. Cleyet-Merle, A. Issard, S. Mancini

Doc. Source: International Conference on Computational Science (ICCS) Workshop on Architecture, Languages, Compilation and Hardware support for Emerging ManYcore systems (ALCHEMY)

Publisher : Elsevier

Pages : 1113-1122

Doi : 10.1016/j.procs.2014.05.100

Computer vision applications constitute one of the key drivers for embedded many-core architectures. In order to exploit the full potential of such systems, a balance between computation and communication is critical, but many computer vision algorithms present a highly data-dependent behavior that complexifies this task. To enable application performance optimization, the development environment must provide the developer with tools for fast and precise application-level performance analysis. We describe the process to port and optimize a face detection application onto the STHORM many-core accelerator using the STHORM OpenCL SDK. We identify the main factors that limit performance and discern the contributions arising from: the application itself, the OpenCL programming model, and the STHORM OpenCL SDK. Finally, we show how these issues can be addressed in the future to enable developers to further improve application performance.