Parallel Programming and Optimization Based on TMS320C6678
2014 ◽
Vol 615
◽
pp. 259-264
◽
Keyword(s):
The development of multi-core processors has provided a good solution to applications that require real-time processing and a large number of calculations. However, simply exploiting parallelism in software is hard to make full use of the hardware performance. This paper studies the parallel programming and optimization techniques on TMS320C6678 multicore digital signal processors. We firstly illustrate an implementation of a selected parallel image convolution algorithm by OpenMP. Then several optimization techniques such as compiler intrinsics, cache, DMA are used to further enhance the application performance and achieve a good execution time according to the test results.
2007 ◽
Vol 52
(1)
◽
pp. 143-148
Keyword(s):
Keyword(s):
2011 ◽
Vol 130-134
◽
pp. 2944-2947
Keyword(s):
Keyword(s):