When all the work-items have finished, one work-item sums the local array into an element of a global array indexed by work-group id. Not every device needs to implement each level of this hierarchy in hardware. Retrieved December 2, Paraver provides a way to view and analyze these traces in a graphical way. Embedded Linux on ARM.
nest...