Changes between Version 83 and Version 84 of DataParallel/BenchmarkStatus
- Timestamp:
- 12/05/10 17:01:06 (2 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
DataParallel/BenchmarkStatus
v83 v84 108 108 || dph.dotp.vector.seq.N4 || 68ms || 1 || 1 || || 109 109 || dph.dotp.vectorised.seq.N4 || 58ms || 1.17 || || A || 110 || dph.dotp.vectorised.par.N1 || 55ms || 1.24 || 1.24 || ||110 || dph.dotp.vectorised.par.N1 || 55ms || 1.24 || 1.24 || B || 111 111 || dph.dotp.vectorised.par.N2 || 33ms || 2.06 || 1.03 || || 112 112 || dph.dotp.vectorised.par.N4 || 25ms || 2.72 || 0.68 || || 113 113 114 A: The sequential vectorised version is faster than with Data.Vector. Why was this? 114 A: The core for the vectorised.seq version is equivalent to the vector version. We expect the backend has compiled it differently. Check this again with LLVM. 115 B: The vectorised.par version runs faster than vectorised.seq because the latter has a duplicate loop counter in the inner loop. We need a duplicate-loop-counter removal optimisation. 115 116 116 117 '''Status''': fine[[br]] 118 '''Todo''': Check again with LLVM. 117 119 118 120 [[br]]
