Changes between Version 32 and Version 33 of Status/May09
- Timestamp:
- 05/04/09 01:13:53 (4 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Status/May09
v32 v33 74 74 The system currently works well for small programs, such as computing a dot product or the product of a sparse matrix with a dense vector. For such applications, the generated code is as close to hand written C code as GHC's current code generator enables us to be (i.e., within a factor of 2 or 3). We ran three small benchmarks on an 8-core x86 server and on an 8-core UltraSPARC T2 server, from which we derived two comparative figures: [http://justtesting.org/post/83014052/this-is-the-performance-of-a-dot-product-of-two a comparison between x86 and T2 on a memory-intensive benchmark (dot product)] and [http://justtesting.org/post/85103645/these-graphs-summarise-the-performance-of-data a summary of the speedup of three benchmarks on x86 and T2.] Overall, we achieved good absolute performance and good scalability on the hardware we tested. 75 75 76 Our next step is to scale the implementation up to properly handle larger programs. In particular, we are currently working on improving the interaction between vectorised code, the aggressively inlining array library, and GHC's standard optimisation phases with the goal of reducing excessively long compile timesdue to a temporary code explosion during optimisation. Moreover, Gabriele started to work on integrating specialised support for regular multi-dimensional arrays into the existing framework for nested data parallelism.76 Our next step is to scale the implementation up to properly handle larger programs. In particular, we are currently working on improving the interaction between vectorised code, the aggressively-inlined array library, and GHC's standard optimisation phases. The current main obstacle is excessively long compile times, due to a temporary code explosion during optimisation. Moreover, Gabriele started to work on integrating specialised support for regular multi-dimensional arrays into the existing framework for nested data parallelism. 77 77 78 78 === Type system improvements ===
