| Version 3 (modified by simonpj, 22 months ago) |
|---|
Improved dependency tracking for GHC
Many packages these days are using Template Haskell to grab information from the filesystem at compile time. I'll use Hamlet as a motivating example, though this applies to others as well.
In Hamlet, you can specify an external file as the source for your HTML template code, with something like:
{-# LANGUAGE TemplateHaskell #-}
module M where
import X
import Y
$(renderHamlet $(hamletFile "myfile.hamlet"))
Now suppose "myfile.hamlet" is updated, but there are no changes in M.hs, X.hs, Y.hs (and their imports). Then ghc --make will think that M doesn't need recompiling at all and will not recompile M (possibly saying "Recompilation is NOT required"). Which is obviously wrong.
Even with plain module-at-a-time ghc -c M.hs, GHC may wrongly fail to recompile M because it thinks its unnecessary. You have to use -fforce-recomp to make it happen.
The situation is very similar with CPP:
{-# LANGUAGE Cpp #-}
module M where
import X
import Y
#include "myfile.h"
Again ghc --make will fail to see the dependency on "myfile.h" and may therefore fail to recompile M when it should really do so.
Possible solution
The solution comes in these pieces:
- A part (different for TH and CPP) that tells GHC what files went into compiling M
- A way (shared) to record this "file-dependency info" in M's interface file, M.hi
- A new bit of the recompilation check that makes M recompile if any of its file-dependencies are out of date.
Concering (2), GHC already has a components of M.hi that records "usages": fingerprints of the things that GHC used in compiling M. This is the mi_usages field of HscTypes.ModIface. To record the extra dependency we can just add a new constructor to HscTypes.Usage for FileUsage. (Record an absolute, not relative path!)
Then (3) is relatively easy: just check the modification date of M.o against the dependened-on file.
That leaves (1). We'll need:
- A new field in TcgGblEnv to accumulate file-dependencies.
- Some way for #include directives to be recorded in that list (presumably by interpreting the #line directives that CPP outputs.
- A way for Template Haskell code to express a dependency. A possible solution is to add a new function to the template-haskell package:
addDependentFile :: FilePath -> Q ()
This in turn would require the Qusai class to support such an operation.
Note that ghc -M would not spit out these dependencies. The solution proposed here simply records what dependencies where encountered while compiling M, and hence when M should be re-compiled. It doesn't tell you (in advance) which files compiling M will depend on.
