Ticket #215 (closed task: fixed)

Opened 5 years ago

Last modified 2 years ago

Overhaul support for package's tests

Reported by: duncan Owned by: RichardG
Priority: normal Milestone: Cabal-1.10
Component: Cabal library Version:
Severity: normal Keywords:
Cc: dons@…, will@…, johan.tibell@…, peteg42@…, bos@…, michal.palka@…, pho@… Difficulty: project(> week)
GHC Version: 6.8.2 Platform:

Description

The current support for running package's testsuites is almost uniformly unused and packages instead implement ad-hoc solutions which makes it hard for users to run the testsuites and impossible to automate on hackage.

The current support is so trivial anyone who takes this task on can ignore it and start with a clean slate.

A  thread on the issue.

Here are some ideas for possible requirements:

  • Support tests even in the Simple build system. Do not require using the build-type: Custom just to be able to use tests.
  • Define a standard interface for returning test results from a testsuite to Cabal so they can be included into a build report and could be published on the hackage website or some other custom in-house test server.
  • Support for above interface in QuickCheck and HUnit.
  • Support for HPC when running testsuite programs, and gathering coverage results, again for reporting purposes.
  • We want to be able to run tests on a library that is not installed yet. This will rely on support in Cabal for registering a package inplace in a local package db, and building test code against that.
  • What about performance tests? What format should that use for reporting?

The emphasis is on automation. A lot of interesting information becomes available with enough automation. For example we could compare on hackage which packages have more comprehensive testsuites (measured by HPC coverage). Or we could track performance over time. Once the data is available there is a lot we could do with it. It's not just for a public hackage server, developers could use this privately too.

Change History

  Changed 5 years ago by dons

  • cc dons@… added

  Changed 5 years ago by duncan

Here's an idea

test
  test-is: Test
  build-depends: QuickCheck
  hs-source-dirs: tests src

So imaging a dir layout like:

foo.cabal
src/*
tests/Test.hs

where tests/Test.hs is

module Test (tests) where

tests :: SomethingYetToBeDecided
tests = ...

So a test is really like an executable or library. It can have all the ordinary buildinfo which for example allows it to depend on extra testing libs. The difference is that instead of having a single Main.main entry point or a bunch of exposed modules, it has a test entry point in one module.

That also means the interface is a Haskell one, not a process text stdin/stdout one. That's likely more portable between Haskell implementations and crucially it allows us to build the test code with additional profiling or hpc or whatever. It also means that in principle such tests could be secure if the test type does not involve IO and we use a Haskell implementation that can run untrusted pure code securely.

The tricky bit is what the type of the test entry point should be. We want to be able to gather the results of the tests. Perhaps the test stanza also needs to declare what version of the test api interface it is using since it is likely to be refined/improved over time.

  Changed 5 years ago by guest

That also means the interface is a Haskell one, not a process text stdin/stdout one. That's likely more portable between Haskell implementations and crucially it allows us to build the test code with additional profiling or hpc or whatever.

Really? Is it not possible to build with hpc style stuff if the interface is just a command line program? I'm also not overly convinced by the argument about portability.

Why not assume Test-Is: Test, means that Test is a module Main, with main in it. You run the tests (perhaps supplying test-arguments: for command line arguments), and the error code is the number of tests fail, where exitSuccess is equivalent to no failures. The log of the test run could also be displayed on hackage.

Command line programs are simple, and can be written without learning a new API. They can also be invoked from outside Cabal. For example, the tagsoup program has a tagsoup binary, which has several example programs in it. One of those example things is "tagsoup test" which runs the test properties. Changing to have Test call system "tagsoup test" would be a bit of a shame.

In general, something does need to be done, but keeping it very simple seems like a good idea. The idea of having a test program be an actual program with an error code seems to appeal to the Unix ideal of keeping things simple and self-contained. I know for certain that Colin would back me up on this :-)

I would also suggest performance testing be given a back seat. In general, its hard to do it well, and get meaningful results. A separate speed-check tool is being worked on, and that can be done separately from Cabal. If it was anyone other than Dons/Duncan having this discussion, no one would have considered performance :-)

  Changed 5 years ago by duncan

  • version 1.2.3.0 deleted
  • milestone set to Cabal-1.6

  Changed 5 years ago by duncan

  • milestone changed from Cabal-1.6 to Cabal-1.8

  Changed 4 years ago by wjt

  • cc will@… added

  Changed 4 years ago by tibbe

  • cc johan.tibell@… added

  Changed 4 years ago by peteg

  • cc peteg42@… added

  Changed 4 years ago by bos

  • cc bos@… added

  Changed 3 years ago by michal.palka

  • cc michal.palka@… added

follow-up: ↓ 13   Changed 3 years ago by RichardG

  • owner set to RichardG
  • status changed from new to assigned

I want to take this on.

I'm very much in favor of having Cabal build and run external programs that print formatted data to stdout, which is then parsed and aggregated by Cabal. This makes it easier to integrate new testing libraries and also provides more control over how testing is performed (e.g., the HUnit tests give different results with 6.12.1 when compiled with -O and without).

in reply to: ↑ 12 ; follow-up: ↓ 14   Changed 3 years ago by tibbe

Replying to RichardG:

I want to take this on. I'm very much in favor of having Cabal build and run external programs that print formatted data to stdout, which is then parsed and aggregated by Cabal. This makes it easier to integrate new testing libraries and also provides more control over how testing is performed (e.g., the HUnit tests give different results with 6.12.1 when compiled with -O and without).

Thomas Tuegel will work on this as part of Google Summer of Code (with me as his mentor):

 http://socghop.appspot.com/gsoc/student_project/show/google/gsoc2010/haskell/t127230760295

I suggest you talk to Thomas if you're interested in working on this project.

in reply to: ↑ 13   Changed 3 years ago by RichardG

Replying to tibbe:

Thomas Tuegel will work on this as part of Google Summer of Code (with me as his mentor)...

That's great news! I'll send him an email. (I did a little searching and didn't find much info on whether the project had been taken up or not.)

It may be a good idea to reassign this bug to him to avoid confusion.

  Changed 2 years ago by PHO

  • cc pho@… added

  Changed 2 years ago by duncan

  • status changed from assigned to closed
  • resolution set to fixed
  • milestone changed from Cabal-1.8 to Cabal-1.10

This is now basically done (well done Thomas!). There are some issues remaining, but those have their own tickets.

Note: See TracTickets for help on using tickets.