HipSpec is an inductive theorem prover for Haskell programs. It will (try to) conjecture essential lemmas to prove tricky properties. It supports all algebraic Haskell 98 data types and polymorphism. We assume that functions terminate on finite input and produce finite output. All quantification is only over total and finite values.
In the examples/Rotate.hs
file we see the following definitions:
rotate :: Nat -> [a] -> [a]
rotate Z xs = xs
rotate _ [] = []
rotate (S n) (x:xs) = rotate n (xs ++ [x])
prop_rotate :: [a] -> Prop [a]
prop_rotate xs = rotate (length xs) xs =:= xs
The second definition defines a property in the HipSpec property language that
we would like to prove. The =:=
operator simply means equals.
Let's run HipSpec on this file:
$ hipspec Rotate.hs --auto --cg --verbosity=30
Depth 1: 11 terms, 5 tests, 29 evaluations, 11 classes, 0 raw equations.
Depth 2: 63 terms, 100 tests, 2151 evaluations, 48 classes, 15 raw equations.
Depth 3: 1688 terms, 100 tests, 63851 evaluations, 1303 classes, 385 raw equations.
Proved xs++[] == xs by induction on xs using Z3
Proved (xs++ys)++zs == xs++(ys++zs) by induction on xs using Z3
Proved length (xs++ys) == length (ys++xs) by induction on ys,xs using Z3
Proved rotate m [] == [] without induction using Z3
Proved rotate m (rotate n xs) == rotate n (rotate m xs) by induction on n,m using Z3
Proved rotate (S m) (rotate n xs) == rotate (S n) (rotate m xs) by induction on xs using Z3
Proved rotate m (x:[]) == x:[] by induction on m using Z3
Proved rotate m xs++rotate m xs == rotate m (xs++xs) by induction on m using Z3
Proved length (rotate m xs) == length xs by induction on m using Z3
Proved rotate (length xs) (xs++ys) == ys++xs by induction on xs using Z3
Proved prop_rotate {- rotate (length xs) xs == xs -} without induction using Z3
Proved:
xs++[] == xs
(xs++ys)++zs == xs++(ys++zs)
length (xs++ys) == length (ys++xs)
rotate m (rotate n xs) == rotate n (rotate m xs)
rotate (S m) (rotate n xs) == rotate (S n) (rotate m xs)
rotate m (x:[]) == x:[]
rotate m xs++rotate m xs == rotate m (xs++xs)
length (rotate m xs) == length xs
rotate (length xs) (xs++ys) == ys++xs
prop_rotate {- rotate (length xs) xs == xs -}
The property and some lemmas conjectured by QuickSpec have been proved! Success!
HipSpec is maintained for GHC 7.8.3.
If you have an older version of GHC and cannot upgrade, try building at commit 2ba4d52c2c101a810dea4058496cf5f0da199b31.
You need to have the development version of QuickSpec. It can be obtained by cloning that repository:
git clone https://github.com/nick8325/quickspec
Assuming your directory structure looks like this:
$CWD/
hipspec/
...
quickspec/
...
You current directory is $CWD. You can then install quickspec and hipspec in one go, which makes the dependency analysis better:
cabal install hipspec/ quickspec/
The homebrew
program sometimes messes up the package ghc-paths
. If you get
the error that hipspec
cannot find the HipSpec
module, try to reinstall
this package with:
cabal install ghc-paths --reinstall
We support:
- Z3. The default.
- CVC4.
- Alt Ergo, in particular versions 0.94 and 0.95. It exists in the ubuntu repositories, and in Arch Linux' AUR.
- vampire, but your executable should be called
vampire-rel
(or changesrc/HipSpec/ATP/Provers.hs
). - E
- SPASS
See --help
to see the flags to select theorem provers.
HipSpec can generate a signature for you with the --auto
flag. If you want to
see what signature it generated for you, use the --print-auto-sig
flag.
Unfortunately, the monomorphiser is not very clever yet: say you have lists
in your program, then (:)
will only get the type A -> [A] -> [A]
and not
other presumably desireable instances such as [A] -> [[A]] -> [[A]]
and
Nat -> [Nat] -> [Nat]
.
To make HipSpec print a small, pruned theory of everything that is proved
to be correct, run the compiled binary with --explore-theory
, or -e
for
short.
Quick information about available flags can be accessed anytime by the
--help
flag to the hipspec
executable.
--call-graph
(--cg
): Order equations according to the call graph in the program.--swap-repr
(-s
): Swap equations with their representative--prepend-pruned
(-r
): Prepend nice, pruned, equations from QuickSpec before attacking all equations.--quadratic
(-q
): Add all pairs of equations from every equivalence class instead of picking a representative.--interesting-cands
(-i
): Consider properties that imply newly found theorems.
--inddepth=INT
(-d
): Induction depth, default 1.--indvars=INT
(-v
): Variables to do induction on simultaneously, default 2.--indhyps=INT
(-H
): For some data types, and with many variables and depths, the number of hypotheses become very large. This flag gives the upper bound, and just cuts of if it gets too big (compromising completeness). Default is 200.--indobligs=INT
(-O
): If the number of parts (bases and steps) gets over this threshold, this combination of variables and depths is skipped. Default is 25.
While theorem provers are still usually single-core, you can run many
of them in parallel. The --processes
or -N
flag will let you
specify this. The default is 2, but if you to use eight cores: -N=8
.
The default timeout is one second. It can be specified with the -t
or
--timeout
flag. With -t=5
, each theorem prover invocation will be 5 seconds
long. The timeout can be issued as a double, so -t=0.25
can be used to get
250 ms timeout.
Should you wish to inspect the generated theory, you can use --output
(or
-o
) to make a proving/
directory. Then all relevant files will be put
there for you to view.
The HipSpec group consists of:
-
Dan Rosén, [email protected],
-
Koen Claessen
-
Moa Johansson
-
Nicholas Smallbone