section('packages/plunit.html')

Abstract

This document describes a Prolog unit-test framework. This framework was initially developed for SWI-Prolog. The current version also runs on SICStus Prolog, providing a portable testing framework. See section 8.1.

Robert Laing said (2018-11-23T15:17:03):

I got stuck at =load_test_files(+Options)= trying to use the plt filename as the argument.

I discovered I should use =load_test_files([make(all)]).= thanks to Google leading me to a nice tutorial at http://www.paulbrownmagic.com/blog/swi_prolog_unit_testing_env

LogicalCaptain said (2020-05-17T12:09:13):

Test cases are your track bed through the swampland

You do not want to haphazardly enter various queries on the toplevel to see whether everything is working after the latest change. Of course not!

Compared to imperative programs, and even functional programs, a predicate call can perform in "several different modes" (unless you stick to functional style, which may be good practice in certain cases). There is practically no typing that can be used as guardrail, additionally a variable can appear unbound when you don't expect it and for added complication unification makes data flow bidirectionally between terms in one statement. A lot of elementary operations can be performed in a very little amount of code and you are bound to miss a number of edge cases.

With test cases, you can make sure both edge cases and standard cases are covered, lead to expected behaviour and stay covered while you add features and execution paths.

The infrastructure supports having test code in separate files terminating in `.plt` instead of `.pl` and to load them with load_test_files/1 - very convenient!

Some examples

tests_demonstrating_units_tests.pl

Suggestions (IMHO)

Use a good explanatory string to label your test case, instead of an obscure atomic label:

test("no redo, choicepoint left open") :- throw_on_redo(1).

instead of

test(one) :- throw_on_redo(1).

If the test expression gets complex, move it out of the head and build it in the body:

(Of course, sometimes you don't have such an expression, e.g. if you test var(X))

test("collect them yourself, built a test expression, test in the head", [true(T)]) :-
   bagof(X,nondeterminism(X),Bag),
   T = (Bag == [1,2]).

Usefully interwork with assertions

See https://eu.swi-prolog.org/pldoc/man?section=testassertion

You can have multiple assertions in a test body which do fail the test but do not break off computation (unlike in non-test code):

:- debug(assertion_info).
  
:- begin_tests(assertion_test).

test(0) :-
   assertion(true),
   debug(assertion_info,"Now past succeeded assertion",[]).

test(1) :-
   assertion(false),
   debug(assertion_info,"Now past single failed assertion",[]).

test(2) :-
   X = 2,
   assertion(float(X)),
   assertion(X > 3),
   debug(assertion_info,"Now past two failed assertions",[]).

:- end_tests(assertion_test).

Then

?- run_tests.
% PL-Unit: assertion_test
% Now past succeeded assertion
.
ERROR: /home/user/test_assertion.pl:9:
        test 1: assertion failed
        Assertion: false
% Now past single failed assertion
A
ERROR: /home/user/test_assertion.pl:13:
        test 2: assertion failed
        Assertion: float(2)
ERROR: /home/user/test_assertion.pl:13:
        test 2: assertion failed
        Assertion: 2>3
% Now past two failed assertions
A done
% 3 assertions failed
% 2 tests failed
% 1 tests passed
false.

Good to read: Why `==` instead of `=` is preferable

https://swi-prolog.discourse.group/t/little-testing-tip/1371

Special magic is this box

Note that this does not obey usual Prolog semantics:

test(two_all,all(M=[1,1])) :- member(M,[1,1,3]),M=1.

I suppose the right-hand side is wrapped in a findall/3 of M.

If your test fails your bindings will be trashed ... d'oh!

test(one,[error(Formal)]) :-    % this catches the exception term thrown by must_be/2
   Formal = domain_error(_,_),  % this is not thrown by the next instruction
   must_be(integer,foo).

This test succeeds, even though domain_error(_,_) is not thrown by must_be/2, and so (at first sight) is not expected to be caught.

However, the failure rolls back the binding to T.... so the catch catches anything.

Some notes on loading modules and the corresponding plunit tests

Module code can be found in `.pl` files, whereas the correspond plunit test blocks can be found in `.plt` files having the same "root filename": foo.pl and foo.plt. Here are some notes on this: load_and_test_script.pl

Possible improvements

The test identifier (1st arg to test/1 or test/2) should be available in the test body. Sometimes one wants to call debug/3 to print something and having identifier would be great. But Prolog doesn't directly cater for such a possibility...

A note on modules

"In the current system, test units are compiled into sub-modules of the module in which they appear." actually sounds wrong.

AFAIK, there are no "submodules" in the current SWI-Prolog implementation, only modules.

The test are compiled into (top level) modules named after the plunit block:

:-begin_tests(footest).
:-end_tests(footest).

Creates a new module plunit_footest. Apparently every predicate in the plunit block is exported (?)

Special consequence: If you want external meta-predicates not declared as meta-predicates to call predicates inside he plunit block by name, you must "guess the plunit module name" and qualify the predicate name so that the caller can resolve it:

% With meta_predicate declaration!

:- meta_predicate(collect_metapredicate(0)).
collect_metapredicate(Collector) :- call(Collector).

% Without meta_predicate declaration!

collect_vanilla(Collector) :- call(Collector).

% ---
% The code below will be compiled into module plunit_footest
% ---

:- begin_tests(footest).

helper(X) :-
   member(X,[1,2,3]).

% The predicate (term) given to bagof/3 does not need qualification:

test("call me, bagof/3!", true(Bag == [1,2,3])) :-
   bagof(X,helper(X),Bag).

% A term which is given to a metapredicate that has not been declared as such
% needs qualification by the (guessed) name of the plunit module:

test("call me, collect_vanilla/1", [true(X == 1),nondet]) :-
   collect_vanilla(plunit_footest:helper(X)).

% A term which is given to a metapredicate that has been properly declared
% as such (on the correct argument position of course) does not need qualification:

test("call me", [true(X == 1),nondet]) :-
   collect_metapredicate(helper(X)).

:- end_tests(footest).

load_test_files_debug(_Options) :-
  (   writeln("==> PlUnit:load_test_files_debug:"),
      source_file(File),
      file_name_extension(Base, Old, File),
      Old \== plt,
      file_name_extension(Base, plt, TestFile),
      exists_file(TestFile),
      write("\tFile to be tested: "), writeln(File),
      write("\tDir and base name: "), writeln(Base),
      write("\tIt's extension:    ."), writeln(Old),
      write("\tFile with tests:   "), writeln(TestFile),
      (   test_file_for(TestFile, File)
      ->  true
      ;   load_files(TestFile,
                     [ if(changed),
                       imports([])
                     ]),
          asserta(test_file_for(TestFile, File))
      ),
      fail ; true
  ).

The only change is the addition of 5 lines with write/writeln predicates which write to the stdout name(s) of tested `.pl file(s) and used .plt` file(s).

It helped me a lot while debugging my scripts.

(of course when new function is created it should be added to exported functions list)

Hans Nowak said (2017-12-03T21:25:50):

"In the current system, test units are compiled into sub-modules of the module in which they appear."

It is not stated here explicitly, but apparently this affects the behavior of assertz/1, retractall/1, and the lookup of any predicates defined using assert{a,z}? I ran into issues with calls to retractall/1 inside tests that (seemingly) didn't actually retract anything, and predicates defined with assertz/1 inside tests that were then not found by other predicates (defined elsewhere). Such behavior makes more sense though if you know that tests run in their own (sub)module. Which is indeed stated here, but IMHO it's not clear what it actually means in practice.

(The retractall/1 inside the test didn't remove predicates defined in the enclosing module (in this case, user); similarly, predicates defined with assertz/1 inside the test, were not found by a predicate defined in user. This did not happen when I ran the exact same code outside of the tests.)

Table of Contents

1 Introduction

2 A Unit Test box

2.1 Test Unit options

2.2 Writing the test body

2.2.1 Testing deterministic predicates

2.2.2 Testing semi-deterministic predicates

2.2.3 Testing non-deterministic predicates

2.2.4 Testing error conditions

2.2.5 One body with multiple tests using assertions

3 Using separate test files

4 Running the test-suite

4.1 Running the test suite from Prolog

4.2 Running the test suite from the command line

5 Tests and production systems

6 Controlling the test suite

7 Auto-generating tests

8 Portability of the test-suite

8.1 PlUnit on SICStus

9 Motivation of choices

Easy to understand and flexible

Index

Test cases are your track bed through the swampland

Some examples

Suggestions (IMHO)

Usefully interwork with assertions

Good to read: Why `==` instead of `=` is preferable

Special magic is this box

If your test fails your bindings will be trashed ... d'oh!

Some notes on loading modules and the corresponding plunit tests

Possible improvements

See also

A note on modules

See also

Easy to understand and flexible

Test cases are your track bed through the swampland

Some examples

Suggestions (IMHO)

Usefully interwork with assertions

Good to read: Why == instead of = is preferable

Special magic is this box

If your test fails your bindings will be trashed ... d'oh!

Some notes on loading modules and the corresponding plunit tests

Possible improvements

See also

A note on modules

See also

Good to read: Why `==` instead of `=` is preferable