1.2 Results

Important note: A number of problems in this test suite have no antiderivative in closed form. This means the antiderivative of these integrals can not be expressed in terms of elementary, special functions or Hypergeometric2F1 functions. RootSum and RootOf are not allowed.

If a CAS returns the above integral unevaluated within the time limit, then the result is counted as passed and assigned an A grade.

However, if CAS times out, then it is assigned an F grade even if the integral is not integrable, as this implies CAS could not determine that the integral is not integrable in the time limit.

If a CAS returns an antiderivative to such an integral, it is assigned an A grade automatically and this special result is listed in the introduction section of each individual test report to make it easy to identify as this can be important result to investigate.

The results given in in the table below reflects the above.

System % solved % Failed
IntegrateAlgebraic 99.40 ( 3135 ) 0.60 ( 19 )
Fricas 69.47 ( 2191 ) 30.53 ( 963 )
Mathematica 69.18 ( 2182 ) 30.82 ( 972 )
Rubi 64.49 ( 2034 ) 35.51 ( 1120 )
Maple 56.91 ( 1795 ) 43.09 ( 1359 )
Mupad 25.68 ( 810 ) 74.32 ( 2344 )
Giac 24.57 ( 775 ) 75.43 ( 2379 )
Maxima 18.07 ( 570 ) 81.93 ( 2584 )
Sympy 17.34 ( 547 ) % 82.66 ( 2607 )
Table 1.1:Percentage solved for each CAS

The table below gives additional break down of the grading of quality of the antiderivatives generated by each CAS. The grading is given using the letters A,B,C and F with A being the best quality. The grading is accomplished by comparing the antiderivative generated with the optimal antiderivatives included in the test suite. The following table describes the meaning of these grades.

grade

description

A

Integral was solved and antiderivative is optimal in quality and leaf size.

B

Integral was solved and antiderivative is optimal in quality but leaf size is larger than twice the optimal antiderivatives leaf size.

C

Integral was solved and antiderivative is non-optimal in quality. This can be due to one or more of the following reasons

  1. antiderivative contains a hypergeometric function and the optimal antiderivative does not.
  2. antiderivative contains a special function and the optimal antiderivative does not.
  3. antiderivative contains the imaginary unit and the optimal antiderivative does not.

F

Integral was not solved. Either the integral was returned unevaluated within the time limit, or it timed out, or CAS hanged or crashed or an exception was raised.

Table 1.2:Description of grading applied to integration result

Grading is implemented for all CAS systems. Based on the above, the following table summarizes the grading for this test suite.

System % A grade % B grade % C grade % F grade
IntegrateAlgebraic 98.48 0.29 0.63 0.60
Fricas 41.22 28.12 0.13 30.53
Rubi 39.38 10.88 14.24 35.51
Mathematica 29.20 6.37 33.51 30.82
Maple 18.99 9.58 28.34 43.09
Giac 18.80 5.64 0.13 75.43
Maxima 15.47 2.50 0.10 81.93
Sympy 4.60 2.63 10.11 82.66
Mupad N/A 25.68 0.00 74.32
Table 1.3:Antiderivative Grade distribution of each CAS

The following is a Bar chart illustration of the data in the above table.

pict

The figure below compares the CAS systems for each grade level.

pict

The following table shows the distribution of the different types of failure for each CAS. There are 3 types of reasons why it can fail. The first is when CAS returns back the input within the time limit, which means it could not solve it. This the typical normal failure F .

The second is due to time out. CAS could not solve the integral within the 3 minutes time limit which is assigned F(-1).

The third is due to an exception generated. Assigned F(-2). This most likely indicates an interface problem between sagemath and the CAS (applicable only to FriCAS, Maxima and Giac) or it could be an indication of an internal error in CAS. This type of error requires more investigations to determine the cause.

System

Number failed

Percentage normal failure

Percentage timeout failure

Percentage exception failure

Rubi

1120

99.20 %

0.80 %

0.00 %

Mathematica

972

98.66 %

1.34 %

0.00 %

Maple

1359

99.85 %

0.15 %

0.00 %

Fricas

963

0.21 %

85.77 %

14.02 %

IntegrateAlgebraic

19

52.63 %

47.37 %

0.00 %

Giac

2379

87.18 %

7.27 %

5.55 %

Maxima

2584

99.07 %

0.00 %

0.93 %

Sympy

2607

72.65 %

27.20 %

0.15 %

Mupad

2344

98.21 %

1.79 %

0.00 %

Table 1.4:Failure statistics for each CAS