criterion performance measurements

overview

bench/./Curry/ReplicateDie 2

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.508165775029291e-2	2.547894736397135e-2	2.606210945532307e-2
Standard deviation	6.965984825148872e-4	1.0450193007930137e-3	1.5064995518359694e-3

Outlying measurements have moderate (0.14570719632426354%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 3

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.5080658200985564e-2	2.5832952193163422e-2	2.7868237276572998e-2
Standard deviation	9.615370492021815e-4	2.5780474740517857e-3	4.832636903644789e-3

Outlying measurements have moderate (0.42722779074022105%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 4

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.554045648809823e-2	2.6419943799268143e-2	2.7718163767097903e-2
Standard deviation	1.7379350771683597e-3	2.3316529860060715e-3	3.0087128898990075e-3

Outlying measurements have moderate (0.3540895657600912%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 5

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.4771141532390538e-2	2.511100633304272e-2	2.5391188395788626e-2
Standard deviation	4.867143085117779e-4	6.960455149974617e-4	1.031179502233606e-3

Outlying measurements have slight (4.986149584487535e-2%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 10

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.4876202351592196e-2	2.5159573310981e-2	2.545348529365808e-2
Standard deviation	4.6168992591860415e-4	6.333501662920411e-4	9.5761156583288e-4

Outlying measurements have slight (4.986149584487534e-2%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 15

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.500839410412954e-2	2.531172529413352e-2	2.566770712450938e-2
Standard deviation	6.086066070845804e-4	7.357792835931941e-4	8.949431283543341e-4

Outlying measurements have slight (7.388911829262826e-2%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 25

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	2.4774115263364198e-2	2.5005770096581517e-2	2.5227096485020598e-2
Standard deviation	4.1443209776581503e-4	4.997376781032426e-4	6.112244131387863e-4

Outlying measurements have slight (4.986149584487534e-2%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 50

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	3.642744659841225e-2	3.696895474885466e-2	3.782627720734787e-2
Standard deviation	9.057335390768865e-4	1.4610799733472083e-3	2.365249775320408e-3

Outlying measurements have moderate (0.11523108262142467%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 100

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	0.10568303056545819	0.10896648666280408	0.11476658104904115
Standard deviation	3.3755195228827063e-3	6.717739794810498e-3	1.187863141043722e-2

Outlying measurements have moderate (0.1996646669488161%) effect on estimated standard deviation.

bench/./Curry/ReplicateDie 200

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	0.33417877661607537	0.37749019149244606	0.46214062470729306
Standard deviation	1.314931025262922e-3	8.390083019548283e-2	9.850507658862204e-2

Outlying measurements have moderate (0.48023126543061473%) effect on estimated standard deviation.

bench/python ProbLog/replicateDie.py 2

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	0.17209205141730813	0.177056817933529	0.18280528789877684
Standard deviation	5.376172743407427e-3	7.714913761498982e-3	9.822594699721039e-3

Outlying measurements have moderate (0.13888888888888887%) effect on estimated standard deviation.

bench/python ProbLog/replicateDie.py 3

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	0.4968080395483412	0.5630756642640335	0.6919188005122123
Standard deviation	1.6660935361869633e-3	0.1276671469428748	0.14973715834385593

Outlying measurements have moderate (0.4810034273462169%) effect on estimated standard deviation.

bench/python ProbLog/replicateDie.py 4

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	9.74575905338861	10.270813969798231	10.766786075352382
Standard deviation	0.48664841879377607	0.5757989592817259	0.631820469902542

Outlying measurements have moderate (0.1875%) effect on estimated standard deviation.

bench/python ProbLog/replicateDie.py 5

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	330.03310889808927	335.667910458229	341.2018055341905
Standard deviation	4.544901117611669	6.7640183178945605	7.924973705005383

Outlying measurements have moderate (0.18749999999999997%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 2

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	1.722593838974717	1.841412473959887	1.934525809182863
Standard deviation	7.867169153844462e-2	0.13755580277696441	0.1931999124441112

Outlying measurements have moderate (0.20624936285731%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 3

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	1.6614248967671301	1.9726969030161854	2.4612955674820114
Standard deviation	6.721651338650614e-2	0.4702339778757757	0.6157407294054976

Outlying measurements have moderate (0.48281328975753873%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 4

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	1.587018496444216	1.6422711295211532	1.6806608671710515
Standard deviation	3.527048198588694e-2	5.950228161201445e-2	8.401417924239712e-2

Outlying measurements have moderate (0.1875%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 5

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	1.8574789731064811	1.9180661091086222	1.9732728056551423
Standard deviation	3.1268049409845844e-2	6.922931034370765e-2	8.801261530433822e-2

Outlying measurements have moderate (0.18749999999999997%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 6

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	3.681885625749904	3.779433753753741	3.8301079681744645
Standard deviation	6.994500095534303e-3	9.492237247007859e-2	0.11702708534711065

Outlying measurements have moderate (0.1875%) effect on estimated standard deviation.

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 7

	lower bound	estimate	upper bound
OLS regression	xxx	xxx	xxx
R² goodness-of-fit	xxx	xxx	xxx
Mean execution time	19.976683511777082	20.91385816804056	21.63352700312195
Standard deviation	0.41175792954163626	0.9090918928586171	1.09558161750057

Outlying measurements have moderate (0.1875%) effect on estimated standard deviation.

In this report, each function benchmarked by criterion is assigned a section of its own. The charts in each section are active; if you hover your mouse over data points and annotations, you will see more details.

The chart on the left is a kernel density estimate (also known as a KDE) of time measurements. This graphs the probability of any given time measurement occurring. A spike indicates that a measurement of a particular time occurred; its height indicates how often that measurement was repeated.
The chart on the right is the raw data from which the kernel density estimate is built. The x axis indicates the number of loop iterations, while the y axis shows measured execution time for the given number of loop iterations. The line behind the values is the linear regression prediction of execution time for a given number of iterations. Ideally, all measurements will be on (or very near) this line.

Under the charts is a small table. The first two rows are the results of a linear regression run on the measurements displayed in the right-hand chart.

OLS regression indicates the time estimated for a single loop iteration using an ordinary least-squares regression model. This number is more accurate than the mean estimate below it, as it more effectively eliminates measurement overhead and other constant factors.
R² goodness-of-fit is a measure of how accurately the linear regression model fits the observed measurements. If the measurements are not too noisy, R² should lie between 0.99 and 1, indicating an excellent fit. If the number is below 0.99, something is confounding the accuracy of the linear model.
Mean execution time and standard deviation are statistics calculated from execution time divided by number of iterations.

We use a statistical technique called the bootstrap to provide confidence intervals on our estimates. The bootstrap-derived upper and lower bounds on estimates let you see how accurate we believe those estimates to be. (Hover the mouse over the table headers to see the confidence levels.)

A noisy benchmarking environment can cause some or many measurements to fall far from the mean. These outlying measurements can have a significant inflationary effect on the estimate of the standard deviation. We calculate and display an estimate of the extent to which the standard deviation has been inflated by outliers.

criterion performance measurements

overview

bench/./Curry/ReplicateDie 2

bench/./Curry/ReplicateDie 3

bench/./Curry/ReplicateDie 4

bench/./Curry/ReplicateDie 5

bench/./Curry/ReplicateDie 10

bench/./Curry/ReplicateDie 15

bench/./Curry/ReplicateDie 25

bench/./Curry/ReplicateDie 50

bench/./Curry/ReplicateDie 100

bench/./Curry/ReplicateDie 200

bench/python ProbLog/replicateDie.py 2

bench/python ProbLog/replicateDie.py 3

bench/python ProbLog/replicateDie.py 4

bench/python ProbLog/replicateDie.py 5

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 2

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 3

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 4

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 5

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 6

bench/./WebPPL/node_modules/.bin/webppl WebPPL/replicateDie.wppl 7

understanding this report

colophon