You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/comparison-analysis.md
-4
Original file line number
Diff line number
Diff line change
@@ -68,7 +68,3 @@ The actual algorithm for determining relevance of a comparison summary may chang
68
68
* High relevance: any number of very large or large changes, a small amount of medium changes, or a large number of small or very small changes.
69
69
* Medium relevance: any number of very large or large changes, any medium change, or smaller but still substantial number of small or very small changes.
70
70
* Low relevance: if it doesn't fit into the above two categories, it ends in this category.
71
-
72
-
### "Dodgy" Test Cases
73
-
74
-
"Dodgy" test cases are test cases that tend to produce unreliable results (i.e., noise). A test case is considered "dodgy" if its significance threshold is sufficiently far enough away from 0.
Copy file name to clipboardExpand all lines: docs/glossary.md
-1
Original file line number
Diff line number
Diff line change
@@ -38,7 +38,6 @@ The following is a glossary of domain specific terminology. Although benchmarks
38
38
***significant test result comparison**: a test result comparison above the significance threshold. Significant test result comparisons can be thought of as being "statistically significant".
39
39
***relevant test result comparison**: a test result comparison can be significant but still not be relevant (i.e., worth paying attention to). Relevance is a factor of the test result comparison's significance and magnitude. Comparisons are considered relevant if they are significant and have at least a small magnitude .
40
40
***test result comparison magnitude**: how "large" the delta is between the two test result's under comparison. This is determined by the average of two factors: the absolute size of the change (i.e., a change of 5% is larger than a change of 1%) and the amount above the significance threshold (i.e., a change that is 5x the significance threshold is larger than a change 1.5x the significance threshold).
41
-
***dodgy test case**: a test case for which the significance threshold is significantly large indicating a high amount of variability in the test and thus making it necessary to be somewhat skeptical of any results too close to the significance threshold.
0 commit comments