Skip to contents

This file contains advanced benchmark testing capabilities including cross-validation, statistical testing, and performance comparison methods. Statistical Significance Testing for Search Performance

Usage

compare_strategies(
  strategy1_results,
  strategy2_results,
  gold_standard,
  test_type = "mcnemar",
  alpha = 0.05
)

Arguments

strategy1_results

Results from first search strategy

strategy2_results

Results from second search strategy

gold_standard

Vector of relevant article IDs

test_type

Type of statistical test ("mcnemar", "paired_t", "wilcoxon")

alpha

Significance level

Value

Statistical test results