Compute Jaccard Similarity Matrix Between Iterations
Source:R/compute_jaccard_matrix.R
compute_jaccard_matrix.Rd
Calculates pairwise Jaccard indices between iterations based on sample usage. Useful for assessing overlap and dependence in train/test/inference sets across iterations.
Arguments
- id_usage
A data.frame with columns
ids
,role
, anditeration
, such as the output fromtrack_sample_ids
.- role
A character string specifying which sample role to evaluate ("test", "train", or "inference").