Logo Detection Comparison Tests
================================
Date: Wed Dec 31 03:43:45 PM MST 2025

Common Parameters:
  Reference logos: 20
  Refs per logo: 10
  Positive samples: 20
  Negative samples: 100
  Min matching refs: 3
  Seed: 42

======================================================================
TEST: SIMPLE MATCHING
Method: Simple (all matches above threshold)
======================================================================
Date: 2025-12-31 16:02:25

Configuration:
  Reference logos:           20
  Refs per logo:             10
  Total reference embeddings:189
  Positive samples/logo:     20
  Negative samples/logo:     100
  Test images processed:     2355
  CLIP threshold:            0.7
  DETR threshold:            0.5
  Random seed:               42

Results:
  True Positives:      751
  False Positives:   58221
  False Negatives:       9
  Total Expected:      369

Scores:
  Precision:  0.0127 (1.3%)
  Recall:     2.0352 (203.5%)
  F1 Score:   0.0253 (2.5%)

======================================================================
TEST: MARGIN MATCHING
Method: Margin-based (margin=0.05)
======================================================================
Date: 2025-12-31 16:20:42

Configuration:
  Reference logos:           20
  Refs per logo:             10
  Total reference embeddings:189
  Positive samples/logo:     20
  Negative samples/logo:     100
  Test images processed:     2361
  CLIP threshold:            0.7
  DETR threshold:            0.5
  Random seed:               42

Results:
  True Positives:       60
  False Positives:      26
  False Negatives:     310
  Total Expected:      369

Scores:
  Precision:  0.6977 (69.8%)
  Recall:     0.1626 (16.3%)
  F1 Score:   0.2637 (26.4%)

======================================================================
TEST: MULTI-REF MATCHING
Method: Multi-ref (mean, min_refs=3, margin=0.05)
======================================================================
Date: 2025-12-31 16:38:59

Configuration:
  Reference logos:           20
  Refs per logo:             10
  Total reference embeddings:189
  Positive samples/logo:     20
  Negative samples/logo:     100
  Test images processed:     2352
  CLIP threshold:            0.7
  DETR threshold:            0.5
  Random seed:               42

Results:
  True Positives:      233
  False Positives:     217
  False Negatives:     170
  Total Expected:      369

Scores:
  Precision:  0.5178 (51.8%)
  Recall:     0.6314 (63.1%)
  F1 Score:   0.5690 (56.9%)

======================================================================
TEST: MULTI-REF MATCHING
Method: Multi-ref (max, min_refs=3, margin=0.05)
======================================================================
Date: 2025-12-31 16:56:49

Configuration:
  Reference logos:           20
  Refs per logo:             10
  Total reference embeddings:189
  Positive samples/logo:     20
  Negative samples/logo:     100
  Test images processed:     2350
  CLIP threshold:            0.7
  DETR threshold:            0.5
  Random seed:               42

Results:
  True Positives:      278
  False Positives:     259
  False Negatives:     136
  Total Expected:      369

Scores:
  Precision:  0.5177 (51.8%)
  Recall:     0.7534 (75.3%)
  F1 Score:   0.6137 (61.4%)

