Combine all test results in a single directory
This commit is contained in:
193
test_results/threshold_analysis/threshold_test_results.txt
Normal file
193
test_results/threshold_analysis/threshold_test_results.txt
Normal file
@ -0,0 +1,193 @@
|
||||
Threshold Optimization Tests
|
||||
=============================
|
||||
Date: Fri Jan 2 10:11:34 AM MST 2026
|
||||
|
||||
Common Parameters:
|
||||
Matching method: multi-ref (max)
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Positive samples: 20
|
||||
Negative samples: 100
|
||||
Min matching refs: 3
|
||||
Seed: 42
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.05)
|
||||
======================================================================
|
||||
Date: 2026-01-02 10:29:26
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2358
|
||||
Similarity threshold: 0.7
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 265
|
||||
False Positives: 288
|
||||
False Negatives: 141
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.4792 (47.9%)
|
||||
Recall: 0.7182 (71.8%)
|
||||
F1 Score: 0.5748 (57.5%)
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.05)
|
||||
======================================================================
|
||||
Date: 2026-01-02 10:47:35
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2348
|
||||
Similarity threshold: 0.8
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 233
|
||||
False Positives: 472
|
||||
False Negatives: 165
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.3305 (33.0%)
|
||||
Recall: 0.6314 (63.1%)
|
||||
F1 Score: 0.4339 (43.4%)
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.1)
|
||||
======================================================================
|
||||
Date: 2026-01-02 11:05:34
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2357
|
||||
Similarity threshold: 0.8
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 187
|
||||
False Positives: 375
|
||||
False Negatives: 208
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.3327 (33.3%)
|
||||
Recall: 0.5068 (50.7%)
|
||||
F1 Score: 0.4017 (40.2%)
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.1)
|
||||
======================================================================
|
||||
Date: 2026-01-02 11:23:33
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2356
|
||||
Similarity threshold: 0.85
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 160
|
||||
False Positives: 434
|
||||
False Negatives: 223
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.2694 (26.9%)
|
||||
Recall: 0.4336 (43.4%)
|
||||
F1 Score: 0.3323 (33.2%)
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.15)
|
||||
======================================================================
|
||||
Date: 2026-01-02 11:41:47
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2359
|
||||
Similarity threshold: 0.85
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 163
|
||||
False Positives: 410
|
||||
False Negatives: 220
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.2845 (28.4%)
|
||||
Recall: 0.4417 (44.2%)
|
||||
F1 Score: 0.3461 (34.6%)
|
||||
|
||||
======================================================================
|
||||
TEST: MULTI-REF MATCHING
|
||||
Model: openai/clip-vit-large-patch14
|
||||
Method: Multi-ref (max, min_refs=3, margin=0.15)
|
||||
======================================================================
|
||||
Date: 2026-01-02 12:00:00
|
||||
|
||||
Configuration:
|
||||
Embedding model: openai/clip-vit-large-patch14
|
||||
Reference logos: 20
|
||||
Refs per logo: 10
|
||||
Total reference embeddings:189
|
||||
Positive samples/logo: 20
|
||||
Negative samples/logo: 100
|
||||
Test images processed: 2363
|
||||
Similarity threshold: 0.9
|
||||
DETR threshold: 0.5
|
||||
Random seed: 42
|
||||
|
||||
Results:
|
||||
True Positives: 84
|
||||
False Positives: 69
|
||||
False Negatives: 288
|
||||
Total Expected: 369
|
||||
|
||||
Scores:
|
||||
Precision: 0.5490 (54.9%)
|
||||
Recall: 0.2276 (22.8%)
|
||||
F1 Score: 0.3218 (32.2%)
|
||||
|
||||
Reference in New Issue
Block a user