Go to file

Rick McEwen 41c75356d9 Add --output-file option for clean results output

- Add --output-file argument to test_logo_detection.py that appends
  only the results summary (no progress indicators) to specified file
- Add write_results_to_file() with detailed header showing test type
  and method parameters
- Update run_comparison_tests.sh to use --output-file instead of
  tee/redirection, keeping console output separate from file output

2025-12-31 17:42:52 -05:00

.gitignore

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

.python-version

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

CLAUDE.md

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

logo_detection_detr_usage.md

Add simple matching method as baseline for comparison tests

2025-12-31 17:36:18 -05:00

logo_detection_detr.py

Add simple matching method as baseline for comparison tests

2025-12-31 17:36:18 -05:00

logo_detection_test_methodology.md

Add simple matching method as baseline for comparison tests

2025-12-31 17:36:18 -05:00

main.py

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

prepare_test_data.py

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

pyproject.toml

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

README.md

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

requirements.txt

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

run_comparison_tests.sh

Add --output-file option for clean results output

2025-12-31 17:42:52 -05:00

test_cuda_support.py

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

test_logo_detection.py

Add --output-file option for clean results output

2025-12-31 17:42:52 -05:00

uv.lock

Initial commit: Logo detection test framework

2025-12-31 10:42:36 -05:00

README.md

Logo Detection Test Framework

A testing framework for evaluating logo detection accuracy using DETR (DEtection TRansformer) and CLIP (Contrastive Language-Image Pre-training) models.

Overview

This project provides tools to:

Detect logos in images using a fine-tuned DETR model
Match detected logos against reference images using CLIP embeddings
Evaluate detection accuracy with precision, recall, and F1 metrics

Architecture

The system uses a two-stage pipeline:

DETR - Identifies potential logo regions (bounding boxes) in images
CLIP - Extracts feature embeddings for each detected region and compares against reference logos

Installation

Requires Python 3.12+. Uses uv for package management.

# Install dependencies
uv sync

# Or using pip
pip install -r requirements.txt

Usage

Prepare Test Data

First, prepare the test database with logo mappings:

uv run python prepare_test_data.py

This creates test_data_mapping.db with ground truth mappings between test images and logos.

Run Detection Tests

# Basic test with default settings (margin-based matching)
uv run python test_logo_detection.py

# Test with more logos and custom threshold
uv run python test_logo_detection.py -n 20 --threshold 0.75

# Use multi-ref matching method
uv run python test_logo_detection.py --matching-method multi-ref \
    --refs-per-logo 5 --min-matching-refs 2

# Reproducible test with seed
uv run python test_logo_detection.py -n 50 --seed 42

Key Parameters

Parameter	Default	Description
`-n, --num-logos`	10	Number of reference logos to sample
`-t, --threshold`	0.7	CLIP similarity threshold
`-d, --detr-threshold`	0.5	DETR detection confidence threshold
`--matching-method`	margin	Matching method: `margin` or `multi-ref`
`--margin`	0.05	Margin over second-best match (margin method)
`--min-matching-refs`	1	Min refs that must match (multi-ref method)
`--refs-per-logo`	3	Reference images per logo
`-s, --seed`	None	Random seed for reproducibility

See --help for all options.

Project Structure

logo_test/
├── logo_detection_detr.py      # Core detection library (DetectLogosDETR class)
├── test_logo_detection.py      # Test script for accuracy evaluation
├── prepare_test_data.py        # Script to prepare test database
├── test_data_mapping.db        # SQLite database with ground truth
├── reference_logos/            # Reference logo images (not in git)
├── test_images/                # Test images (not in git)
├── logo_detection_detr_usage.md        # API usage guide
└── logo_detection_test_methodology.md  # Test methodology documentation

Accuracy Improvement Techniques

The framework implements several techniques to improve detection accuracy:

Non-Maximum Suppression (NMS) - Removes overlapping duplicate detections
Minimum Box Size Filtering - Filters out noise from tiny detections
Confidence Threshold Filtering - Removes low-confidence detections
Multiple Reference Images - Uses multiple refs per logo for robust matching
Margin-Based Matching - Requires confidence margin over second-best match
Multi-Ref Matching - Aggregates similarity scores across references
Embedding Caching - Caches embeddings to avoid recomputation

Models

The framework uses:

DETR: Pravallika6/detr-finetuned-logo-detection_v2
CLIP: openai/clip-vit-large-patch14

Models are automatically downloaded from HuggingFace on first run and cached in ~/.cache/huggingface/.

Documentation

API Usage Guide - How to use the DetectLogosDETR class
Test Methodology - Detailed explanation of test framework and tuning

License

MIT