Test scripts and utilities for evaluating vision-language models on jersey number detection using llama.cpp server.