Add accuracy test framework, prompts, results, and analysis reports
Includes accuracy test scripts for Qwen (local) and Gemini (cloud API), three prompt variants (original, capstone, constrained), test results from all runs, and two analysis reports with an HTML presentation version.
This commit is contained in:
5609
accuracy_test_results_all.txt
Normal file
5609
accuracy_test_results_all.txt
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user