Made O'Meter
Discover where a brand or product originates
The Boswell Test is an innovative automated framework designed for the comparative analysis of Large Language Models (LLMs). Conceived by Dr. Peter Luh and implemented as the 'botwell' software project by independent developer Alan Wilhelm, the tool utilizes a peer-review methodology. In this system, multiple AI models generate essays on specific domains and then evaluate each other’s work to determine relative performance, grading bias, and an overall 'Boswell Quotient.'
As an open-source software project, its 'manufacturing' or development primarily takes place through community contributions and individual maintenance on platforms like GitHub. The framework is built to interact with various AI providers, such as OpenRouter, to facilitate testing across a diverse range of models including GPT, Claude, and Llama. It serves as a multidimensional alternative to traditional static benchmarks by leveraging the analytical capabilities of the AI models themselves.
Report a bug/Feedback
disclaimer
poweredBy