Developers are exploring new methods to benchmark generative AI models, with one initiative being the Minecraft Benchmark (MC-Bench), a platform for head-to-head competitions among AI models that generate unique Minecraft creations. Users vote on the performances without knowing which AI created each entry. The project, created by 12th-grade student Adi Singh, leverages Minecraft's universal recognition to evaluate AI capabilities. MC-Bench currently has eight volunteer contributors and has received support from major AI companies like Anthropic, Google, OpenAI, and Alibaba. The focus is on simple builds, with plans to scale to more complex tasks. MC-Bench requires models to write code for requested builds, making it easier for users to assess the quality of creations visually. Singh believes the scores from MC-Bench provide meaningful insights into AI performance compared to traditional text-based benchmarks.