benchmark

BetaBeacon
June 1, 2026
The AYANEO Pocket S Mini is a gaming device that hit the Japanese market on March 26, 2026. It is powered by a Snapdragon G3x Gen 2 SoC and allows users to enjoy Android game apps with a physical controller. The device scored 144464 on the AnTuTu Benchmark app, surpassing the performance of 35% of users. It features a quick menu called 'AYAWindow' and comes in two models with different RAM and storage options.
AppWizard
May 30, 2026
Call of Duty: Modern Warfare 4 introduces a movement system that enhances speed and fluidity, making player movement quicker and more responsive than in Modern Warfare 2. It features innovative parkour mechanics, smoother slides that maintain momentum, a new ledge grab mechanic, and overall enhanced fluidity in gameplay. The game is available on PlayStation 5, Xbox Series X and Series S, and Nintendo Switch 2.
Winsage
May 29, 2026
Hardware prices are increasing due to high demand for RAM and components driven by artificial intelligence growth, affecting sectors like gaming and smartphones. The Steam Deck has already seen a price rise. Qualcomm has launched the Snapdragon C platform, which may allow Windows ARM laptops to be priced as low as 9, potentially making them affordable alternatives to the MacBook Neo. Acer, HP, and Lenovo are among the first manufacturers to use the Snapdragon C, with the Acer Aspire Go 15 featuring this chip. The Snapdragon C utilizes Qualcomm's Kryo CPU cores, and while specific pricing and performance details are not yet available, the Acer Aspire Go 15 is expected to have a 512 GB storage drive, 8 GB of RAM, and a 1920 x 1080 display. Actual pricing will depend on hardware configurations, and a comparison with the MacBook Neo will require performance benchmarks.
AppWizard
May 28, 2026
Winlator is an application that enables Windows games to be played on Android devices by combining Wine with x86/x86_64 translation layers. The Snapdragon 8 Elite processor has compatibility issues, particularly with its Vulkan driver, making it less suitable for mobile gaming compared to the older Snapdragon 8 Gen 2. Users often resort to experimental drivers like Vortek to improve performance, but results can be inconsistent. The fragmentation of Winlator into various forks complicates the user experience, as each fork addresses different issues. GameHub, developed by GameSir, offers a more user-friendly experience but has faced criticism for invasive permissions. GameHub Lite is a modified version that improves performance and reduces tracking features. Ultimately, the Samsung Galaxy Z Fold 5 with the Snapdragon 8 Gen 2 provided a better gaming experience than the Snapdragon 8 Elite, benefiting from mature drivers and compatibility with GameHub Lite.
AppWizard
May 28, 2026
Ongoing shortages in memory chips, driven by increased demand from AI-focused data centers, have led to significant price increases for the Steam Deck OLED. The 512GB model's price rose from £479 to £649 in the UK (35% increase) and from 9 to 9 in the US (44% increase). The 1TB variant increased from £569 to £779 in the UK (36% increase) and from 9 to 9 in the US (46% increase). Valve attributes these hikes to rising component costs and logistical challenges. The 1TB Steam Deck OLED now competes closely with the Asus ROG Ally X, which offers better performance at a similar price point. The 512GB version has lost its budget-friendly appeal, as the Lenovo Legion Go S undercuts it at £549. Despite the price adjustments, the Steam Deck OLED remains favored for casual gaming due to its design and comfort, but the hikes may deter potential buyers and raise concerns about future products.
BetaBeacon
May 27, 2026
The REDMAGIC 11S Pro outperforms competitors in benchmarks, including the Galaxy S26 Ultra and OPPO Find X9 Pro. It excels in 3DMark's Wild Life Extreme test and can sustain 120fps in Call of Duty Mobile's Battle Royale. The handset struggles with Asphalt Legends but performs well in emulation tests. Cooling setups may not always be necessary for optimal gaming experiences on modern gaming phones.
AppWizard
May 26, 2026
Google launched the Android Bench benchmarking portal in March to help software developers evaluate AI models for Android app development. The leaderboard was updated last week to include open-weight models and new metrics for latency, tokens, and cost. Matthew McCullough, Google's VP of Product for Android Development, stated that the goal is to provide a benchmark for evaluating large language models (LLMs) in Android development. As of May 18, GPT 5.5 is the top AI model for Android app development, with Gemini 3.1 Pro and GPT 5.4 ranked as joint leaders. Android Bench evaluates LLMs based on real-world challenges and tasks sourced from public GitHub repositories. Other benchmarking tools in the Android ecosystem include Jetpack Microbenchmark, Jetpack Macrobenchmark, Firebase Performance Monitoring, Android Vitals, Apptim, and Android Performance Analyzer. The overall benchmark score on Android Bench is calculated using four core values: Confidence Interval Range, Average Latency Score, Average Total Tokens Score, and Average Cost. The test harness for Android Bench is publicly available on GitHub.
AppWizard
May 21, 2026
Google has updated its "Android Bench" rankings, introducing new AI models for Android app development, including open-weight models. The latest rankings, as of May 18, 2026, show GPT 5.5 at the top, surpassing GPT 5.4 and Gemini 3.1 Pro by nearly 2%. The update provides metrics such as average latency, total tokens used, and average cost per benchmark run. GPT 5.5 has a score of 74, with an average latency of 15.5, total tokens of 64.5, and an average cost of .9. In comparison, GPT 5.4 has a score of 72.4, with an average latency of 21.2, total tokens of 64.2, and an average cost of [openai_gpt model="gpt-4o-mini" prompt="Summarize the content and extract only the fact described in the text bellow. The summary shall NOT include a title, introduction and conclusion. Text: Google has refreshed its “Android Bench” rankings, unveiling a new lineup of AI models tailored for Android app development. This update introduces several “open-weight” models and provides deeper insights into the performance metrics, including token usage and associated costs. Large language models have increasingly demonstrated their prowess in coding, significantly enhancing the app development process. This trend has given rise to what is now known as “vibe coding.” Earlier this year, Google released a benchmark ranking that evaluated the top AI models for Android development, focusing on common tasks and adherence to best practices. Initially, the rankings were led by Gemini 3.1 Pro, with OpenAI’s GPT 5.4 later sharing the spotlight. However, as of the latest update on May 18, 2026, a new contender has emerged. GPT 5.5 has claimed the top position, surpassing GPT 5.4 and Gemini 3.1 Pro by nearly 2%. This update also enhances clarity by presenting average latency, total tokens utilized, and the average cost associated with each AI model. Google has provided documentation detailing the methodology behind these metrics. Average Latency: Time taken to complete 100 tasks across 10 runs Average Total Tokens: Token consumption for a complete benchmark run across 10 iterations Average Cost: Cost per benchmark run in US dollars at the time of testing While GPT 5.5 boasts superior performance, it comes at a cost—over twice that of Gemini 3.1 Pro for equivalent functions. Here’s a look at the top ten models based on Google’s latest data as of May 21, 2026: Model Score Avg Latency Avg Total Tokens Avg Cost New: GPT 5.5 74 15.5 64.5 3.9 GPT 5.4 72.4 21.2 64.2 .7 Gemini 3.1 Pro Preview 72.4 11.5 75.4 .0 New: Claude Opus 4.7 68.7 11.6 90.0 4.3 GPT 5.3 Codex 67.7 11.2 71.4 .6 Claude Opus 4.6 66.6 9.9 69.5 .4 GPT 5.2 Codex 62.5 24.3 124.4 1.9 Claude Opus 4.5 61.9 12.5 79.8 2.5 Gemini 3 Pro Preview 60.4 9.8 117.0 .7 New: GLM 5.1 59.7 33.4 80.2 .7 The rankings now feature a wider array of open-weight models, including Gemma, Qwen, DeepSeek, and MiMo, among others. GLM 5.1 has emerged as the highest scorer among these newcomers, closely followed by Kimi K2.6. Google is committed to updating the “Android Bench” on a monthly basis. With the anticipated release of Gemini 3.5 Pro and the already available 3.5 Flash, the competitive landscape will be intriguing to watch as Google seeks to reclaim its lead against OpenAI's advancements. More on Android: Follow Ben: Twitter/X, Threads, Bluesky, and Instagram FTC: We use income earning auto affiliate links. More." max_tokens="3500" temperature="0.3" top_p="1.0" best_of="1" presence_penalty="0.1" frequency_penalty="frequency_penalty"].7. Gemini 3.1 Pro has the same score as GPT 5.4 but with different latency and token metrics. The rankings also include other models like Claude Opus 4.7, GPT 5.3 Codex, and GLM 5.1, which has emerged as the highest scorer among newcomers. Google plans to update the rankings monthly.
AppWizard
May 19, 2026
The early access release of Forza Horizon 6 has set a new player count record, peaking at 273,148 players on Steam, more than tripling the previous record of 81,096 held by Forza Horizon 5. The game was launched in premium version on May 15 and had a broader release on May 19. The standard edition is available on Game Pass, which may increase the total player count beyond half a million. The game's Japan setting and strong review scores have contributed to its popularity.
Search