benchmarking

Tech Optimizer
June 26, 2026
EDB has introduced new features for its Postgres AI platform, including an agentic database and converged analytics capabilities, allowing enterprises to run AI agents alongside transactional workloads on a unified PostgreSQL foundation. The platform includes governance tools that position control mechanisms at the data layer and integrates AI processing with operational data, enabling businesses to connect live records with AI systems without transferring sensitive information. The agentic database can monitor over 200 metrics, identify issues, suggest changes, and apply fixes automatically based on user-defined policies. It consolidates various data types through a single SQL interface, significantly accelerating database tuning processes and enhancing application performance. EDB has also expanded its analytics capabilities with a zero-ETL architecture for real-time analysis and large-scale warehousing. EDB PG AI for ClickHouse targets real-time analysis, while EDB PG AI for WarehousePG focuses on historical analysis at petabyte scale. The platform claims up to 30 times faster query performance compared to legacy systems and improved scaling efficiency. EDB's platform integrates vector search and retrieval for AI agents, demonstrating lower query latency and higher retrieval accuracy than competitors. NTT East is using EDB PG AI for AI-driven network operations, while the governance feature manages agent access at the data querying point using native Postgres roles and row-level security. The platform can be deployed on-premises, in hybrid environments, or across cloud infrastructures, with partnerships including Dell, IBM, Nvidia, Red Hat, and Supermicro.
Tech Optimizer
June 18, 2026
Lakebase Search is a hybrid vector and full-text retrieval system integrated into Lakebase, now in beta on AWS and Azure. It utilizes two Postgres extensions: lakebase_vector and lakebase_text, allowing agents to operate on a single data backend. Agents manage four times more databases than human users and require real-time access to indexed data. The system features a tiered architecture that stores cold data in cost-effective object storage while keeping active data in local NVMe, significantly reducing costs. The lakebase_vector extension offers 32x compression for vectors, allowing a billion vectors to fit into under 10GB of RAM. The lakebase_text extension provides BM25 relevance ranking without high RAM usage. Benchmarking shows that Lakebase Search can efficiently handle large-scale workloads, achieving high recall and low latency with reduced resource requirements compared to traditional architectures. The system allows for continuous search experimentation and dedicated retrieval engines for each agent, enhancing operational efficiency and scalability.
AppWizard
June 15, 2026
THQ Nordic announced the Gothic 1 Remake for PC, utilizing Unreal Engine 5. The benchmarking was conducted on a high-end setup featuring an AMD Ryzen 9 7950X3D processor and various GPUs, including AMD Radeon RX 6900XT and NVIDIA RTX 4090. The game offers extensive graphics settings and supports technologies like NVIDIA DLSS 4.5 and AMD FSR 3.1. It does not include a built-in benchmark tool, so testing was done in a demanding scene. At 1080p with Very High settings, it consistently achieves 60FPS across many GPUs. At 1440p, leading NVIDIA GPUs maintained frame rates above 60FPS, while AMD's RX 9070XT and RX 7900XTX averaged 70FPS and 68FPS, respectively. For Native 4K at Very High settings, the NVIDIA RTX 5090 delivered a minimum of 65FPS. The game scales well with graphics presets, achieving over 60FPS at lower settings. DLSS 4.5 improved performance significantly, with frame rates exceeding 100FPS in some scenarios. The graphics are impressive, though character models may not match the quality of top-tier titles. Minor traversal stutters were noted but did not significantly impact gameplay. Overall, the game performs well across various PC configurations, especially at 1080p with compatible monitors.
AppWizard
May 26, 2026
Google launched the Android Bench benchmarking portal in March to help software developers evaluate AI models for Android app development. The leaderboard was updated last week to include open-weight models and new metrics for latency, tokens, and cost. Matthew McCullough, Google's VP of Product for Android Development, stated that the goal is to provide a benchmark for evaluating large language models (LLMs) in Android development. As of May 18, GPT 5.5 is the top AI model for Android app development, with Gemini 3.1 Pro and GPT 5.4 ranked as joint leaders. Android Bench evaluates LLMs based on real-world challenges and tasks sourced from public GitHub repositories. Other benchmarking tools in the Android ecosystem include Jetpack Microbenchmark, Jetpack Macrobenchmark, Firebase Performance Monitoring, Android Vitals, Apptim, and Android Performance Analyzer. The overall benchmark score on Android Bench is calculated using four core values: Confidence Interval Range, Average Latency Score, Average Total Tokens Score, and Average Cost. The test harness for Android Bench is publicly available on GitHub.
Winsage
May 12, 2026
Microsoft is reassessing its strategy following criticism and a decline in user satisfaction related to its AI tool, Copilot. The company is unwinding some Copilot integrations and reallocating resources to address issues with Windows 11, particularly focusing on improving File Explorer. Windows K2 will enhance File Explorer's performance, utilizing the WinUI 3 framework, which aims to streamline operations. Recent benchmarks show significant reductions in resource consumption for File Explorer, including 41% fewer allocations, 63% fewer transient allocations, 45% fewer function calls, and a 25% reduction in time spent in WinUI code. Improvements are expected to be rolled out soon.
Tech Optimizer
May 11, 2026
Databricks has enhanced its managed Postgres service using lakebase architecture, achieving write throughput improvements of up to five times. Traditional Postgres durability mechanisms, such as full page writes (FPW), impose overhead that can inflate Write-Ahead Log (WAL) volume by up to 15 times in write-heavy scenarios. The lakebase architecture decouples compute from storage, allowing compute nodes to stream WAL records to a distributed quorum of safekeepers, mitigating the risk of torn pages. Databricks has addressed read performance challenges by transferring image generation to the storage layer, which reconstructs data pages by identifying the latest materialized image and replaying corresponding WAL deltas. This results in a 94% reduction in WAL traffic and significant performance enhancements, with write throughput increasing by over 4.5 times on a 32-vCPU instance and WAL generation decreasing from 58KB per transaction to under 4KB. In production settings, steady-state WAL generation dropped from 30 MB/s to 1 MB/s, and read latencies improved by 30% to 50%. The optimization has been seamlessly integrated across Databricks' Serverless and Neon databases without requiring restarts or interruptions for customers.
Winsage
April 28, 2026
FinalWire has released AIDA64 version 8.30, featuring the AIDA FPS module for real-time FPS data capture in DirectX 11 and 12 games, available exclusively in the Extreme edition. The update includes an optimized performance test for APX SHA3 for Intel Diamond Rapids and Nova Lake processors, support for Turing 4.6 and 12.3-inch LCD displays, compatibility with Intel Core Ultra 250K Plus and 270K Plus, enhanced support for Intel Wildcat Lake and Nova Lake processors, preliminary support for AMD Zen 6 architecture APUs, support for Aqua Computer Ampinel and Thermal Grizzly WireView Pro II sensors, extended support for Adaptec RAID controllers, USB-NVMe pass-through support for Realtek RTL9220 controllers, support for EXPO 1.2 memory profiles, and detailed GPU information for Intel Arc Pro B65 and B70 as well as NVIDIA RTX Pro 4500 Blackwell Server Edition. The update enhances support for Intel's Nova Lake CPUs and introduces a new SHA3 benchmark optimized for APX architectures. It also lays groundwork for support of AMD's upcoming Zen 6 Medusa Point mobile processors and introduces support for AMD's EXPO 1.2 technology. AIDA64 version 8.30 discontinues support for 32-bit Windows and Windows XP x64, requiring users on those platforms to revert to an earlier version. The new web-based AIDA64 SensorPanel Tools allows users to create image sets for SensorPanel Manager. The update is available across the Extreme, Engineer, Business, and Network Audit editions.
Tech Optimizer
April 17, 2026
Efforts to merge storage roles into a single solution are ongoing, particularly with Amazon S3's durability and cost-effectiveness. In PostgreSQL, achieving a durable commit requires flushing the Write-Ahead Log (WAL) before signaling transaction completion, which can take tens of microseconds on high-performance NVMe drives but extend to milliseconds on slower storage. This latency impacts Online Transaction Processing (OLTP) systems and user response times. Benchmark studies show that systems with faster local storage outperform those with slower alternatives as workloads exceed memory capacity. The fsync operation in PostgreSQL is a commitment rather than a simple write, with enterprise-grade SSDs performing better due to power-loss protection. Read operations also face challenges, as PostgreSQL's need for small, latency-sensitive reads conflicts with S3's design for larger, higher-latency requests. As the working set exceeds memory, storage latency becomes a critical performance factor. Modern managed PostgreSQL systems typically do not place object storage in the critical commit path, instead maintaining a fast log or cache close to the database while relegating colder data to remote storage. Recent PostgreSQL developments, such as asynchronous I/O support in version 18, aim to leverage fast storage more effectively. S3 is valuable for tasks like WAL archiving and backups, but these should be kept separate from the commit path to avoid resource contention. The solution involves using both NVMe and S3, with fast storage managing commits and cache misses, while object storage handles archives and backups. PostgreSQL performs best when hot and cold storage functions are clearly delineated.
AppWizard
April 11, 2026
Cities Skylines 2 has faced performance issues and limited content since its launch. Iceflake Studios is working on improvements, collaborating with Unity to address performance challenges. The game is CPU-intensive, with unique challenges due to its dynamic nature. Key areas for enhancement include rendering performance, simulation performance, and pathfinding. Recent changes have simplified citizen modeling and improved pathing, leading to a better gameplay experience. The game's Steam rating has shifted from 'Mixed' to 'Mostly Positive.' Iceflake has developed a benchmarking tool to collect performance data, which will be integrated into an upcoming patch.
AppWizard
April 9, 2026
The "Android Bench," Google's benchmark for evaluating AI models in Android app development, has been updated, with OpenAI's GPT 5.4 and GPT 5.3 Codex now sharing the top ranking with Gemini. The benchmark evaluates models based on criteria such as compatibility with Jetpack Compose, use of Coroutines and Flows, and integration with Room and Hilt. The latest rankings are as follows: 1. GPT 5.4: 72.4% 2. Gemini 3.1 Pro Preview: 72.4% 3. GPT 5.3-Codex: 67.7% 4. Claude Opus 4.6: 66.6% 5. GPT-5.2 Codex: 62.5% 6. Claude Opus 4.5: 61.9% 7. Gemini 3 Pro Preview: 60.4% 8. Claude Sonnet 4.6: 58.4% 9. Claude Sonnet 4.5: 54.2% 10. Gemini 3 Flash Preview: 42% 11. Gemini 2.5 Flash: 16.1% The rankings have not changed since the initial assessment in late February, and the latest models were evaluated in mid-March. The findings should be interpreted cautiously, as real-world performance may vary based on specific workflows and project requirements.
Search