language model

Winsage
June 11, 2026
Microsoft is testing a new feature that allows developers to implement local language models on non-Copilot+ PCs running Windows 11. The Language Model APIs can now operate on any Windows 11 device with a compatible Nvidia GPU, specifically targeting GeForce RTX 30 series and newer models with at least 6 GB of video RAM. This initiative aims to democratize access to AI capabilities across a broader range of Windows 11 PCs, although not all PCs will gain access to exclusive Copilot+ AI functionalities.
Winsage
June 11, 2026
Microsoft has made its local Language Model APIs for Windows 11 accessible on non-Copilot+ PCs, requiring an NVIDIA RTX 30 series GPU with at least 6GB of VRAM. Copilot+ PCs, which debuted on June 18, 2024, require 16GB of RAM, an SSD, and a minimum of 40 TOPS NPU. The new APIs allow broader access to AI functionalities, including AI-powered text formatting, summarization, text rewriting, text-to-table conversion, and general prompt generation, through a compact language model called "Phi Silica." Users with compatible NVIDIA GPUs can utilize these features locally, enhancing privacy compared to cloud-based models. However, some features remain exclusive to devices with NPU chips.
Tech Optimizer
June 6, 2026
Microsoft announced the public preview of Azure HorizonDB, a fully managed PostgreSQL-compatible database designed for agentic AI workloads, during Microsoft Build 2026 in San Francisco. HorizonDB features a "database-as-logs" architecture, allowing for sub-millisecond multi-zone commit latency and independent scaling of compute and storage. It incorporates a Rust-based storage engine, native DiskANN vector search, and in-database AI model invocation. Additionally, Microsoft launched Web IQ, a web-grounding API layer integrated into Microsoft Copilot and OpenAI's ChatGPT, which provides passage-level structured evidence objects rather than full documents. Web IQ is model-agnostic and aims to enhance information density and reduce costs. Both services are currently in limited availability, with HorizonDB open for preview signups across five Azure regions.
Winsage
June 3, 2026
Researchers at Pureinfotech discovered a new AI Components page in the Windows 11 Insider Experimental Preview build 26300.8553, which provides detailed information about local AI models on Copilot+ PCs. The page includes details such as the publisher, version, installation date, size, and total usage of each model. Currently, users can only uninstall the Phi Silica model, while it's uncertain if other AI components will have similar capabilities. The build also introduces expanded customization options for the Start menu, enhanced search functionality, and support for touch swipe gestures. The management page aims to address user concerns about the automatic installation of AI features and the need for transparency regarding their usage. Although the feature is not officially accessible yet, it indicates potential plans for future updates.
Winsage
June 3, 2026
Build is an annual event focused on connecting with the global developer community and sharing innovations. Feedback from developers emphasizes the need for a frictionless and adaptable platform for seamless development across various environments. Key announcements include: - Coreutils for Windows: A suite of Linux-like command line utilities available natively on Windows. - WSL containers: A forthcoming public preview feature for creating and interacting with Linux containers. - Windows Development Skills: Structured knowledge for building native Windows apps using WinUI3 and WinApp CLI, now generally available. - Intelligent Terminal: An experimental feature integrating context-aware intelligence into the terminal for improved debugging and task execution. - Windows Developer Configurations: A WinGet-powered setup for distraction-free development, now generally available. - Windows 365 with Developer Configuration: A pre-configured cloud-based development environment in public preview. For security, the announcements include: - Microsoft Execution Containers (MXC) SDK: A policy-driven execution layer for agent access and containment, available in early preview. - Agent 365 integration: Enhances security for agents with protections from Defender, Entra, Intune, and Purview, available in preview. - OpenClaw: Runs securely on Windows using MXC for easy installation of agent connections. - NVIDIA's OpenShell: A deployment package for autonomous agents on Windows, built on MXC. - Windows 365 for Agents: Secure Cloud PCs for executing enterprise workflows, now generally available. For AI capabilities: - Aion 1.0 Instruct: An on-device small language model for local workloads, available soon. - Expanded Windows AI APIs: Now accessible on more Windows 11 PCs, enhancing local AI capabilities. - Surface RTX Spark Dev Box: A device for developers featuring NVIDIA RTX Spark silicon for local AI workloads. - DGX Station for Windows: A powerful AI supercomputer for developing agents, coming in Q4. - Project Solara: A platform for agent-driven experiences, simplifying development across environments. - New Speech Recognition API: Real-time, on-device transcription capabilities. - Expansion of Windows AI APIs across GPUs and CPUs: Now available for broader hardware support. Windows is enhancing its security framework to reduce risks and ensure robust protection for applications.
AppWizard
May 17, 2026
Oppo's Multi-X team has introduced X-OmniClaw, an open-source AI agent for Android that operates on the device without cloud processing. It uses the camera, screen, and voice functionalities to perform tasks across applications. Unlike cloud-based platforms, X-OmniClaw processes information locally, with the cloud serving as a supplementary resource. The architecture integrates three perception channels into a unified pipeline, allowing it to interpret scenes and user requests effectively. It transforms local data into semantic entries for long-term memory, processes gallery photos into descriptions, and filters out sensitive information. X-OmniClaw captures user behavior into reusable skills, enabling direct navigation to app pages through deeplinks. Demonstrations show its ability to retrieve product prices, assist with homework, and create highlight albums from photos. The project is built on the open-source HermesApp codebase and is accessible on GitHub. It draws inspiration from existing models, including Google's local model and ByteDance's UI-TARS, while enhancing functionality through on-device execution and structural XML data integration.
AppWizard
May 12, 2026
The RPCS3 development team has publicly addressed the influx of AI-generated pull requests (PRs) in their project, urging contributors to stop submitting what they call "AI slop code" and warning that they will ban those who do not disclose AI contributions. They expressed concern over poorly constructed PRs, particularly affecting their macOS build, and emphasized the importance of understanding the code being contributed. The team clarified that their issue is not with the use of AI code itself, but with the lack of disclosure. They have established new guidelines for AI contributions, allowing the use of AI tools for research and reverse engineering, but requiring contributors to fully understand and take ownership of their code. All communication with the team must come from human contributors, not AI.
Tech Optimizer
May 3, 2026
PostgreSQL 18 addresses the challenges of data quality and integration in artificial intelligence, emphasizing the importance of the relationship between AI embeddings and relational data. It features pgvector, a tool that helps maintain this relationship and reduces reliance on fragile code. The accompanying resources include working schemas, implementation scripts, and a demo showcasing the system's capabilities with evidence rows and explanations from a large language model. The approach combines semantic candidate retrieval with SQL filtering and business rule enforcement, enhancing the reliability of AI outputs and ensuring data accuracy.
Search