software stack

Winsage
March 8, 2025
AMD is working to expand its ROCm software stack support from Linux to Windows, with confirmation from Vice President of AI Software, Anush Elangovan, that the company is looking to broaden support for additional GPUs on Windows. Currently, ROCm support on Windows is limited to select AMD Instinct GPUs and a few Radeon GPUs, including the RX 7900 XT and XTX, leaving many users with newer RX 9000 series GPUs unsupported. The latest version for Windows is 6.2.4, but users have reported issues such as crashes and driver timeouts. AMD's potential support for RDNA 4 GPUs could benefit deep-learning tasks for Windows users. Additionally, tinygrad will receive two MI300X boxes from AMD to enhance its AI solutions.
Winsage
December 6, 2024
The Applied Sciences team has developed the small language model (SLM) Phi Silica, which enhances power efficiency, inference speed, and memory efficiency for Windows 11 Copilot+ PCs using Snapdragon X Series NPUs. Phi Silica is designed for on-device use and supports multiple languages, featuring a 4k context length. Microsoft announced that developers will have access to the Phi Silica API starting January 2025. The Copilot+ PCs can perform over 40 trillion operations per second, achieving significant performance improvements when connected to the cloud. Phi Silica utilizes a Cyber-EO compliant derivative of Phi-3.5-mini, and its architecture includes components such as a tokenizer, detokenizer, embedding model, transformer block, and language model head. The model's context processing consumes only 4.8mWh of energy on the NPU, with a 56% improvement in power consumption compared to CPU operation. Phi Silica features 4-bit weight quantization for efficiency, rapid time to first token, and high accuracy across languages. The model was developed using QuaRot for low-precision inference, achieving 4-bit quantization with minimal accuracy loss. Techniques like weight sharing and memory-mapped embeddings were employed to optimize memory usage, resulting in a ~60% reduction in memory consumption. Innovations such as a sliding window for context processing and a dynamic KV cache were introduced to expand context length. The model has undergone safety alignment and is subject to Responsible AI assessments and content moderation measures.
Winsage
August 7, 2024
Microsoft Windows is the leading desktop operating system, managing over a billion machines globally. Windows ME is often considered the least favorable version. Microsoft initiated a research project called Midori, aimed at creating a cloud-based operating system that would decouple software from hardware, leading to the development of a new programming language, M#. Midori became a project within Microsoft's Unified Operating System group in 2013 but was discontinued in 2015, with Microsoft stating that insights from it would inform future projects. Midori aimed to create a new software stack and prioritize cloud computing, challenging the traditional Windows architecture. Joe Duffy, a former project member, has shared insights about Midori after its cancellation. Although Midori was never launched, its focus on cloud computing and security likely influenced later Microsoft projects like Azure and OneDrive. Windows continues to be Microsoft's main operating system focus, with anticipation for a potential Windows 12 release.
Search