<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>GPU on Rik Kisnah - Blog</title><link>https://www.rik-kisnah.ai/tags/gpu/</link><description>Recent content in GPU on Rik Kisnah - Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 28 Feb 2026 09:00:00 -0700</lastBuildDate><atom:link href="https://www.rik-kisnah.ai/tags/gpu/feed.xml" rel="self" type="application/rss+xml"/><item><title>Attention Is All You Need — And All You Need to Know: An Infrastructure Engineer's Guide to the Paper That Built Your GPU Cluster</title><link>https://www.rik-kisnah.ai/posts/attention-is-all-you-need-and-all-you-need-to-know/</link><pubDate>Sat, 28 Feb 2026 09:00:00 -0700</pubDate><guid>https://www.rik-kisnah.ai/posts/attention-is-all-you-need-and-all-you-need-to-know/</guid><description>The 2017 Transformer paper replaced sequential processing with parallel attention and launched the GPU infrastructure boom. The same attention problem exists in your repository. Here is how to fix it.</description></item><item><title>The Complete NCCL Reference Guide: Commands, Errors, and Troubleshooting for OCI GPU Infrastructure</title><link>https://www.rik-kisnah.ai/posts/nccl-complete-reference-guide/</link><pubDate>Sat, 15 Nov 2025 00:00:00 -0700</pubDate><guid>https://www.rik-kisnah.ai/posts/nccl-complete-reference-guide/</guid><description>Disclaimer: This article reflects my personal research and analysis based on publicly available information and is not representative of my employer&amp;rsquo;s official position.
Executive Summary NCCL (NVIDIA Collective Communication Library) is the cornerstone of distributed GPU computing, enabling efficient communication between GPUs in multi-node clusters. This comprehensive guide provides every NCCL command, parameter, error message, and troubleshooting technique you need for successful deployment on Oracle Cloud Infrastructure (OCI).
Table of Contents Why NCCL Exists Understanding Collective Communications NCCL Fundamentals Complete NCCL Commands Reference All NCCL Environment Variables NCCL Error Messages and Solutions OCI GPU-Specific Configurations Advanced Troubleshooting Scenarios Performance Tuning Reference Quick Reference Tables Why NCCL Exists The Distributed Training Challenge Modern AI models have grown exponentially in size and complexity.</description></item><item><title>OCI Powers America's AI Future at NVIDIA GTC 2025: Supercomputers, AI Factories, and Strategic Leadership</title><link>https://www.rik-kisnah.ai/posts/oci-nvidia-gtc-2025-ai-supercomputing/</link><pubDate>Wed, 29 Oct 2025 10:00:00 -0700</pubDate><guid>https://www.rik-kisnah.ai/posts/oci-nvidia-gtc-2025-ai-supercomputing/</guid><description>Disclaimer: This article reflects my personal research and analysis based on publicly available information and is not representative of my employer&amp;rsquo;s official position.
Highlights from NVIDIA GTC DC 2025: OCI at the Heart of the AI Revolution Oracle Cloud Infrastructure (OCI) demonstrated its central role in shaping the future of AI infrastructure at NVIDIA&amp;rsquo;s GTC conference in Washington, DC (October 28-29, 2025). The event showcased groundbreaking announcements, technical innovations, and strategic partnerships that position OCI as a cornerstone of America&amp;rsquo;s AI leadership.</description></item><item><title>From First Principles to Zettascale: How OCI's GPU/RDMA Architecture Redefines AI Infrastructure</title><link>https://www.rik-kisnah.ai/posts/summary-gpu-oci-first-principles-blog/</link><pubDate>Sun, 26 Oct 2025 22:43:15 -0400</pubDate><guid>https://www.rik-kisnah.ai/posts/summary-gpu-oci-first-principles-blog/</guid><description>Disclaimer: This article reflects my personal research and analysis based on publicly available information and is not representative of my employer&amp;rsquo;s official position.
In the rapidly evolving landscape of AI infrastructure, one company has quietly revolutionized how we think about GPU computing at scale. Through a series of &amp;ldquo;First Principles&amp;rdquo; engineering blogs and groundbreaking deployments, Oracle Cloud Infrastructure (OCI) has demonstrated that starting from fundamental physics and systems design—rather than following industry conventions—can yield extraordinary results.</description></item></channel></rss>