From the course: AI Trends
NVIDIA DGX Spark
- [Instructor] This is the NVIDIA DGX Spark, also known as the world's smallest supercomputer. You can use it to run pretty large language models. You can also use the DGX Spark for image and video generation and to power developer tools. Now, local AI development isn't new, but this device offers a few features that were previously pretty hard to come across. First of all, the Spark has a lot of memory to work with, 128 gigabytes of unified memory available to the graphics processing unit. This means you can run pretty large language models, sometimes with up to 200 billion parameters. The device can also be used for fine tuning. Another helpful feature of the Spark is that it leverages NVIDIA's CUDA platform for parallel computing. This gives developers and researchers access to a broad ecosystem of models and fine tuning libraries. The unit comes with a Linux-based operating system, which you can use by plugging in a mouse, keyboard, and monitor. I have a feeling, though, that many developers are going to connect to the Spark using a different computer. This is a very powerful way of working with AI models because it means you can offload a lot of the computational work through the Spark. I can definitely see this becoming useful in industries where there's a great emphasis on data privacy and security. This can also be very useful in AI research and development to supplement cloud compute. Now, setting up the spark does require some technical know-how. This doesn't mean less technical users can't leverage this technology. A less technical team would need some IT support in order to set up the Spark unit and maintain it. Innovations like this can help democratize AI research and development by empowering more individuals and teams to solve more problems with emphasis on data privacy. To learn more about open weight models and local AI development, check out my course on developing with GPT-OSS.
Contents
-
-
-
Microsoft Ignite8m 23s
-
Google Gemini 39m 52s
-
NVIDIA DGX Spark2m 10s
-
Kimi K2: The thinking open-source model6m 59s
-
Agentic development changes with Cursor 24m 39s
-
Agentic browsers5m 12s
-
Claude Code on the web3m 33s
-
Claude Skills6m 10s
-
(Locked)
OpenAI Agent Builder6m 43s
-
(Locked)
OpenAI Apps SDK6m 45s
-
(Locked)
GitHub MCP Registry6m 22s
-
(Locked)
GPT-5-Codex: A coding paradigm shift3m 41s
-
(Locked)
MCP in ChatGPT Developer Mode7m 23s
-
(Locked)
Styles and modes in ChatGPT, Claude, and VS Code6m 34s
-
(Locked)
Context Engineering3m 27s
-
(Locked)
Vibe coding with Windsurf6m
-
(Locked)
OpenAI GPT-511m 41s
-
(Locked)
OpenAI gpt-oss7m 11s
-
(Locked)
Perplexity Comet7m 34s
-
(Locked)
ChatGPT agent6m 56s
-
(Locked)
OpenAI's o3-pro model4m 32s
-
(Locked)
Gemini Diffusion model7m 47s
-
(Locked)
Codex54s
-
(Locked)
Bolt45s
-
(Locked)
Vibe Coding3m 39s
-
(Locked)
Generative Engine Optimization (GEO)4m 9s
-
(Locked)
Github Copilot Spaces5m 22s
-
(Locked)
Model Context Protocol (MCP)4m 32s
-
(Locked)
Physical AI2m 17s
-
(Locked)
ChatGPT 4.53m 25s
-
(Locked)
Coding in Cursor1m 15s
-
(Locked)
OpenAI's Operator3m 20s
-
(Locked)
Choosing an OpenAI model5m 9s
-
(Locked)
DeepSeek7m 20s
-
(Locked)
OpenAI canvas4m 58s
-
(Locked)
Agentic computer use from Claude1m 58s
-
(Locked)
NPUs vs. GPUs vs. CPUs2m 45s
-
-
-