Local-Models

Running Ollama on a Jetson Orin Nano: From Gemma 3 to Gemma 4 with GPU Acceleration

9 June 2026·1990 words·10 mins

Ai Ollama Local-Models Ai Llm Homelab Inference Edge-Ai Gemma4 Jetson

The journey from Gemma 3 4B (17.5 tok/s CPU) to Gemma 4 E2B (25.5 tok/s GPU) on the Jetson Orin Nano. Covers model testing, QAT quantization, the JetPack CUDA rabbithole, CMA traps, and the keepalive architecture that makes it all work.

Boost Your AI Workflow: A Guide to Using Ollama, OpenwebUI, and Continue

25 July 2024·1510 words·8 mins

Ai Ai Llm Ollama Continue Open-Webui Local-Models Coding-Assistant

Run local LLMs with Ollama, manage conversations via OpenwebUI, and get AI code completion in VS Code with Continue. A complete local AI stack setup guide.

Leveraging Fabric and LM Studio for Advanced AI

6 June 2024·1150 words·6 mins

Ai Ai Llm Fabric Lm-Studio Local-Models Prompt-Engineering

How to run Fabric with local models through LM Studio for custom AI patterns and workflows. Setup, integration, and practical use cases for prompt-based automation.

↑