Ai

Running Ollama on a Jetson Orin Nano: From Gemma 3 to Gemma 4 with GPU Acceleration

9 June 2026·1990 words·10 mins

Ai Ollama Local-Models Ai Llm Homelab Inference Edge-Ai Gemma4 Jetson

The journey from Gemma 3 4B (17.5 tok/s CPU) to Gemma 4 E2B (25.5 tok/s GPU) on the Jetson Orin Nano. Covers model testing, QAT quantization, the JetPack CUDA rabbithole, CMA traps, and the keepalive architecture that makes it all work.

OpenCode Go: Can $10/Month Open Models Replace Frontier APIs?

30 May 2026·3103 words·15 mins

Ai Opencode Llm Benchmarks Ai-Coding Open-Models Agentic

12 open coding models benchmarked against Claude and GPT-5.5. DeepSeek V4 Flash handles 70% of tasks at 12x cheaper than DeepSeek V4 Pro. MiMo-V2.5 is now the cheapest high-volume option at 30,100 req/5h. Qwen3.7 Max leads on SWE-bench Pro (60.6%). Kimi K2.6 leads on agentic coding. Here’s how to route between them.

Unveiling the World of AI Chatbots: A Diverse Exploration

3 March 2025·417 words·2 mins

Ai Ai Chatbot Llm Tools Claude Gemini

Beyond ChatGPT: a curated list of 20+ AI chatbot platforms covering frontier models, research tools, and developer-focused interfaces with their unique strengths.

Boost Your AI Workflow: A Guide to Using Ollama, OpenwebUI, and Continue

25 July 2024·1510 words·8 mins

Ai Ai Llm Ollama Continue Open-Webui Local-Models Coding-Assistant

Run local LLMs with Ollama, manage conversations via OpenwebUI, and get AI code completion in VS Code with Continue. A complete local AI stack setup guide.

Leveraging Fabric and LM Studio for Advanced AI

6 June 2024·1150 words·6 mins

Ai Ai Llm Fabric Lm-Studio Local-Models Prompt-Engineering

How to run Fabric with local models through LM Studio for custom AI patterns and workflows. Setup, integration, and practical use cases for prompt-based automation.

↑