Ovi's Tech Blog

I love technology!

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Archives

Archives

Archives

2026

24 Jun Forecasting with Foundation Models: Capacity Planning and Incident Detection
20 Jun Running DeepSeek-V4-Flash at 700 tokens/s on 2x RTX Pro 6000
26 Apr Coding locally with Pi Coding agent and open weights models (April 2026 edition)
22 Feb What Happened When I Gave Luna Access to My Email
31 Jan Luna: An AI Assistant That Works While I Sleep
17 Jan Fixing RTX Pro 6000 Blackwell shutdowns with custom fan control

2025

30 Dec The age of hyper-personalized software
27 Dec Running MiniMax-M2.1 Locally with Claude Code on Dual RTX Pro 6000
25 Dec Guide on installing and running the best models on a dual RTX Pro 6000 rig with vLLM
21 Dec Injecting Knowledge into LLMs via Fine-Tuning
30 Nov Three Years of ChatGPT
16 Nov Getting Started with running LLM models locally
02 Nov Silicon Valley's New Secret: Chinese Base Models
26 Oct Speeding up local LLM inference 2x with Speculative Decoding
21 Oct Open Weights, Borrowed GPUs
11 Oct Harnessing GPT-OSS Built-in Tools

Recently Updated

Forecasting with Foundation Models: Capacity Planning and Incident Detection
Running DeepSeek-V4-Flash at 700 tokens/s on 2x RTX Pro 6000
Coding locally with Pi Coding agent and open weights models (April 2026 edition)
What Happened When I Gave Luna Access to My Email
Luna: An AI Assistant That Works While I Sleep

Trending Tags

llm vllm agents gpt-oss local-inference capacity-planning coding-agents deepseek email fine-tuning

© 2026 Ovidiu Dan. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

llm vllm agents gpt-oss local-inference capacity-planning coding-agents deepseek email fine-tuning

New content available