Archives 2025 02 Nov Silicon Valley's New Secret: Chinese Base Models 26 Oct Speeding up local LLM inference 2x with Speculative Decoding 21 Oct Open Weights, Borrowed GPUs 11 Oct Harnessing GPT-OSS Built-in Tools