GGUF, Q4, Q8, fp16: A Pleb’s Guide to LLM Quantization
Quantization is lossy compression for LLMs — same idea as JPEG for photos. It's the reason a used 3090 runs 70B models…
We're upgrading our operations to serve you better. Orders ship as usual from Laval, QC. Questions? Contact us
Bitcoin accepted at checkout | Ships from Laval, QC, Canada | Expert support since 2016
// d-central.tech / ai
Self-hosted AI on hardware you already own. Credits all the open-source projects that made this possible. One more layer decentralized — the same sovereignty move Bitcoin made for money.
D-Central is shipping AI content and, soon, AI hardware and firmware built for the individual Bitcoiner who already thinks like a miner. DCENT_Inference OS and DCENT Heatbox AI are in closed beta under GPL-3.0, with public beta landing summer 2026. None of this replaces the Bitcoin core of the shop — the AI vertical is additive. Same hashcenter mindset, new compute workload.
Every post on this site slots into one of these five. Pick the one that matches where your head is at.
Sovereignty
Why self-hosted AI is the next cypherpunk battle.
Self-Hosting
The pleb's whole-stack guide to running AI locally.
Hardware
Which GPU runs which model. Buyer guides for the plebs.
Hashcenter
Hashcenter, not datacenter. Heat, sats, and sovereign intelligence under one roof.
Bitcoin × AI
When public miners left Bitcoin Hashcenters. Why plebs didn't.
Read these in order. By the end you’ll have a local model running on your box, a real UI in front of it, and enough theory to pick the right quant.
The narrative anchor. Why sovereign AI is the same move Bitcoin made for money.
Whole-stack overview — models, runners, hardware, the works.
Ten minutes from zero to a local model answering prompts on your own box.
A ChatGPT-shaped front end that talks to your Ollama node. No cloud.
GGUF, Q4, Q8, FP16 — what actually fits in your VRAM and why.
Newest posts across all six AI categories.
Quantization is lossy compression for LLMs — same idea as JPEG for photos. It's the reason a used 3090 runs 70B models…
Three excellent open-source runners. Three different plebs. llama.cpp is the foundation Gerganov built. Ollama wraps it for daemon simplicity. LM Studio wraps…
The terminal is fine for testing, unusable for daily driving. Open WebUI is the ChatGPT-style interface that plugs into your local Ollama…
Hut 8, Core Scientific, IREN, and TeraWulf are leaving Bitcoin Hashcenters to become AI datacenter operators. It's a rational pivot for public…
24 GB of VRAM at $600–$800 used. For LLMs under 70B parameters at Q4–Q5 quants, the RTX 3090 is still the pleb…
The mining shed is the hardest part of an AI Hashcenter — and you already have it. 240V service, airflow, sound isolation,…
Ten minutes, three commands, one evening. By the end you'll have Llama 3.1 or Gemma 3 running locally on your own hardware.…
Bitcoin ASICs dump nearly all their power as heat — which is why mining heaters are a category. GPUs doing LLM inference…
Long-form model pages — architecture, quantizations, hardware that runs them. Built on the dc_ai_model CPT.
Mistral AI's January 2025 24B model — Apache 2.0, competitive with Llama 3.3 70B, fits on a single…
Cohere's April 2024 RAG-native flagship — 104B dense, first-class grounded citation and tool use, CC-BY-NC 4.0.
OpenAI's November 2023 open ASR model — 1.55B params, MIT-licensed, the open reference for multilingual speech-to-text.
Black Forest Labs' August 2024 Apache 2.0 FLUX variant — 12B distilled to 1-4 steps for fast, commercially-open…
Stability AI's October 2024 MMDiT flagship — 2B (Medium) and 8B (Large) variants with dramatically improved prompt adherence…
Alibaba's May 2025 release — first open family with hybrid reasoning (toggle-able chain of thought), Apache 2.0 across…
A dedicated hardware CPT plus benchmarks database ships in v1. Until then, the foundational reads:
Heads up: a full hardware CPT plus benchmarks taxonomy lands in a later task on the roadmap.
We use cookies to improve your experience and analyze site traffic. Privacy Policy
Choose which cookies you allow. Essential cookies are required for the site to function and cannot be disabled.
Required for basic site functionality, shopping cart, and security. Always active.
Help us understand how visitors use our site so we can improve your experience. Includes Google Analytics.
Used to deliver relevant ads and track campaign performance across platforms.