Skip to content

We're upgrading our operations to serve you better. Orders ship as usual from Laval, QC. Questions? Contact us

Bitcoin accepted at checkout  |  Ships from Laval, QC, Canada  |  Expert support since 2016

OC_UNSTABLE Warning

ASIC Miner – Overclocking Instability

Overclocking instability — dashboard HW% drifts above the ~1.5% healthy baseline, effective hashrate falls behind nameplate, per-chain `HW error` / `nonce error` log lines accumulate, and the miner intermittently reboots or rolls back autotune. Root causes span silicon-lottery ceiling, voltage-domain collapse, PSU sag, thermal drift, and firmware autotune regressions.

Warning — Should be addressed soon

Affected Models: All Antminer models with overclock or autotune profiles: S9, S17, S19, S19 Pro, S19j, S19j Pro, S19 XP, S19 XP Hydro, S19k Pro, S21, S21 Pro, T19, T21, L7, KA3, D9.

Symptoms

  • Rig-wide `HW%` drifted above its historical 0.5-1.5% baseline and is now climbing or pinned above 2-3%
  • One hashboard's chain-level `HW%` clearly worse than the others (stock firmware: chain 0/1/2 rollup)
  • Effective pool-reported hashrate lags nameplate by 5-20% even though the dashboard claims target hashrate
  • `kern.log` / cgminer / bmminer shows repeated `HW error`, `nonce error chain X`, or `hash unit fail` lines
  • Rejected or stale share rate on the pool side climbed after an OC/UV profile change
  • Intermittent cold reboots every 5-60 min during mining, never during idle — PSU click-dip audible at reboot
  • Per-chain temperature drifted up 3-5 °C with no change in ambient
  • Autotune target keeps rolling back — firmware steps target down within minutes of ramp-up
  • PSU fan at high duty during steady-state mining, not just startup — a voltage-sag tell
  • A chain reports `find 0 asic` or chip count below nameplate after a reboot cycle following an OC push
  • On DCENT_OS / Braiins OS+ / LuxOS / Vnish: one or more chip positions flagged red or showing isolated >20% HW%
  • Symptoms appeared immediately after a profile change (silicon/profile cause) or crept up gradually over weeks (thermal/cap/firmware-regression cause)

Step-by-Step Fix

1

Cold-reboot the miner — 30 seconds breaker-off, not a soft restart. Clears wedged driver state that lingers after an OC profile change and gives NVRAM a clean read on next boot. A one-shot instability from a transient can end here. If the symptoms return within 30 minutes of hashing, the profile or hardware is the cause — keep going.

2

Revert to nameplate stock profile: no OC, no UV, autotune disabled or set to stock target. Run 15 minutes. If HW% returns under 1.5% and reboots stop, your tune was past silicon lottery — rebuild slower in Tier 2. If instability persists at stock, the cause is hardware, thermal, power, or firmware — continue diagnosing.

3

Shop-vac the intake filter, wipe the grille, and verify no furniture, curtains, or dust buildup within 15 cm of the front of the miner. Dust on filters adds 3-6 °C to inlet temp — enough to destabilize a previously-stable OC. Confirm intake temp with an IR thermometer at the grille itself (not room-middle). Target ≤ 35 °C standard, ≤ 40 °C Hydro.

4

Roll firmware ±1 version within your hardware revision. Check the control-board sticker for BHB42 / BHB56902 / BHB68601 etc before flashing — wrong-revision FW bricks the board because newer Bitmain builds are cryptographically signed and don't downgrade cleanly without a physical SD-card flash. See D-Central's custom-firmware guide for the signing details.

5

Measure PSU output at the board connector under full hashing load with a multimeter on DC. Target ≥ 13.8 V sustained on an S19-class miner. Below that means tired PSU or undersized circuit. Swap with a known-good PSU before suspecting the hashboard. APW9/APW12 units degrade with age; a PSU that looked fine at startup can sag meaningfully under steady-state continuous load.

6

Measure line voltage at the panel under load. 240 V split-phase should be 235-245 V; 208 V commercial should be 202-212 V. If you're on 110 V with an S19, that's your answer — the chassis cannot sustain stock hashrate on 110 V and any OC push collapses the tune immediately. See antminer-s19-110v-insufficient-power. Move to a dedicated 240 V circuit before chasing firmware.

7

Re-seat every hashboard data and power cable. Breaker off, disconnect each connector, inspect for blackening, bent pins, corrosion, or oxidation; reconnect firmly and listen for the click. An almost-seated connector produces intermittent voltage on one domain — the silent killer of OC stability that masquerades as firmware misbehavior.

8

Rebuild the OC from stock, slowly. Start at nameplate (S19: 110 TH/s @ 3250 W; S21: 200 TH/s @ 3500 W; L7: 9500 MH/s). Step +50 MHz — half the usual community default — and wait 15 minutes per step for stability. Stop at the step *before* HW% crosses 2%. That's this miner's silicon-lottery ceiling; it varies per chip, per wafer, per board. No two identical-model miners share the same ceiling.

9

Swap hashboards between slots. On 3-board miners label slots 0/1/2 with tape. Move the suspect board to a known-good slot and observe HW% for 15 minutes. If the bad HW% follows the board, the board is the problem; if it stays in the slot, the control board or cable path owns it. This is the cleanest way to split board-level from control-path problems before you reach for a reflow station.

10

Replace the CR2032 RTC battery on the control board. A dead battery fails TLS cert expiry checks during pool auth; on some firmwares the cascade looks like OC instability or bmminer crashes. $2 fix, 5-minute job. Replace on any Antminer older than two years that has started acting weird, regardless of the leading symptom.

11

Flash DCENT_OS — D-Central's own open-source Antminer firmware. Per-chip HW%, closed-loop autotune (steps up and observes per chip rather than trusting an open-loop V-F table the way stock does), stratum v2, fully open-source (source: https://github.com/DCentralTech/DCENT_OS). Alternatives: Braiins OS+, LuxOS, Vnish. All four expose per-chip HW% and beat stock on OC stability. Run 20 minutes after flash to stabilize; then record the worst 3-5 chip positions by HW%.

12

Enable closed-loop per-chip autotune on DCENT_OS or Braiins OS+, conservative target (95% of nameplate). The tuner probes each chip at its own V-F curve rather than running all chips on one open-loop curve — this alone resolves most OC instability that stock firmware cannot. Braiins documents 10-15% efficiency gain and 38-45% power savings at an 80 TH/s S19 underclock (braiins.com/blog/smarter-clock-speed-management-for-bitcoin-miners-maximize-roi-minimize-waste).

13

Reflow the worst-performing chip. If per-chip HW% isolates 1-3 chips dominating the metric on a specific board, remove the heatsink, flux the BGA, preheat bottom-side to ~150 °C, top-side hot air 310-330 °C for ~30 s. Let cool naturally, re-apply thermal paste, reassemble. BM1398 / BM1362 / BM1368 packages tolerate a reflow cycle well. This is the lowest-risk chip-level repair on S19-class boards.

14

Re-apply thermal paste on the whole board with Arctic MX-6 or Thermal Grizzly Kryonaut. Uniform thin layer, not a glob. Pay attention to PCH and voltage-domain ICs — dried pads there drift chip junction temperatures on a tune that used to be stable. Paste that was fresh 18-24 months ago no longer protects the silicon on an OC profile that depended on it.

15

Visually inspect voltage-domain capacitors. Bulging electrolytics or cracked MLCCs near the PMIC need replacement — soldering iron + hot-air job, not a reflow job. Caps drift with age under continuous 80 °C operation, and that drift narrows the domain's stable voltage window, which destabilizes OC profiles that depended on the original margin.

16

Stop DIY and book a D-Central repair when: per-chip HW% isolates the same chip position on two different boards (PCB-level issue); OR a PMIC / voltage-domain IC is suspected (short, heat damage, sanity check failing); OR a reflowed chip's HW% returned within 30 days (silicon failing); OR visible cap bulging, MLCC cracking, or burnt smell; OR a cryptographically-signed firmware downgrade was interrupted mid-write (soft-brick). https://d-central.tech/services/asic-repair/ — turnaround 5-10 business days.

17

D-Central bench process: test fixture with programmable load, per-chip isolation using official Bitmain test binaries, chip replacement from graded salvage or new-old-stock BM1398/BM1362/BM1368/BM1387 inventory, reflow + reseal, 24-hour nameplate burn-in before return. We diagnose the failure mode first, quote second — no fix-then-pray.

18

Ship safely. Hashboards in anti-static bags, double-boxed with ≥ 5 cm foam on every side. Include a note: observed symptoms, firmware version at the time of instability, the exact OC profile that triggered it (frequency + voltage + tuning target), last-known-good configuration, your contact. This saves us diagnostic time and saves you money. Canada / US / international accepted.

When to Seek Professional Repair

If the steps above do not resolve the issue, or if you are not comfortable performing these repairs yourself, professional service is recommended. Attempting advanced repairs without proper equipment can cause further damage.

Related Error Codes

Still Having Issues?

Our team of Bitcoin Mining Hackers has been repairing ASIC miners since 2016. We have seen it all and fixed it all. Get a professional diagnosis.