Whatsminer M50S – Overclocking Instability
Warning — Should be addressed soon
Symptoms
- MinerTool shows 'Unstable at overclocked settings' or BTminer logs repeat 5110/5111/5112 frequency-up timeouts on SM0/SM1/SM2
- Realized hashrate 8-20% below 126 TH/s M50S nameplate (or M50S+/M50S++ High Performance target)
- Pool-side rejected share rate sustained above 2% (past the first-hour vardiff overshoot window)
- upfreq_test.log shows frequency downshifts back to a lower tier after completing the ramp - boards ramp then retreat
- Red or alternating red/green LED pattern during ramp, then green but hashrate never catches up
- One hashboard (commonly SM1, middle slot) reports per-board hashrate 15%+ below the other two
- Control board reboots or BTminer process restarts when High Performance mode engages
- Event log shows codes 272/273 (Warning of excessive power output) or 255/256 (Protection of over power output)
- Hashrate steady on Low or Normal but collapses when switched to High Performance
- Intake temperature climbs 4-6 C above baseline at the same ambient; fans ramp to 6000+ RPM trying to compensate
- Miner 'accepts' power-mode change but get_miner_info returns the previous mode after the ramp timeout
- api.log / chip-ID dump shows mismatched ChipID or ChipTemp distributions across the three boards (chip-bin mismatch)
Step-by-Step Fix
Hard power-cycle at the breaker for 30 seconds, then power back up and let the M50S complete its full boot plus ramp (5-8 minutes). Watch the LED cycle red-amber-green. This clears a wedged BTminer state that can persist after a failed ramp, a firmware update, or a network-flaky reboot - and it is the fastest Tier 1 win before you touch anything else.
In MinerTool, drop power mode from High Performance to Normal via Remote Ctrl. Reboot the miner. Observe for 20 minutes. If hashrate stabilizes at 126 TH/s +/- 3% with zero 5110-5112 codes, your chip bin / PSU / grid / thermals cannot support High Performance in this environment. Running Normal is not a workaround - it is the correct, stable setpoint for this specific unit.
Shop-vac the intake grille and clean the fan filters. Inspect for anything blocking the front 15 cm (furniture, curtains, boxes). Verify intake temperature at the grille with an IR thermometer - not room-middle, not the hallway. Dust on the intake side is the single most common cause of 'it was stable last month' M50S complaints because it raises inlet temp past the High Performance ceiling without you noticing.
Export logs via MinerTool: Remote Ctrl > Export Log. Five files come out: upfreq_test.log, power.log, miner.log, system.log, api.log. Archive them. upfreq_test.log is the one that matters for this error - it records each board's frequency ramp result and per-chip final frequencies. You will need it for Tier 3 work and for any D-Central repair ticket, so capture it now while the miner is still in the failing state.
Check your BTminer firmware version at support.whatsminer.com against the current release. If you are on a build in the approximately 20230101-20230601 range, update one step forward - there is a known High Performance ramp-overshoot regression in that window that MicroBT fixed in later releases. Verify the firmware signature and confirm the build is for your exact M50S hardware revision (M50S, M50S+, M50S++ are not firmware-interchangeable).
With a multimeter on DC, probe the PSU-to-hashboard connector (or the busbar) while the miner is mid-ramp to High Performance. Expect 13.8-14.2 V sustained. Anything below 13.5 V under load means the PSU cannot hold regulation - swap with a known-good M50S-spec PSU. Critical note: M30S PSUs are NOT cross-compatible with M50S - different ceiling, different firmware handshake, using the wrong one trips 201 (power-supply and configuration-file mismatch).
Measure AC input at the PSU input with the miner under full load. Target: 225-245 V on 240 V split-phase, 202-215 V on 208 V commercial. Sub-spec line voltage is the single most-diagnosed 'mystery ramp failure' cause on the D-Central bench. If your house voltage drops when the neighbour's A/C or a shop tool kicks on, you have found it - fix the circuit, not the miner.
Power off at the breaker. Re-seat all three hashboard data and power cables at both ends. Inspect for blackening, bent pins, or oxidation. M50S uses flat ribbon data cables to the control board - look for kinks or crush damage if the miner has been moved recently. Also check the adapter plate between the hashboard and the chassis, which Zeus Mining flags as a frequent miss on 53X hashboard-not-found escalations downstream of ignored overclock instability.
Label the three hashboard slots 0/1/2 with tape. Swap the suspect board to a different slot and re-run the ramp. If the 5110-5112 fault follows the board, that board is the culprit (chip bin ceiling, thermal paste drift, or a weak chip). If the fault stays with the slot, the control board, data cable, or adapter plate is the culprit. This is the cheapest definitive slot-vs-board isolation test on M50S without a test fixture.
Rebuild from Low to target: Low mode for 10 minutes, then Normal for 10 minutes, then High Performance for 20 minutes. Log at each step via MinerTool. This walks the voltage-frequency curve the way BTminer does internally but with human eyes on the intermediate steps. Stop at the last tier that holds stable for 20 minutes - that is your miner's true ceiling, regardless of what the power-mode selector claims it should be doing.
Re-apply thermal paste on all 198 chips across the three hashboards. Remove the heatsink, clean old paste with 99% IPA and lint-free wipes, apply a uniform thin layer of Arctic MX-6 or Thermal Grizzly Kryonaut. Re-torque the heatsink evenly. Replace thermal pads on the spacers if they look compressed or discolored. This alone recovers roughly 50% of 'drifting overclock' cases on 12+ month old M50S units - paste degradation is the silent killer of High Performance stability.
Pull per-chip final frequency from upfreq_test.log. Identify any chip consistently 50+ MHz below its siblings on the same board. Reflow it: flux the BGA, preheat bottom side to ~150 C, top-side hot air at 310-330 C for 30 seconds, let cool naturally, re-apply paste, reassemble. SM3H packages tolerate one reflow cycle well. A second reflow on the same chip rarely helps - at that point you are replacing the chip via a D-Central repair.
Visually inspect the voltage-domain MLCCs and electrolytic capacitors on the failing hashboard. Bulging caps, cracked MLCCs, or discoloration around the DC-DC converters need replacement before you re-flash or re-ramp anything. Continuing to run at High Performance on a cap-damaged board accelerates the damage and risks taking out the PSU - a $200+ failure from a $5 part. This is soldering-iron + hot-air work, not a reflow job.
Flash the last-known-good BTminer firmware for your exact M50S hardware revision. MicroBT publishes per-rev firmware on support.whatsminer.com - get it from the official portal, verify the signature (firmware signature verification is enforced on M50S/M60S generations), and confirm the build matches your revision. Wrong-rev firmware on an M50S can brick the control board. Do not cross-flash M50S+ firmware onto an M50S.
If you have a second M50S available, swap one hashboard between the rigs. If the 'failing' board ramps clean on the other rig, the problem is your rig's control board or slot, not the board. This is the cheapest way to definitively isolate slot vs board without a test fixture - essentially free if you already operate two M50S units, and the most valuable diagnostic data point you can generate before shipping for repair.
Stop DIY and book a D-Central ASIC Repair slot when: (a) two different boards in the same M50S fail ramp on the same chip position after a reflow, (b) visible cap damage or burnt-component smell, (c) a second reflow on the same chip fails within 30 days, or (d) get_miner_info shows chip-bin mismatch across boards and you cannot source matched-bin replacements. You are now in test-fixture territory - further DIY risks damaging adjacent components and wasting diagnostic effort.
D-Central bench process for M50S overclock instability: test fixture with programmable load, BTminer debug build exposing per-chip frequency targets, chip-bin matching from graded SM3H inventory, chip replacement (salvaged-grade or new-old-stock), full reflow and re-seal, and 24-hour burn-in at your intended power mode - not just Normal. We test at the mode you actually plan to run, because passing Normal burn-in does not mean passing High Performance in your environment.
Ship hashboards (not the full miner) in anti-static bags, double-boxed with at least 5 cm of foam on every side. Include the upfreq_test.log, power.log, and miner.log you exported in Step 4, plus a note with observed symptoms, firmware build, power mode in use at failure, and your intake / ambient temperature range. This saves D-Central diagnostic time, which saves you money on your repair invoice. Canada-wide shipping, 5-10 business days turnaround, US and international welcomed.
When to Seek Professional Repair
If the steps above do not resolve the issue, or if you are not comfortable performing these repairs yourself, professional service is recommended. Attempting advanced repairs without proper equipment can cause further damage.
Related Error Codes
Still Having Issues?
Our team of Bitcoin Mining Hackers has been repairing ASIC miners since 2016. We have seen it all and fixed it all. Get a professional diagnosis.
