IMM_PUMP_FAIL Critical

Immersion Cooling Pump Failure Creates Hashboard Hotspot

Immersion-cooling circulator pump has failed or is operating below design flow. Dielectric fluid stratifies (hot top, cool bottom), top-bank ASIC hashboards throttle and burn while bottom-bank stays at spec. Critical — minutes-to-hours to permanent silicon damage.

Critical — Immediate action required

Affected Models: Tank circulator pumps in single-phase immersion systems (Iwaki MD/MX, March, Liquidstack ChassisTank circulators, Engineered Fluids reference designs, custom DIY builds). Affects every ASIC submerged in the tank — Antminer, Whatsminer, Avalon, Bitaxe arrays — top-of-tank position cooks first.

Quick answer

Immersion-cooling circulator pump has failed or is operating below design flow. First step: Cut load immediately.

Symptoms

Top-bank ASICs in the tank reporting `ERR_TEMP_HIGH`, `OVER_TEMP`, or per-chip imbalance while bottom-bank ASICs run at spec
Thermocouple top-vs-bottom of tank reads delta `> 10 °C`; healthy single-phase immersion holds delta under `5 °C` with active circulation
Return-line flow meter reads `0 L/min` or sustained below the loop's design flow (typically `≥ 30 L/min` per `30 kW` of dissipation)
Pump head audibly silent (no hum, no whine) — total stop — or cavitating / chattering under partial flow
VFD / drive panel shows fault code: overcurrent, undervoltage, phase loss, or over-temperature on the pump motor
Magnetic-drive pump audibly slipping — characteristic high-frequency rattle from the magnet coupling, zero output flow
Visible fluid level rise / fall change in the reservoir or sight glass that's inconsistent with thermal expansion alone
IR camera of the tank top surface shows a clear hot zone over the top hashboards while bottom-row hashboards are uniform
Hashrate dashboard shows top-rack miners throttling or self-shutdown, bottom-rack miners at full nameplate — geographic, not random
Pump motor casing hotter than ambient by `> 20 °C` (overcurrent / locked-rotor / dragging bearing)
Strainer cup or pre-filter shows pressure drop above design (gauge across filter) — restriction starving the pump
Fluid in the sight glass appears cloudy, copper-tinted, or has visible particulate — chemistry / debris loading on the pump

Step-by-Step Fix

Cut load immediately. Power off every miner submerged in the tank from the PDU or breaker — not the dashboard. With circulation dead, every minute at hash load cooks more silicon. The whole tank's hashrate is worth less than one $400 hashboard you're about to lose. Free, takes 30 seconds, saves the most money of any step on this page.

Verify the pump motor is energised. Voltmeter across the motor terminals (or VFD output) — confirm three-phase or single-phase voltage is actually present at the pump head. Tripped breaker / blown fuse / dropped contactor accounts for a meaningful share of 'dead pump' tickets that are not the pump at all. Free, multimeter only.

Read the VFD / drive panel for active or last-recorded fault codes. Common codes: `OC` (overcurrent — locked rotor / debris jam), `UV` (undervoltage — facility supply), `OL` (overload — bearing drag or fluid viscosity high), `PHL` (phase loss — wiring), `OT` (motor over-temp — duty cycle / cooling). Note the code; it dictates the rest of the diagnostic branch.

Validate the temperature differential top-to-bottom of the tank with a calibrated thermocouple or thermal probe. Healthy single-phase circulation: delta `< 5 °C`. Stratified loop: `> 10 °C`. Dead pump under continued load: `> 20 °C` and climbing. The number tells you how long you've been stratified — useful for triaging which hashboards to inspect first.

Pull the strainer / suction-side filter. With the pump isolated and tank fluid level dropped below the pickup, remove the strainer cup. Inspect for debris: ASIC fasteners, gasket fragments, chip-cap shards, biofilm. A blocked suction strainer drops pump output to a fraction of nameplate — the pump is fine, the strainer is starving it. Clean, reinstall, restart, re-test flow.

Check for a magnetic-drive coupling slip. With facility power off, manually rotate the pump shaft / impeller magnet. It should spin freely and re-couple to the drive magnet on power-up. If the impeller magnet is decoupled (rotor spinning, magnet stationary) you'll hear a distinctive high-frequency rattle. Mag-drive slip is a known failure mode under cold-start high-viscosity conditions and after debris jams.

Failover to the redundant pump if the loop is built with one. Most production single-phase immersion designs ship (or should ship) with an active+standby pair on a manifold with isolation valves. Manually transfer with the valve handles, energise the standby, verify flow returns, then plan repair on the failed unit at leisure. Fifteen minutes if the manifold is built right; if the manifold isn't built right, that's the lesson — see Prevention.

Bleed air from the pump volute and high points of the secondary loop. After any tank top-up, valve isolation, or pump replacement, air pockets at the volute crown will starve the impeller and produce zero flow despite a healthy pump. Crack the volute bleed plug or the highest manifold tee, vent until steady fluid runs. Five minutes, prevents an hour of misdiagnosis.

Test pump shaft bearings before assuming pump-end failure. Energise the bare pump (suction valve closed briefly) and listen — smooth hum = bearings OK. Grinding / metallic wash = bearings shot. On mag-drive pumps the wear sleeve / journal bearing is the most common failure point, replaceable as a kit ($60-$180 USD typical, $80-$240 CAD).

Megger the motor windings. With the motor isolated, measure insulation resistance phase-to-ground and phase-to-phase at `500 VDC` or `1000 VDC` per the motor nameplate. Healthy: `> 10 MΩ`. Suspect: `1-10 MΩ`. Failed (water / fluid ingress / winding short): `< 1 MΩ`. Common failure: dielectric fluid migrated past a failed shaft seal on non-mag-drive designs, shorted the windings.

Replace the pump head / impeller assembly with an OEM-spec equivalent — same flow curve, same fluid compatibility, same connection geometry. Mag-drive over shaft-seal every time on dielectric immersion (no leak path). Materials: 316 stainless or PTFE-lined wetted parts, ceramic / silicon-carbide journal bearing. Upsize for `+25 %` flow margin over current-load demand to absorb future capacity growth and resist clogging-driven derate.

Pull and inspect every hashboard from the top half of the tank. Top-bank position is where the heat accumulated when circulation died — these are the boards most likely cooked. Look for discoloured silicon, lifted BGA solder balls, scorched solder mask, and cap drift. Bench-test before re-immersing. Replacement-or-rework call is cheaper than re-immersing a marginal board and watching it fail in the next stratification event.

Sample the dielectric fluid for contamination. Look for colour shift (clear → yellow / amber / copper-tinted = oxidation or copper leaching), particulate count, and acid number on synthetic fluids per the manufacturer's spec sheet. Engineered Fluids and LiquidStack publish acceptance limits. Out-of-spec fluid degrades pump bearings, attacks ASIC heatsinks, and reduces dielectric strength.

Install a redundant pump pair if the tank doesn't have one. The fix is mechanical: an active pump and a standby pump on a manifold with isolation valves and a check valve, both wired to the VFD with auto-transfer logic on flow loss. Cost is dwarfed by one cooked top-bank in a 30 kW tank. Mining Hacker default — never run a single-pump immersion loop in production.

Add flow alarms and tank-stratification monitoring. Inline turbine or paddle-wheel flow meter on the return line wired to a relay that powers off the PDU on flow loss; thermocouples top and bottom of tank wired to a comparator alarm at `> 8 °C` delta. Mining-rig PLCs / open-source controllers (Sonoff / Shelly / ESP32) all do this for under $200. The alarm is what saves the silicon — not the pump's reliability.

Right-size the secondary heat reject. The most common upstream cause of a chronically-overworked pump is an undersized dry cooler / fluid-to-air heat exchanger forcing the loop to run hotter than design. Hotter fluid = lower viscosity = same flow, but higher tank ΔT and longer time-to-stratify when flow is lost. Dual-coil dry-cooler with N+1 fan banks recommended for production-grade heat reject.

Stop DIY and book D-Central's immersion service when: more than one hashboard from the tank shows thermal damage, the dielectric fluid is out of spec, you've had two pump failures in 12 months (systemic problem, not a single component), or you need engineering review of a tank build. We carry mag-drive replacements, fluid testing relationships, and bench infrastructure for multi-board diagnostics.

Document the failure for warranty and insurance. Photograph the pump-out condition, save the VFD fault history, log the thermocouple delta time-series, and bag any debris found in the strainer. Pump manufacturers and dielectric fluid suppliers will both want documentation for a warranty claim. Include facility-power voltage logs covering the 24 hours before failure — undervoltage events are a common warranty defence.

Recommission the loop end-to-end before re-energising miners. Pump on, bleed, run 30 minutes at no-load, verify flow at meter ≥ design, verify thermocouple delta `< 5 °C` with miners off but fluid circulating. Only then re-energise miners — and ramp from one rack at a time, verifying delta stays under `8 °C` at each step. Cold-start a 30 kW load all at once and you can stratify before the loop even reaches steady state.

Re-confirm prevention scaffolding before walking away: redundant pump online + tested, flow alarm wired to PDU shutoff, thermocouple delta alarm at `8 °C`, dry-cooler at design capacity, strainer service interval logged, dielectric fluid sample queued for next quarter. Without these, you're scheduling the next failure — and you already paid the tuition on this one.

When to Seek Professional Repair

If the steps above do not resolve the issue, or if you are not comfortable performing these repairs yourself, professional service is recommended. Attempting advanced repairs without proper equipment can cause further damage.

All Repair Services

Frequently Asked Questions

What does the IMM_PUMP_FAIL error mean?

Immersion-cooling circulator pump has failed or is operating below design flow. Dielectric fluid stratifies (hot top, cool bottom), top-bank ASIC hashboards throttle and burn while bottom-bank stays at spec. Critical — minutes-to-hours to permanent silicon damage. Commonly reported on: Tank circulator pumps in single-phase immersion systems (Iwaki MD/MX, March, Liquidstack ChassisTank circulators, Engineered Fluids reference designs, custom DIY builds). Affects every ASIC submerged in the tank — Antminer, Whatsminer, Avalon, Bitaxe arrays — top-of-tank position cooks first..

Can I fix the IMM_PUMP_FAIL error myself?

This is an advanced repair best handled by someone experienced with ASIC hardware. Start with: Cut load immediately. Power off every miner submerged in the tank from the PDU or breaker — not the dashboard. With circulation dead, every minute at hash... If you are not equipped for board-level work, D-Central can diagnose and repair it at our Laval bench.

How much does it cost to repair?

A DIY repair typically runs $20-$3,500 CAD depending on which part the fault traces to. D-Central can also diagnose and quote a mail-in bench repair.

What parts might I need to fix this?

Common replacement parts for this fault: 18-pin Data Cable for Antminer, Hashboard Thermal Paste (Cyan), STASIC Hashboard MultiTester Pro. The exact part depends on diagnosis - measure first.

Key Terms in This Fault

Jump to the full definition of the technical terms involved in this fault:

Hashboard Thermal throttling Hashrate Firmware Undervolting Heatsink Thermal paste UART

Related Error Codes

Own your firmware — DCENT_OS (Antminer first)

DCENT_OS is D-Central’s open-source, GPL-3.0 firmware effort, now in public beta on Antminer (SHA-256) hardware — signed S9 and S19j Pro (Zynq/XIL) images are free to download. It is experimental and not production-ready. We build on the shoulders of the open-firmware projects that came before us, and we are starting with Antminer before widening hardware support. If you run Antminer gear, or just want firmware you can fully own and audit, grab the beta image. This is a free public beta, never a pre-order — collection only, we will not email you anything else yet.

I agree to D-Central storing my email to contact me about this. See our privacy policy.

Printable quick-reference cards

ASIC Miner Error-Code Quick-Reference Card — print-to-PDF one-pager
Stratum Share-Rejection Error Decoder Card — print-to-PDF one-pager
ASIC PSU & Connector Pinout Quick-Reference Card — print-to-PDF one-pager

Still Having Issues?

Our team of Bitcoin Mining Hackers has been repairing ASIC miners since 2016. We have seen it all and fixed it all. Get a professional diagnosis.

ASIC Repair Services Browse All Errors

Immersion Cooling Pump Failure Creates Hashboard Hotspot

Symptoms

Step-by-Step Fix

When to Seek Professional Repair

Frequently Asked Questions

Key Terms in This Fault

Related Error Codes

Own your firmware — DCENT_OS (Antminer first)

Printable quick-reference cards

Still Having Issues?

Find parts by failure

Related products, repair, and setup paths

Immersion Cooling Pump Failure Creates Hashboard Hotspot

Symptoms

Step-by-Step Fix

When to Seek Professional Repair

Frequently Asked Questions

Key Terms in This Fault

Related Error Codes

Immersion Cooled Miner - Fluid Temperature Too High

Antminer S19 Hydro - Coolant Temperature Too High

Whatsminer M30S - Water Cooling Leak Detection

Antminer S19 - ASIC Chip Temperature Imbalance

Antminer S19 - Hashboard Not Detected (0 ASIC)

Antminer - Hashboard Short Circuit

ASIC Miner - Thermal Paste Degradation

Antminer S19 XP - Temperature Too High

Whatsminer M60S - Temperature Too High

Antminer KA3 - Temperature Too High

Own your firmware — DCENT_OS (Antminer first)

Printable quick-reference cards

Still Having Issues?

Find parts by failure

Related products, repair, and setup paths