Passer au contenu

Nous améliorons nos opérations pour mieux vous servir. Les commandes sont expédiées normalement depuis Laval, QC. Questions? Contactez-nous

Bitcoin accepté au paiement  |  Expédié depuis Laval, QC, Canada  |  Soutien expert depuis 2016

CHIP_INIT Critical

Goldshell – Chip Init Failed on Startup

chip init fail | ASIC enumeration error | hashboard scan

Critical — Immediate action required

Affected Models: KD-BOX - KD-BOX Pro - KD-BOX II - KD5 - KD6 - KD-MAX - LT5 - LT5 Pro - LT6 - LT-LITE - HS5 - HS-BOX - CK5 - CK6 - CK-BOX - Mini-DOGE I/II/III+ - ST-BOX - KA-BOX - AL-BOX II

Symptoms

  • Web UI loads at `http://<ip>` and reports `Status: running` but hashrate is stuck at `0 H/s` or oscillates between `0` and a tiny fraction of nameplate
  • cgminer log (`/tmp/cgminer.log`) shows repeating `chain 0 init fail`, `chain 1 init fail`, `hashcore init fail`, `ASIC detect failed`, or `chain disabled`
  • Cold air comes out the exhaust under "load" - the ASICs are not generating heat because they're not running work
  • Wallplug power draw stuck in the 40-70 W range instead of nameplate (e.g., KD-BOX should pull 205 W, LT5 ~3,200 W, KD6 ~3,300 W)
  • Green LED solid OR slow-blinking; not the red+green firmware-brick pattern
  • `find.goldshell.com` discovery tool DOES find the device (firmware is alive)
  • Miner Status page in the web UI lists fewer ASICs than the hashboard's nameplate count, OR shows them all at frequency `0 MHz`
  • Pool dashboard at Dxpool / F2pool / your pool shows the worker briefly online then idles to no shares
  • Issue started after one of: firmware upgrade, PSU swap, ambient temp drop below 18 °C, a power outage, or hardware move
  • PSU rail measures 12.0 ± 0.2 V at the barrel jack at idle but sags below 11.6 V at boot ramp (PSU brownout indicator)
  • One specific hashboard chain shows the init failure across multiple cold boots while the others come up - points at one failed PMIC or chip
  • Issue is reproducible from cold but recovers after the miner has been powered for 30+ minutes (warm-restart-only failure pattern)

Step-by-Step Fix

1

Full cold boot, 60 seconds, in a 20-25 °C room. Unplug 12V at the wall AND the barrel. Wait a full minute. Move the miner to a warm room if it's been cold (garage, basement, outbuilding). Re-plug, boot, watch the LEDs for 5 minutes. Open the web UI - confirm hashrate climbs to nameplate. If it climbs: you were in a warm-reboot stall or a cold-start chip-init failure that's resolved by ambient temperature. Update the firmware over Ethernet only and call it done.

2

Check the 12V PSU is healthy. Multimeter on the 12V barrel jack. Idle = `12.0 ± 0.2 V`, stable, no wandering. If your PSU reads below `11.8 V` at idle, swap it - your `chip init fail` is likely a PSU brownout, not silicon. Spare known-good 12V bricks: Mean Well LRS-350-12 (CAD $35-50) with a 5.5x2.1 mm DC barrel pigtail covers most of the BOX series; LT5/KD6/LT6 require a higher-rated brick.

3

Confirm the firmware version is not on the regression list. Web UI -> System -> About. If KD-BOX is on `2.1.1` or `2.1.3`, or ST-BOX is on `2.2.0`: that is your bug. Plan for firmware downgrade in Tier 2. If your firmware is current and not on a known-broken release, the cause is hardware-side - plan for Tier 2 / 3.

4

Look for visible dust in the intake. If the intake fins are visibly clogged with dust or pet hair, the chip-init failure may be a thermal-protection abort during the boot ramp, not a true enumeration fail. Compressed air through the fins (with the miner unplugged), then cold-boot. KD5/KD6, LT5, and CK6 are the most dust-vulnerable. We see this often enough on the bench that it's worth ruling out before tearing into hashboards.

5

Cycle the network cable and switch port. Long shot but cheap: a flaky Ethernet PHY can starve the watchdog process and cascade into a chip-init reset on some firmware revisions. Swap the Ethernet cable for a known-good Cat 6, plug into a different port on the switch. Cold-boot. This fixes <5% of chip-init failures but it costs you 30 seconds.

6

Firmware downgrade via the web UI (if your firmware is on the regression list and the UI accepts the older image). Web UI -> System -> Firmware Upgrade -> upload `KD-BOX_2.0.6.bin` (or the model-specific known-good image). For KD-BOX: `2.0.6` is the proven stable. For ST-BOX: roll back from `2.2.0` to `2.1.x`. Use Ethernet, not WiFi. Wait the full 5-10 minute upgrade window without touching the miner. If the UI rejects the older image with "Version not supported" or hangs partway: stop, do not retry, and switch to SD-card recovery.

7

SD-card firmware recovery if the web UI won't downgrade. Email `hello@goldshell.com` for the `burn-<model>.img` recovery image (24-72h response time). Quality 8-16 GB microSD card (Samsung EVO Plus or SanDisk Industrial only - cheap cards corrupt mid-flash). Flash with balenaEtcher. Insert into the SD slot on the control board (4-8 screws to access on most BOX series). Cold-boot. The recovery script overwrites the eMMC with a known-good firmware.

8

Reseat the hashboard ribbon cables. Power off, unplug 12V. Top cover off (4-8 Phillips screws). Each hashboard has a ribbon cable to the control board - unplug each, mark Pin 1 with a Sharpie, inspect for bent pins/corrosion, reseat firmly until the connector clicks. On LT5 / LT5 Pro / KD6 with multiple hashboards, do all of them in one session - if one ribbon is marginal, the others are likely close behind. Reassemble, cold-boot, check the cgminer log via SSH.

9

Swap the 12V PSU with a known-good brick under a load tester. If diagnostics suggested a PSU brownout, do a clean swap. KD-BOX / Mini-DOGE / HS-BOX / ST-BOX / CK-BOX accept any clean 12V / 4-6 A brick. KD5/KD6/LT5/LT6 need 350W+ rated bricks. Use a load tester (or just monitor the rail with a multimeter while the miner ramps) to confirm the new PSU stays above `11.7 V` even at peak ramp. If the issue is gone post-swap, you've confirmed PSU brownout.

10

Deep clean the heatsink fins and the fan. With the miner case open: compressed air at low pressure through every fin, both directions. Soft brush for stubborn deposits. Q-tip + isopropyl on the fan blades and the bearing housing. KD5/KD6 and LT5 in particular have fin spacing that traps dust hard. After cleaning: reassemble, cold-boot, re-check. If the chip-init failure was thermal-throttle-driven, this fixes it.

11

Verify hashboard temperatures with the miner running for 30 minutes. SSH in: `cgminer-api stats` or read `/tmp/cgminer.log` for per-board temps. Expected: hashboard temp 50-75 °C under load; chip junction (where reported) 70-90 °C. If a specific board reports >95 °C at low load or doesn't respond at all on the API: that board is failing - go to Tier 3.

12

Per-chain isolation test. If your model has multiple hashboards (LT5, LT5 Pro, KD5, KD6, LT6, CK5, CK6), unplug the ribbon to one board at a time and cold-boot. The miner reports fewer chains but should hash on the boards that are connected. If hashing works with board A unplugged but not with B unplugged, board B has the dead chain. Document which board is which and label the chassis. Now you know which board to swap or repair.

13

Run a 1-hour stability test after any fix. Reconnect the pool, monitor wallplug power and the cgminer share log. Expected: power at nameplate ± 5%, share acceptance rate >95%, no `chain X init fail` events, no resets. If you get reset events even at low frequency, escalate - a half-fixed chip-init failure tends to come back hard.

14

PMIC inspection and replacement. If a specific chain consistently fails init, the PMIC on that hashboard is the most common culprit. With the case open, locate the PMIC chip on the suspect hashboard (typically a TI / MPS / RT-series part labeled near the chip rail). Power-on-and-touch-test (briefly!) for excessive heat. Reflow with a hot-air station at `260-280 °C` if you suspect cold solder joints; replace if the PMIC is visibly damaged or out of spec on a multimeter check. Source replacement PMICs from LCSC or Digi-Key.

15

Single dead chip identification and reflow. If the failure points at a single chip in the chain (specific position fails enumeration consistently), that chip is dead or has cold-joint fractures. Hot-air station at `260-280 °C`, pre-heat the board to `~120 °C` to avoid thermal stress, lift and replace the chip. ICT560 (LT5/LT5 Pro) replacements are sourceable from Goldshell parts brokers or Zeus BTC; KD-series Kadena chips and HS-BOX Handshake silicon are harder to find but not impossible. Cost: chips are CAD $5-25 each; the labor is the expensive part.

16

Verify with a thermal camera if you have one. Post-reflow / post-replacement: power on, run for 5 minutes, scan with a thermal camera. Every working chip should be 70-90 °C; cold spots = dead chips. This also catches solder bridges that didn't get caught visually.

17

Hashboard replacement. If 2+ chips on a board are dead, or the PMIC swap didn't recover the chain, the board itself is the path of least resistance. LT5/LT5 Pro hashboards: CAD $200-350 from Zeus BTC, BT-Miners, or D-Central when in stock. KD5/KD6 hashboards: $250-450. CK5/CK6 hashboards: similar. Some BOX-series hashboards (Mini-DOGE, ST-BOX) are not sold separately - the whole control + hashboard assembly is one unit and you swap the unit, not the board.

18

When to stop DIY and ship it to the bench. You've reached Tier 4 if: (a) all Tier 1-3 fixes ran clean and the miner still won't enumerate, (b) you've got a soldering iron in your hand and a chip in tweezers and you're about to YouTube-tutorial your way through a BGA reflow, (c) the hashboard shows compound damage (multiple dead components, scorched VRM, lifted pads), or (d) replacement parts cost approaches the resale price of the miner. Cost-benefit becomes obvious - sometimes the right answer is to harvest the miner for parts and buy current-gen hardware.

19

What D-Central does at the bench for chip-init failures. Cross-model expertise spanning the entire Goldshell lineup. Bench-validated PSU, ribbon cables, and `burn-<model>.img` library to rule out external causes in 30 minutes. Per-chain isolation, PMIC inspection with a TI BQ-series rail analyzer, BGA reflow at 220 °C preheat / 245 °C reflow on dead chips, hashboard swap when reflow fails, 24-hour burn-in at nameplate before shipping back. Canadian shop, no ship-to-China lottery, no 6-week turnaround.

20

Ship safely. ESD bag the control board if you're sending control only. Foam-cradle the whole miner if you're shipping the unit. Include: model, serial, firmware version, exact symptom (cgminer log paste preferred), what you've tried, ambient temp where it ran. Ship to D-Central HQ. Typical turnaround for Goldshell chip-init failures: 5-10 business days from receipt, faster when the diagnosis lands on PSU/firmware/ribbon (Tier 1-2 territory) and slower when it lands on chip-level rework.

When to Seek Professional Repair

If the steps above do not resolve the issue, or if you are not comfortable performing these repairs yourself, professional service is recommended. Attempting advanced repairs without proper equipment can cause further damage.

Related Error Codes

Still Having Issues?

Our team of Bitcoin Mining Hackers has been repairing ASIC miners since 2016. We have seen it all and fixed it all. Get a professional diagnosis.