I have an instrument built based on a WC32.
It has a pyranometer, low-level light sensor and 240-360nm UV sensor all driving analog inputs, I2C sensor for humidity and temperature and three DS18B20 (two for air temperature, one for internal case temperature).
Over the first year, it's operated adequately, requiring the occasional powercycle (once a week or so), but since the weather started warming up a couple of months ago, it's been stopping dead in its tracks anything up to 8 times a day!
Usually starts around 13:00 local, frequently peaks around 16:00 or so, then improves and rarely fails after about 19:00.
The temperatures outside during this time are frequently from 30 to 40 degrees C.
The device, when built, was powered via PoE. 48V coming up the ethernet to an end-span PoE terminator. That was supplying 12V DC to the WC32. WC32 switching a low-current (100mA) 12V fan via open-drain FET.
The internal case temperature was reading about 2 deg C higher than ambient overnight and up to 10 deg C higher than ambient when the sun was striking the widest face of the case.
Thinking perhaps the POE device was failing under the temperature, I replaced it with a passive PoE injector/splitter (midspan) and 12V supply. Absolutely no change.
I added a solar shield - gloss white stainless steel sheet, about 20mm off the case. This dropped the case temperature to about 3 deg above ambient in sun, but the problem persisted unchanged.
I double-checked all my firewall rules, spent ages doing TCPDUMP and ensuring there was absolutely no odd traffic to or from it. I reconfigured the WC32 to use my local NTP timeserver and DNS server, then completely blocked all outside traffic from being able to get to the WC32. The problem persisted.
It became evident that it would usually stop responding "soon after" it did a DHCP request. Not exactly co-incident, but the DHCP transaction was almost always the last thing that happened before it stopped responding.
When the WC32 was "dead", it doesn't respond to ICMP echo-request or reply, or to tcp connection requests on port 80.
On a whim, I changed it from DHCP to static IP. It still had the same address, just that it was set in its internal configuration rather than always being given the same address by the DHCP server. Interestingly, it now ran for 3 days straight, without failing! I thought I was onto a winner!
But then this afternoon, it died at 12:30, and again at 14:30, and again 5 more times! So that wasn't it.
When it stops talking, the network LINK light remains on on the switch, it's not like it's lost power...
Anyone got any similar experience, clues, hints or cures?!?
It has a pyranometer, low-level light sensor and 240-360nm UV sensor all driving analog inputs, I2C sensor for humidity and temperature and three DS18B20 (two for air temperature, one for internal case temperature).
Over the first year, it's operated adequately, requiring the occasional powercycle (once a week or so), but since the weather started warming up a couple of months ago, it's been stopping dead in its tracks anything up to 8 times a day!
Usually starts around 13:00 local, frequently peaks around 16:00 or so, then improves and rarely fails after about 19:00.
The temperatures outside during this time are frequently from 30 to 40 degrees C.
The device, when built, was powered via PoE. 48V coming up the ethernet to an end-span PoE terminator. That was supplying 12V DC to the WC32. WC32 switching a low-current (100mA) 12V fan via open-drain FET.
The internal case temperature was reading about 2 deg C higher than ambient overnight and up to 10 deg C higher than ambient when the sun was striking the widest face of the case.
Thinking perhaps the POE device was failing under the temperature, I replaced it with a passive PoE injector/splitter (midspan) and 12V supply. Absolutely no change.
I added a solar shield - gloss white stainless steel sheet, about 20mm off the case. This dropped the case temperature to about 3 deg above ambient in sun, but the problem persisted unchanged.
I double-checked all my firewall rules, spent ages doing TCPDUMP and ensuring there was absolutely no odd traffic to or from it. I reconfigured the WC32 to use my local NTP timeserver and DNS server, then completely blocked all outside traffic from being able to get to the WC32. The problem persisted.
It became evident that it would usually stop responding "soon after" it did a DHCP request. Not exactly co-incident, but the DHCP transaction was almost always the last thing that happened before it stopped responding.
When the WC32 was "dead", it doesn't respond to ICMP echo-request or reply, or to tcp connection requests on port 80.
On a whim, I changed it from DHCP to static IP. It still had the same address, just that it was set in its internal configuration rather than always being given the same address by the DHCP server. Interestingly, it now ran for 3 days straight, without failing! I thought I was onto a winner!
But then this afternoon, it died at 12:30, and again at 14:30, and again 5 more times! So that wasn't it.
When it stops talking, the network LINK light remains on on the switch, it's not like it's lost power...
Anyone got any similar experience, clues, hints or cures?!?