QNAP reboots every 16 hours or so

7,349

Solution 1

I could not find anything in the logs as per Wesley's answer. As a result I have gone on-site and swapped the QNAP chassis for a spare chassis I had and put the old drives into the new chassis.

It seems to have been stable for over 2 days now, whilst still under the same sort of load. I suspect the hardware has failed due to an overheating issue, as it was in a reasonably warm location for a very long time.

Solution 2

if you enable ssh login then

ssh admin@<qnap_ip_addr_here>
password: admin

then look at /var/log

grep ERR *log

look at hal_lib.log and hal_lib.log.bak specifically

look for stuff like this. I don't think my HDD was 60C and am suspicious about whether sometimes HDD temp is read incorrectly. It's possible these are setting thresholds not the actual readings. Not sure.

hal_lib.log:Get_Temp_Threshold() called, SYS_ERROR_TEMP=70
hal_lib.log:Get_Temp_Threshold() called, HDD_ERROR_TEMP=60
hal_lib.log:Get_Temp_Threshold() called, CPU_ERROR_TEMP=85
hal_lib.log:Get_Temp_Threshold() called, CPU_ERROR_TEMP=85

I'm also curious if other people see this:

manaRequest.cgi:Tue Oct 21 16:45:48 2014
PD_Is_Exist: can't retrieve port_id value!
hal_daemon:Tue Oct 21 16:45:59 2014
root_get_fan_speed(978): NOT implement.
se_sys_get_SAS_HBA_info(3606):System Error.
disk_manage.cgi:Tue Oct 21 16:46:03 2014

you may want to go inside the QNAP qui (hardware) and disable smart fan speed setting and switch to manual speed (medium always?)

when it says "root_get_fan_speed(979): NOT implement." I'm suspicious. Since fan speed seems correct in the gui. ~9k when I choose medium, and >10k when high speed is chosen

Solution 3

Connect to your QNAP with a terminal and look in /mnt/HDA_ROOT/.logs. Specifically, kmsg. You should see a lot of deep information for your perusal, interpretation, and ultimate edification.

Another option is going to the Web UI >> Select System Administration >> System Logs >> tabs across the top will give you UI access to various files.

As for what to look for, it's usually a crap shoot of what looks bad, and what Google tells you is worse.

Share:
7,349

Related videos on Youtube

Franz Wong
Author by

Franz Wong

Currently a Site Reliability Engineer at Take 2 Interactive. Formerly a Site Reliability Engineer for the Stack Exchange network. Prior to my work for Stack Exchange I worked for a small software developer in Sydney, Australia.

Updated on September 18, 2022

Comments

  • Franz Wong
    Franz Wong almost 2 years

    I have a QNAP TS-412. It has 4x 4TB WD Red drives in RAID10.

    The poor thing works pretty hard. Virtually 24/7 it runs at 20-30MB/sec as a backup device from a bunch of office computers and a handful of servers.

    In the last 2 weeks, seemingly out of nowhere, it has been rebooting every 16 hours or so. It has been under this sort of workload for about 3 years, with only breaks to upgrade the drive capacity every 18 months or so.

    Is there anywhere I can go in the console (or via SSH) to see the reason why it is rebooting?

  • Franz Wong
    Franz Wong over 9 years
    Meow meow meow meow meeoowww meow (translation: There's a lot of stuff in there; anything specific I should look at?)
  • Franz Wong
    Franz Wong over 9 years
    Meow meow meow meow meeeooowwww meow meow meow meooowwww (I had a look at the section where the timestamps go backwards, indicating a reboot, but can see nothing of interest. I will keep looking).
  • Wesley
    Wesley over 9 years
    Mew mew rowww SKREEEE! (QNAP sucks)
  • Franz Wong
    Franz Wong over 9 years
    Meow meow meeoowww hhhuuukkkkkkkkkkkkkk meow siiiigh (Then why do I keep buying them?! Sigh.)