New on LowEndTalk? Please Register and read our Community Rules.
All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.
All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.
Providers or managers of large linux server fleets
Q. How often do you see non fatal kernel OOPS'es, warnings or bug alerts. Do you monitor for them (netconsole, or other)?
Recently we have increased the number of managed servers quite significantly. On all systems we watch, log and triage.
One thing I've noticed is that the Linux kernel really isnt as defect free as one might hope.
Q. Do you find there is significant benifit in high patch number releases of LTS branches? Do you find them significantly more stable than say low (i.e <20) releases?
Comments
For those curious as to the spark.
Just today I found a netconsole (or virtio?) bug. Non fatal, but a crash risk for sure (for example if an IRQ occurred during the op).
An unsafe
printk
(or in this casenet_warn_ratelimited
) is a scary idea.Correct, they're not that experienced or do very little with it to know where and how often it shits the bed.