posted on 2016-01-21 00:03
Summary: check a linux system for problems immediatly after ssh'ing onto it.
htop- uptime, core diversity, load, swap on first sight via a TUI.
uptime- for load checking, likely unnnecesary after
dmesg | tail- check for errors like out of memory
vmstat 1- check amount of processes (r) and kernel/userland distribution and swap
mpstat -P ALL 1- check for a single hot core
pidstat 1- check for high load on single process
iostat -xz 1- high r/w load? awaits? util%?
free -m- memory available, likely unneded after
sar -n DEV 1- rxkb/s or txkb/s is 125mbytes max for 1G NICs, util% ok?
sar -n TCP,ETCP 1- act = egress, pasv = ingress traffic, retransmits = bad, usually
top- zxcV and 1 and < and > are your best friends, along with knowing status indices.
At 7. buffers = block device caching, cache = page cache for file system.
At 10. just switch columns through the angle brackets keys and have a look the waits (
wa) to see if there are disk related issues, after having pressed
1 to show all available cores.
d with a number after changes the refresh time to x seconds.
In general everything concerning
top can be found in the manual.
Lastly, a list of the process states from the mentioned
top man page:
D = uninterruptible sleep <<-- waiting for disk R = running S = sleeping T = traced or stopped Z = zombie
posted on 2015-03-04 12:54:59
To get a fast overview on what is running on your linux box, use
(If you want some fancy graphics, try
htop, but it has less intuitive shortcuts and is not always installed.)
Sad thing is, at first you don't really know what you are doing. So some guidance:
After starting top, press:
This will color top (
z), show current sort column (
x) and the full application path (
1 will show stats for all individual cpus.
If you have no idea, use
h for getting the help shown.
If you have a newer version of
V will also work:
This gives you a nice process-tree view.
d changes the update delay, which is at three seconds per default.
Straight from the manpage, the CPU statistics show the times spent in:
us = user mode sy = system mode ni = low priority user mode (nice) id = idle task wa = I/O waiting hi = servicing IRQs si = servicing soft IRQs st = steal (time given to other DomU instances)
If you have low cpu and ram usage but the system is unresponsive, have a look at the
Changing the sort column can be done via
Also available: (not shown in help)
N sort by PID
P sort by CPU usage
M sort by memory usage
T sort by time
R will reverse the output.
u to choose user name, show only this user's processes.
S for cululative time toggling.
f will toggle a window in which you can choose the info fields to be shown.
Pressing the character will toggle its state. (Shown or not shown.)
o also opens a window, in there you can reorder the columns.
Press the character of the column you want to move, depending on it being upper- or lowercase it gets moved up and down.
These should be self-explanatory:
k kill task
r renice task
View posts from 2017-05, 2017-04, 2017-03, 2017-02, 2017-01, 2016-12, 2016-11, 2016-10, 2016-09, 2016-08, 2016-07, 2016-06, 2016-05, 2016-04, 2016-03, 2016-02, 2016-01, 2015-12, 2015-11, 2015-10, 2015-09, 2015-08, 2015-07, 2015-06, 2015-05, 2015-04, 2015-03, 2015-02, 2015-01, 2014-12, 2014-11, 2014-10, 2014-09, 2014-08, 2014-07, 2014-06, 2014-05, 2014-04, 2014-03, 2014-01, 2013-12, 2013-11, 2013-10