Resumo A2
Resumo A2
zabbix_agentd
<-----response to poller-------collects metrics
- Alguns keys:
Host name system.hostname[<type>]
Availability agent.ping
Remote services net.tcp.service[service,<ip>,<port>]
Processes proc.num[<name>,<user>,<state>,<cmdline>]
Disk space availability vfs.fs.size[fs,<mode>]
Directory entry count vfs.dir.count[dir]
Network net.if.in/out/total[interface]
Memory availability vm.memory.size[<mode>]
CPU load (Unix) system.cpu.load[<cpu>,<mode>]
CPU utilization (Win) system.cpu.util[<cpu>,<type>,<mode>]
- Linha de comando (no agente): zabbix_agentd -t system.cpu.load
zabbix_agentd -t "vfs.file.regexp[/etc/passwd,root]"
- Retrieves information from remote passive Zabbix agent daemons:
podemos checar alguns par�metro do agente
zabbix_get -s 127.0.0.1 -k system.cpu.load
Listing of services
key: services[<type>,<state>,<exclude>]
� type - all (default), automatic, manual or disabled
� state - all (default), stopped, started etc.
� exclude - services to exclude from the result.
MACROS:
- Syntax: {MACRO.NAME}
- Used in various places
� Hosts and templates {HOST.NAME}, {HOST.CONN}
� Triggers {ITEM.VALUE}, {ITEM.LASTVALUE}
� Tags
� Notifications {EVENT.DATE}
� Web monitoring
- User: {$MACRO.NAME}
Three levels
� Global (Administration > General > Macros)
� Template (Configuration > Templates > {template} > Macros)
� Host (Configuration > Hosts > {host} > Macros)
- ex net.tcp.service[ssh,{$SSH.PORT}]
- Inheritance can be seen and modified on templates and hosts
Problem detection
Triggers
- Triggers are logical expressions that �evaluate� data gathered by items
- Represents current system state
- Allows to define a threshold (state of data is "acceptable"/"not acceptable")
- sTATUS:
OK normal trigger state.
PROBLEM something happened. For example, the processor load is too high.
- When trigger changes state new event is created
- Trigger status (expression) is recalculated every time Zabbix server receives a
new value
that is part of the expression
- Triggers are evaluated based on history data only; trend data are never
considered
- Suporta macros, como as de sistema:
{HOST.HOST}, {HOST.NAME}, {HOST.CONN}, {HOST.DNS}, {HOST.IP}, {ITEM.VALUE}
- Operational data: ex item.last (ultimo valor)
If not configured, latest values of all items from the expression are displayed
- Severity: High, Informative, etc
- ex: {host1:item1.func(5m)}>10 and {host1:item2.func(5m)}>5
{ Zabbix : system.uptime . last() } < 10m
- Functions: min(), max(), avg(), last(), diff(), count(), delta(), time(), etc
- pode ter a recovery expression para recolocar em ok (por ex, somente apos 5 pings
ok volta)
- Triggers have an option to define custom event tags
Trigger may have several tags with the same name, but different values
TEMPLATES
- Template is a set of entities that can be applied to multiple hosts
- Easily set up and manage monitoring for hundreds, thousands of hosts
- Ex: v�rios hosts tem m�tricas semelhantes, d� pra agrupar num template e
distribuir p cada host
- You can use the Clone and Full clone buttons in the form of an existing template
or host to create a new one
[Clone] Will retain all parameters and linkage (e.g. keeping all entities
from the templates)
[Full clone] Full clone will additionally retain directly attached entities
(applications, items,
triggers, etc.)