QRadar Troubleshooting Guide
QRadar Troubleshooting Guide
Contents
1 Basics ............................................................................................................................................4
1.1 QRadar Directory Structure ............................................................................................................... 4
1.2 Basic Commands................................................................................................................................ 5
2 Resilient ........................................................................................................................................7
2.1 Backup ............................................................................................................................................... 7
2.2 Health Checkup ................................................................................................................................. 7
2.3 Resilient Button goes missing from QRadar UI ................................................................................. 7
3 Basic Troubleshooting ...................................................................................................................8
3.1 Clearing Browser Cache ..................................................................................................................... 8
3.2 Enabling and Disabling Debug Logs ................................................................................................... 8
3.3 Collecting Logs via CLI and GUI .......................................................................................................... 8
3.3.1 Using GUI............................................................................................................................... 8
3.3.2 Using CLI ................................................................................................................................ 8
3.4 Restarting of services ........................................................................................................................ 9
3.5 Basic Troubleshooting ....................................................................................................................... 9
4 QRadar Agent (Wincollect) ............................................................................................................9
4.1 Reinstalling Wincollect without Rebooting ....................................................................................... 9
4.2 Verification of Agents on Console and Client. ................................................................................... 9
5 Rules and Offenses ...................................................................................................................... 10
5.1 Offenses Overview .......................................................................................................................... 10
5.2 Repopulating Offenses on Console ................................................................................................. 10
5.3 AQL for Offenses.............................................................................................................................. 11
6 Backups ...................................................................................................................................... 11
6.1 Backups Not Generated................................................................................................................... 11
7 High Availability .......................................................................................................................... 12
7.1 Commands ....................................................................................................................................... 12
7.2 HA host in Failed state due to hidden token ................................................................................... 12
8 RegEx , DSM and Parsing ............................................................................................................. 13
8.1 Some Common Regular Expressions ............................................................................................... 13
8.2 Installing RPMs ................................................................................................................................ 13
8.3 Checking the Expensive DSMs ......................................................................................................... 14
1
IBM Security | QRadar
2
IBM Security | QRadar
3
IBM Security | QRadar
1 Basics
1.1 QRadar Directory Structure
+-- cv = contains accumulated data
+-- events = Events top-level directory
¦ +-- md = created when encryption is enabled and contains hash values.
¦ +-- payloads = contains event payloads
¦ ¦ +--<YEAR-xxx1>
¦ ¦ ¦ +--<MONTH-1>
¦ ¦ ¦ ¦ +--<DAY-1>
¦ ¦ ¦ ¦ ¦ +--<HOUR-1>
¦ ¦ ¦ ¦ ¦ +--<HOUR-2>
¦ ¦ ¦ ¦ ¦ +-- .
¦ ¦ ¦ ¦ ¦ +--<HOUR-24>
¦ ¦ ¦ ¦ +--<DAY-2>
¦ ¦ ¦ ¦ +-- .
¦ ¦ ¦ ¦ +--<DAY-31>
¦ ¦ ¦ +--<MONTH-2>
¦ ¦ ¦ +-- .
¦ ¦ ¦ +--<MONTH-12>
¦ ¦ +-- .
¦ ¦ +--<YEAR-xxxN>
¦ +-- records = contains event records
¦ ¦ +--<YEAR-xxx1>
¦ ¦ ¦ +--<MONTH-1>
¦ ¦ ¦ ¦ +--<DAY-1>
¦ ¦ ¦ ¦ ¦ +--<HOUR-1>
¦ ¦ +-- .
¦ ¦ +-- .
¦ ¦ +--<YEAR-xxxN>
¦ +-- uncompressedCache = pointers to compressed files
+-- flows = flows top-level directory
¦ +-- payloads = contains flow payloads
¦ ¦ ¦ +--<MONTH-1>
¦ ¦ ¦ ¦ +--<DAY-1>
¦ ¦ ¦ ¦ ¦ +--<HOUR-1>
¦ ¦ +-- .
¦ ¦ +-- .
¦ ¦ +--<YEAR-xxxN>
¦ +-- records = contains flow records
¦ ¦ ¦ +--<MONTH-1>
¦ ¦ ¦ ¦ +--<DAY-1>
¦ ¦ ¦ ¦ ¦ +--<HOUR-1>
¦ ¦ +-- .
¦ ¦ +-- .
¦ ¦ +--<YEAR-xxxN>
¦ +-- uncompressedCache = pointers to compressed files
+-- gv = global views top-level directory
¦ +-- definitions = global view definitions
¦ +-- records = global view records
+-- hprof = host profiles top-level directory
¦ +-- uncompressedCache = cursors for searches
+-- persistent_data = pointer to compressed files
¦ +-- ariel.ariel_proxy_server = saved search results and searches done in the last 24 hours
+-- simarc = QRadar Risk Manager connection data
+-- simevent = QRadar Risk Manager event data
+-- statistics = statistics
4
IBM Security | QRadar
Description Command
To check the psql -U qradar -c " select * from tenant;"
tenant info on
console
Normalized /store/ariel/events/records/aux/tenant-ID/year/month
Events
Raw Events /store/ariel/events/payloads/aux/tenant-ID/year/month
Reference:
https://www-01.ibm.com/support/docview.wss?uid=swg22010279
5
IBM Security | QRadar
6
IBM Security | QRadar
2 Resilient
2.1 Backup
i. To back up the platform, you must ssh to the virtual appliance and run this
command:sudo resSystemBackup
This creates a backup in the /crypt/backups/ folder in the form of a gz file; for
example, resilient-backup-20170426201138.tar.gz. The time stamp is appended to
the file name for uniqueness. You can rename this file for clarity, and move it to a
secure location.
ii. To back up the platform, you must ssh to the virtual appliance and run this
command: sudo resSystemBackup
The backup file remembers the KeyVault password scheme (cleartext or gpg
encrypted as described in KeyVaults). When running a restore on that file, it
restores that scheme.
iii. You can encrypt the backup by using the ––encrypt option as follows: sudo
resSystemBackup –encryptCopy
It is recommended that you store the backup and its corresponding
backup_passphrase file to a secure location for future use.
Use the –help option to view all the options on the resSystemBackup and
resSystemRestore commands.
iv. To restore a backup, use the resSystemRestore command and the name of the
backup file. For example: sudo resSystemRestore -f /crypt/backups/resilient-
backup-20170426201138.tar.gz
7
IBM Security | QRadar
3 Basic Troubleshooting
3.1 Clearing Browser Cache
Perform the Steps in orderly fashion:
service hostcontext stop
service tomcat stop
service hostservices stop
rm -rf /opt/tomcat/Catalina/work/localhost/*
rm -rf /opt/tomcat-85/Catalina/work/localhost/*
service hostservices start
service tomcat start
service hostcontext start
8
IBM Security | QRadar
9
IBM Security | QRadar
Qradar dormancy period lasts 5 days. After these 5 days, an offense is marked as inactive. New
events triggering the Offense rule test will not contribute to the inactive offense. Our Offense
Model checks each day within these 5 days to determine which offenses are still dormant and
which are inactive. If an event is received during the dormant time, the dormant time is reset
back to zero. You will have to wait another 5 days of no events or flows triggering the rule test
in order for the offense to become inactive.
Note: By default, the system allows 2,500 open (active) offenses and 100,000 (inactive)
offenses. If these values are reached, a System Notification is generated to alert the
administrator that they might need to review offenses that can be closed or tune rules to
reduce the overall number of offenses that are being generated in QRadar. By default the
system will begin to remove 0.05 percent all inactive offenses every 2 hours.
When an offense is closed either by manually closing an offense or by magistrate, which makes
the offense inactive, the Offense Retention Period setting is then applied. The Offense
Retention Period determines how long inactive offenses are kept before being purged from the
Console.
For better management:
The administrator can manage offenses from Admin tab > Advanced> Clean SIM Model. The
options include:
Soft Clean - this option closes all offenses, but does not remove them from QRadar.
Hard Clean - this option closes and removes all offenses from the system. It is not advised to
Hard Clean your SIM Model, unless advised by QRadar Support.
10
IBM Security | QRadar
6 Backups
6.1 Backups Not Generated
Pre Checks:
i. Last Generated Backup
ii. Check Old qradar.log file and see if backups are completed.
iii. Check Old qradar.error file and see if there are any backup errors.
iv. Check space and /store size.
v. Check all services
11
IBM Security | QRadar
Example:
Found error in qradar.error file
May 28 07:50:10 ::ffff:10.254.158.209 [hostcontext.hostcontext] [Scheduled Backup]
com.q1labs.hostcontext.backup.core.BackupUtils: [ERROR]
[NOT:0000003000][10.254.158.209/- -] [-/- -]The Apache certificates on the managed host do
not match the certificates on the Console. Tomcat connection test failed.
Executed tomcat connection script and fixed the connected
7.Executed test_tomcat_connection script and it shows connected and also see connected in
qradar.log
May 28 10:30:06 ::ffff:10.254.158.209 [test_tomcat_connection] [main]
com.q1labs.hostcontext.backup.core.BackupUtils: [INFO] [NOT:0000006000][10.254.158.209/- -
] [-/- -]Connected to tomcat
7 High Availability
7.1 Commands
Description Command
Current state of the node /opt/qradar/ha/bin/ha cstate
To make the current node primary /opt/qradar/ha/bin/ha takeover
Validates the deployment configurations /opt/qradar/support/ validate_deployment.sh
Verfiying the DRBD Services cat /proc/drbd
HA diagnostics /opt/qradar/support/ha_diagnosis.sh
HA help /opt/qradr/ha2/bin/ha help
Giveback the assigned role /opt/qradar/ha2/bin/ha giveback
12
IBM Security | QRadar
13
IBM Security | QRadar
10 Log Sources
10.1 Linux Log Sources with same hostname
As syslog reads the data from the payload the moment it will find the hostname in the payload
it will parse the logs under one log source.
Also you won’t be able to add the 2nd log source with the same host name and protocol as it
will give you an error stating a log source already exist with same host name and protocol.
14
IBM Security | QRadar
15
IBM Security | QRadar
v. You have asked me couple of things related to license and EPS for that I told
you to raise a new case as there is separate team which looks into such type of
issues and AQL queries related issues.
vi. After some time the last event time of log source got updated and events
started flowing under log source.
vii. Then checked in log activity tab for confirmation and the events are flowing.
viii. You have also raised query related to stored events coming under the log
source so I checked the DSM version of CISCO firepower management and
common found that both are up to date. Then I conveyed you to raise another
case or allow me to raise a new case related to parsing issue for this but you
told me that you will discuss this within your team and let me know.
16
IBM Security | QRadar
17
IBM Security | QRadar
Q1: As per the technote, EPS is measure at 2 points SourceMonitor and StatsFilter. Which
value is checked against the license?
A1:The first number is coalesced and the 2nd number is raw. The peak is the peak since the last
time the service was restarted and again the first number is coalesced and the 2nd number is
raw.
Q2: EPS values being shown on CLI(StatFilter/Sourcemonitor) and UI are very different.
There are 4 values of EPS being shown on UI:
a. Events per second coalesced - Peak 1 sec
b. Events per second coalesced- Average 1 min
c. Events per second Raw - Peak 1 sec
d. Events per second Raw - Average 1 min
A2:
Oct 25 13:25:09 ::ffff:10.254.158.209 [ecs-ec.ecs-ec]
[[type=com.ibm.si.ec.filters.stat.StatFilter][parent=ISB-EP-PRI-primary.ptcl.net.pk:ecs-
ec/EC/Processor2]] com.ibm.si.ec.filters.stat.StatFilter: [INFO]
[NOT:0000006000][10.254.158.209/- -] [-/- -] Events per second: 1s:6829,14121 (peak
12713,34691) (compression: 52%) 5s:4456,10781 (peak 7483,22373) (compression: 59%)
10s:4747,13657 (peak 6910,15464) (compression: 65%) 30s:4332,13261 (peak 6297,14190)
(compression: 67%) 60s:4327,13216 (peak 6018,13855) (compression: 67%)
Values are:
EPS Coalesced - 1 Sec = 6829
EPS RAW - 1 Sec = 14121
EPS Coalesced - Peak 1 sec = 12713
EPS Raw - Peak 1 sec = 34691
EPS Coalesced - 1 Min Avg =4327
EPS RAW - 1 Min Avg =13216
EPS Coalesced - Peak 1 Min Avg =6018
EPS Raw - Peak 1 Min Avg =13855
18
IBM Security | QRadar
19
IBM Security | QRadar
20
IBM Security | QRadar
Partition info:
10.126.217.81:/persistent_queue 873G 30G 844G 4% /store/persistent_queue
Mount:
10.126.217.81:/persistent_queue /store/persistent_queue glusterfs defaults,_netdev 0 0
The first .dat file after the ecs-ec_EC_TCP_TO_EP_0.dat was from Apr 3rd and the last from Apr
8th
First: -rw-r--r-- 1 root root 100M Apr 3 18:59 ecs-ec_EC_TCP_TO_EP_1.dat
Last: -rw-r--r-- 1 root root 28M Apr 8 13:19 ecs-ec_EC_TCP_TO_EP_180.dat
Solution:
i. Stopped ecs-ec and ecs-ec-ingress
systemctl stop ecs-ec-ingress && systemctl stop ecs-ec
ii. Moved all .dat files to /store/ibm_support/persistent_queue_backup/
iii. Removed the ecs-ec_EC_TCP_TO_EP.cfg
iv. Recreated the ecs-ec_EC_TCP_TO_EP.cfg (6444 root:root) and set the its contents as:
3
1
1
0
0
0
false
0393985
true
false
v. Moved the ecs-ec_EC_TCP_TO_EP_1.dat back to /store/persistent_queue/ecs-ec.ecs-ec
and started ecs-ec and ecs-ec-ingress
21
IBM Security | QRadar
In this case, we need to find out which searches are associated with those Views. To do that,
you can run the following command on the console this time.
/opt/qradar/support/collectGvStats.sh -m 10094
/opt/qradar/support/collectGvStats.sh -m 10098
22
IBM Security | QRadar
23
IBM Security | QRadar
12.2.5APARs
APAR IJ15472: EVENT COUNT NUMBERS DOESN'T MATCH IN THE OFFENSE
DETAILS SCREEN ON CLICKING THE EVENT/FLOW COUNT
24