Motion Estimation: ECE 569 - Spring 2010 Toan Nguyen Shikhar Upadhaya
Motion Estimation: ECE 569 - Spring 2010 Toan Nguyen Shikhar Upadhaya
3
32
2 64
128
1
256
512
0
Image Size
256 threads/block give the best performance.
Simulation Results (cont.)
• The runtime of the serial versions and the parallel versions of different
algorithms are collected and compare to see what kind of performance
improvement we achieved.
30 Full Search serial vs. parallel
Hexagon Search serial vs. parallel
25 3.5
Runtime (seconds)
Runtime (seconds)
20
2.5
15 2
FSS_Serial 1.5
10 Hexagon_Serial
FSS_Parallel 1
5 Hexagon_parallel
0.5
0 0
3
2.5
2 improvement when the image size is
1.5 4SS_Serial
1
4SS_Parallel
256x256 or bigger. Any image of size
0.5
0
smaller than this will actually decrease
the performance.
Image Size
Simulation Results (cont.)
• So how much speed up do we get and which
algorithm is better, Full Search, Four Step
Search, or Hexagon Search?
Parallel vs. serial versions speedup
35
30
25
Speed up
20
15 Speed_UP_FS
10 Speed_UP_4SS
5 Speed_UP_Hexagon
Image size
Simulation Results (cont.)
• Overall performance
Full_Serial Full_Parallel 4SS_Serial 4SS_Parallel Hexagon_Serial Hexagon_parallel
16X16 0 0 0 0.016 0 0.078
32X32 0 0.016 0 0.015 0 0.047
64X64 0.01 0.016 0.01 0.015 0.01 0.062
128X128 0.02 0.016 0.01 0.015 0.01 0.062
256X256 0.09 0.031 0.02 0.016 0.02 0.047
512X512 0.41 0.078 0.06 0.016 0.06 0.063
1024X1024 1.64 0.265 0.236 0.032 0.22 0.062
2048X2048 6.56 0.922 0.87 0.047 0.85 0.078
4096X4096 26.29 3.719 3.38 0.11 3.3 0.157
30
25
20
FSS_Serial
FSS_Parallel
15
4SS_Serial
10 4SS_Parallel
Hexagon_Serial
5 Hexagon_parallel
0
Simulation Results (cont.)
• Performance comparison between NVIDIA
8400 GS and 9800 GT GPUs.
NVIDIA 8400 GS vs. 9800 GT performance
4.5
4
3.5
3
Speed up
2.5
2 Speed-up_FSS
1.5 Speed-up_4SS
1 Speed-up_Hexagon
0.5
0
Image Size
Simulation Results (cont.)
• Distortion measurement (motion estimation
quality).
Fast Search Distortion Min. SAD returned by different algorithms
600 1400
500 1200
1000
400
Distortion
Min. SAD
800
300
4SS distortion 600 Full-Step
200 Hexagon distortion 4SS
400
Hexagon
100 200
0 0