Computer Science - BotPro Case Study (May 2024)
Computer Science - BotPro Case Study (May 2024)
Challenge 2: Navigation
● Multimodalism is the process in an autonomous system that integrates data
from multiple sensing and operational modalities. By combining information
and processes from different sources, these systems can help in providing a
more comprehensive and accurate understanding and navigation of the
environment.
(Source: A Review of Radio Frequency Based Localisation for Aerial and Ground
Robots with 5G Future Perspectives)
Bundle Adjustment
● Technique used in computer vision and photogrammetry
● Refines parameters of a 3D reconstruction system
● Used to align multiple views of the scene points and camera points
● Bundle - a collection of features observed in multiple images of the same
scene.
● The bundles are then optimised while taking into account the camera
parameters such as distortion, calibration and relative positioning to rearrange
into a proper scene.
● Iterative minimisation of reprojection error provides a more precise
representation of the scene
Computer Vision
● Field of study and research focusing on enabling computers to understand,
interpret and analyse information from images and videos
● Development of algorithms, techniques and models to extract meaningful
information from visual data to replicate human vision capabilities.
● Steps in computer vision
○ Accepts the digital information and video frames as input
○ Extract high-level information from visual data using complex
mathematical and statistical algorithms to analyse patterns, shapes,
colours and textures to recognise objects.
Edge Computing
● Distributed computing paradigm
● Brings computation and data storage to the edge of the network where data is
generated or consumed compared to sole reliance on centralised cloud
computing infrastructure.
● Processing algorithms are located near the edge devices or sensors which
reduces the need to transmit data to remote cloud servers to compute data.
● Idea is to enable real time processing and processing closer to the data
source
● Advantages
○ Low latency: less access times, good for time-sensitive applications
such as autonomous vehicles and real-time analytics.
○ Bandwidth Optimisation: reduces amount of data needed to be
transferred across the internet, alleviates congestion and high
bandwidth costs
○ Privacy and security: data does not need to traverse external networks,
allows localised data storage and processing
○ Offline operation: allows operation during times of limited cloud/network
connectivity.
GPS Signal
● Refers to radio frequency signals emitted by GPS satellites that provide
information to receivers on earth
● Allows receivers to calculate precise location, position and time to
synchronise its motion
● Consists of satellites that continuously transmit signals about their orbital
parameters and exact time signals
● Components of the GPS signal
○ Navigation message - information about satellite orbit, clock eros and
other parameters are 50 bits per second
○ Carrier wave - radio wave carrying the navigation message on L1 or L2
in the form of modulated signals
○ Spread spectrum signal - to enhance signal quality and resistance to
interference or disruptions to spread the signal over a wider frequency
band.
● Receivers intercept the signal to analyse time delay between sending and
receiving to calculate distance.
GPS-Degraded Environment
● The GPS signals are severely compromised or degraded, leading to
challenges/limitations in accurate positioning and navigation.
● Causes:
○ Signal obstruction - physical obstructions along the line-of-sight of GPS
receiver and satellites
○ Multi-path interference - signals reflect off buildings and other terrain
before reaching receiver to interference with direct signals to cause
errors/inaccuracies.
○ Signal jamming - intentional or unintentional interference to disrupt or
block GPS using EM waves, concerned in high electronically active
areas.
● Alternative positioning methods such as inertial navigation system (INS) using
accelerometers and gyros.
GPS-Denied Environment
● Situation where the GPS signals are too weak/completely unavailable
● Indoor, underground, dense areas where signals may be weakened, distorted
or blocked
● Again, alternative positioning methods or technologies are required
Keyframe Selection
● Video processing technique involving the identification and selection of key
frames that represent the entire scene from a sequence of videos / images
● Capture essential information from the content of the visual sequence
● Reduces amount of data to be processed or analysed while limiting the data
to only relevant information
● Criteria for a frame to be considered keyframe
○ Visual Saliency: only capture visually salient regions or objects in the
video
○ Content Diversity: represent different scenes, perspectives and/or
actions to provide a comprehensive overview of text
○ Temporal Significance: select specific points in time of significance
○ Motion Characteristics: based on motion analysis
○ Redundancy: Selecting frames that only offer unique information
compared to neighbouring frames
○ Computational efficiency: strike balance between accuracy and
complexity
Object Occlusion
● Phenomena in which an object positioned in front of another obstructs the
visibility of obscured object from viewpoint of observer
● Affects tracking, segmentation and recognition
● Causes complexities due to partial visibility making it difficult to predict actual
nature
● Disadvantages
○ Full extent and boundary of object not visible
○ Loss of tracking on an object - continuity requires complex algorithms
○ Exhibit limited visual cues or fragmented appearance
○ Depth relationships between occluded and occluded objects are not
visible
Odometer Sensor
● Device used to measure movement and displacement of mobile robot or
vehicle
● Provides information about vehicle change in position based on motion
● Use rotational encoders or sensors on wheels / motor shafts to measure
movement
● Combined with other information such as from IMU or GPS to improve
accuracy and reliability of vehicle pose estimation and localisation
● Provide real-time feedback to allow for precise control and monitoring
Optimisation
● Process of finding best possible solution by maximising a specific objective
function within a given set of constraints
● Involves systematic exploration and performance improvement
● Process
○ Defining the problem and its constraints
○ Identification of the space of possible solutions through range and
bound determination
○ Objective function to measure quality of solution based on optimisation
goal
○ Reference to any constraints of the problem that solution must aim to
solve
○ Developing algorithms and techniques to solve the problem
○ Assessing the optimised solution and evaluating its performance
against defined objectives and constraints
Re-localization
● Relocating the position when camera or robot loses track or encounters
environmental change due to sensor drift, occlusion, movement or lighting
● Successful re-localization allows system to accurately cover its pose
estimation and continue operation
● Steps to re-localization
○ Map of the environment is created with key reference points for pose
estimation
○ Extraction of visual features including key points and key frames
○ Features matched against map
○ Estimation of camera or robot pose calculated using matched features
○ Refinement or verification of estimation to improve accuracy and
reliability
○ Most useful in SLAM situations where the robot/camera needs to
constantly update its location
Robot Drift
● Robot’s estimated position gradually deviates from actual position over time
● Factors that contribute
○ Sensor noise: inaccuracies or anomalies
○ Calibration errors: misalignment of components, incorrect calibration
○ Environmental changes: terrain, lighting or magnetic field affect
readings
○ Accumulative integration: adding sensor measurements causing error
propagation and accumulation of mistake
○ Uncertainty: complex or dynamic environments
● Methods for mitigation
○ Sensor fusion: integrating data from multiple sensors doing same thing
○ Kalman Filtering: to mitigate noise and uncertainties
○ Loop closure: correction mechanisms to correct accumulated error
○ Environmental Constraints: connect drift by aligning estimated pose
with actual environment
○ Online calibration/recalibration: reduce systematic errors