EEEM048 Lecture6
EEEM048 Lecture6
End-user
Operating
Systems? Core network
Gateway e.g. Internet
Protocols?
Protocols? Data
Aggregation/
In-node Data Fusion
Processing
Sink
Gateway Computer services
node
3
3
Beyond conventional sensors
Road block, A3
Road block, A3
4
Suggest a different route
The benefits of data processing in IoT
− Requires: real-time, reliable, efficient (for low power and resource limited
nodes), and scalable solutions.
5
Partially adapted from: What is Bog Data?, IBM
IoT Data Access
− Publish/Subscribe (long-term/short-term)
− Ad-hoc query
− The typical types of data request for sensory data:
− Query based on
− ID (resource/service) – for known resources
− Location
− Type
− Time – requests for freshness data or historical data;
− One of the above + a range [+ Unit of Measurement]
− Type/Location/Time + A combination of Quality of Information attributes
− An entity of interest (a feature of an entity on interest)
− Complex Data Types (e.g. pollution data could be a combination of
different types)
6
Sensor Data
7
Sensor Data
8
Data Processing and Interpretation
9
“Each single data item could be important.”
10
IoT Data Challenges
11
IoT Data Processing
Data collections
and processing WSN
within the Network-enabled
networks Devices
WSN
Network
services/storage
and processing
units Data/service access
at application level
WSN
WSN
WSN
Data Discovery
Network-enabled
Devices Service/
Resource
Discovery
In-network processing
− Mobile Ad-hoc Networks can be seen as a set of nodes that deliver bits
from one end to the other;
− WSNs, on the other end, are expected to provide information, not
necessarily original bits
− Gives additional options
− e.g., manipulate or process the data in the network
− Main example: aggregation
− Applying aggregation functions to a obtain an average value of measurement
data
− Typical functions: minimum, maximum, average, sum, …
− Not amenable functions: median
Source: Protocols and Architectures for Wireless Sensor Networks, Protocols and Architectures for Wireless Sensor Networks 13
Holger Karl, Andreas Willig, chapter 3, Wiley, 2005 .
In-network processing
Source: Protocols and Architectures for Wireless Sensor Networks, Protocols and Architectures for Wireless Sensor Networks 14
Holger Karl, Andreas Willig, chapter 3, Wiley, 2005 .
Data-centric networking
Source: Protocols and Architectures for Wireless Sensor Networks, Protocols and Architectures for Wireless Sensor Networks 15
Holger Karl, Andreas Willig, chapter 3, Wiley, 2005 .
Data Aggregation
1 1
1
1
3 1
1 1
6 1
1 1
Source: Holger Karl, Andreas Willig, Protocols and Architectures for Wireless Sensor Networks, Protocols and Architectures for Wireless Sensor Networks, chapter 3, Wiley, 2005 .
Efficacy of an aggregation mechanism
Publisher 1 Publisher 2
− Variations
− Topic-based P/S –
inflexible
− Content-based P/S – Software bus
use general predicates
over named data
Subscriber 1 Subscriber 2 Subscriber 3
Source: Holger Karl, Andreas Willig, Protocols and Architectures for Wireless Sensor Networks, Protocols and Architectures for Wireless Sensor Networks, chapter 12, Wiley, 2005
.
MQTT Pub/Sub Protocol
− The message header for each MQTT command message contains a fixed header.
− Some messages also require a variable header and a payload.
− The format for each part of the message header:
− The sensor data (or IoT data in general) can be seen as time-
series data.
− A sensor stream refers to a source that provide sensor data
over time.
− The data can be sampled/collected at a rate (can be also
variable) and is sent as a series of values.
− Over time, there will be a large number of data items
collected.
− Using time-series processing techniques can help to reduce
the size of the data that is communicated;
− Let’s remember, communication can consume more
energy than communication;
Sensor Data as time-series data
fggfffhfffffgjhghfff
jfhiggfffhfffffgjhgi
fggfffhfffffgjhghfff
Source: Jessica Lin, Eamonn Keogh, Stefano Lonardi, and Bill Chiu. 2003. A symbolic representation of time series, with implications for streaming algorithms. In
Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery (DMKD '03). ACM, New York, NY, USA, 2-11.
SAX- normalisation before PAA
σ = 1.79691402131
μ = 3.11
z1 = (2- 3.11)/1.79691402131 = -0.617725716
z2 = (3-3.11)/ 1.79691402131= -0.061216062
z3 = (4.5-3.11)/ 1.79691402131= 0.773548419
z4 = (7.6-3.11)/ 1.79691402131= 2.498728346
z5 = (4-3.11)/ 1.79691402131= 0.495293592
z6 = (2-3.11)/ 1.79691402131= -0.617725716
z7 = (2-3.11)/ 1.79691402131= -0.617725716
z8 = (2-3.11)/ 1.79691402131= -0.617725716
z9 = (3-3.11)/ 1.79691402131= -0.061216062
z10 = (1-3.11)/ 1.79691402131= -1.17423537
Source: Jessica Lin, Eamonn Keogh, Stefano Lonardi, and Bill Chiu. 2003. A symbolic representation of time series, with implications for streaming algorithms. In Proceedings of the
8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery (DMKD '03). ACM, New York, NY, USA, 2-11.
Alphabet representation in SAX
− SAX divides a time series data into equal segments and then
creates a string representation for each segment.
− The SAX patterns create the lower-level abstractions that are
used to create the higher-level interpretation of the
underlying data.
− The string representation of the SAX mechanism enables to
compare the patterns using a specific type of string similarity
function.
Interpretation of data
37
Quiz
40