Week 6 - Review On High Performance Energy Efficient Multicore Embedded Computing 1
Week 6 - Review On High Performance Energy Efficient Multicore Embedded Computing 1
Moore’s Law
One of the guiding principles of computer
architecture is known as Moore’s Law. In 1965
Gordon Moore stated that the number of transistors
on a chip will roughly double each year (he later
redefined this, in 1975, to every two years). What is
often quoted as Moore’s Law is Dave House’s
revision that computer performances will double
every 18 months.
As shown in Figure 2, the number of transistors has Figure 3(a) Shared Memory Model, (b)
roughly doubled every 2 years. Multicore processors Distributed Memory Model
are often run at slower frequencies, but have much
better performance than a single core processor Table 1 below shows a comparison of a single and
because “Two heads are better than one.” [5]. multicore (8 cores in this case) processor used by the
Packaging Research Center at Georgia Tech [5].
With the same source voltage and multiple cores run
at a lower frequency we see an almost tenfold
increase in bandwidth while the total power
consumption is reduced by a factor of four.
343
International Journal of Advanced Computer Research (ISSN (print): 2249-7277 ISSN (online): 2277-7970)
Volume-3 Number-3 Issue-11 September-2013
application requirements. HPEEC softwarebased
techniques include data forwarding, task scheduling,
task migration and load balancing.The workload
aware load unbalancing strategy reduces themean
waiting time of aperiodic tasks by 92 percent with
similar power efficiency as compared to a
workloadunaware load unbalancing strategy [3].
344
International Journal of Advanced Computer Research (ISSN (print): 2249-7277 ISSN (online): 2277-7970)
Volume-3 Number-3 Issue-11 September-2013
Tilera revolutionizes high performance multicore to incorrect results or even the embedded system
embedded computing by leveraging a tiled multicore failure. Depending on the target market, embedded
architecture (e.g., the TILEPro64 and TILE-Gx applications typically operate above 45OC (e.g.,
processor family [9], [10]). The TILEPro64 and telecommunication embedded equipment temperature
TILE-Gx processors offer 5.6 and 32 MB of on chip exceeds 55OC) in contrast to traditional computer
cache respectively and implement Tilera’s dynamic systems, which normally operate below 38OC.
distributed cache (DDC) technology that provides a
2x improvement on average in cache coherence 3. Reliability Constrained
performance over traditional cache technologies Embedded systems with high reliability constraints
using a cache coherence protocol. are typically required to operate for many years
without errors and/or must recover from errors since
B. Intel Xeon Processor many reliabilityconstrained embedded systems are
Intel leverages Hafnium Hi-K andmetal gates in next deployed in harsh environments where post
generation Xeon processors to achievehigher clock deployment removal and maintenance is infeasible.
speeds and better performance per watt. TheXeon Hence, hardware and software for
processors also implement hyper threading and reliabilityconstrained embedded systems must be
widedynamic execution technologies for high developed and tested more carefully than traditional
performance. Thewider execution pipelines enable computer systems.
each core to simultaneously fetch, dispatch, execute,
and retire up to four instructions per cycle [11]. 4. Real Time
Intel’s deep power down technology enables both In addition to correct functional operation, realtime
cores and the L2 cache to be powered down when the embedded applications have additional stringent
processor is idle [12]. timing constraints, which impose realtime operational
deadlines on the embedded system’s response time.
C. Graphics Processing Units Although realtime operation does not strictly imply
GPUs feature highmemorybandwidth that is typically high performance, realtime embedded systems
10xfaster thancontemporary CPUs. NVIDIA and require high performance only to the point that the
AMD/ATI are thetwomain GPU vendors. NVIDIA’s deadline is met, at which time high performance is no
PowerMizer technology available on all NVIDIA longer needed. Hence, realtime embedded systems
GPUs is a DPM technique that adapts the GPUto suit require predictable highperformance. Realtime
an application’s requirements [13]. operating systems (RTOSs) provide guarantees for
meeting the stringent deadline requirements for
5. Embedded Applications embedded applications [7].