0% found this document useful (0 votes)

57 views

H264 Encoder Short

This document proposes enabling technologies for self-aware adaptive systems. It introduces the Application Heartbeats framework, which allows applications to monitor their own performance and express goals to external observers. It also discusses Smartlocks, an adaptive spinlock library that uses machine learning to determine optimal lock scheduling policies. The document argues that these approaches address key challenges for self-aware systems by enabling applications to specify goals, systems to determine if goals are met, and adaptive systems to make informed decisions through techniques like reinforcement learning.

Uploaded by

Raoul Bitenbois

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

H264 Encoder Short

Uploaded by

Raoul Bitenbois

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Enabling Technologies For Self-Aware Adaptive

Systems
Marco D. Santambrogio1.2 , Henry Hoffmann1 , Jonathan Eastep1 , Jason E. Miller1 , Anant Agarwal1
1

Massachusetts Institute of Technology

Computer Science and Artificial Intelligence Laboratory
Cambridge, MA 02139
{santa, hank, eastepjm, jasonm, agarwal}@csail.mit.edu
2

Politecnico di Milano
Dipartimento di Elettronica e Informazione
20133 Milano, Italy
[email protected]

AbstractSelf-aware computer systems will be capable

of adapting their behavior and resources thousands of
times a second to automatically find the best way to
accomplish a given goal despite changing environmental
conditions and demands. Such a capability benefits a broad
spectrum of computer systems from embedded systems
to supercomputers and is particularly useful for meeting
power, performance, and resource-metering challenges in
mobile computing, cloud computing, multicore computing,
adaptive and dynamic compilation environments, and parallel operating systems.
Some of the challenges in implementing self-aware
systems are a) knowing within the system what the goals of
applications are and if they are meeting them, b) deciding
what actions to take to help applications meet their goals,
and c) developing standard techniques that generalize and
can be applied to a broad range of self-aware systems.
This work presents our vision for self-aware adaptive
systems and proposes enabling technologies to address
these three challenges. We describe a framework called
Application Heartbeats that provides a general, standardized way for applications to monitor their performance
and make that information available to external observers.
Then, through a study of a self-optimizing synchronization
library called Smartlocks, we demonstrate a powerful
technique that systems can use to determine which optimization actions to take. We show that Heartbeats can be
applied naturally in the context of reinforcement learning
optimization strategies as a reward signal and that, using such a strategy, Smartlocks are able to significantly
improve performance of applications on an important
emerging class of multicore systems called asymmetric
multicores.

I. I NTRODUCTION
Resources such as quantities of transistors and memory, the level of integration and the speed of components
have increased dramatically over the years. Even though
the technologies have improved, we continue to apply
outdated approaches to our use of these resources. Key
computer science abstractions have not changed since
the 1960s. The operating systems, languages, etc we
use today, were designed for a different era. Therefore
this is the time for a fresh approach to the way systems
are designed and used. The Self-Aware computing research leverages the new balance of resources to improve
performance, utilization, reliability and programmability
[1, 2].
Within this context, imagine a revolutionary computing system that can observe its own execution and
optimize its behavior around a users or applications
needs. Imagine a programming capability by which
users can specify their desired goals rather than how
to perform a task, along with constraints in terms of
an energy budget, a time constraint, or simply a preference for an approximate answer over an exact answer.
Imagine further a computing chip that performs better
according to a users preferred goal the longer it runs
an application. Such an architecture will enable, for
example, a hand-held radio or a cell phone that can
run cooler the longer the connection time. Or, a system
that can perform reliably and continuously in a range
of environments by tolerating hard and transient failures
through self healing. Self-aware computer systems [3]
will be able to configure, heal, optimize and protect
themselves without the need for human intervention.

Abilities that allow them to automatically find the best

way to accomplish a given goal with the resources
at hand. Considering the similarity to organic nature
we will refer this new class of computer systems as
Organic Computing System (OCS). These systems will
benefit the full range of computer infrastructures, from
embedded devices to servers to supercomputers. Some
of the main challenges in realizing such a vision are: to
add auto-adaptability capabilities to organic devices, to
implement distributed self-training algorithms over such
architectures, and to specify and formulate application
solutions using such a computing paradigm.
To realize this vision of an Organic Computing System, we must a) enable applications to specify their
goals, b) enable system services to determine whether
these goals are met, and c) enable adaptive systems to
make informed decisions among multiple possible actions. Tackling the first two of these challenges requires a
general framework allowing a wide range of applications
to express their goals while enabling adaptive systems to
read these goals and measure the applications progress
toward them. Addressing the third challenge requires
imbuing adaptive systems with the ability to navigate
a vast and interconnected decision space. Furthermore,
these challenges should be met using techniques that are
generalizable, repeatable, and portable.
We address the first two of these challenges by presenting the Application Heartbeats framework (or Heartbeats for short), which provides a general, portable interface for applications to express their performance goals
and progress towards those goals [4]. Using Heartbeats
applications express their performance using the wellknown abstraction of a heartbeat; however, the API also
allows applications to express their goals as a desired
heart rate (for throughput oriented applications) or a
desired latency between heartbeats (for latency sensitive
applications). We find the use of Heartbeats within an
application allows adaptive systems to make decisions
using a direct measure of an applications performance
rather than trying to infer it from underlying hardware
counters. Additionally, by specifying application goals,
the Heartbeats framework allows adaptive systems to
perform constraint-based optimizations like minimizing
power for a minimum performance target.
One promising approach to the third challenge, that
of making informed decisions, is to use machine learning in adaptive systems. We explore this approach in
an adaptive, self-aware spinlock library called Smartlocks [5]. Smartlocks use an adaptation referred to as
lock acquisition scheduling to determine the optimal
policy for allowing access to a critical section within
an application. We find that guiding lock acquisition

scheduling using machine learning allows Smartlocks

to adopt a near perfect lock scheduling policy on an
asymmetric multicore. Furthermore, Smartlocks are able
to adapt their behavior in the face of environmental
changes, like changes in clock frequency.
The rest of this paper is organized as follows. Section II identifies key system components of a selfaware systems and presents several works proposed
in literature. Section III presents our vision on selfaware adaptive systems, proposing enabling technologies
for adaptive computing that address these challenges.
Section IV presents our experimental results. Finally,
Section V concludes.
II. A N OVERVIEW OF S ELF -AWARE S YSTEMS
One solution to overcoming the burden imposed by
the increasing complexity and the associated workload
of modern computing systems is to adopt self-adaptive
[3] and autonomic computing techniques [1]. This work
includes research on single- and multi-core architectures [69], networks [10], self-healing systems [11
13], self-monitoring for anomaly detection in distributed
systems [14], automatic techniques to detect and cope
with attacks [15] or faults in software systems [16],
managing in cloud computing and grid [1719], complex
distributed Internet services [20, 21], self-healing and
operating systems [2228].
Within this context classical reconfigurable and multicore systems are moving to self-aware computing systems [29] where hardware components [7, 3032], the
applications [33, 34] and the operating system [26] can
be made to autonomously adapt their behavior to achieve
the best performance.
Tackling online monitoring and program optimization
together, it is possible to obtain an efficient monitoring
system able to improve operating system availability [35]
with dynamic updates based on hot-swapable objects
[32, 34]. The hot-swap mechanism is used to implement
software reconfiguration in the K42 Operating System
[26].
A self-optimizing hardware approach can be found
in [30], where a self-optimizing memory controller is
presented. This controler can optmize its scheduling
policy using a reinforcement learning approach which
allows it to estimate the performance impact of each
action it can take to better respond to the observations
made on the system state. Two interesting examples
of adaptable chip multiprocessor architecture have been
presented in [7] and [31]. In [7], a heterogeneous multicore architecture has been proposed which optimizes
power consumption by assigning different parts of an

A. Application Heartbeats
The Application Heartbeats framework provides a
simple, standardized way for applications to report their
performance and goals to external observers [4]. As
shown in Figure 1, this progress can then be observed
by either the application itself or an external system
(such as the OS or another application). Having a simple,
standardized interface makes it easy for programmers to
add Heartbeats to their applications. A standard interface,
or API, is also crucial for portability and inter-operability
between different applications, runtime systems, and
operating systems. Registering an applications goals
with external systems enables adaptive systems to make
optimization decisions while monitoring the programs
performance directly rather than having to infer that
performance from low-level details. If performance is
found to be unacceptable, information gleaned from
hardware counters can help explain why and what should
be changed.
API

Machine
framework

framework

API

To achieve the vision described in the previous sections, a self-aware system must be able to monitor
its behavior to update one or more of its components
(hardware architecture, operating system and running
applications), to achieve its goals. This paper proposes
the vision of organic computation that will create such
a self-aware computing system. An organic computer is
given a goal and a set of resources and their availability,
it then finds the best way to accomplish the goal while
optimizing constraints of interest. An organic computer
has four major properties:
It is goal oriented in that, given application goals,
it takes actions automatically to meet them;
It is adaptive in that it observes itself, reflects on its
behavior to learn, computes the delta between the
goal and observed state, and finally takes actions to
optimize its behavior towards the goal;
It is self healing in that it constantly monitors
for faults and continues to function through them,
taking corrective action as needed;
It is approximate in that it uses the least amount of
computation or energy to meet accuracy goals and
accomplish a given task.
More importantly, much like biological organisms, an organic computer can go well beyond traditional measures
of goodness like performance and can adapt to different
environments and even improve itself over time.
To adapt what the organic computer is doing or how
it is doing a given task at run time, it is necessary to develop a control system as part of the system that observes
execution, measures thresholds and compares them to
goals, and then adapts the architecture, the operating
system or algorithms as needed. A key challenge is to
identify what parts of a computer need to be adapted and
to quantify the degree to which adaptation can afford
savings in metrics of interest to us. Examples of mechanisms that can be adapted include recent researches on

API

III. O UR W ORK T OWARDS THE D EFINITION OF

S ELF -AWARE A DAPTIVE S YSTEMS

memory controller [30] or on reactive synchronization

mechanisms in which the waiting algorithm is tailored
at run time to the observed delay in lock acquisition.
In the following we present our first work in defining enabling technologies for adaptive computing that
address these challenges. Specifically, we present the
Application Heartbeats framework [4], an open source
project [36] which provides a simple, standardized way
for applications to monitor their performance and make
that information available to external observers. We
illustrate how other components of the system can use
Heartbeats to adapt and achieve better performance. We
discuss an example of Heartbeat usage within Smartlocks
[5], a self-aware synchronization library that adapts its
internal implementation using reinforcement learning
with the Heartbeat as the reward function.

API

application to the core that will have the best energy

characteristics for that computation. This architecture
has been designed taking into consideration the fact
that typical programs go through phases with different
execution characteristics. Therefore the most appropriate
core during one phase may not be the right one for a
following phase. In [31] the Core Fusion architecture
is proposed. This architecture is characterized by the
presence of different tiny independent cores that can be
used as distinct processing elements or that can be fused
into a bigger CPU based on the software demand.

App

App Parameters

App

(a)

System Parameters

(b)

Fig. 1: (a) Self-optimizing application using the Application Heartbeats framework. (b) Optimization of machine
parameters by an external observer.
The Application Heartbeats framework measures application progress toward goals using a simple and well-

known abstraction: a heartbeat. At significant points,

applications make an API call to signify a heartbeat.
Over time, the intervals between heartbeats provide key
information about progress for the purpose of application auto-tuning and/or externally-driven optimization.
The Heartbeats API allows applications to communicate
their goals by setting a target heart rate (i.e.,number
of heartbeats per second) and a target latency between
specially tagged heartbeats. Adaptive system services,
such as the operating system, runtime systems, hardware,
or the application itself, monitor progress through additional API calls and can then use this information to
change their behavior and help the application achieve
the specified performance goals. As an example, a video
codec application could use Application Heartbeats to
specify a target throughput of 30 video frames a second.
In the encoder example, an adaptive scheduler could
ensure that the encoder meets this goal while using the
least number of cores, thus saving power or allowing
extra cores to be assigned to other purposes.
B. Smartlocks
Smartlocks [5] is a self-aware system designed to help
reduce the programming complexity of extracting high
performance from todays multicores and asymmetric
multicores. Smartlocks is a self-optimizing spin-lock
library that can be used as the basis for synchronization,
resource sharing, or programming models in multicore
software. As it runs, Smartlocks uses Application Heartbeats [4] together with a Machine Learning (ML) engine
to monitor and optimize application performance by
adapting Smartlocks internal behaviors.
The key adaptable behavior within Smartlocks is
the lock acquisition scheduling policy. Lock acquisition
scheduling is picking which thread or process among
those waiting should get the lock next (and thus get to
execute the critical section next) for the best long-term
effect. [5] demonstrates that intelligent lock acquisition
scheduling is an important optimization for multicores
with static and especially dynamic performance asymmetries.
As illustrated in Figure 2, Smartlock applications
are C/C++ pthread applications that use pthreads for
thread spawning, Smartlocks for spin-lock synchronization, and the Heartbeats framework for performance
monitoring. Application developers insert heart beats at
significant points in the application to indicate progress
toward the applicaions goals. Then, within Smartlock,
the Heartbeats heart rate signal is used as a reward
signal by a Reinforcement Learning (RL) algorithm.
The RL algorithm attempts to maximize the heart rate

Fig. 2: Smartlocks Architecture. ML engine tunes Smartlock internally to maximize monitor reward signal. Tuning adapts the lock acquisition scheduling policy by configuring a priority lock and per-thread priority settings.
by adjusting Smartlocks lock acquisition scheduling
policy. The scheduler is implemented as a priority lock,
and the scheduling policy is configured by dynamically
manipulating per-thread priority settings.
IV. P RELIMINARY R ESULTS
This section presents several examples illustrating the
use of the Heartbeats framework and Smartlocks. First,
a brief study is presented using Heartbeats to instrument
the PARSEC benchmark suite [37] and after that the
benefits in using Smartlocks have been presented using
a synthetic benchmark.
A. Heartbeats in the PARSEC Benchmark Suite
We present several results demonstrating the simplicity and efficacy of the Heartbeat API. These results all
make use of our reference implementation of the API
which uses file I/O for communication. Results were
collected on an Intel x86 server with dual 3.16 GHz
Xeon X5460 quad-core processors.
To demonstrate the broad applicability of the Heartbeats framework across a range of applications, we apply
it to the PARSEC benchmark suite (v. 1.0). For each
benchmark, we find the outer-most loop used to process
inputs and insert a call to register a heartbeat in that
loop. In some cases, the application is structured so that
multiple inputs are consumed during one iteration of the
loop. Table I shows both how the heartbeats relate to the
input processed by each benchmark and the average heart
rate (measured in beats per second) achieved running
the native input data set1 . The Heartbeat interface
is found to be easy to insert into an application, as
it requires adding less than half-a-dozen lines of code
per benchmark, and only requires identifying the loop
that consumes input data. In addition, the interface is
low-overhead, resulting in immeasurable overhead for 9
1

freqmine and vips are not included as the unmodified benchmarks did not compile on the target system with our installed version
of gcc.

of 10 benchmarks and less than 5% for the remaining

benchmark. In a deployed system, users may want to
adjust the placement of the heartbeat calls to give underlying adaptive services more or less time to respond
to changes in heart rate. To demonstrate the use of
TABLE I: Heartbeats in the PARSEC Benchmark Suite
Benchmark
blackscholes
bodytrack
canneal
dedup
facesim
ferret
fluidanimate
streamcluster
swaptions
x264

Heartbeat Location
Every 25000 options
Every frame
Every 1875 moves
Every chunk
Every frame
Every query
Every frame
Every 200000 points
Every swaption
Every frame

Heart Rate (beat/s)

561.03
4.31
1043.76
264.30
0.72
40.78
41.25
0.02
2.27
11.32

4.5

3.5

2.5

4
Heartrate
Target Min
Target Max
Cores

1.5
1
0.5

Cores

Heart Rate (beat/s)

the API by an external system we develop an adaptive

scheduler which assigns cores to a process to keep performance within the target range. The Heartbeat-enabled
application communicates performance information and
goals to the scheduler which attempts to maintain the
required performance using the fewest cores possible.
The behavior of bodytrack under the external sched-

3
2
1

0
0

100

150

200

250

Time (Heartbeat)

Fig. 3: bodytrack coupled with an adaptive scheduler.

uler is illustrated in Figure 3, which shows the average
heart rate as a function of time measured in beats. The
scheduler quickly increases the assigned cores until the
application reaches the target range using seven cores.
Performance stays within that range until heartbeat 102,
when performance dips below 2.5 beats per second and
the eighth and final core is assigned to the application.
Then, at beat 141 the computational load decreases and
the scheduler is able to reclaim cores while maintaining
the desired performance.
Additional case studies are described in [38] and [39].
[38] has more information on our adaptive scheduler and

studies using heartbeats to develop adaptive video encoders. [39] describes the SpeedGuard run-time system
which can be automatically inserted into applications by
the SpeedPress compiler. SpeedGuard uses heartbeats to
monitor application performance and trade quality-ofservice for performance in the presence of faults such
as core failures or clock-frequency changes.

B. Smartlocks Versus Frequency Variation

This section demonstrates how Smartlocks helps address asymmetry in multicores by applying Smartlocks
to the problem of thermal throttling and dynamic clock
speed variations. It describes our experimental setup then
presents results.
Our setup emulates an asymmetric multicore with six
cores where core frequencies are drawn from the set
{3.16 GHz, 2.11 GHz}. The benchmark is synthetic,
and represents a simple work-pile programming model
(without work-stealing). The app uses pthreads for thread
spawning and Smartlocks within the work-pile data
structure. The app is compiled using gcc v.4.3.2. The
benchmark uses 6 threads: one for the main thread, four
for workers, and one reserved for Smartlock. The main
thread generates work while the workers pull work items
from the queue and perform the work; each work item
requires a constant number of cycles to complete. On the
asymmetric multicore, workers will, in general, execute
on cores running at different speeds; thus, x cycles on
one core may take more wall-clock time to complete
than on another core.
Since asymmetric multicores are not widely available
yet, the experiment models an asymmetric multicore but
runs on a homogeneous 8-core (dual quad core) Intel
Xeon(r) X5460 CPU with 8 GB of DRAM running
Debian Linux kernel version 2.6.26. In hardware, each
core runs at its native 3.16 GHz frequency. Linux system
tools like cpufrequtils could be used to dynamically manipulate hardware core frequencies, but our experiment
instead models clock frequency asymmetry using a simpler yet powerful software method: adjusting the virtual
performance of threads by manipulating the reward signal supplied by the application monitor. The experiment
uses Application Heartbeats [38] as the monitor and
manipulates the number of heartbeats such that at each
point where threads would ordinarily issue 1 beat, they
instead issue 2 or 3, depending on whether they are
emulating a 2.11 GHz or 3.16 GHz core.
The experiment simulates a throttling runtime environment and two thermal-throttling events that change

Heartrate (beats per second / 1e6)

Workload #1

1.6

x 10

Workload #2

Workload #1

1.4
1.2
Optimal

Smartlock
Priority lock: policy 1
Priority lock: policy 2

0.8

Spinlock: reactive lock

Spinlock: test and set

0.6
0.0

0.3

0.6

1.0

1.3

1.6

2.0

2.3

2.6

3.0

3.3

3.6

4.0

Time (seconds)

Fig. 4: Performance results on thermal throttling experiment. Smartlocks adapt to different workloads; no single
static policy is optimal for all of the different conditions.
core speeds.2 No thread migration is assumed. Instead,
the virtual performance of each thread is adjusted by
adjusting heartbeats. The main thread always runs at 3.16
GHz. At any given time, 1 worker runs at 3.16 GHz and
the others run at 2.11 GHz. The thermal throttling events
change which worker is running at 3.16 GHz. The first
event occurs at time 1.4s. The second occurs at time 2.7s
and reverses the first event.
Figure 4 shows several things. First, it shows the
performance of the Smartlock against existing reactive
lock techniques. Smartlock performance is the curve
labeled Smartlock and reactive lock performance is
labeled Spin-lock: reactive lock. The performance of
any reactive lock implementation is upper-bounded by
its best-performing internal algorithm at any given time.
The best algorithm for this experiment is the write-biased
readers-writer lock so the reactive lock is implemented
as that.3 The graph also compares Smartlock against a
baseline Test and Set spin-lock labeled Spin-lock: test
and set for reference. The number of cycles required to
perform each unit of work has been chosen so that the
difference in acquire and release overheads between lock
algorithms is not distracting but so that lock contention
is high; what is important is the policy intrinsic to the
lock algorithm (and the adaptivity of the policy in the
case of the Smartlock). As the figure shows, Smartlock
outperforms the reactive lock and the baseline, implying
that reactive locks are sub-optimal for this and similar
benchmark scenarios.
The second thing that Figure 4 shows is the gap
between reactive lock performance and optimal performance. One lock algorithm / policy that can outperform
standard techniques is the priority lock and prioritized
access. The graph compares reactive locks against two
priority locks / hand-coded priority settings (the curves
labeled Priority lock: policy 1 and Priority lock:
2

We inject throttling events as opposed to recording natural events

so we can determine a priori some illustrative scheduling policies to
compare Smartlock against.
3
This is the highest performing algorithm for this problem known
to the authors to be included in a reactive lock implementation.

policy 2). Policy 1 is optimal for two regions of the

graph: from the beginning to the first throttling event and
from the second throttling event to the end. Its policy sets
the main thread and worker 0 to a high priority value and
all other threads to a low priority value (e.g. high = 2.0,
low = 1.0). Policy 2 is optimal for the region of the graph
between the two throttling events; its policy sets the main
thread and worker 3 to a high priority value and all other
threads to a low priority value. In each region, a priority
lock outperforms the reactive lock, clearly demonstrating
the gap between reactive lock performance and optimal
performance.
The final thing that Figure 4 illustrates is that Smartlock approaches optimal performance and readily adapts
to the two thermal throttling events. Within each region
of the graph, Smartlock approaches the performance of
the two hand-coded priority lock policies. Performance
dips after the throttling events (time=1.4s and time=2.7s)
but improves quickly.
V. C ONCLUSION
Adaptive techniques promise to reduce the burden
modern computing systems place on application developers; however there are several obstacles to overcome
before we see widespread usage of adaptive systems.
Among these challenges, adaptive techniques must be
developed which are generally applicable to a wide
range of applications and systems. Furthermore, adaptive
systems should incorporate the goals of the applications
they are designed to support using standard and broadly
applicable methods.
This paper presents our work addressing these challenges. First, we have defined the key characteristics of
adaptive systems. We have also presented the Heartbeat
API which provides a standard interface allowing applications to express their goals and progress to adaptive
systems using a standard interface. With the Smartlocks
library we have provided an example of using machine
learning in combination with application performance
measurements to adapt to a challenging computing environment. Both Heartbeats and Smartlocks are built upon
techniques which can be generalized to a wide range of
adaptive systems.
Our results show that Heartbeats are low-overhead and
easy to add to a variety of different applications. In
addition, we have demonstrated the use of Heartbeats
in an adaptive resource allocator to perform constraint
based optimization of an application incorporating that
applications performance and goals. Furthermore, results using the Smartlocks library indicate that using
Heartbeats as a reward function for a reinforcement

learning engine is an effective technique for adapting

an applications behavior. Specifically, we show that
the Smartlock machine learning engine can dynamically
learn optimal lock acquisition policies even faced with
disruptive events like core frequency changes.

[11]
[12]

ACKNOWLEDGMENT
Wed like to thank all the many people we worked
with over the last two years for all the useful discussions
and for their thoughtful ideas. Special thanks to Jason
E. Miller and all the members of the CARBON research
groups and to the Rocca Foundation for its support.

[13]

R EFERENCES
[1] Jeffrey O. Kephart and David M. Chess. The vision
of autonomic computing. Computer, 36(1):4150,
2003.
[2] Mazeiar Salehie and Ladan Tahvildari.
Selfadaptive software: Landscape and research challenges. ACM Trans. Auton. Adapt. Syst., 4(2):142,
2009.
[3] P. Dini. Internet, GRID, self-adaptability and beyond: Are we ready? Aug 2004.
[4] Henry Hoffmann, Jonathan Eastep, Marco D. Santambrogio, Jason E. Miller, and Anant Agarwal.
Application heartbeats for software performance
and health. In PPOPP, pages 347348, 2010.
[5] Jonathan Eastep, David Wingate, Marco D. Santambrogio, and Anant Agarwal. Smartlocks: Selfaware synchronization through lock acquisition
scheduling. SMART 2010: Workshop on Statistical
and Machine learning approaches to ARchitectures
and compilaTion, 2010. Online document, http:
//ctuning.org/dissemination/smart10-05.pdf.
[6] B. Sprunt. Pentium 4 performance-monitoring
features. IEEE Micro, 22(4):7282, Jul/Aug 2002.
[7] R. Kumar, K. Farkas, N.P. Jouppi, P. Ranganathan,
and D.M. Tullsen. Processor power reduction via
single-isa heterogeneous multi-core architectures.
Computer Architecture Letters, 2(1):22, JanuaryDecember 2003.
[8] Intel Inc. Intel itanium architecture software developers manual, 2006.
[9] R. Azimi, M. Stumm, and R. W. Wisniewski.
Online performance analysis by statistical sampling
of microprocessor performance counters. In ICS
05: Proceedings of the 19th Inter. Conf. on Supercomputing, pages 101110, 2005.
[10] Rich Wolski, Neil T. Spring, and Jim Hayes. The
network weather service: a distributed resource

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

performance forecasting service for metacomputing. Future Generation Computer Systems, 15(5
6):757768, 1999.
HP Labs. HP open view self-healing services:
Overview and technical introduction.
David Breitgand, Maayan Goldstein, Ealan Henis,
Onn Shehory, and Yaron Weinsberg. Panacea
towards a self-healing development framework. In
Integrated Network Management, pages 169178.
IEEE, 2007.
C. M. Garcia-Arellano, S. Lightstone, G. Lohman,
V. Markl, and A.Storm. A self-managing relational
database server: Examples from IBMs DB2 universal database for linux unix and windows. IEEE
Transactions on Systems, Man and Cybernetics,
36(3):365 376, 2006.
Andres Quiroz, Nathan Gnanasambandam, Manish
Parashar, and Naveen Sharma. Robust clustering
analysis for the management of self-monitoring
distributed systems. Cluster Computing, 12(1):73
85, 2009.
Salim Hariri, Guangzhi Qu, R. Modukuri, Huoping
Chen, and Mazin S. Yousif. Quality-of-protection
(qop)-an online monitoring and self-protection
mechanism. IEEE Journal on Selected Areas in
Communications, 23(10):19831993, 2005.
Onn Shehory. Shadows: Self-healing complex
software systems. In ASE Workshops, pages 71
76, 2008.
S. S. Vadhiyar and J. J. Dongarra. Self adaptivity in
grid computing. Concurr. Comput. : Pract. Exper.,
17(2-4):235257, 2005.
J. Buisson, F. Andre, and J. L. Pazat. Dynamic
adaptation for grid computing. Lecture Notes in
Computer Science. Advances in Grid Computing EGC, pages 538547, 2005.
P. Reinecke and K. Wolter. Adaptivity metric and
performance for restart strategies in web services
reliable messaging. In WOSP 08: Proceedings of
the 7th International Workshop on Software and
Performance, pages 201212. ACM, 2008.
John Strassner, Sung-Su Kim, and James Won-Ki
Hong. The design of an autonomic communication
element to manage future internet services. In
Choong Seon Hong, Toshio Tonouchi, Yan Ma, and
Chi-Shih Chao, editors, APNOMS, volume 5787 of
Lecture Notes in Computer Science, pages 122
132. Springer, 2009.
Armando Fox, Emre Kiciman, and David Patterson.
Combining statistical monitoring and predictable
recovery for self-management. In WOSS 04:
Proceedings of the 1st ACM SIGSOFT workshop

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

on Self-managed systems, pages 4953, New York,

NY, USA, 2004. ACM.
J.S. Vetter and P.H. Worley. Asserting performance
expectations. In Supercomputing, ACM/IEEE 2002
Conference, pages 3333, Nov. 2002.
M. Caporuscio, A. Di Marco, and P. Inverardi. Runtime performance management of the siena publish/subscribe middleware. In WOSP 05: Proc. of
the 5th Inter. Work. on Software and performance,
pages 6574, 2005.
L. A. De Rose and D. A. Reed.
SvPablo:
A multi-language architecture-independent performance analysis system. In Inter. Conf. on Parallel
Processing, 1999.
C. Cascaval, E. Duesterwald, P. F. Sweeney, and
R. W. Wisniewski. Performance and environment
monitoring for continuous program optimization.
IBM J. Res. Dev., 50(2/3):239248, 2006.
O. Krieger, M. Auslander, B. Rosenburg, R. Wisniewski J. W., Xenidis, D. Da Silva, M. Ostrowski,
J. Appavoo, M. Butrico, M. Mergen, A. Waterland,
and V. Uhlig. K42: building a complete operating
system. pages 133145, 2006.
R. W. Wisniewski and B. Rosenburg. Efficient,
unified, and scalable performance monitoring for
multiprocessor operating systems. In SC 03: Proc.
of the ACM/IEEE conf. on Supercomputing, Nov
2003.
A. Tamches and B. P. Miller. Fine-grained dynamic
instrumentation of commodity operating system
kernels. In OSDI 99: Proc. of the third symp.
on Operating systems design and implementation,
1999.
Marco D. Santambrogio. From reconfigurable
architectures to self-adaptive autonomic systems.
IEEE International Conference on Computational
Science and Engineering, pages 926 931, 2009.
E. Ipek, O. Mutlu, J. F. Martnez, and R. Caruana.
Self-optimizing memory controllers: A reinforcement learning approach. In ISCA 08: Proc. of the
35th Inter. Symp. on Comp. Arch., 2008.
Engin Ipek, Meyrem Kirman, Nevin Kirman, and
Jose F. Martinez. Core fusion: accommodating software diversity in chip multiprocessors. SIGARCH
Comput. Archit. News, 35(2):186197, 2007.
J. Appavoo, K. Hui, M Stumm, R. W. Wisniewski,
D. Da Silva, O. Krieger, and C. A. N. Soules.
An infrastructure for multiprocessor run-time adaptation. In WOSS 02: Proceedings of the first
Workshop on Self-healing Systems, pages 38, New
York, NY, USA, 2002. ACM.
N. Thomas, G. Tanase, O. Tkachyshyn, J. Perdue,

[34]

[35]

[36]

[37]

[38]

[39]

N. M. Amato, and L. Rauchwerger. A framework

for adaptive algorithm selection in STAPL. In
PPoPP 05: Proceedings of the 10th ACM SIGPLAN symposium on Principles and Practice of
Parallel Programming, pages 277288, New York,
NY, USA, 2005. ACM.
C. A. N. Soules, J. Appavoo, K. Hui, R. W.
Wisniewski, D. Da Silva, G. R. Ganger, O. Krieger,
M. Stumm, M. Auslander, Ostrowski M., B. Rosenburg, and J. Xenidis. System support for online
reconfiguration. In Proc. of the Usenix Technical
Conference, 2003.
A. Baumann, D. Da Silva, O. Krieger, and R. W.
Wisniewski. Improving operating system availability with dynamic update. In Proc. of the
1st Workshop on Operating System and Architectural Support for the On-Demand IT Infrastructure,
2004.
Henry Hoffmann, Jonathan Eastep, Marco Santambrogio, Jason Miller, and Anant Agarwal. Application Heartbeats Website. MIT, 2009. Online document, http://groups.csail.mit.edu/carbon/heartbeats.
C. Bienia, S. Kumar, J. P. Singh, and K. Li. The
PARSEC benchmark suite: Characterization and
architectural implications. In PACT-2008: Proceedings of the 17th Inter. Conf. on Parallel Architectures and Compilation Techniques, Oct 2008.
Henry Hoffmann, Jonathan Eastep, Marco Santambrogio, Jason Miller, and Anant Agarwal. Application heartbeats for software performance and
health. Technical Report MIT-CSAIL-TR-2009035, MIT, Aug 2009.
Henry Hoffmann, Sasa Misailovic, Stelios
Sidiroglou, Anant Agarwal, and Martin Rinard.
Using Code Perforation to Improve Performance,
Reduce Energy Consumption, and Respond to
Failures . Technical Report MIT-CSAIL-TR-2009042, MIT, September 2009.

A Hyper-Heuristic Scheduling Algorithm For Cloud
No ratings yet
A Hyper-Heuristic Scheduling Algorithm For Cloud
14 pages
CivilFEM Python Manual
No ratings yet
CivilFEM Python Manual
47 pages
A General Guide To Applying Machine Learning To Computer Architecture - Marked
No ratings yet
A General Guide To Applying Machine Learning To Computer Architecture - Marked
21 pages
Acos: An Autonomic Management Layer Enhancing Commodity Operating Systems
No ratings yet
Acos: An Autonomic Management Layer Enhancing Commodity Operating Systems
8 pages
A Systematic Approach To Composing and Optimizing Application Workflows
No ratings yet
A Systematic Approach To Composing and Optimizing Application Workflows
9 pages
Methods For The Design and Development: Abstract. After The Domain-Spanning Conceptual Design, Engineers From Different
No ratings yet
Methods For The Design and Development: Abstract. After The Domain-Spanning Conceptual Design, Engineers From Different
2 pages
Programming Models Ieee MM
No ratings yet
Programming Models Ieee MM
9 pages
Resource Allocation and Management in Cloud Based Networks
No ratings yet
Resource Allocation and Management in Cloud Based Networks
5 pages
A Vision On Autonomic Distributed Systems: Macedo@
No ratings yet
A Vision On Autonomic Distributed Systems: Macedo@
4 pages
KKT
No ratings yet
KKT
10 pages
Agarwal Harrod Organic 2006
No ratings yet
Agarwal Harrod Organic 2006
5 pages
An Operating System Architecture For Organic Computing
No ratings yet
An Operating System Architecture For Organic Computing
15 pages
Towards Flexibility in A Distributed Data Mining Framework: Jos e M. Pe Na Ernestina Menasalvas
No ratings yet
Towards Flexibility in A Distributed Data Mining Framework: Jos e M. Pe Na Ernestina Menasalvas
4 pages
Self-Aware Adaptation in FPGA-based Systems
No ratings yet
Self-Aware Adaptation in FPGA-based Systems
6 pages
Sieve: Actionable Insights From Monitored Metrics in Microservices
No ratings yet
Sieve: Actionable Insights From Monitored Metrics in Microservices
17 pages
Computer Performance Evaluation
100% (1)
Computer Performance Evaluation
6 pages
غدير حميد عبد الحسين غلام رابع :A
No ratings yet
غدير حميد عبد الحسين غلام رابع :A
5 pages
Ijdps 040203
No ratings yet
Ijdps 040203
11 pages
Survey On Job Scheduling Algorithms in Cloud Computing
No ratings yet
Survey On Job Scheduling Algorithms in Cloud Computing
4 pages
A Review of Embedded Machine Learning Based On Hardware, Application
No ratings yet
A Review of Embedded Machine Learning Based On Hardware, Application
55 pages
A Software Architectural Design Method For Large-Scale Distributed Information Systems
No ratings yet
A Software Architectural Design Method For Large-Scale Distributed Information Systems
12 pages
Grid Computing in Drug Development and Drug Dispensing Using MEMS and NEMS
100% (1)
Grid Computing in Drug Development and Drug Dispensing Using MEMS and NEMS
7 pages
Project Guidelines for DAA.docx
No ratings yet
Project Guidelines for DAA.docx
3 pages
Architectures For Autonomic Communications: Visa Holopainen, Visa@netlab - Tkk.fi
No ratings yet
Architectures For Autonomic Communications: Visa Holopainen, Visa@netlab - Tkk.fi
30 pages
Record
No ratings yet
Record
132 pages
Literature Review Flexible Manufacturing System
100% (2)
Literature Review Flexible Manufacturing System
6 pages
Stretching Site Resources in Cloud Computing
No ratings yet
Stretching Site Resources in Cloud Computing
4 pages
Reference2 PDF
No ratings yet
Reference2 PDF
15 pages
International Journal of Computer Science and Security (IJCSS), Volume (1), Issue
No ratings yet
International Journal of Computer Science and Security (IJCSS), Volume (1), Issue
101 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
16 pages
Multicore Resource Managment
No ratings yet
Multicore Resource Managment
6 pages
2007 9737 Cys 26 01 135
No ratings yet
2007 9737 Cys 26 01 135
13 pages
03-Real Time and Distributed Computing Systems - 4paginas
No ratings yet
03-Real Time and Distributed Computing Systems - 4paginas
4 pages
IJEDR1902068
No ratings yet
IJEDR1902068
7 pages
3415 Ijasa 01
No ratings yet
3415 Ijasa 01
12 pages
Real Time Human Activity Recognition On Smartphones Using LSTM Networks
No ratings yet
Real Time Human Activity Recognition On Smartphones Using LSTM Networks
6 pages
Overcoming Computational Errors in Sensing Platforms Through Embedded Machine-Learning Kernels
No ratings yet
Overcoming Computational Errors in Sensing Platforms Through Embedded Machine-Learning Kernels
12 pages
Res 2017
No ratings yet
Res 2017
4 pages
16. Main Body #(Summary & Conclusion) Done
No ratings yet
16. Main Body #(Summary & Conclusion) Done
3 pages
Evaluation Issues in Autonomic Computing
No ratings yet
Evaluation Issues in Autonomic Computing
12 pages
Perf-Eval-lecture Notes
No ratings yet
Perf-Eval-lecture Notes
48 pages
CFS Based Feature Subset Selection For Software Maintainance Prediction
No ratings yet
CFS Based Feature Subset Selection For Software Maintainance Prediction
11 pages
CHAPTER THREE
No ratings yet
CHAPTER THREE
5 pages
Developing Thinking Skill - A3
No ratings yet
Developing Thinking Skill - A3
5 pages
P - R - A M C S: Attern Based and Euse Driven Rchitecting of Obile Loud Oftware
No ratings yet
P - R - A M C S: Attern Based and Euse Driven Rchitecting of Obile Loud Oftware
20 pages
The Effect of The Resource Consumption Characteristics of Cloud Applications On The Efficiency of Low-Metric Auto Scaling Solutions
No ratings yet
The Effect of The Resource Consumption Characteristics of Cloud Applications On The Efficiency of Low-Metric Auto Scaling Solutions
9 pages
Reconfigurable Computing: Architectures and Design Methods
No ratings yet
Reconfigurable Computing: Architectures and Design Methods
15 pages
Chap 14
No ratings yet
Chap 14
15 pages
Ant Colony Optimization For Effective Load Balancing in Cloud Computing
No ratings yet
Ant Colony Optimization For Effective Load Balancing in Cloud Computing
6 pages
Applsci 09 00376 v2 PDF
No ratings yet
Applsci 09 00376 v2 PDF
40 pages
CC Unit I
No ratings yet
CC Unit I
72 pages
International Journal of Computer Science and Security (IJCSS), Volume (3), Issue
No ratings yet
International Journal of Computer Science and Security (IJCSS), Volume (3), Issue
119 pages
$vvhvvphqwrihqkdqflqjdxwrqrplfihdwxuhlqvriwzduh Ghyhorsphqw: 7Dqpd/.Dnndu 5Djkdyhqgud.'Zlyhgl 3Ul/Dqnd Dgdy 6KUH/ RHO
No ratings yet
$vvhvvphqwrihqkdqflqjdxwrqrplfihdwxuhlqvriwzduh Ghyhorsphqw: 7Dqpd/.Dnndu 5Djkdyhqgud.'Zlyhgl 3Ul/Dqnd Dgdy 6KUH/ RHO
10 pages
A New Approach To The Design of Distribu
No ratings yet
A New Approach To The Design of Distribu
20 pages
third chapter
No ratings yet
third chapter
35 pages
FMI Standard
No ratings yet
FMI Standard
27 pages
Bio-Inspired Imprecise Computational
100% (1)
Bio-Inspired Imprecise Computational
13 pages
ZZZ 2020 Sensors-20-01176
No ratings yet
ZZZ 2020 Sensors-20-01176
27 pages
Applications of Artificial Intelligence in Power Systems
No ratings yet
Applications of Artificial Intelligence in Power Systems
15 pages
Abstracts For Analyzing With Answers 1-16
No ratings yet
Abstracts For Analyzing With Answers 1-16
9 pages
Embedded Systems Programming with C++: Real-World Techniques
From Everand
Embedded Systems Programming with C++: Real-World Techniques
Robert Johnson
No ratings yet
Worksafeforestryguide Wffbchmaehap
No ratings yet
Worksafeforestryguide Wffbchmaehap
80 pages
Automatic Gate Alarm With Light: Project Report
No ratings yet
Automatic Gate Alarm With Light: Project Report
4 pages
11. Bản Vẽ Sơ Đồ 1 Sợi Dãy Tủ 22kV
No ratings yet
11. Bản Vẽ Sơ Đồ 1 Sợi Dãy Tủ 22kV
9 pages
Dairy Farm Group Case Study
No ratings yet
Dairy Farm Group Case Study
5 pages
Disclaim Ready Reckner Geri 14082015
No ratings yet
Disclaim Ready Reckner Geri 14082015
94 pages
Infineon-MOSFETs OptiMOS Selection For DC-DC converters-AN-v01 00-EN
No ratings yet
Infineon-MOSFETs OptiMOS Selection For DC-DC converters-AN-v01 00-EN
6 pages
Balboa Instruments: EL2000 Mach 3 Tech Sheet
No ratings yet
Balboa Instruments: EL2000 Mach 3 Tech Sheet
8 pages
Water Potential Questions Key
No ratings yet
Water Potential Questions Key
1 page
Empowerment 2nd Grading
No ratings yet
Empowerment 2nd Grading
5 pages
2.2 Peter Delphin
No ratings yet
2.2 Peter Delphin
32 pages
Fastener Tightening Specifications: 2009 Chevrolet Aveo
No ratings yet
Fastener Tightening Specifications: 2009 Chevrolet Aveo
5 pages
LGSF Container Back Wall External Cladding Sheet
No ratings yet
LGSF Container Back Wall External Cladding Sheet
1 page
Lotek Network
No ratings yet
Lotek Network
15 pages
Lesson 21
No ratings yet
Lesson 21
5 pages
CFX Tutorials 2.0
No ratings yet
CFX Tutorials 2.0
2 pages
Zeta Catalog 1804
No ratings yet
Zeta Catalog 1804
37 pages
Canalis KTC 1000-5000A - EN
No ratings yet
Canalis KTC 1000-5000A - EN
244 pages
Man Machine System
No ratings yet
Man Machine System
3 pages
SFD BMD For Overhanging Beam Uvsbrn
No ratings yet
SFD BMD For Overhanging Beam Uvsbrn
5 pages
How Ornithopters Fly - Erich Von Holst and Karl Herzog
100% (1)
How Ornithopters Fly - Erich Von Holst and Karl Herzog
8 pages
NTB5860NL, NTP5860NL, NVB5860NL N-Channel Power MOSFET: 60 V, 220 A, 3.0 MW
No ratings yet
NTB5860NL, NTP5860NL, NVB5860NL N-Channel Power MOSFET: 60 V, 220 A, 3.0 MW
9 pages
02-Random Variables
No ratings yet
02-Random Variables
38 pages
Headroom Speaker Stand
No ratings yet
Headroom Speaker Stand
2 pages
Control Systems - Mid Exam Solutions
No ratings yet
Control Systems - Mid Exam Solutions
9 pages
Sub Station Design
88% (8)
Sub Station Design
43 pages
JSW Steel LTD Vishwanth 06124
100% (2)
JSW Steel LTD Vishwanth 06124
66 pages
ME484 Finite Element Analysis
No ratings yet
ME484 Finite Element Analysis
2 pages
GATRONOVA Internship Report
No ratings yet
GATRONOVA Internship Report
34 pages

H264 Encoder Short

Uploaded by

H264 Encoder Short

Uploaded by

Enabling Technologies For Self-Aware Adaptive

Massachusetts Institute of Technology

AbstractSelf-aware computer systems will be capable

Abilities that allow them to automatically find the best

scheduling using machine learning allows Smartlocks

III. O UR W ORK T OWARDS THE D EFINITION OF

memory controller [30] or on reactive synchronization

application to the core that will have the best energy

known abstraction: a heartbeat. At significant points,

of 10 benchmarks and less than 5% for the remaining

Heart Rate (beat/s)

Heart Rate (beat/s)

the API by an external system we develop an adaptive

Fig. 3: bodytrack coupled with an adaptive scheduler.

B. Smartlocks Versus Frequency Variation

Heartrate (beats per second / 1e6)

Spinlock: reactive lock

We inject throttling events as opposed to recording natural events

policy 2). Policy 1 is optimal for two regions of the

learning engine is an effective technique for adapting

on Self-managed systems, pages 4953, New York,

N. M. Amato, and L. Rauchwerger. A framework

You might also like