0% found this document useful (0 votes)

5 views

Compilation_Time-Based_Analysis_using_Op

Uploaded by

Engr Mubashar Ch

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Compilation_Time-Based_Analysis_using_Op

Uploaded by

Engr Mubashar Ch

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/343007153

Compilation Time-Based Analysis using Optimized Iterative Techniques

Article · May 2020

CITATIONS READS
0 24

3 authors:

Ume Farwa Khurshid Asghar

Information Technology University of the Punjab University of Okara
1 PUBLICATION 0 CITATIONS 17 PUBLICATIONS 96 CITATIONS

SEE PROFILE SEE PROFILE

Engr Mubashar Ch
COMSATS University Islamabad
11 PUBLICATIONS 33 CITATIONS

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Component Based Software Testing View project

Video Forgery Detection View project

All content following this page was uploaded by Engr Mubashar Ch on 17 July 2020.

The user has requested enhancement of the downloaded file.

120 IJCSNS International Journal of Computer Science and Network Security, VOL.20 No.5, May 2020

Compilation Time-Based Analysis using Optimized Iterative

Techniques
Ume Farwa1, Khurshid Asghar2†, Mubbashar Saddique2
[email protected], [email protected] , [email protected]
1
Department of Computer Science, Information Technology University Lahore 54000, Pakistan
2
Department of Computer Science, University of Okara 56300, Pakistan

Abstract [2]. The third technique involves changing the loop order.
Compilation time has always been an important factor for The impact of loop order is an important count in execution
performance analysis of any system. This paper discusses the time. Basically, it reduces the jump calls between
various optimized iterative techniques to analyze the performance instruction execution which in return reduce the overall
of programs like loop unrolling, loop level parallelism or loop- execution time. It comes up with vertical and horizontal
carried dependence, and loop ordering. In first technique, loop is
unrolled up to scale 5 and then compared with rolled one to find
execution order of instructions.
out performance differences. In the second technique, it finds that 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑘 = 0; 𝑘 < 100; 𝑘 + +)
the loop carried dependence and inter-change the statements order {
and exposes the par-allelism. In the third one, the loop order is . . .
changed to reduce the jump calls during code execution. The
execution time of all three methods are compared, that is the proof 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 100; 𝑗 + +)
of high performance after implementing the optimized iterative {
techniques. The execution time may differ on different machines. . . .
The results are calculated on a core i5 machine with 2.7Ghz
processor under Linux kernel. }
Key words:
Compilation time; performance analysis; iterative techniques;
program optimization.
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 100; 𝑗 + +)
{
1. Introduction . . .

Optimization techniques are the helping hand of any system 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑘 = 0; 𝑘 < 100; 𝑘 + +)
developed to perform well. Different optimization {
techniques are developed to reduce the execution time/ . . .
memory usage by processor. Some of these are }
implemented on ma-chine level, while others are the source }
level implementation. This paper discusses the source level
optimization of iterative methods to reduce the execution This technique also counts for the high-performance system
time of a program. The first technique discusses the loop that needs the iterative solutions to be implemented. The
unrolling against loop rolling. If a large number of loop paper is further divided as follows. It discusses related work
iterations exist in the kernel pipeline, the loop iterations that is already gone through the experiment. Then the
could potentially be the critical path of the kernel pipeline. methodology is explained under all the techniques
UL can increase the pipeline throughput by allocating more experimented during this research. Then the results and
hardware resources to the loop [1]. It causes the reduction conclusion sum up the discussion about the experiment
of compilation time that indirectly adds up to performance. under discussion.
The loop-carried dependence is the iterative dependency
that exists in loops that causes a barrier to implement
parallelism. To implement the loop-level parallel-ism, we 2. Related Work
need to recognize the structure of loops, arrays and any
variable involved. If the findings show that the statements Loop unrolling is widely helpful in different types of
inside the loop are not circular dependent, then we can make applications. Some of its applications exist in image
alterations in statements to execute parallel and improve the processing where the image convolution takes help of loop
execution time. The results of this technique also count for unrolling while multiplying the matrix. It helps in creating
higher performance and reduction in compilation technique optimized algorithms. The performance after optimization

Manuscript received May 5, 2020

Manuscript revised May 20, 2020
IJCSNS International Journal of Computer Science and Network Security, VOL.20 No.5, May 2020 121

and parallelism speeds up over 2000x over baseline [3]. The 𝑥 = 𝑦;

work has been done on parallel frameworks that offer the
programming patterns which express the concurrency in 𝑥 = 𝑦;
applications that enables the usage of hardware in parallel
manners. This transforms the sequential instructions to }
parallel after identifying the map and pipeline the parallel
patterns. [5] The kernel level optimization is also done by }
finding the loop pattern for its different activities. The loop
order is an important factor that always impacts the Loop is unrolled up to scale 5. What we did is just replace
performance as well as compilation time [4]. the repetitive instruction with the same 5 instructions and
lower the number of iterations five times of total.

3. Proposed Methodology
3.2 Loop level parallelism
This approach comes up with three different methodologies
to find out the best possible optimization on source level.
All the three techniques are explained here with source code The following code is about eliminating the loop-carried
and results. Every function provokes from the 𝑚𝑎𝑖𝑛() dependence and exposing the loop-level parallelism [2].
function. While for comparing the results both scenarios are
discussed. 𝐿𝑜𝑜𝑝 − 𝑙𝑒𝑣𝑒𝑙 𝑑𝑒𝑝𝑒𝑛𝑑𝑒𝑛𝑐𝑒:

3.1 Loop unrolling 𝑣𝑜𝑖𝑑 𝐿𝐿𝑃1()

The following given code is about loop rolling and unrolling. {
𝐿𝑜𝑜𝑝 𝑟𝑜𝑙𝑙𝑒𝑑: 𝑖𝑛𝑡 𝐴[100], 𝐵[100 + 1], 𝐶[100], 𝐷[100];
𝑣𝑜𝑖𝑑 𝑙𝑜𝑜𝑝𝑅𝑜𝑙𝑙𝑒𝑑 ( ) 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 1000000; 𝑗 + +)
{
{
𝑖𝑛𝑡 𝑥 = 0, 𝑦 = 0;
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 100; 𝑖 + +)
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 100000000; 𝑖 + +)
{
{
𝐴[𝑖] = 𝐴[𝑖] + 𝐵[𝑖];
𝑥 = 𝑦;
} 𝐵[𝑖 + 1] = 𝐶[𝑖] + 𝐷[𝑖];
𝐿𝑜𝑜𝑝 𝑢𝑛𝑟𝑜𝑙𝑙𝑒𝑑: }
𝑣𝑜𝑖𝑑 𝑙𝑜𝑜𝑝𝑈𝑛𝑟𝑜𝑙𝑙𝑒𝑑() { }
𝑖𝑛𝑡 𝑥 = 0, 𝑦 = 0; }
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 20000000; 𝑖 + +) 𝐿𝑜𝑜𝑝 − 𝑙𝑒𝑣𝑒𝑙 𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙𝑖𝑠𝑚
}
{ 𝑣𝑜𝑖𝑑 𝐿𝐿𝑃2()
{
𝑥 = 𝑦;
𝑖𝑛𝑡 𝐴[100], 𝐵[100 + 1], 𝐶[100], 𝐷[100];
𝑥 = 𝑦;
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 1000000; 𝑗 + +)
𝑥 = 𝑦;
{
122 IJCSNS International Journal of Computer Science and Network Security, VOL.20 No.5, May 2020

}
𝐴[0] = 𝐴[0] + 𝐵[0];
}
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 99; 𝑖 + +)
𝐿𝑜𝑜𝑝 𝑜𝑟𝑑𝑒𝑟 2:
{ 𝑣𝑜𝑖𝑑 𝑙𝑜𝑜𝑝2()

𝐵[𝑖 + 1] = 𝐶[𝑖] + 𝐷[𝑖]; {

𝐴[𝑖 + 1] = 𝐴[𝑖 + 1] + 𝐵[𝑖 + 1]; 𝑖𝑛𝑡 𝑥[100][100];

} 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 10000; 𝑖 + +)

} {

𝐵[100] = 𝐶[99] + 𝐷[99]; 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑘 = 0; 𝑘 < 100; 𝑘 + +)

} {

The above code is just finding out the dependency between 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 100; 𝑗 + +)
the instructions and finding if it can be removed or not.
{
3.3 Loop ordering
𝑥[𝑗][𝑘] = 𝑖;
The following code describes the loop ordering technique
that is also helpful in some scenarios to enhance the }
performance.
}
𝐿𝑜𝑜𝑝 𝑜𝑟𝑑𝑒𝑟 1:
}
𝑣𝑜𝑖𝑑 𝑙𝑜𝑜𝑝1()
}
{
It can be observed that here the most inner loop is
𝑖𝑛𝑡 𝑥[100][100]; interchanged with the second inner loop thus improves the
execution time.
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 10000; 𝑖 + +)
3.4 Execution process
{
The execution is the process where we can compare the
𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑗 = 0; 𝑗 < 100; 𝑗 + +) results of its compilation time that indirectly is a sign of
performance evaluation of the system. To get the execution
{ time, we come up with the following strategies.

𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑘 = 0; 𝑘 < 100; 𝑘 + +) // Get the system time before function start
𝑎𝑢𝑡𝑜 𝑠𝑡𝑎𝑟𝑡 = ℎ𝑖𝑔ℎ_𝑟𝑒𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛_𝑐𝑙𝑜𝑐𝑘: : 𝑛𝑜𝑤();
{
// Invoke the function,
𝑥[𝑗][𝑘] = 𝑖;
𝑙𝑜𝑜𝑝1()
} //Get the system time after function stop execution

} 𝑎𝑢𝑡𝑜𝑠𝑡𝑜𝑝 = ℎ𝑖𝑔ℎ_𝑟𝑒𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛_𝑐𝑙𝑜𝑐𝑘: : 𝑛𝑜𝑤();

// Get the total time duration.
IJCSNS International Journal of Computer Science and Network Security, VOL.20 No.5, May 2020 123

𝑎𝑢𝑡𝑜 𝑑𝑢𝑟𝑎𝑡𝑖𝑜𝑛 = 𝑑𝑢𝑟𝑎𝑡𝑖𝑜𝑛_𝑐𝑎𝑠𝑡 < 𝑚𝑖𝑐𝑟𝑜𝑠𝑒𝑐𝑜𝑛𝑑𝑠 > [5] D. del Rio Astorga, M. F. Dolz, L. M. Sánchez, J. D. García,
(𝑠𝑡𝑜𝑝 – 𝑠𝑡𝑎𝑟t); M. Danelutto, and M. Torquati, "Finding parallel patterns
through static analysis in C++ applications," The
International Journal of High Performance Computing
4. Results Applications, vol. 32, pp. 779-788, 2018.
[6] (2020, 15-Jan-2020). C++ Debugger. Available:
The results are taken from two different sources after https://www.onlinegdb.com/
compilation. The first source is the online compiler [6]. The
second source is the Linux OS. Table1 shows the results.
Ume Farwa completed BS (Information
Technology) from University of Education,
LR: Loop rolling Lahore in 2017. Presently, she is MPhil
Scholar at Information Technology
LUR: Loop unrolling University Lahore, Pakistan. Her research
interest is including HCI, machine learning,
LLP: Loop-level parallelism data mining, networks and programming.

Table 1: Margin specifications

LR() LUR() LLP1() LLP2() Loop1() Loop2()

1.05 0.3001 4.3828 1.8878 1.4375 0.3556 Khurshid Asghar is working as Associate
Professor at Department of Computer
0.2126 0.0421 3.3843 2.903 2.5339 2.3422 Science University of Okara. He earned
PhD degree in the field of image forensics
from COMSATS University Islamabad,
Pakistan. He also worked as research
associate at Cardiff School of Computer
5. Conclusion Science and Informatics, Cardiff
University, UK. His current research
The techniques for optimizing the iterative methods are interest includes Image Processing, Image
useful scenario to scenario. For example, some techniques and Video Forensics, Machine Learning, Deep Learning, Network
might not be as helpful as expected. On the other hand, there Security, Biometrics, Medical Imaging Brain Signals, Geometric
Modeling and Computer programming.
is machine to machine performance variation. So, the above
results may vary if the same code is run on some other
machine with different environments. Overall these Mubbashar Siddique is working as
techniques are helpful in various real time applications as Lecturer at Department of Computer
well as for the OS itself, as stated above. Image processing Sciences. He completed BSc
takes it into a great account to use the loops for matrix (Telecommunication Engineering) from
manipulation. These techniques are still on the way to Institute of Engineering & Technology,
improve time to time for achieving high performance on Lahore Campus, and Pakistan. He got merit
different systems. scholarship from COMSATS University
Islamabad (Abbottabad Campus), Pakistan
where he completed his MS computer
References science in 2010. Presently, he is a PhD
[1] Z. Wang, B. He, W. Zhang, and S. Jiang, "A performance Scholar at COMSATS University Islamabad, Pakistan. Mr.
analysis framework for optimizing OpenCL applications on Siddique also worked as a research associate at Department of
FPGAs," in 2016 IEEE International Symposium on High Cyber Defense Graduate School of Information Security, Korea
Performance Computer Architecture (HPCA), 2016, pp. 114- University, South Korea. Presently, he is working in video and
125. image forensic domain. Furthermore, his research interest is in the
[2] J. L. Hennessy and D. A. Patterson, Computer architecture: a area of image/video processing, computer vision, machine
quantitative approach vol. 6: Elsevier, 2019. learning, data mining and networks.
[3] A. Tousimojarad, W. Vanderbauwhede, and W. P. Cockshott,
"2D Image Convolution using Three Parallel Programming
Models on the Xeon Phi," arXiv preprint arXiv:1711.09791,
2017.
[4] T. M. Low, F. D. Igual, T. M. Smith, and E. S. Quintana-Orti,
"Analytical modeling is enough for high-performance BLIS,"
ACM Transactions on Mathematical Software (TOMS), vol.
43, pp. 1-18, 2016.

View publication stats

ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
From Everand
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
Computer-Controlled Systems: Theory and Design, Third Edition
From Everand
Computer-Controlled Systems: Theory and Design, Third Edition
Karl J Åström
3/5 (1)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Sample Aspect and Impact Identification Procedure
50% (4)
Sample Aspect and Impact Identification Procedure
8 pages
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
From Everand
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
Mustafa Al-Dori
5/5 (1)
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Computer Science, Career and Job
From Everand
Computer Science, Career and Job
Ramkrishna Ghosh
No ratings yet
Angular Performance Optimization: Everything you need to know
From Everand
Angular Performance Optimization: Everything you need to know
Abdelfattah Ragab
No ratings yet
Production System: Fundamentals and Applications
From Everand
Production System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Python-Based Evolutionary Algorithms for Engineers
From Everand
Python-Based Evolutionary Algorithms for Engineers
Pankaj Jayaraman
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)
Swift Programming Simplified: A Practical Guide with Examples
From Everand
Swift Programming Simplified: A Practical Guide with Examples
William E. Clark
No ratings yet
Java Streams Explained: A Practical Guide with Examples
From Everand
Java Streams Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Model-Driven Online Capacity Management for Component-Based Software Systems
From Everand
Model-Driven Online Capacity Management for Component-Based Software Systems
André van Hoorn
No ratings yet
Key Principles of IT Architecture
From Everand
Key Principles of IT Architecture
Nelson Ambrose
No ratings yet
Data Structures and Algorithms with Python
From Everand
Data Structures and Algorithms with Python
Aadinath Pothuvaal
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
From Everand
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
Matthew Rosch
No ratings yet
Learning PyTorch 2.0, Second Edition
From Everand
Learning PyTorch 2.0, Second Edition
Matthew Rosch
No ratings yet
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
From Everand
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
Robert Johnson
No ratings yet
Optimization of Computer Programs in C
No ratings yet
Optimization of Computer Programs in C
2 pages
Live Trace Visualization for System and Program Comprehension in Large Software Landscapes
From Everand
Live Trace Visualization for System and Program Comprehension in Large Software Landscapes
Florian Fittkau
No ratings yet
Basics of Python Programming: A Quick Guide for Beginners
From Everand
Basics of Python Programming: A Quick Guide for Beginners
Krishna Kumar Mohbey
No ratings yet
Optimal Code Compiling in C: Nitika Gupta Nistha Seth Prabhat Verma
No ratings yet
Optimal Code Compiling in C: Nitika Gupta Nistha Seth Prabhat Verma
8 pages
Automatic Control: Experimental Approaches
From Everand
Automatic Control: Experimental Approaches
Subodh Keshari
No ratings yet
JAVA PROGRAMMING FOR BEGINNERS: Master Java Fundamentals and Build Your Own Applications (2023 Crash Course)
From Everand
JAVA PROGRAMMING FOR BEGINNERS: Master Java Fundamentals and Build Your Own Applications (2023 Crash Course)
Theo Houle
No ratings yet
Linear and Nonlinear Programming Essentials
From Everand
Linear and Nonlinear Programming Essentials
Tanushri Kaniyar
No ratings yet
Mivar NETs and logical inference with the linear complexity
From Everand
Mivar NETs and logical inference with the linear complexity
Varlamov, Oleg O.
No ratings yet
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
From Everand
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
ARCHER PAUL
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
From Everand
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
Granino A. Korn
No ratings yet
Code Optimization Techniques
No ratings yet
Code Optimization Techniques
27 pages
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
From Everand
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
Steve Brown
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
10
No ratings yet
10
76 pages
A Practical Approach To Optimize Code Implementation
No ratings yet
A Practical Approach To Optimize Code Implementation
11 pages
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
From Everand
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
Ojula Technology Innovations
No ratings yet
SCCharts - Language and Interactive Incremental Compilation
From Everand
SCCharts - Language and Interactive Incremental Compilation
Christian Motika
No ratings yet
Lecture 04
No ratings yet
Lecture 04
100 pages
Algorithms Made Simple: Understanding the Building Blocks of Software
From Everand
Algorithms Made Simple: Understanding the Building Blocks of Software
William E. Clark
No ratings yet
Visualised Systems Engineering on Railway Projects
From Everand
Visualised Systems Engineering on Railway Projects
Jong-Pil Nam
No ratings yet
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
Shell Scripting Step by Step: A Practical Guide with Examples
From Everand
Shell Scripting Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Azure Patterns for Real-World Apps: Resilient by Design
From Everand
Azure Patterns for Real-World Apps: Resilient by Design
Kameron Hussain
No ratings yet
Optimizing Machine Learning Pipelines: Advanced Techniques with TensorFlow and Kubeflow
From Everand
Optimizing Machine Learning Pipelines: Advanced Techniques with TensorFlow and Kubeflow
Adam Jones
No ratings yet
(PEY97) Implementing Functional Languages
No ratings yet
(PEY97) Implementing Functional Languages
296 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Code Optimization Sept. 25, 2003: "The Course That Gives CMU Its Zip!"
No ratings yet
Code Optimization Sept. 25, 2003: "The Course That Gives CMU Its Zip!"
57 pages
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Fundamentals of Control Engineering
From Everand
Fundamentals of Control Engineering
Aniruddh Mohan
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Learn Design and Analysis of Algorithms in 24 Hours
From Everand
Learn Design and Analysis of Algorithms in 24 Hours
Alex Nordeen
No ratings yet
3. Binary Oriented Feature Selection for Valid Product Derivation in Sofware-Corresponding
No ratings yet
3. Binary Oriented Feature Selection for Valid Product Derivation in Sofware-Corresponding
18 pages
14. Edge–texture feature‑based image forgery detection
No ratings yet
14. Edge–texture feature‑based image forgery detection
20 pages
12Digital Video Tampering Detection and Localization Review, Representations, Challenges and Algorithm
No ratings yet
12Digital Video Tampering Detection and Localization Review, Representations, Challenges and Algorithm
38 pages
Enhance stegnoanalysis for color images using curvlet features and svm
No ratings yet
Enhance stegnoanalysis for color images using curvlet features and svm
18 pages
B1PLUS U1 Accessible Reading
No ratings yet
B1PLUS U1 Accessible Reading
2 pages
Multiply and Divide Integers - Revised
No ratings yet
Multiply and Divide Integers - Revised
17 pages
Transmission Expansion Planning Springer Nature LaTeX Template
No ratings yet
Transmission Expansion Planning Springer Nature LaTeX Template
15 pages
Nirs Xds Rapidcontent Analyzer: Fast, Non-Destructive Analyses of Solid and Liquid Substances
No ratings yet
Nirs Xds Rapidcontent Analyzer: Fast, Non-Destructive Analyses of Solid and Liquid Substances
4 pages
Mechanical Vibrations 5th Edition Rao Solutions Manual - Full Version Is Now Available For Download
100% (4)
Mechanical Vibrations 5th Edition Rao Solutions Manual - Full Version Is Now Available For Download
34 pages
Enhancing The Pupils' Problem Solving Skills Involving Time Through Mobile Learning Approach (Mlearning)
No ratings yet
Enhancing The Pupils' Problem Solving Skills Involving Time Through Mobile Learning Approach (Mlearning)
6 pages
A Short History of Urban Planning
No ratings yet
A Short History of Urban Planning
14 pages
Egress Turnover Checklist MAJESTIC HAM
No ratings yet
Egress Turnover Checklist MAJESTIC HAM
1 page
Description and Application: 80%ar - 20%CO / 100%CO EN ISO 17633-A T 19 9 L P C1/M21 1 AWS A5.22 E308LT1-1/4 EN 1.4316
No ratings yet
Description and Application: 80%ar - 20%CO / 100%CO EN ISO 17633-A T 19 9 L P C1/M21 1 AWS A5.22 E308LT1-1/4 EN 1.4316
1 page
Guide 1812 01 Start
No ratings yet
Guide 1812 01 Start
1 page
Extracted Pages From ASME II PART A1 (2019) - SA210
No ratings yet
Extracted Pages From ASME II PART A1 (2019) - SA210
6 pages
EET306 - M3 Ktunotes - in
No ratings yet
EET306 - M3 Ktunotes - in
19 pages
Maths Worksheet Class - V
No ratings yet
Maths Worksheet Class - V
7 pages
Test Bank Ngan Hang Thuong Mai 2
No ratings yet
Test Bank Ngan Hang Thuong Mai 2
110 pages
96 Commom Idioms
No ratings yet
96 Commom Idioms
45 pages
A Review of User-Friendly Freely-Available Statistical Analysis Software For Medical Researchers and Biostatisticians
No ratings yet
A Review of User-Friendly Freely-Available Statistical Analysis Software For Medical Researchers and Biostatisticians
13 pages
Test English 8
No ratings yet
Test English 8
5 pages
Q4 Take-Home Activity 6
No ratings yet
Q4 Take-Home Activity 6
3 pages
Log Cat 1699712504331
No ratings yet
Log Cat 1699712504331
14 pages
Unitedaluminum.com-1100 Aluminum Alloy Alloy Data Sheet and Properties
No ratings yet
Unitedaluminum.com-1100 Aluminum Alloy Alloy Data Sheet and Properties
2 pages
Arrange and Display Pharmaceutical Products
No ratings yet
Arrange and Display Pharmaceutical Products
28 pages
Process Flow
No ratings yet
Process Flow
3 pages
Anatomy and Physiology of The Brain
No ratings yet
Anatomy and Physiology of The Brain
4 pages
pdf24_merged (2)
No ratings yet
pdf24_merged (2)
13 pages
Alexa Vazquez
No ratings yet
Alexa Vazquez
2 pages
PRAKASINA by Professor Puran Singh
No ratings yet
PRAKASINA by Professor Puran Singh
194 pages
Principle 9 12
No ratings yet
Principle 9 12
18 pages
Chess History
No ratings yet
Chess History
4 pages
What Are Activities To Convert Raw Data To Make Information in Information Systems
33% (3)
What Are Activities To Convert Raw Data To Make Information in Information Systems
4 pages

Compilation_Time-Based_Analysis_using_Op

Uploaded by

Compilation_Time-Based_Analysis_using_Op

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Compilation Time-Based Analysis using Optimized Iterative Techniques

Article · May 2020

Ume Farwa Khurshid Asghar

SEE PROFILE SEE PROFILE

Component Based Software Testing View project

Video Forgery Detection View project

The user has requested enhancement of the downloaded file.

Compilation Time-Based Analysis using Optimized Iterative

Manuscript received May 5, 2020

and parallelism speeds up over 2000x over baseline [3]. The 𝑥 = 𝑦;

3.1 Loop unrolling 𝑣𝑜𝑖𝑑 𝐿𝐿𝑃1()

𝐵[𝑖 + 1] = 𝐶[𝑖] + 𝐷[𝑖]; {

𝐴[𝑖 + 1] = 𝐴[𝑖 + 1] + 𝐵[𝑖 + 1]; 𝑖𝑛𝑡 𝑥[100][100];

} 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑖 = 0; 𝑖 < 10000; 𝑖 + +)

𝐵[100] = 𝐶[99] + 𝐷[99]; 𝑓𝑜𝑟 (𝑖𝑛𝑡 𝑘 = 0; 𝑘 < 100; 𝑘 + +)

} 𝑎𝑢𝑡𝑜𝑠𝑡𝑜𝑝 = ℎ𝑖𝑔ℎ_𝑟𝑒𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛_𝑐𝑙𝑜𝑐𝑘: : 𝑛𝑜𝑤();

Table 1: Margin specifications

View publication stats

You might also like