0% found this document useful (0 votes)

33 views

Explainable Automated Program Repair

Uploaded by

Aryan Kundu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Explainable Automated Program Repair

Uploaded by

Aryan Kundu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

XAIR: Explainable Automated Program Repair

Using Deep Learning and Explainable AI

Techniques

Explainable AI (XAI): Explained

Published in: 2023 IEEE Open Conference of Electrical, Electronic and
Information Sciences (eStream)

The abstract provides a detailed summary of the paper's content, including an overview of
XAI, specific techniques like LIME and SHAP, and their applications across various domains
such as healthcare, finance, and law.
The paper addresses a current and pressing issue in AI, which is the need for explainability
in complex AI models. This is particularly relevant given the growing use of AI in high-stakes
domains.
The inclusion of ethical and legal implications highlights the broader impact of XAI and the
necessity for responsible AI deployment.

The use of technical terms like "Local Interpretable Model-Agnostic Explanations" (LIME)
and "SHapley Additive exPlanations" (SHAP) might be difficult for readers unfamiliar with
XAI.
The abstract mentions that few review papers are available but does not clearly state what
new insights or unique contributions this paper offers compared to existing literature.

Current Trends, Challenges and

Techniques in XAI Field; A Tertiary
Study of XAI Research
Published in: 2024 47th MIPRO ICT and Electronics Convention (MIPRO)

Increased Adoption Across Industries: XAI is being integrated into various high-stakes
domains such as healthcare, finance, law, and autonomous vehicles. The need for
transparency and trust in AI decision-making is driving this trend.
Regulatory Push: There is a growing emphasis on AI explainability due to regulatory
requirements. Governments and organizations are pushing for AI systems that can be
audited and understood by non-experts to ensure fairness and accountability.
Trade-off Between Performance and Explainability: High-performing models like deep
neural networks are often complex and less interpretable. Striking a balance between model
accuracy and interpretability remains a major challenge.
Scalability: Many XAI methods are computationally intensive and may not scale well with
large datasets or complex models, limiting their practical applicability.

Bayesian XAI Methods Towards a

Robustness-Centric Approach to Deep
Learning: An ABIDE I Study
The integration of Bayesian Neural Networks (BNNs) with Explainable AI (XAI) methods
represents a novel approach in the diagnosis of Autism Spectrum Disorder (ASD),
showcasing the potential for advancements in both model interpretability and reliability.
By using Layerwise Relevance Propagation (LRP), the study emphasizes the importance of
understanding model predictions, which is crucial in high-stakes fields like healthcare.
The combination of BNNs and LRP provides a robustness-centric deep learning approach,
enhancing the reliability of the model's predictions by quantifying epistemic uncertainty.

The use of BNNs and the repeated inference required for uncertainty estimation can be
computationally intensive, which may limit the practical application of this approach in
real-time scenarios.The reliance on the ABIDE dataset, while comprehensive, may not fully
capture the diversity of ASD presentations, potentially limiting the model's applicability to
broader populations.

Automated Program Repair for

Introductory Programming
Assignments
Novel approach: Proposes CEMR, a new automated program repair tool that combines
learning from existing code modifications with a large language model
(CodeBERT).Comprehensive evaluation: Tested on both open online judge platform (LuoGu)
and real classroom datasets, comparing against multiple baselines.
Strong performance: Achieves higher repair rates compared to baseline methods, especially
for semantic and logical errors.
Efficiency: Repairs incorrect programs in about half the time compared to AlphaRepair.

Limited scope: Only tested on introductory Python programming problems, may not
generalize to more complex programs or other languages.
Inability to fix syntactical errors: Unlike AlphaRepair, CEMR cannot repair programs with
syntax errors due to its reliance on ASTs.
Dependence on existing solutions: As a data-driven approach, CEMR may struggle with
novel or uncommon problem-solving approaches not present in the training data.

Towards JavaScript program repair

with Generative Pre-trained
Transformer (GPT-2)
Novel use of GPT-2 model for automated program repair (APR), which had not been done
before according to the authors.
Focus on JavaScript, which is an extremely popular programming language but
underrepresented in APR research.
Able to generate syntactically correct source code in most attempts.
Achieved an overall accuracy of up to 17.25% in generating correct fixes.
Created and used a large dataset of 16,863 JavaScript code snippets for training.

Failed to learn good bug-fixes in some cases, indicating inconsistent performance.

17.25% accuracy, while promising, still leaves significant room for improvement.
Limited to fixing single-line bugs only, not more complex multi-line issues.
Approach may be computationally intensive, given the size of the GPT-2 model.
Potential for data leakage or overfitting, as the model needs to be trained on project-specific
data to accurately predict variable names.
Lack of comparison to state-of-the-art APR techniques, making it difficult to assess relative
performance.

An Evaluation of the Effectiveness of

OpenAI's ChatGPT for Automated
Python Program Bug Fixing using
QuixBugs
Uses a state-of-the-art language model (GPT-3.5) for automated bug fixing in Python code.

Evaluates the effectiveness using an established benchmark (QuixBugs), allowing for

comparison with other methods.

Demonstrates high accuracy, successfully fixing 30 out of 40 bugs from the QuixBugs
benchmark.
Outperforms other tools like standard program repair and Codex in bug-fixing capability.

Highlights the potential of ChatGPT as a powerful tool for enhancing code quality and
reducing manual bug-fixing efforts.

Limited scope - only tested on 40 Python bugs from a single benchmark suite.

Lack of details on the specific types of bugs that were fixed or not fixed.

No mention of the time or computational resources required for the bug-fixing process.

Doesn't address potential limitations or challenges of using ChatGPT for this task.

Doesn't discuss how the approach might generalize to more complex or real-world coding
scenarios beyond the benchmark.

No information on false positives or potential introduction of new bugs during the fixing
process.

DeepRepair: Style-Guided Repairing

for Deep Neural Networks in the
Real-World Operational Environment
Addresses an important real-world problem - repairing deployed deep neural networks
(DNNs) that fail due to mismatches between training and operational environments.

Proposes a novel approach using style-guided data augmentation to repair DNNs.

Introduces clustering-based failure data generation to improve the effectiveness of the

augmentation.

Conducts large-scale evaluation across 15 different degradation factors/failure patterns.

Demonstrates significant accuracy improvements (62.88% for CNNs, 39.02% for RNNs on
average) compared to state-of-the-art methods.

Shows the repaired DNNs maintain or even improve accuracy on clean data.
Limited to image classification tasks - may not generalize to other types of DNN applications.

Requires collecting some failure examples from the operational environment - may be
challenging in some real-world scenarios.

Focuses only on naturally occurring degradations/noise - does not address adversarial

attacks or malicious perturbations.

Evaluation is limited to CIFAR-10 dataset - more diverse datasets could strengthen the
results.

Does not provide theoretical guarantees on the effectiveness or generalizability of the

approach.

Generative AI for Self-Healing Systems

Clear problem statement: The abstract effectively identifies the risk of component failures in
large-scale system production and the current reliance on human experts for system
monitoring.

Innovative approach: It proposes integrating generative AI technology into self-healing

systems, which is a novel and potentially impactful solution.

Specific focus areas: The abstract outlines clear areas of application for generative AI,
including anomaly detection, code generation, debugging, and auto-generative reporting.

Practical application: The study aims to optimize system functionality and efficiency at scale,
which has real-world implications for large-scale systems.

Comprehensive solution: The proposed approach covers multiple aspects of system

maintenance, from detection to repair and reporting.

Lack of quantitative goals: The abstract doesn't provide specific, measurable objectives for
improvement over current methods.

Limited discussion of challenges: It doesn't address potential challenges or limitations of

integrating generative AI into self-healing systems.
Absence of methodology details: The abstract doesn't provide an overview of the research
methodology or experimental setup.

No mention of comparative analysis: There's no indication of how the proposed solution

compares to existing self-healing systems or other AI-based approaches.

Vague on implementation details: While it mentions using GPT-4 for code completion, it
doesn't provide specifics on how other aspects of the generative AI integration will be
implemented.

Examining Zero-Shot Vulnerability

Repair with Large Language Models

Novel application of large language models (LLMs) to security bug repair, exploring an
important and timely research question.

Comprehensive evaluation using multiple types of scenarios: synthetic, hand-crafted, and

real-world security bugs.

Examination of multiple commercial and open-source LLMs, providing a broad comparison.

Exploration of prompt engineering techniques to improve LLM performance on this task.

Demonstrates some promise, with LLMs collectively able to repair 100% of synthetic and
hand-crafted scenarios.

Limited to zero-shot performance, without fine-tuning LLMs specifically for this task.

Challenges identified in generating functionally correct code for real-world examples.

Focused only on vulnerabilities that can be fixed with localized changes in a single file.

Reliance on existing test suites and security tools to validate fixes, which may miss some
issues.

Does not fully solve the problem of automatic security bug repair, but provides initial
characterization of LLM capabilities in this domain.

(Verizon) Bill
50% (4)
(Verizon) Bill
1 page
TM-1815 AVEVA Everything3D Cableway and Cable Modelling Rev 1.0
86% (7)
TM-1815 AVEVA Everything3D Cableway and Cable Modelling Rev 1.0
136 pages
Game Audio Programming 4 Principles and Practices
100% (1)
Game Audio Programming 4 Principles and Practices
356 pages
General Information:: Servicetraining 39TL3 - Introduction RMA, BSP Ver 1.0, 3.8.201
No ratings yet
General Information:: Servicetraining 39TL3 - Introduction RMA, BSP Ver 1.0, 3.8.201
9 pages
Explainable Automated Program Repair Final Paper
No ratings yet
Explainable Automated Program Repair Final Paper
5 pages
Aspect-Oriented Programming in Practice: Definitive Reference for Developers and Engineers
From Everand
Aspect-Oriented Programming in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SDLC Model Explainable Automated Program Repair
No ratings yet
SDLC Model Explainable Automated Program Repair
7 pages
Clean Code Practices
From Everand
Clean Code Practices
Zoe Codewell
No ratings yet
ucalgary_2023_hajimohammadkhani_ahmad
No ratings yet
ucalgary_2023_hajimohammadkhani_ahmad
103 pages
Exploring The Use of ChatGPT For Resolving Programming Bugs
No ratings yet
Exploring The Use of ChatGPT For Resolving Programming Bugs
12 pages
Beyond The Algorithm: Practical Machine Learning Strategies
From Everand
Beyond The Algorithm: Practical Machine Learning Strategies
Jane Onwuchekwa
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
REVIEW1[1]
No ratings yet
REVIEW1[1]
17 pages
AI Coder Research Proposal
No ratings yet
AI Coder Research Proposal
61 pages
Practical Guide to H2O.ai: Definitive Reference for Developers and Engineers
From Everand
Practical Guide to H2O.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
29
No ratings yet
29
7 pages
XGBoost in Practice: Definitive Reference for Developers and Engineers
From Everand
XGBoost in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Projectacademy Artificial Intelligence Projects List 2023
No ratings yet
Projectacademy Artificial Intelligence Projects List 2023
8 pages
OpenAI Development Guide: Definitive Reference for Developers and Engineers
From Everand
OpenAI Development Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
AutoCodeRover: The Future of Program Improvement and GitHub Issue Resolution
No ratings yet
AutoCodeRover: The Future of Program Improvement and GitHub Issue Resolution
9 pages
Our Research Paper
No ratings yet
Our Research Paper
7 pages
7 +ijaise5
No ratings yet
7 +ijaise5
10 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Optimization in Engineering Sciences: Exact Methods
From Everand
Optimization in Engineering Sciences: Exact Methods
Pierre Borne
No ratings yet
Different XAI Techniques
No ratings yet
Different XAI Techniques
52 pages
10 1109@vlsi-Dat49148 2020 9196288
No ratings yet
10 1109@vlsi-Dat49148 2020 9196288
1 page
Few-Shot Machine Learning: Doing More with Less Data
From Everand
Few-Shot Machine Learning: Doing More with Less Data
Robert Johnson
No ratings yet
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
From Everand
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
William E. Clark
No ratings yet
Algorithms Made Simple: Understanding the Building Blocks of Software
From Everand
Algorithms Made Simple: Understanding the Building Blocks of Software
William E. Clark
No ratings yet
Decoding Large Language Models: An exhaustive guide to understanding, implementing, and optimizing LLMs for NLP applications
From Everand
Decoding Large Language Models: An exhaustive guide to understanding, implementing, and optimizing LLMs for NLP applications
Irena Cronin
No ratings yet
Chai Assertion Library in Practice: Definitive Reference for Developers and Engineers
From Everand
Chai Assertion Library in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Explainable Automated Program Repair Using XAI
No ratings yet
Explainable Automated Program Repair Using XAI
4 pages
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
Linter Technology and Best Practices: Definitive Reference for Developers and Engineers
From Everand
Linter Technology and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Research Project
No ratings yet
Research Project
5 pages
Building Scalable Systems with C: Optimizing Performance and Portability
From Everand
Building Scalable Systems with C: Optimizing Performance and Portability
Larry Jones
No ratings yet
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
From Everand
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
Fluent Simulation and Modeling Techniques: Definitive Reference for Developers and Engineers
From Everand
Fluent Simulation and Modeling Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet
software defect prediction ppr
No ratings yet
software defect prediction ppr
11 pages
2308.10345v1
No ratings yet
2308.10345v1
18 pages
Mastering Modern AI Tools
From Everand
Mastering Modern AI Tools
Jean Claude AI
No ratings yet
Use Cases For Project
No ratings yet
Use Cases For Project
4 pages
Pentaho Solutions and Architecture: Definitive Reference for Developers and Engineers
From Everand
Pentaho Solutions and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Word Embedding Comparison
No ratings yet
Word Embedding Comparison
19 pages
Software Reuse: Methods, Models, Costs, Second Edition
From Everand
Software Reuse: Methods, Models, Costs, Second Edition
Ronald J. Leach
No ratings yet
DeepSeek-Coder-v2_ The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) [DownSub.com]
No ratings yet
DeepSeek-Coder-v2_ The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) [DownSub.com]
14 pages
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
Effective Cucumber Automation: Definitive Reference for Developers and Engineers
From Everand
Effective Cucumber Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Guideline for the assignment
No ratings yet
Guideline for the assignment
7 pages
Detectron2 in Practice: Definitive Reference for Developers and Engineers
From Everand
Detectron2 in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Building Intelligent Applications with Azure OpenAI: End-to-End Solutions in Conversational Programming and LLMs
From Everand
Building Intelligent Applications with Azure OpenAI: End-to-End Solutions in Conversational Programming and LLMs
Aarav Joshi
No ratings yet
Mastering AI Prompts: Unlocking the Potential of Intelligent Interaction
From Everand
Mastering AI Prompts: Unlocking the Potential of Intelligent Interaction
salah allam
No ratings yet
Jaeger Distributed Tracing in Practice: Definitive Reference for Developers and Engineers
From Everand
Jaeger Distributed Tracing in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Ranorex Automation Engineering: Definitive Reference for Developers and Engineers
From Everand
Ranorex Automation Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
JETIR2104196
No ratings yet
JETIR2104196
5 pages
Refactoring vs Refuctoring Advancing the State of AI Automated Code Improvements 1
No ratings yet
Refactoring vs Refuctoring Advancing the State of AI Automated Code Improvements 1
10 pages
OpenMPI Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenMPI Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Ishita Patel Resume 2025
No ratings yet
Ishita Patel Resume 2025
2 pages
2023 - Generative Image Model Benchmark For Reasoning and Representation (GIMBRR)
No ratings yet
2023 - Generative Image Model Benchmark For Reasoning and Representation (GIMBRR)
7 pages
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
From Everand
Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
Matthew Windham
No ratings yet
8335 Decap770 Advanced Data Structures
No ratings yet
8335 Decap770 Advanced Data Structures
241 pages
Scheme of Examination & Syllabi of Bachelor of Business Administration
No ratings yet
Scheme of Examination & Syllabi of Bachelor of Business Administration
40 pages
HackerRank Time Conversion Problem Solution
No ratings yet
HackerRank Time Conversion Problem Solution
7 pages
GENERAL 5pg
No ratings yet
GENERAL 5pg
5 pages
Data Analytics
No ratings yet
Data Analytics
16 pages
A Novel Intelligent Inspection Robot With Deep Stereo Vision For Three-Dimensional Concrete Damage Detection and Quantification
No ratings yet
A Novel Intelligent Inspection Robot With Deep Stereo Vision For Three-Dimensional Concrete Damage Detection and Quantification
15 pages
Profit Maximization5
No ratings yet
Profit Maximization5
4 pages
3.NCS-3.0-3-System Setup PDF
No ratings yet
3.NCS-3.0-3-System Setup PDF
46 pages
C# Questions and Answers
No ratings yet
C# Questions and Answers
2 pages
Jessica M. Rosin: Education
No ratings yet
Jessica M. Rosin: Education
1 page
Technical Tip - Initial Troubleshooting For GUI or ... - Fortinet Community
No ratings yet
Technical Tip - Initial Troubleshooting For GUI or ... - Fortinet Community
5 pages
Converting To SAP S/4HANA: Custom Code Migration
No ratings yet
Converting To SAP S/4HANA: Custom Code Migration
15 pages
Abstract - Automatic Traffic and Street Light Controller
No ratings yet
Abstract - Automatic Traffic and Street Light Controller
3 pages
MANUal Bluetooth J20H066
No ratings yet
MANUal Bluetooth J20H066
9 pages
Synergy of High Net-Worth Individuals (Hnis) With Their Distribution Channel
No ratings yet
Synergy of High Net-Worth Individuals (Hnis) With Their Distribution Channel
69 pages
Pulse Width Measurement Using PIC Microcontroller
No ratings yet
Pulse Width Measurement Using PIC Microcontroller
12 pages
Python Flash Cards
No ratings yet
Python Flash Cards
11 pages
Instructions for Certifying Data
No ratings yet
Instructions for Certifying Data
4 pages
Introduction To Information Systems
100% (2)
Introduction To Information Systems
13 pages
Project Running LED
100% (1)
Project Running LED
1 page
299 Project Report-3
No ratings yet
299 Project Report-3
11 pages
SLCodecs UG
No ratings yet
SLCodecs UG
30 pages
SIPI - Kasus 15.2
No ratings yet
SIPI - Kasus 15.2
5 pages
Using The Bink Audio Test CD
No ratings yet
Using The Bink Audio Test CD
3 pages
BC0036 Digital System Paper 3
No ratings yet
BC0036 Digital System Paper 3
13 pages
Glossary
No ratings yet
Glossary
24 pages

Explainable Automated Program Repair

Uploaded by

Explainable Automated Program Repair

Uploaded by

XAIR: Explainable Automated Program Repair

Using Deep Learning and Explainable AI

Explainable AI (XAI): Explained

Current Trends, Challenges and

Bayesian XAI Methods Towards a

Automated Program Repair for

Towards JavaScript program repair

Failed to learn good bug-fixes in some cases, indicating inconsistent performance.

An Evaluation of the Effectiveness of

Evaluates the effectiveness using an established benchmark (QuixBugs), allowing for

DeepRepair: Style-Guided Repairing

Proposes a novel approach using style-guided data augmentation to repair DNNs.

Introduces clustering-based failure data generation to improve the effectiveness of the

Conducts large-scale evaluation across 15 different degradation factors/failure patterns.

Focuses only on naturally occurring degradations/noise - does not address adversarial

Does not provide theoretical guarantees on the effectiveness or generalizability of the

Generative AI for Self-Healing Systems

Innovative approach: It proposes integrating generative AI technology into self-healing

Comprehensive solution: The proposed approach covers multiple aspects of system

Limited discussion of challenges: It doesn't address potential challenges or limitations of

No mention of comparative analysis: There's no indication of how the proposed solution

Examining Zero-Shot Vulnerability

Comprehensive evaluation using multiple types of scenarios: synthetic, hand-crafted, and

Examination of multiple commercial and open-source LLMs, providing a broad comparison.

Exploration of prompt engineering techniques to improve LLM performance on this task.

Challenges identified in generating functionally correct code for real-world examples.

You might also like