Advancing_Robotic_Control_Data-Driven_Model_Predictive_Control_for_a_7-DOF_Robotic_Manipulator
Advancing_Robotic_Control_Data-Driven_Model_Predictive_Control_for_a_7-DOF_Robotic_Manipulator
ABSTRACT In this study, we applied deep learning to improve the control of a KUKA LBR4 7 Degrees of
Freedom (DOF) robotic arm. We developed a dynamic model using a comprehensive dataset of joint angles
and actuator torques obtained from pick-and-place operations. This model was incorporated into a Model
Predictive Control (MPC) framework, enabling precise trajectory tracking without the need for traditional
analytical dynamic models. By integrating specific constraints within the MPC, we ensured adherence to
operational and safety standards. Experimental results demonstrate that deep learning models significantly
enhance robotic control, achieving precise trajectory tracking. This approach not only surpasses traditional
control methods in terms of accuracy and efficiency but also opens new avenues for research in robotics,
showcasing the potential of deep learning models in predictive control techniques.
INDEX TERMS Deep learning, model predictive control (MPC), robotic arm, trajectory tracking.
2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
115926 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ VOLUME 12, 2024
H. El-Hussieny et al.: Advancing Robotic Control: Data-Driven MPC for a 7-DOF Robotic Manipulator
FIGURE 5. Results of point stabilization scenarios to evaluate the the FIGURE 6. Results of trajectory tracking scenarios evaluating the
proposed deep learning-based MPC for reaching predefined joints proposed deep learning-based MPC in following reference joint
references. trajectories.
C. TRAJECTORY TRACKING PREDICTIVE CONTROL overall system, it has been included to test the algorithm’s
To assess the performance of the proposed data-driven MPC robustness. The results demonstrate that the proposed MPC
controller in trajectory tracking, we utilized a reference achieves satisfactory tracking performance while adhering to
trajectory. This trajectory is particularly useful in scenarios this constraint and effectively managing constraints on the
where the robotic manipulator must follow a specific other joints.
sequence of goals, such as in pick-and-place applications.
Specifically, the reference trajectory, denoted as qr ∈ R7 , D. COMPARISON WITH PID CONTROL
consists of a series of desired joint values. In this simulation experiment, we compare the performance
The robot starts with initial joint values q0 = [0.26, of our data-driven MPC controller with a PID independent
0.24, −0.34, 1.76, −0.07, −1.61, −1.666]T (rad). The con- joint control approach. The PID control loops were applied
troller’s time step is set to t = 0.01 seconds, with a to the robot model obtained using a deep learning model
prediction horizon of N = 12, and the total simulation trained from data. The PID gains were determined through
time is 18 seconds. The weighting matrices are chosen trial and error. To evaluate the controller, we used tracking
as W 1 = 100 and W 2 = 0.01. The performance error, measuring the difference between the desired and actual
of the MPC is depicted in Figure 6, demonstrating the joint positions. Within this PID controller, the torque τi for
MPC’s effectiveness in minimizing the difference between each joint is determined individually in the following manner:
the measured and reference trajectories. The Mean Squared dei
Z
Error (MSE) between the reference trajectory and the actual τi = Kp ei + Kd + Ki ei dt (12)
dt
robot trajectory is calculated to be 2 × 10−4 (in radians
squared). In contrast, the PID independent joint control approach
Furthermore, Figure 7 illustrates the performance of yielded poor results. Despite the effort to tune the PID
the data-driven MPC in tracking a reference joint and gains, the controller struggled to maintain stability. The
the corresponding applied torque, considering a nonlinear tracking error was significantly higher than that of the MPC.
inequality constraint on the robot’s joint (q1 − 1)2 ≤ 0. This More concerningly, the applied torques exhibited signs of
constraint acts as a limitation on the first joint within the joint instability, with significant amplifications when the reference
space. While this constraint does not significantly impact the joints changed.
REFERENCES
[1] B. Siciliano, O. Khatib, and T. Kröger, Springer Handbook of Robotics,
vol. 200. Germany: Springer, 2008.
[2] S. Kalpakjian and S. R. Schmid, Manufacturing Engineering and
Technology. London, U.K.: Prentice-Hall, 2009, pp. 568–571.
[3] R. H. Taylor and D. Stoianovici, ‘‘Medical robotics in computer-integrated
surgery,’’ IEEE Trans. Robot. Autom., vol. 19, no. 5, pp. 765–781,
Oct. 2003.
FIGURE 9. (a) Training losses expressed as mean squared error (MSE) for [4] D. Arney, R. Sutherland, J. Mulvaney, D. Steinkoenig, C. Stockdale,
five deep neural network (DNN) models during 5-fold cross-validation, and M. Farley, ‘‘On-orbit servicing, assembly, and manufacturing
and (b) mean and standard deviations.
(OSAM) state of play,’’ Edition-NASA Tech. Rep. Server (NTRS), White
Paper 20210022660, 2021. Accessed: May 1, 2023. [Online]. Available:
TABLE 1. Chosen architectures for the 5-fold cross-validation experiment.
https://ntrs.nasa.gov/api/citations/20210022660/downloads/osam_state_
of_play%20(1).pdf
[5] C. C. Cheah, S. Kawamura, and S. Arimoto, ‘‘Feedback control
for robotic manipulator with uncertain kinematics and dynamics,’’
in Proc. IEEE Int. Conf. Robot. Autom., vol. 4, May 1998,
pp. 3607–3612.
[6] L. Sciavicco and B. Siciliano, Modelling and Control of Robot Manipula-
tors. Germany: Springer, 2012.
[7] J. Son, H. Kang, and S. H. Kang, ‘‘A review on robust control of
robot manipulators for future manufacturing,’’ Int. J. Precis. Eng. Manuf.,
vol. 24, no. 6, pp. 1083–1102, Jun. 2023.
rapid learning could potentially enhance the overall system’s [8] P. D. Nguyen, N. H. Nguyen, and H. T. Nguyen, ‘‘Adaptive control for
robustness. manipulators with model uncertainty and input disturbance,’’ Int. J. Dyn.
Control, vol. 11, no. 5, pp. 2285–2294, Oct. 2023.
[9] L. Sciavicco and B. Siciliano, Modeling and Control of Robot Manipula-
IV. CONCLUSION tors. Germany: Springer, 2012.
To address the challenges of constrained nonlinear joint [10] M. W. Spong, S. Hutchinson, and M. Vidyasagar, Robot Modeling and
control for a seven Degrees of Freedoms (DoF) robotic Control. Hoboken, NJ, USA: Wiley, 2006.
[11] W. Khalil and E. Dombre, Modeling, Identification and Control of Robots.
manipulator, this study introduces a new approach through London, U.K.: Butterworth, 2002.
the implementation of a data-driven-enhanced nonlinear [12] M. Ruderman, F. Hoffmann, and T. Bertram, ‘‘Modeling and identification
Model Predictive Control (MPC) method. Initially, a data- of elastic robot joints with hysteresis and backlash,’’ IEEE Trans. Ind.
Electron., vol. 56, no. 10, pp. 3840–3847, 2009.
driven predictive model was developed, capable of forecast-
[13] R. Fareh, S. Khadraoui, M. Y. Abdallah, M. Baziyad, and M. Bettayeb,
ing future joint positions based on current joint values and ‘‘Active disturbance rejection control for robotic systems: A review,’’
applied torques. This model was then integrated within an Mechatronics, vol. 80, 2021, Art. no. 102671.
MPC framework, employing an online optimization problem [14] V. Bargsten, P. Zometa, and R. Findeisen, ‘‘Modeling, parameter identifi-
cation and model-based control of a lightweight robotic manipulator,’’ in
with process constraints to determine the optimal control Proc. IEEE Int. Conf. Control Appl., 2013, pp. 134–139.
torques for accurate point stabilization and joint trajectory [15] T. W. Yang, W. L. Xu, and J. D. Han, ‘‘Dynamic compensation control
tracking of the KUKA LBR4 robotic manipulator. of flexible macro–micro manipulator systems,’’ IEEE Trans. Control Syst.
Technol., vol. 18, no. 1, pp. 143–151, 2009.
In conclusion, the data-driven MPC controller exhibited [16] F. Aghili, ‘‘Adaptive control of manipulators forming closed kinematic
strong performance in terms of tracking accuracy, handling chain with inaccurate kinematic model,’’ IEEE/ASME Trans. Mechtron.,
of joint and actuators constraints, and effectively adapting vol. 18, no. 5, pp. 1544–1554, 2012.
[17] T. Salzmann, E. Kaufmann, J. Arrizabalaga, M. Pavone, D. Scaramuzza,
to changing targets. The data-driven MPC outperformed
and M. Ryll, ‘‘Real-time neural MPC: Deep learning model predictive
the PID independent joint control in tracking reference control for quadrotors and agile robotic platforms,’’ IEEE Robot. Autom.
joint trajectories for the KUKA LBR4 robotic manipulator. Lett., vol. 8, no. 4, pp. 2397–2404, Apr. 2023.
The MPC approach provided better tracking accuracy, [18] H. El-Hussieny, I. A. Hameed, and A. A. Nada, ‘‘Deep CNN-based static
modeling of soft robots utilizing absolute nodal coordinate formulation,’’
consistent stabilization times, and stable control efforts, Biomimetics, vol. 8, no. 8, p. 611, Dec. 2023.
demonstrating its effectiveness and reliability for robotic [19] P.-B. Wieber, ‘‘Trajectory free linear model predictive control for stable
control applications. walking in the presence of strong perturbations,’’ in Proc. 6th IEEE-RAS
Int. Conf. Humanoid Robots, Dec. 2006, pp. 137–142.
Furthermore, a K-fold cross-validation was conducted [20] H. Li, R. J. Frei, and P. M. Wensing, ‘‘Model hierarchy predictive control of
to select the optimal model architecture, ensuring robust robotic systems,’’ IEEE Robot. Autom. Lett., vol. 6, no. 2, pp. 3373–3380,
performance and generalization across different data splits. Apr. 2021.
A promising avenue for future research is the development of [21] G. García, R. Griffin, and J. Pratt, ‘‘MPC-based locomotion control of
bipedal robots with line-feet contact using centroidal dynamics,’’ in Proc.
data-driven Moving Horizon Estimation (MHE) [32] along IEEE-RAS 20th Int. Conf. Humanoid Robots (Humanoids), Jul. 2021,
with the MPC, potentially mitigating the current assumption pp. 276–282.
of full state observability that this work presupposes. [22] T. Ohtsuka and K. Ozaki, ‘‘Practical issues in nonlinear model predictive
control: Real-time optimization and systematic tuning,’’ in Nonlinear
Such advancements promise to further refine the precision Model Predictive Control: Towards New Challenging Applications. Berlin,
and applicability of MPC in robotic manipulator control, Germany: Springer, 2009, pp. 447–460.
[23] T. Akbas, S. E. Eskimez, S. Ozel, O. K. Adak, K. C. Fidan, and K. Erbatur, IBRAHIM A. HAMEED (Senior Member, IEEE)
‘‘Zero moment point based pace reference generation for quadruped robots received the Ph.D. degree in industrial systems and
via preview control,’’ in Proc. 12th IEEE Int. Workshop Adv. Motion information engineering from Korea University,
Control (AMC), Mar. 2012, pp. 1–7. Seoul, South Korea, and the Ph.D. degree in
[24] S. Kolathaya, ‘‘Local stability of PD controlled bipedal walking robots,’’ mechanical engineering from Aarhus University,
Automatica, vol. 114, Apr. 2020, Art. no. 108841. Aarhus, Denmark. He is currently a Professor
[25] M. Sombolestan, Y. Chen, and Q. Nguyen, ‘‘Adaptive force-based control with the Department of ICT and Natural Sciences,
for legged robots,’’ in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst. (IROS), Faculty of Information Technology and Electrical
Sep. 2021, pp. 7440–7447.
Engineering, Norwegian University of Science
[26] A. S. Polydoros and L. Nalpantidis, ‘‘A reservoir computing approach
and Technology (NTNU), Norway. He is the
for learning forward dynamics of industrial manipulators,’’ in
Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst. (IROS), Oct. 2016, Deputy Head of research and innovation within the Department of ICT
pp. 612–618. and Natural Sciences, Faculty of Information Technology and Electrical
[27] J. Bai, F. Lu, and K. Zhang. (2019). ONNX: Open Neural Network Engineering, NTNU. His current research interests include artificial
Exchange. [Online]. Available: https://github.com/onnx/onnx intelligence, machine learning, optimization, and robotics. He is elected as
[28] H.-G. Han, X.-L. Wu, and J.-F. Qiao, ‘‘Real-time model predictive control the Chair of the IEEE Computational Intelligence Society (CIS) Norway
using a self-organizing neural network,’’ IEEE Trans. Neural Netw. Learn. Section.
Syst., vol. 24, no. 9, pp. 1425–1436, Sep. 2013.
[29] G. Wang, Q.-S. Jia, J. Qiao, J. Bi, and M. Zhou, ‘‘Deep learning-based
model predictive control for continuous stirred-tank reactor system,’’
IEEE Trans. Neural Netw. Learn. Syst., vol. 32, no. 8, pp. 3643–3652,
Aug. 2021. TAMER F. MEGAHED received the B.Sc.,
[30] F. Fiedler, B. Karg, L. Lüken, D. Brandner, M. Heinlein, F. Brabender, and M.Sc., and Ph.D. degrees in electrical engineering
S. Lucia, ‘‘do-mpc: Towards FAIR nonlinear and robust model predictive from Mansoura University, Mansoura, Egypt,
control,’’ Control Eng. Pract., vol. 140, Nov. 2023, Art. no. 105676. in 2006, 2010, and 2015, respectively. Currently,
[31] D. Anguita, A. Ghio, S. Ridella, and D. Sterpi, ‘‘K-fold cross validation he is an Associate Professor and the Chairperson
for error rate estimate in support vector machines,’’ in Proc. DMIN, 2009,
of the Electrical Engineering Department, Egypt-
pp. 291–297.
Japan University of Science and Technology
[32] H. El-Hussieny, I. A. Hameed, and A. B. Zaky, ‘‘Plant-inspired soft
(E-JUST). Also, he has four patents in mag-
growing robots: A control approach using nonlinear model predictive
techniques,’’ Appl. Sci., vol. 13, no. 4, p. 2601, Feb. 2023. netic refrigeration, designed the IoT systems for
monitoring electricity consumption, thrust-vector-
control model rocket design, and magnetic gear. His research interests
include power control, renewable energy sources, smart grid, electric
vehicles, wireless charging, energy storage control and management, vector
control, power system protection, model predictive control, and green
hydrogen.