Dynamic hand gesture detector using python and open
CV
Lakshya Gaur1, Kapil Tomar2, Dr. Naveen Tyagi3
1 M.Tech Scholar, Department of Computer Science and Engineering, MIT, Bulandshahr 2 Assistant Professor, Department of Computer Science and Engineering, MIT, Bulandshahr 3 Professor, Department of Computer Science and Engineering, MIT, Bulandshahr
Abstract- Hand Gesture Recognition and Image
verbal cues, computers can discern user intent and Overlay Using OpenCV and MediaPipe This research paper presents a method for real-time respond in a manner that feels more natural and hand gesture recognition and image overlay using intuitive. OpenCV and MediaPipe. The system captures live video The evolution of gesture-based interfaces has been feed from a webcam, detects hand gestures, and propelled by advancements in a myriad of overlays corresponding images based on the detected technologies, including computer vision, machine gestures. The implementation leverages the capabilities learning, sensor technology, and augmented reality. of MediaPipe for hand tracking and OpenCV for image processing and display. The proposed method is The main objectives of this project are: efficient and runs in real-time, providing immediate To develop a real-time hand gesture recognition feedback on detected gestures. This paper discusses the system model, the underlying algorithm, and the results system. obtained from the implementation. To overlay images based on detected gestures. To provide a seamless and responsive user 1. INTRODUCTION experience.
Hand gesture recognition is a crucial technology in 2. SYSTEM MODEL
the field of human-computer interaction. It allows users to interact with digital devices using 2.1 The system comprises the following components: natural hand movements, enhancing the user Webcam: Captures live video feed. experience. This project focuses on creating a hand OpenCV: Handles video capture, image gesture recognition system that identifies specific processing, and display. gestures and overlays images accordingly. The MediaPipe: Detects and tracks hand landmarks. system uses Python, OpenCV, and MediaPipe, Gesture Recognition: Identifies specific hand leveraging their powerful image processing and gestures. machine learning capabilities. The journey Image Overlay: Overlays corresponding images towards gesture-based interaction based on detected gestures. represents a departure from the conventional notion 2.2 The flow of the system is as follows: of computer interfaces as passive tools controlled 1. Capture live video feed using OpenCV. solely through manual manipulation. Instead, it 2. Process each frame to detect hand landmarks seeks to imbue technology with a deeper using MediaPipe. understanding of human intention and expression, 3. Identify gestures based on the positions of the enabling seamless interaction that mirrors the landmarks. fluidity of human communication. At its core, 4. Overlay images on the video feed based on gesture-based interaction draws inspiration from the detected gestures. rich tapestry of human movement, encompassing 5. Display the processed video feed in real-time. gestures, postures, facial expressions, and even subtle nuances of body language. By decoding and interpreting these non-
IJIRT 165596 INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH IN TECHNOLOGY 1