Presentation New
Presentation New
Student Details:
Anusha Garg – 21BCE0412
Mitali Gopinath Paul- 21BCE2447
Suhani Bhatnagar- 21BCE3129
Faculty Guide:
Dr. Shashank Mouli Satapathy - Professor Grade-I
Cataracts, a leading cause of visual impairment globally, occur when the eye's lens
becomes cloudy, resulting in blurred vision and potential blindness if left
untreated. Traditional diagnostic methods rely on manual evaluation by healthcare
professionals, which can be slow and inconsistent, particularly in regions with
limited access to specialized care. The rise of artificial intelligence (AI) and deep
learning (DL) offers a transformative solution, enabling automated and highly
accurate cataract detection. Leveraging advanced techniques such as convolutional
neural networks (CNNs), transfer learning, and ensemble models, AI-driven
systems can analyze medical images efficiently, reducing the need for manual
intervention and improving diagnostic precision. This project aims to harness these
technologies to develop a robust, early detection system for cataracts, enhancing
patient outcomes through faster and more reliable diagnoses.
Literature Survey
• Transfer learning with pre-trained models like VGGNet[15], ResNet[8], MobileNet[19], and Inception is widely applied for cataract
detection, often combined with SVM classifiers[18] to enhance performance, as seen in studies achieving up to 98.17% accuracy. Hybrid
models leveraging these architectures demonstrate high accuracy, such as 95.65% and 92.91%, with reduced data requirements.[1][5]
• Ensemble learning, combining multiple CNN models[3] like AlexNet[9], InceptionV3[22], and ResNet[7][18], enhances cataract
classification accuracy, achieving up to 99.20%. Hybrid methods, such as CNN-RNN combinations, further improve performance, with F1
scores reaching 95.90% and accuracies of 97.39% .[4]
• Several studies introduce advanced techniques for enhancing feature extraction in cataract classification. These include object detection
and multi-task learning[6] to focus on key areas , portable diagnostic systems using Inception-v3, real-time grading with YOLOv3[24][25]
and DenseNet[17], and the efficient CataractNet[11] model with reduced computational cost.[2]
• XAI techniques like Grad-CAM[20] enhance model transparency by generating visual heatmaps that highlight decision-relevant areas in
cataract detection. DeepSurgery uses AI to supervise cataract surgeries in real-time, improving surgical accuracy and outcomes.[14]
3
Research Gaps
Dataset Limitations:
• Most studies rely on small, homogeneous datasets, While effective, there is a need for larger, more
varied datasets which can improve model accuracy and robustness.
Focus on Binary Classification:
• Limited generalizability to diverse populations and real-world conditions. Current models often
overlook the subtleties of different cataract types and severity levels.
Real-Time & Portable Diagnostic Challenges:
• Difficulty integrating real-time, portable tools, especially in resource-constrained settings.
Novelties
Generated cataract images at 100 epochs Generated normal images at 100 epochs
Implementation Progress- GAN
Inception Resnet V2
Architecture
Design - Model
1.Resizing: Images resized to 224x224 to match the input size of the model.
2.Green (G) Channel Extraction: The G-channel of the image is extracted for enhanced clarity, as it provides
the best contrast for fundus images.
3.Image Augmentation: Random rotations, zooms, width/height shifts, and horizontal flips were applied to
reduce overfitting by artificially increasing the dataset.
1.CLAHE (Contrast Limited Adaptive Histogram Equalization): Applied to the G-channel to improve
contrast by limiting over-amplification of noise, especially in low-contrast areas.
2.Wavelet Denoising: Applied on top of CLAHE-enhanced G-channel images using wavelet transforms (Haar
wavelet). This method effectively reduces noise, particularly illumination and small artifacts, while preserving
important image details
Design -XAI
• Grad-CAM (Gradient-weighted Class Activation Mapping) is designed to provide visual explanations for predictions made by convolutional
neural networks (CNNs).
• It works by computing gradients of the target class score with respect to the feature maps of the final convolutional layer.
• These gradients highlight important regions of the image that contributed most to the prediction.
• A global average pooling is performed on the weighted sum of these feature maps which produces a heatmap that is overlaid on the input
image, showing the areas the model focused on.
• This method helps in visualizing which parts of the image are crucial for classification, aiding in model interpretability.
Implementation-XAI
Fundus image of eye containing cataract Heatmap superimposed on the original image highlighting the
regions which contributed the most to the prediction
Result
•Image Generation Quality (GANs): The GAN model trained with 64x64 resolution images for 100 epochs produced coherent
cataract images with structural clarity, though lacking detail for effective feature extraction. Extending to 200 epochs reduced
image quality due to instability and potential mode collapse.
•Training Metrics (GANs): Discriminator and generator losses achieved balance at 100 epochs, indicating stable training;
however, at 200 epochs, the generator loss increased, reflecting instability from overfitting or mode collapse.
•Model Training Outcomes: Without preprocessing, InceptionResNetV2 achieved a stable 97.7% training and 88.2% validation
accuracy, suggesting good generalization. After CLAHE and wavelet denoising, the model's validation accuracy dropped to
73.4%, with instability suggesting overfitting to enhanced features rather than generalized ones.
•Grad-CAM Visualization: Applying Grad-CAM to InceptionResNetV2 highlighted critical regions in cataract detection,
specifically the lens and surrounding eye tissues, aiding in interpretability and validating the model’s focus on relevant features.
References
Weblinks:
1. https://glassboxmedicine.com/2020/05/29/grad-cam-visual-explanations-from-deep-networks/
2. https://jovian.ai/aakashns/06b-anime-dcgan
Dataset Links:
3. https://www.kaggle.com/datasets/jr2ngb/cataractdataset
4. https://www.kaggle.com/datasets/drskprabhakar/cataract-dr-normal-glaucoma-fundus-images-dataset
5. https://dataverse.telkomuniversity.ac.id/dataset.xhtml?persistentId=doi:10.34820/FK2/CDWESA
6. https://data.mendeley.com/datasets/s9bfhswzjb/1
Journals:
7. Pratap, Turimerla, and Priyanka Kokil. "Computer-aided diagnosis of cataract using deep transfer learning." Biomedical
Signal Processing and Control 53 (2019): 101533.
8. Hasan, Md Kamrul, et al. "[Retracted] Cataract Disease Detection by Using Transfer Learning‐Based Intelligent
Methods." Computational and Mathematical Methods in Medicine 2021.1 (2021): 7666365.
9. Jidan, Omar Jilani, et al. "A comprehensive study of DCNN algorithms-based transfer learning for human eye cataract
detection." International Journal of Advanced Computer Science and Applications 14.6 (2023).
References
4. Imran, Azhar, et al. "Fundus image-based cataract classification using a hybrid convolutional and recurrent neural
network." The visual computer 37 (2021): 2407-2417.
5. Khan, Md Sajjad Mahmud, et al. "Cataract detection using convolutional neural network with VGG-19 model." 2021
IEEE World AI IoT Congress (AIIoT). IEEE, 2021.
6. Junayed, Masum Shah, et al. "CataractNet: An automated cataract detection system using deep learning for fundus
images." IEEE access 9 (2021): 128799-128808.
7. Imran, Azhar, et al. "Automated identification of cataract severity using retinal fundus images." Computer Methods in
Biomechanics and Biomedical Engineering: Imaging & Visualization 8.6 (2020): 691-698.
8. Kalyani, B. J. D., et al. "Smart cataract detection system with bidirectional LSTM." Soft Computing 27.11 (2023): 7525-
7533.
9. Kumari, Pammi, and Priyank Saxena. "Cataract detection and visualization based on multi-scale deep features by RINet
tuned with cyclic learning rate hyperparameter." Biomedical Signal Processing and Control 87 (2024): 105452.
10. Yadav, Sunita, and Jay Kant Pratap Singh Yadav. "Automatic Cataract Severity Detection and Grading Using Deep
Learning." Journal of Sensors 2023.1 (2023): 2973836.
11. Ganokratanaa, Thittaporn, Mahasak Ketcham, and Patiyuth Pramkeaw. "Advancements in Cataract Detection: The
Systematic Development of LeNet-Convolutional Neural Network Models." Journal of Imaging 9.10 (2023): 197.
References
12. Patil, Yogeshwar, et al. "Multiple ocular disease detection using novel ensemble models." Multimedia Tools and
Applications 83.4 (2024): 11957-11975.
13. Yadav, Jay Kant Pratap Singh, and Sunita Yadav. "Computer‐aided diagnosis of cataract severity using retinal fundus
images and deep learning." Computational Intelligence 38.4 (2022): 1450-1473.
14. Faizal, Sahil, et al. "Automated cataract disease detection on anterior segment eye images using adaptive thresholding
and fine tuned inception-v3 model." Biomedical Signal Processing and Control 82 (2023): 104550.
15. Goh, Jocelyn Hui Lin, et al. "Artificial intelligence for cataract detection and management." Asia-Pacific journal of
ophthalmology 9.2 (2020): 88-95.
16. Wang, Yong, et al. "Cataract detection based on ocular B-ultrasound images by collaborative monitoring deep
learning." Knowledge-based systems 231 (2021): 107442.
17. Hu, Shenming, et al. "ACCV: automatic classification algorithm of cataract video based on deep learning." BioMedical
Engineering OnLine 20 (2021): 1-17.
18. Keenan, Tiarnan DL, et al. "DeepLensNet: deep learning automated diagnosis and quantitative classification of cataract
type and severity." Ophthalmology 129.5 (2022): 571-584.
19. Wang, Ting, et al. "Intelligent cataract surgery supervision and evaluation via deep learning." International Journal of
Surgery 104 (2022): 106740.