Enhanced DR Classification

Abstract

Diabetic retinopathy (DR) is a severe complication of diabetes mellitus that can lead to vision loss. This project enhances DR classification using a novel ensemble of deep convolutional neural networks integrating VGG16, ResNet50, and InceptionV3. Techniques such as depthwise separable convolutions, residual learning, and inception modules were incorporated to improve accuracy and efficiency. With a dataset from Kaggle, preprocessing and data augmentation were applied, and results showed notable improvements—particularly after restructuring the classification into binary categories. The final ensemble model, deployed via a PyQt-based GUI, achieved a peak F1 score of 0.946, indicating high sensitivity and specificity in distinguishing DR from non-DR cases.

Key Contributions

Developed and compared three CNN models (VGG16, ResNet50, InceptionV3) for DR image classification.
Constructed a hybrid ensemble model integrating the strengths of VGG16 and ResNet50.
Performed extensive preprocessing and augmentation to handle class imbalance and improve generalization.
Deployed the final model with a user-friendly PyQt GUI for practical clinical testing.

Conclusion

This project demonstrates the efficacy of CNN-based ensemble learning for medical image classification, especially for diabetic retinopathy. While the initial multi-class models showed moderate performance, restructuring the task into a binary classification significantly enhanced sensitivity and overall accuracy, making the model suitable for early DR screening tools.

Limitation

The model's performance relies heavily on well-annotated data, and generalizability may be limited when deployed on unseen datasets from varied imaging devices. The ensemble strategy also introduces higher computational complexity.

Future Work

Future research will explore transfer learning from medical-specific datasets, integration with mobile health platforms, and real-time DR classification embedded in fundus camera hardware. Additionally, federated learning and cross-dataset adaptation will be investigated.

🔬 Student Innovator: Lihan Chen (Daniel) is a driven researcher in AI for medical diagnostics. Under the mentorship of Dr. Happy Nkanta Monday, he focused on CNN integration strategies for diabetic retinopathy classification, aiming to reduce misdiagnosis and improve clinical workflows. His ambition is to develop intelligent assistive tools for ophthalmology.

📄 Download Poster