Abstract
This study investigates the use of Residual Channel Attention Networks (RCAN) for 2x super-resolution in malaria cell imaging to enhance diagnostic capabilities. Traditional microscopy techniques often yield variable results contingent upon operator proficiency. RCAN was employed to upscale malaria cell images, allowing for extraction of more detailed features. The effectiveness was quantitatively assessed using PSNR (Peak Signal-to-Noise Ratio) and SSIM (Structural Similarity Index Measure). A comprehensive comparison of classification models demonstrated the superiority of super-resolved images, with our ensemble model achieving perfect classification (100% accuracy, 1.0 F1-score) on enhanced images versus 92.07% on originals. The model was successfully deployed via ONNX Runtime for real-world applications including static images, video analysis, and live camera feeds, demonstrating robust generalization across media types.
Key Performance Metrics
- Super-Resolution Quality: Average PSNR 32.63dB, SSIM 0.9002
- Classification Accuracy: 100% on super-resolved vs 92.07% on original images
- Processing Speed: 45 FPS for real-time camera analysis
- Model Efficiency: 99.998% accuracy with batch size 32
Key Contributions
- Implemented RCAN-based 2x super-resolution achieving PSNR 32.63dB and SSIM 0.9002 on malaria cell images
- Developed an ensemble CNN model with attention mechanisms achieving perfect classification (100% accuracy) on enhanced images
- Conducted comprehensive interpretability analysis using CAM, SHAP, and LIME to validate model decisions
- Created practical deployment solutions including web-based interface and ONNX Runtime optimization
- Demonstrated real-time processing capabilities (45 FPS) for clinical applications
Methodology
The project employed a two-stage approach combining super-resolution enhancement with advanced classification:
1. Super-Resolution with RCAN
The RCAN architecture consists of four key components: shallow feature extraction, deep feature extraction via residual-in-residual (RIR) blocks, upsampling, and reconstruction. The model was trained using L1 pixel loss over 3200 iterations, with channel attention mechanisms focusing on diagnostically relevant features.
2. Classification Models
We developed and compared four CNN architectures:
- Custom Ensemble Model: Combining factorized inception blocks with SE attention
- ShuffleNetV2: Lightweight architecture optimized for mobile deployment
- ResNet18: Baseline residual network for comparison
- RegNet: Regularized network for balanced performance
Results
The models demonstrated exceptional performance across different evaluation scenarios:
Super-Resolution Performance
- Average PSNR: 32.63dB (vs 28.42dB for bicubic interpolation)
- SSIM: 0.9002 indicating excellent structural preservation
- Visual analysis confirmed enhanced cellular details critical for diagnosis
Classification Performance
Model |
Original Accuracy |
SR Accuracy |
Improvement |
Ensemble Model |
92.07% |
100% |
+7.93% |
ShuffleNetV2 |
91.69% |
100% |
+8.31% |
RegNet |
92.30% |
100% |
+7.70% |
Conclusion
The integration of RCAN super-resolution with advanced CNN classifiers demonstrates significant potential for automated malaria diagnosis. Key findings include:
- Super-resolution enhances classification accuracy by 7-8% across all models
- Our custom ensemble model achieves perfect classification on enhanced images
- Real-time processing at 45 FPS enables practical clinical deployment
- Interpretability analysis confirms the model focuses on diagnostically relevant features
Future Work
Future extensions include:
- Integration with mobile microscopy platforms for field deployment
- Extension to other parasitic infections using similar methodology
- Development of cloud-based analysis for resource-constrained settings
- Incorporation of clinical metadata to improve diagnostic accuracy
🔬 Student Innovator: Leon Yu is passionate about applying cutting-edge computer vision techniques to medical diagnostics. Under the mentorship of Dr. Happy Nkanta Monday, Leon developed this comprehensive solution combining super-resolution and deep learning classification, demonstrating exceptional skills in both theoretical understanding and practical implementation. With a strong focus on real-world applicability, Leon aims to bridge the gap between advanced AI research and practical healthcare solutions.