Application of Convolutional Neural Network for Image Recognition

Abstract

This project explores the use of Convolutional Neural Networks (CNN) for image recognition using the CIFAR-10 dataset. By leveraging transfer learning through fine-tuning the VGG-16 model, the student developed a high-accuracy classifier. The model achieved 85% accuracy and was optimized by adjusting the last three fully connected layers. A graphical user interface (GUI) built using the Tkinter library enables non-programmers to use the model for classification. Users can load a trained model, upload a 24-bit image, and receive classification results directly on the interface. This system aims to simplify the use of deep learning for general users while maintaining reliable recognition performance.

Key Contributions

Fine-tuned a VGG-16 CNN model using the CIFAR-10 dataset, achieving 85% accuracy.
Designed and implemented a user-friendly GUI for real-time image classification with Tkinter.
Demonstrated the use of transfer learning by modifying only the last three layers of VGG-16.
Applied model visualization and evaluation techniques including accuracy/loss plots using Matplotlib.

Conclusion

The modified VGG-16 model effectively classifies images in the CIFAR-10 dataset with high accuracy and reduced overfitting. Its integration into a GUI allows broader accessibility and usability for non-technical users. The application is suitable for educational and practical use in image classification tasks.

Limitation

The model currently supports only 24-bit depth RGB images and may underperform on datasets with drastically different image characteristics. The reliance on VGG-16 and CIFAR-10 limits its adaptability without further training or domain-specific tuning.

Future Work

Planned enhancements include expanding the model’s compatibility to different image formats and bit depths, integrating other pre-trained models for comparative analysis, and deploying the tool as a web application to enhance accessibility. Additional work may include support for edge computing environments.

🔬 Student Innovator: Di Cui is dedicated to exploring practical applications of AI in image processing. Through this project, Di aimed to simplify machine learning access for general users by combining deep learning techniques with an intuitive GUI interface. Supervised by Dr. Happy Nkanta Monday, this project showcases Di’s skills in deep learning, model tuning, and user-centric software development.

📄 Download Poster 📁 View Final Model Draft