Modern Convolutional Neural Networks

What is modern convolutional neural networks?

Modern convolutional neural networks (CNNs) have revolutionized the field of computer vision by achieving remarkable performance in various tasks. These deep learning models are specifically designed to process visual data, mimicking the human visual system. Through their hierarchical architecture, CNNs effectively learn and extract increasingly complex features from input images, enabling them to identify patterns and objects with exceptional accuracy. Modern CNNs incorporate advanced techniques such as residual connections, attention mechanisms, and normalization layers, enhancing their ability to handle large-scale datasets and address challenges like overfitting. Their widespread adoption has led to breakthroughs in image classification, object detection, semantic segmentation, and other visual recognition tasks, making CNNs an indispensable tool in modern artificial intelligence research and applications.

About VGG network?

The VGG algorithm, short for the Visual Geometry Group algorithm, is a deep convolutional neural network architecture that has proven to be highly influential in the field of computer vision. Proposed by the Visual Geometry Group at the University of Oxford, the VGG algorithm consists of a series of convolutional layers followed by fully connected layers. What sets VGG apart is its uniform architecture, where convolutional layers have a small receptive field of 3×3 and are stacked on top of each other, making the network deeper. This depth allows VGG to learn increasingly complex features, leading to improved accuracy in image classification tasks. Despite its simplicity compared to more recent architectures, VGG has been widely adopted and serves as a strong baseline for benchmarking new models in the field of deep learning.

About this project

Many of my research projects focus on deep learning algorithms specifically designed for modern convolutional neural networks . For instance, at beginning, I have worked on 3,278 instances (5 classes) of apples for image processing and classification .

I use these libraries for classification of a dataset of apples:
Keras, Sklearn, pandas, TensorFlow, glob, Numpy and ImageDataGenerator.

Additionally, I have utilized these conditions for image data #augmentation, such as rotation or shifting:

“20 degrees for rotation and 10 percent for shift at x or y.”

Data splitted to 2450 for the train and 837 for the test phase at 5 classes.

I have employed the VGG and Inception (InceptionV3) networks for data classification within the context of modern convolutional neural networks.

Results

The results obtained from these experiments are presented below:

VGG accuracy: 0.9943
VGG val_accuracy: 0.8996
Inception accuracy: 0.9861
Inception val_accuracy: 0.9367

Sample plots of dataset

VGG network values of loss and val_loss

VGG network values of accuracy and val_accuracy

Inception network values of loss and val_loss

Inception network values of accuracy and val_accuracy

Posted in Python by Mohsen Majidi Pishkenari

Mohsen Majidi Pishkenari

Mohsen Majidi Pishkenari

Skype:

LinkedIn:

Email:

HARD SKILLS

SOFT SKILLS