VGGNet

노트

말뭉치

VGGNet is a Convolutional Neural Network architecture proposed by Karen Simonyan and Andrew Zisserman from the University of Oxford in 2014.^[1]
Another variation of VGGNet has 19 weight layers consisting of 16 convolutional layers with 3 fully connected layers and same 5 pooling layers.^[1]
In both variation of VGGNet there consists of two Fully Connected layers with 4096 channels each which is followed by another fully connected layer with 1000 channels to predict 1000 labels.^[1]
VGGNet is a neural network that performed very well in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2014.^[2]
VGGNet apparently took 2-3 weeks to train on a computer with four NVIDIA Titan Black GPUs.^[2]
Let’s say you want to train a network such as VGGNet to recognize faces of celebrities.^[2]
Since the VGGNet we’re using was trained on ImageNet, it’s really good at distinguishing between different breeds of dogs, different types of fish, and so on.^[2]
VGGNet is invented by Visual Geometry Group (by Oxford University).^[3]
The reason to understand VGGNet is that many modern image classification models are built on top of this architecture.^[3]
It achieves better accuracy than VGGNet and GoogLeNet while being computationally more efficient than VGGNet.^[4]
The architecture is similar to the VGGNet consisting mostly of 3X3 filters.^[4]
From the VGGNet, shortcut connection as described above is inserted to form a residual network.^[4]
VGGNet was a competitor in the ImageNet ILSVRC-2014 image classification competition and scored second place.^[5]
The runner-up in ILSVRC 2014 was the network from Karen Simonyan and Andrew Zisserman that became known as the VGGNet.^[6]
A downside of the VGGNet is that it is more expensive to evaluate and uses a lot more memory and parameters (140M).^[6]
Lets break down the VGGNet in more detail as a case study.^[6]
The whole VGGNet is composed of CONV layers that perform 3x3 convolutions with stride 1 and pad 1, and of POOL layers that perform 2x2 max pooling with stride 2 (and no padding).^[6]
They compare HybridNet, with VGGNet and CBP-CNN, for 292, 100, and 200 sub-classes of VegFru Dataset.^[7]

소스

[ref_2858c5de-1] 1.0 ^1.1 ^1.2 VGGNet Architecture Explained

[ref_77021bf8-2] 2.0 ^2.1 ^2.2 ^2.3 Convolutional neural networks on the iPhone with VGGNet

[ref_ef17d37d-3] 3.0 ^3.1 What is the VGG neural network?

[ref_23b2901f-4] 4.0 ^4.1 ^4.2 ResNet, AlexNet, VGGNet, Inception: Understanding various architectures of Convolutional Networks – CV-Tricks.com

[ref_58cfc493-5] /VGGNet-Metal: iPhone version of the VGGNet convolutional neural network for image recognition

[ref_83dc2acd-6] 6.0 ^6.1 ^6.2 ^6.3 CS231n Convolutional Neural Networks for Visual Recognition

[ref_d4a29aa1-7] A Review of Convolutional Neural Network Applied to Fruit Image Processing

[1]

[2]

[3]

[4]

[5]

[6]

[7]

VGGNet

노트

말뭉치

소스

둘러보기 메뉴

검색