Neocognitron: Hierarchically Structured Model

The basic ideas for the neocognitron are described as follows:
  1. Hirarchical Representation Structure. The neocognitron uses successive (say, M) stages that can recognize patterns. It pogresses stage by stage from the input layer to the output layer.The levels of representation in the layers exhibit a hierarchical structure. More precisely, the first layer (or the first few layers) extracts local features, such as a line at a particular orientation. More global features are extracted in later layers. The objective of such a hierarchical representation structure is that by going deeper through succesive layers, the position of the symbol in the input pattern becomes less important.
  2. Intra-Layer Structure. Each layer consist of one simple sublayer of  cells and one complex sublayer of  cells. Each layer of  cells or  cells is divided into subgroups according to the features to which they respond. The cells in each subgroup are arranged in a twodimensional array. The  cells match an input patten with the template of the receptive fields of the cells. The  cells receive excitatory signals from its correspondig  cells;
    1. Feature-Extracting  Cells: Connections converging to this cells are variable and reinforced by learning or training. The process of kearning and the mechanism of feature extraction by  are based on self-organizing networks discussed before. Briefly, only the one cell that gives the maximum response has its input connection reinforced. After finishing the learning,  cells can extract features from the input pattern. Only when a relevant feature is presented at a certain position in the input layer will the corresponding  cell be activated.
    2. Position readjusting  Cells: This sublayer inmediately follows the  sublayer. The  cells are used to compensate for positional errors. Connections from  cells to cells are fixed and invariable.
    Recall that, just like the  layer, each layer of  cells is divided into subgroups according to the features to which they respond. All the cells in a subgroup receive input connections of the same spatial distribution, but allowing a certain degree of positions shift. In other words, each  cell receives signal from a group of  cells that extract the same feature but have slightly different positions. The  cell is activated if at least one of these  cells is active. Even if the position shift of the stimulus feature causes a nearby  cell to be activated instead of the original one, the same cell will respond; thereby, the effect of a small shift can be nullified. Based on such a position-readjusting process, local features extracted in a lower stage can be smoothly and gradually integrated into more global features.
  3. Inter-Layer Structure. The interlayer structure, that is, the mapping from a  sublayer to  cells in the next layer, provides a further shift tolerance to combat deformation of the training pattern.
  4. Final Layer. Finally, each  cell of the final (recognition) layer integrates all the information of the input pattern. Due to the competitive learnng nature, only one cell in the final layer, corresponding to the category of the input pattern, will be activated. Other cells respond to the patterns of other categories.
In summary, neocognitron is a self-organized, competitve learning, hierarchical multilayer network. It is useful for pattern classification without supervised learning, especially when there are possible shifts in position or distortion in shape. However, it is difficult to determine how well can the network cope with deformation of patterns, because there is no mathematical measure or proper model for such study. Moreover, although the neocognitron appears to be attractive in its high biological fidelity, it incurs a vast computational cost which is not easily affordable for most applications.


SUPERVISED MODELS

Contents


Artificial Neural Networks
About this Tutorial