Structures of Neural Networks

The major structures factors are connection structures, network size, and ACON versus OCON aproaches.

Interlayer and intralayer Connectios Structures

A neural network comprises the neuron and weight building blocks. The behavior of the network depends largely on the interaction between these building blocks. There are three types of neuron layers: input, hidden and output layers. Two layers of neuron communicate via a weight conection network. There are four types of weighted connections: feedforward, feedback, lateral, and time-delayed connections, as you can see in this figure:

  1. Feedforward connections : For all the neural models, data from neurons of a lower layer are propagated forward to neurons of an upper layer via feedforward connections networks.
  2. Feddback Connections: Feedback networks bring data from neurons of an upper layer back to neurons of a lower layer.
  3. Lateral Connections: One tipical example of a lateral network is the winners-takes-all circuit, which serves the important role of selecting the winner. In the feature map example, by allowing neurons to interact via the lateral network, a certain topological ordering relationship can be preserved. Another example is the lateral orthogonalization network which forces the network to extract orthogonal components.
  4. Time-delayed Connections: Delay elements may be incorporated into the connections to yield temporal dynamics models. They are more suitable for temporal pattern recognitions.
The synaptic connections may be fully or locally interconnected, see figures. Also, a neural network may be either a single layer feedback model or a multilayer feed-forward model. It is possible to cascade several single layer feedback neural nets to form a larger net.

Sizes of Neural Networks

In a feed-forward multilayer neural net, there are one or more layers of hidden neuron units between the input and output neuron layers. The sizes of networks depends on the number of layers and the number of hidden-units per layer.

ACON versus OCON Approaches

The issue at hand is how many networks should be used for multicategory classification. Typically, one output node is used to represent one class. As example, in an alpha-numeric recognition problem, there are 36 classes, so there are in total 36 output nodes. Given an input pattern in the retrieven phase, the winner (i.e., the class that wins the recognition) is usually the output node that has the maximum among all the output values.

Two plausible network structures are All-Class-in-One-Network (ACON) and One-Class-in-One-Network (OCON). In the ACON approach, all the classes are lumped into one giant-size super-network. It is sometimes advantageous to decompose a huge network into many subnets, so that each subnet has a small size. For example, a 36-output net can be decomposed into 12 subnets, each responsible for 3 outputs. The most extreme decomposition is the so-called OCON structure, where one subnet is devoted to one class only. Although the number of subnets in the OCON is relatively large, each individual subnet has considerably smaller size than the ACON supper-network. This may be explaned by the next figures,

full-size net

is partitioned into

many subnets `

by eliminating all the "cross-class" connections int the upper layer.

For convenience, all the subnets are assumed to have a uniform size, say k. The number of hidden units of the ACON supper-network is denoted as K. (Obviously, k << K.) The ACON and OCON differ significantly in size and speed, that is the total numbers of synaptic weights and the training time. Let us denote the input and output vector dimensions as n and N. The number of the total synaptic weights for the ACON structure is (N+n) x K. Likewise, the number for the OCON structure is N x (n+1) x k::N x n x k . Two extreme situations are analyzed below. When N is relatively small (compared with n ), ACON could have compatible or less weights than OCON. If N is very large, then OCON could have a major advantage in terms of network size.

In addition, the OCON seems to prevail over ACON in training and recognition speed when the number of classes is large.

 In the ACON approach, the single supernet has the burden of having to simultaneously satisfy all these classes, so the number of hidden units K is expected to be very big.


Applications

Contents


Artificial Neural Networks
About this Tutorial