Artificial Neural Networks
This chapter discusses several issues that
are pertinent for the PSOM algorithm (which is described more
fully in Chap. 4). Much of its motivation derives from the field
of neural networks. After a brief historic overview of this
rapidly expanding field we attempt to order some of the
prominent network types in a taxonomy of important
characteristics. We then proceed to discuss learning from the
perspective of an approximation problem and identify several
problems that are crucial for rapid learning. Finally we focus
on the so-called “Self-Organizing Maps”, which emphasize the use
of topology information for learning. Their discussion paves the
way for Chap. 4 in which the PSOM algorithm will be presented.

In (1969) Minsky and Papert showed that
certain classes of problems, e.g. the “exclusive-or” problem,
cannot be learned with the simple perceptron. They doubted that
learning rules could be found for computationally more powerful
multi-layered networks and recommended to focus on the symbolic
oriented learning paradigm, today called artificial
intelligence (“AI”). The research funding for artificial
neural networks was cut, and it took twenty years until the
field became viable again. An important stimulus for the field
was the multiple discovery of the error back-propagation
algorithm. Its has been independently invented in several
places, enabling iterative learning for multi-layer perceptrons
(Werbos 1974, Rumelhart, Hinton, and Williams 1986, Parker
1985). The MLP turned out to be a universal approximator,
which means that using a sufficient number of hidden units, any
function can be approximated arbitrarily well. In general two
hidden layers are required - for continuous functions one
layer is sufficient (Cybenko 1989, Hornik et al. 1989). This
property is of high theoretical value, but does not guarantee
efficiency of any kind.
Other important developments where made: e.g.
v.d. Malsburg and Willshaw (1977, 1973) modeled the ordered
formation of connections between neuron layers in the brain. A
strongly related, more formal algorithm was formulated by
Kohonen for the development of a topographically ordered map
from a general space of input stimuli to a layer of abstract
neurons. We return to Kohonen's work later in Sec. 3.7. Hopfield
(1982, 1984) contributed a famous model of the
content-addressable
Hopfield network, which can be used e.g.
as associative memory for image completion. By
introducing an energy function, he opened the
mathematical toolbox of statistical mechanics to the class of
recurrent neural networks (mean field theory developed for
the physics of magnetism). The

The Radial Basis Function Networks (“RBF”)
became popular in the connectionist community by Moody and
Darken (1988). The RFB belong to the class of local
approximation schemes (see p. 33). Similarities and differences
to other approaches are discussed in the next sections.