Contrast this with human memory. We can be reminded of facts and episodes from what are often quite obscure cues which are often not unique when taken in isolation. We are able to locate records (or memories) based on any of the information stored in that record. For instance, if I say to a friend, "I like your blue shirt with the stripes.", they will often know exactly which one I'm talking about, despite the fact that I have not provided a unique identifier for that shirt. Furthermore, the cues given, that is, blue and striped, may only specify the shirt uniquely when taken in combination (if your friend has other blue shirts and other striped shirts).
Of course, it is possible with the appropriate query to retrieve the same information from a computer database. In human memory, however, retrieval from the content of a memory is automatic - human memory is fundamentally content addressable.
Robustness to Noise
Von Neumann architectures are discrete in nature. This discrete nature
allows them to retain information completely faithfully when subjected
to small amounts of noise. Provided the noise is not sufficient to
switch a bit from a one to a zero or vice versa the information will be
interpreted as intended. This is, of course, the secret behind the
improved performance of digital recording. For a great many
applications this is a very useful property. I would prefer that the bank
retained the exact balance of my account, not an approximation or best guess.
In contrast, neural networks use redundancy in their structure to provide a best guess of the information to be retrieved. Such an approach is very useful in situations of extreme noise (such as speech recognition) where the information is incomplete or even incorrect. The next exercise demonstrates how the IAC network is resistant to erroneous information.
Exercise 21: Run another 40 cycles (for a total of 60 cycles). What happens to the age units?
Generalization
One operation that people seem to be very good at is collapsing over a
set of instances to establish a general trend. For instance, we might
ask "Are Americans more extroverted than Australians?". Unless you have
read the studies claiming that in fact they are, then your only
recourse would be to collapse across the set of Americans and the set
Australians you know and to extract some form of central tendency
measure on the extrovert/introvert dimension. This is quite a
difficult computation, but one that people perform routinely. The IAC
network can accomplish spontaneous generalisation of this kind by
activating a property and cycling.
Exercise 23: The Art and Ralph instance units become active but not the Sam instance unit, despite the fact that Sam is Single also. Why is this?
Exercise 24: Why does the 40s unit become active?
Default Assignment
The final property that we will examine is the ability of the IAC
network to provide plausible default values if it does not "know" a
given piece of information. The human memory system makes extensive use
of plausible defaults. In fact, people can have difficulty
distinguishing actual memories from those that have been reconstructed
from other related information. In the IAC network the provision of
plausible defaults is closely related to generalisation. Items which
are similar to the target item are used to extrapolate what the missing
information should be. In the following exercise we will remove some of
the weights in the Jets and Sharks network and see if it can provide
reasonable answers.
[Return to the start of the tutorial]
References
McClelland, J. L. (1981). Retrieving general and specific information
from stored knowledge of specifics. Proceedings of the Third Annual
Meeting of the Cognitive Science Society, 170-172.
McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context effects in letter perception: Part 1. An account of basic findings. Psychological Review, 88, 375-407.
McClelland, J. L. & Rumelhart, D. E. (Eds.). (1988). Explorations in parallel distributed processing: A handbook of models, programs, and exercises. Cambridge, MA: MIT Press.
Rumelhart, D. E., & McClelland, J. L. (1982). An interactive activation model of context effects in letter perception: Part 2. The contextual enhancement effect and some tests and extensions of the model. Psychological Review, 89, 60-94.
Rumelhart, D. E., & McClelland, J. L. (Eds.). (1986). Parallel distributed processing: Explorations in the microstructure of cognition (Vol. 1). Cambridge, MA: MIT Press.
Endnotes
[1] In the McClelland and Rumelhart formulation,
negative activations coming from other units are set to zero before entering
into the net input calculation. For simplicity's sake we have not included
this thresholding process. This restriction has little impact on the major
points we are trying to address.
[2] In the McClelland and Rumelhart version an activation which falls outside of the boundaries it is set either to max or min which ever is closest. This prevents the activations from becoming very large or very small quickly which can occur if parameter values are large. We have not included this component so that the natural tendency of the activation to keep activations within bounds can be observed. Be warned, though. If your activations are growing very large you may need to decrease some of the parameter values.