Definitions

Terms such as pattern, linkset and pattern intersection are helpful to limit the number of words needed to describe Pattern-atom (patomic) theory.

At a very high level all that a robotic brain is required to do is receive inputs from senses and then determine and execute appropriate actions. Actions result from controlled muscle contractions (or alternative motion devices like speakers or motors). In fact, the primary purpose of a brain may be to enable movement (Greenfield, 1996) to effectively react to the environment. The challenge of mechanical control appears to be in finding the right sensory pattern elements that identify real-world objects. Following this recognition, the machine then moves itself by selecting the right pattern of muscle elements to operate with the objects. As animals like dogs and cats recognise the function of a door, it is fair to suggest that many brain functions do not rely on the human brain’s unique capabilities, but rather on our brain’s common ancestral capabilities. Here we explore the terms behind the robotic brain concept: snapshots, linksets and Patoms.

Pattern-atoms

The first task is to recognise objects. Matching the indivisible pattern elements that comprise surrounding objects is needed. If there is an indivisible representation, an atom comprising the unique object’s patterns if you will, it must simply be matched. What pattern-atoms identify a square? What atoms comprise your friend, Sally? What atoms recognise your fourth grade school environment? What are the basic pattern elements necessary to recognise a wheelbarrow, archaic fonts or the mechanical workings of a door? This search for indivisible pattern-atoms has been seen in the computational quest to reduce sensory experience to single algorithmic outcomes since the 1950s. That approach seeks to represent visual objects and auditory words as single indivisible sets of time-independent values, stored in databases. The proposed alternative is a layered model in which pattern-atoms exist, but not at the layer that matches the stored instances. In language, for example, the set of elements comprising an object is anchored to a single, indivisible auditory pattern: a word. In fact both the visual object and the auditory word are proposed to represent sets of instances, as will be shown later.

Snapshots

The concept for robotic brain control is based first on recognition (finding and using sensor combinations comprising pattern-atoms) and second on moving using the correct combination of motion atoms. A snapshot pattern (or snapshot for short) is a set of active sensors, motor units or a set of active snapshots at a given point in time. Depending on the type of pattern stored, determined by its position (layer) within the system, a snapshot can be one instance of sensory attributes or the set of multi-sensory instances of objects. Snapshots are stored in sufficient detail to match or repeat the same pattern again. In conjunction with linksets, hierarchical snapshots allow for object identification. As an example, one hundred photographs of your friend Susan represents one hundred snapshot patterns.

Upon recognition of a previously stored snapshot pattern, a unique signal is sent both forwards to the next layer and backwards to the previous layer. Identifying a part of a snapshot is sufficient to identify the entire hierarchy. In other words, snapshots are bidirectional. Note that the snapshots stored in edge Patoms are key, being the only connection to the original experience.

Linksets

Snapshots are found at single points in time, but as objects are not experienced independently of time multiple snapshot patterns are necessary to represent physical objects. Objects seldom appear stationary in space, due to our own changing point of view and consequently, they rarely appear exactly the same way twice. To recognise objects, matching any one of the linked snapshots together is sufficient, removing the need for computation otherwise necessary to identify the atoms of mathematically idealised patterns.

The linked group of snapshots observed over time are called linkset patterns (or just linksets, for short). Like snapshots, elements of a linkset pattern are bi-directional. By automatically linking together snapshot patterns based on continuity (sequence), logical objects are formed therefore eliminating the need to identify reduced object atoms. Successful object recognition relies on correctly adding snapshots to existing linksets. The pattern-atoms that enable object identification are therefore collections of snapshot patterns joined into linksets over time. An example of a linkset is the set of box shapes caused due to a shift in the point of reference. Linksets are sequential patterns, logically represented as pointers to snapshot patterns.

Linkset patterns operate using the concept of intersection. By combining more than one linkset together, the weighted links will reduce the total active patterns to a subset in which the elements are fully consistent. There will typically be more than one consistent pattern, in which case the strongest links will determine the intersection’s best-fit.

Snapshot and linkset interaction

Snapshot patterns include not only the sensor patterns, but also links to other sensor patterns stored in higher layers. The difference between a snapshot and a linkset is that while snapshots are instantaneous patterns, linksets are the time-based, weighted collections of snapshot patterns. Snapshots rarely have much value without the time-based components. In language, for example, it is only at a specific point in time that a physical object is named. Linksets enable language by allowing a single association (a word) to be linked to numerous instances (snapshot patterns), effectively naming multiple instances in time with a single snapshot. While both the physical object and its name appear to be independent of time, the name may only be associated verbally for part of a second, despite hours of experience with the object. Yet our recall of the name for any of the object snapshots must be almost instantaneous. To stress the point, regardless of the shortness of object naming, immediate recall of this association is available through the linkset.

To illustrate the interaction, let’s walk through the sequence of events necessary to name a person you meet. Snapshots are established based on concurrency and linksets are established based on continuity. When introduced to Robert and hearing the sound “robert”, components of the images of Robert are stored as visual snapshots. These are grouped together in a linkset due to continuity. Elements of the sounds comprising the word “robert” are stored as snapshots, grouped together in an auditory linkset. Based on the concurrency of the two active linksets, at the layer above them, a snapshot connects the two sensory patterns. This connection effectively joins the two linksets. In other words, by association, the sounds comprising “robert” are joined with the visual images of Robert.

Linkset intersection

Linksets enable the concept of decision making through intersection. Intersection is the result of combining two or more linksets to establish the common ground between them. At a low level, intersection maintains consistency with the stored patterns involved. At a high level, intersection results in the most favourable or least unfavourable emotional outcome. The following example illustrates these concepts.

When viewing a known box, one of the previously stored snapshot patterns is matched. The snapshot signals the match to the layer above, which activates the linkset. The linkset signals forward and backward to its constituents including all the other snapshots comprising the box. These other snapshots are activated, in turn. One of the forward links may result in a link to the auditory linkset pattern for the word “box”, effectively naming the object. This forward and backward linking is bidirectional pattern matching since either pattern identifies the other, albeit indirectly.

Our brain needs to do more than simply name objects, of course. More typically the brain’s requirement is to direct motion based on numerous pattern inputs. Leaving aside the question of learning at this stage, let’s consider two examples in which a brain needs to take action based on previously established patterns.

If smell A, movement B, sight C and touch D happen simultaneously, take action E. In isolation to all other stored patterns, this allows for A and B only to activate action E. As linksets are bidirectional, the pattern E must identify A-D when receiving only one of the inputs. This bidirectional nature of linksets is invaluable to function in the world’s complex sensory environment.

If the sensory pattern group above results in E, but a similar one with A, B, C plus F leads to G, how does the brain select which action to take when presented with just A? The concept of pattern intersection comes into play. Linksets connect using activating and inhibiting weighted links, not binary switches. Provided the A link is slightly stronger than E, in isolation A leads to E (Fig. 1). In more complex intersections, the strongest links identified over a number of patterns dictate the action. If intersection fails to find a consistent pattern, no action is taken.

decision pattern

Fig. 1. Examples of bi-directional “best-fit” pattern matching: (a) Sense A, B, C and D link to action E (strength 5). (b) Sense A, B, C and F link to action G (strength 4). (c) A-only active leads to E due to strength 5 (5>4). If A and F were active, G would be the action (4+4>5). In this latter case, G still activates all of A, B, C and F in reverse.

In human examples when a decision cannot be reached, actions can be compelled with the addition of a time limit as the best fit can be selected by lowering the accuracy threshold. If the human brain uses this approach, full knowledge of all previous experiences in the relevant domain, including emotional patterns from all elements in the brain’s linksets, is necessary to predict a human decision. Even for simpler animals, like dogs and cats, predicting behaviour is theoretically complex.

Patoms

To control these complex theoretical constructs, the concept of the Patom is introduced. It is the basic building block for the robotic brain – a standard device enabling complex hierarchical and bidirectional pattern storage. Patoms are implemented in layers forming a hierarchy in which the lowest layers store sensor attribute patterns and the higher layers store complex, multi-sensory patterns. Despite the Patom itself being fairly simple in nature, the approach described rapidly results in complex capabilities.

A Patom (Pattern-matching ATOM) is defined as the smallest unit that stores, matches and uses patterns in a brain (Ball, 2000). The word Patom was selected since it sounds a bit like “pattern” and is a combination of the words pattern-matching and atom. The term Patom recognises the value to science from postulating the indivisible physical element (atom) upon which chemistry is built, an approach reignited from antiquity by John Dalton in the early 1800s (Gribbin, 2002). In cognitive science also, there is great value in finding reliable ways to convert complex experiences into groups of pattern-atoms.

In humans, there is evidence supporting the existence of such a construct. The observable effects of brain damage variations resulting in cell loss suggests that localised brain areas are collectively capable of capturing specific patterns. Damasio (1994), for example, describes effects resulting from losing the six layers of cortical cells. Greenfield (1997), McCrone (1999) and others popularise this understanding through the visual presentation of graphic brain scan techniques. The question of whether the brain is modular has moved to the question of how many modules there are in the brain (Jacobs, Jordan and Barto, 1991).

A Patom converts patterns (input signals) into links (output signals). Patoms are hierarchically structured with each Patom signalling uniquely (including a feedback signal) to correspond with the received patterns. By storing and linking snapshot patterns, the volume of data sent through the system is minimised. This effectively distributes pattern storage at the received location and not centrally in some form of processed abstraction. While higher-level Patoms are essential to multi-sensory patterns over long timeframes, only sensory patterns in edge Patoms interact directly with our body. Snapshots stored in edge Patoms contain sufficient detail to recognise the transduced sensory pattern, while the rest of the system simply supplies links. The contents of the links from the edge Patoms are incomprehensible without referencing back. Our brain’s architecture is structured with multiple layers of bidirectionally connected neural networks. While it may appear confusing when compared with a model in which senses are connected directly to motor control areas (Damasio, 1994), it aligns with the needs of a reasonably slow brain driven by evolutionary necessity to act rapidly and the proposed layered approach to control complex multi-sensory patterns.

Although the alignment between our brain’s topology and the theoretical Patom construct requires more research, current evidence is sufficiently detailed to propose the following functional specification: (1) A Patom automatically stores and matches patterns based on received inputs. The location, dictated by its topology of connected links, determines whether the patterns are single-sense or multiple-sense. (2) Matched patterns send a unique output, weighted by experience. This output directly links to other Patoms. (3) Patoms are hierarchically connected and operate bi-directionally based on concurrency and continuity. See Fig. 2 for a high-level diagram of interconnected Patoms.

high level patom

Fig. 2. Simplified Patoms for hearing, vision and grammar. Apparent complexity stems from the great number of forward and backward connections to multi-use Patom elements. Patom 5’s learned patterns link back to sensory patterns as the system is bidirectional. If Patom 1 receives input X, it sends output Y. If Patom 2 receives input Y, it sends output Q, which includes feedback Z. If Patom 1 receives input Z, it sends output Y. In other words, a linked pattern works both forward and backward and compels experience to align with “what you expect” since near matches direct to previous experience, if consistent.

References

Ball, J. S. (2000). ABC Radio National, Ockham’s Razor, Our Brain, the Patom-Matcher http://www.abc.net.au/rn/science/ockham/stories/s73842.htm.

Damasio, A.R. (1994). Descartes’ Error New York: Picador, 26-27, 92-93.

Greenfield, S. (1997). The Human Brain: A Guided Tour, London: Phoenix, 18-28.

Greenfield, S. A. (1996). The Human Mind Explained, Sydney: Reader’s Digest, 12-13.

Gribbin, J. (2002). Science: A History 1543-2001, London: Allen Lane, 359-360.

Jacobs, R.A., Jordan, M.I. & Barto A.G. (1991). Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks. Cognitive Science 15, 219-220.

McCrone, J. (1999). Going Inside, London: Faber and Faber, 186 and Plate 10.

 

© 2003-2009 Thinking Solutions Pty Ltd. ABN 31 122 884 637. All rights reserved.
Forum  Blog  Site Map  Privacy Policy  Legal Terms