Carpenter Grossberg Rosen1991b

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    Carpenter Grossberg Rosen1991b - Presentation Transcript

    1. Neural Networks, Vol. 4, pp. 759-771, 1991 0893-61180/91$3.00 + .110 Printed in the USA. All rightsreserved. Copyright'~ 1991PergamonPressplc ORIGINAL CONTRIBUTION Fuzzy ART: Fast Stable Learning and Categorization of Analog Patterns by an Adaptive Resonance System G A I L A . C A R P E N T E R 1, STEPHEN G R O S S B E R G 2, AND D A V I D B . R O S E N ~ Boston University (Received 1 April 1991; revised and accepted 21 June 1991) Abstract--A Fuzzy Adaptive Resonance Theory (ART) model capable of rapid stable learning of recognition categories in response to arbitrary sequences of analog or binary input patterns is described. Fuzzy ART incorporates computations from fuzzy set theory into the ART 1 neural network, which learns to categorize only binary input patterns. The generalization to learning both analog and binary input patterns is achieved by replacing appearances of the intersection operator ( N ) in ART 1 by the MIN operator (/~ ) of fuzzy set theory. The MIN operator reduces to the intersection operator in the binarv case. Category proliferation is prevented by, normalizing input vectors at a preprocessing stage. A normalization procedure called complement coding leads to a symmetric theory in which the M1N operator (/) and the MAX operator (V) of fuzzy set theory play complementary roles. Complement coding uses on-cells and o f cells to represent the input pattern, and preserves individual feature amplitudes while normalizing the total on-cell/off-cell vector. Learning is stable because all adaptive weights can only decrease in time. Decreasing weights correspond to increasing sizes of categorv "boxes." Smaller vigilance values lead to larger category boxes. Learning stops when the input space is covered by boxes. With fast learning and a finite input set of arbitrary size and composition, learning stabilizes after just one presentation of each input pattern. A fast-commit slow-recode option combines fast learning with a forgetting rule that buffers system memory against noise. Using this option, rare events can be rapidly learned, yet previously learned memories are not rapidly erased in response to statistically unreliable input fluctuations. Keywords--Fuzzy set theory, Adaptive resonance theory, Neural network, Pattern recognition, Learning, ART 1, Categorization, Memory search. 1. INTRODUCTION: A CONNECTION of learning stable recognition categories in response BETWEEN ART SYSTEMS A N D to arbitrary input sequences with either fast or slow FUZZY LOGIC learning. Model families include A R T 1 (Carpenter & Grossberg, 1987a), which can stably learn to ca- Adaptive Resonance Theory, or ART, was intro- tegorize binary input patterns presented in an arbi- duced as a theory of human cognitive information trary order: A R T 2 (Carpenter & Grossberg, 1987b), processing (Grossberg, 1976, 1980). The theory has which can stably learn to categorize either analog or since led to an evolving series of real-time neural binary input patterns presented in an arbitrary order; network models for unsupervised category learning and A R T 3 (Carpenter & Grossberg, 1990a, 1990b), and pattern recognition. These models are capable which can carry out parallel search, or hypothesis testing, of distributed recognition codes in a multil- evel network hierarchy. Variations of these models ' Supported in part by British Petroleum (89-A-12041,DARPA adapted to the demands of individual applications (AFOSR 90-0083), and the National Science Foundation (NSF IRI-90-00530). have been developed by a number of authors (Baloch ~'Supported in part by the Air Force Office of Scientific Re- & Waxman, 1991; Baxter, 1991; Carpenter, Gross- search (AFOSR 90-0175), and DARPA (AFOSR 90-0083). berg, & Rosen, 1991a; Galindo & Michaux, 1990; Supported in part by DARPA (AFOSR 90-0083). Gjerdingen, 1990; Gochin, 1990; Harvey et al., 1990; Acknowledgements: The authors wish to thank Cynthia E. Hecht-Nielsen, 1990; Johnson, 1990: Kosko, 1987a, Bradford and Diana Meyers for their valuable assistance in the preparation of the manuscript. 1987b, 1987c; Kumar & Guez, 1989, 1991: Levine & Requests for reprints should be sent to Gall A. Carpenter, Penz, 1990; Li & Wee, 1990; Liao, Liang, & Lin, Center for Adaptive Systems, Boston University, 111 Cumming- 1990: Mekkaoui & Jespers, 199(I; Michalson & Heldt, ton Street, Boston, MA 02215. 1990; Moore, 1989; Nigrin, 199(/; Rajapakse, Jaku- 759
    2. 760 (;. A. Carpenter, S. Grossbep~ and D. B. Roser, bowicz, & Acharya, 1990; Ryan, 1988; Seibert & Waxman, 1990a, 1990b; Simpson, 1990; Weingard, i SEARCH ; 1990; Wilson, Wilkinson, & Ganis, 199(1: Winter, ATTENTIONAL ! ORIENTING i 1989; Winter, Ryan, & Turner, 1987). SUBSYSTEM Recently the ART 1 model has been used to de- sign a hierarchical network architecture, called ART- gain 2 I MAP, that can rapidly self-organize stable categor- STM F2 i ~ ical mappings between m-dimensional input vectors and n-dimensional output vectors (Carpenter, Gross- berg, & Reynolds, 1991a,b). Under supervised learn- ing conditions, ARTMAP's internal control mech- anisms create stable recognition categories of optimal size by maximizing predictive generalization while minimizing predictive error in an on-line setting. ARTMAP was originally used to learn mappings be- tween binary input and binary output vectors. The Fuzzy ART model (Carpenter, Grossberg, & Rosen, I+ oain, L___f .......... +t ' 1991b) developed herein generalizes ART l to be capable of learning stable recognition categories in response to both analog and binary input patterns. [. . . . MATCHING INTERNAL This Fuzzy ART model has been incorporated into ACTIVE /~ INPUT CRITERION: a Fuzzy A R T M A P architecture (Carpenter, Gross- REGULATION ................... VIGILANCE berg, Markuzon, Reynolds, & Rosen, 1991) that can PARAMETER rapidly learn stable categorical mappings between analog or binary input and output vectors. For ex- FIGURE 1. Typical ART 1 neural network (Carpenter & Gross- ample, Fuzzy A R T M A P learns in five training ep- berg, 1987a). ochs a benchmark that requires twenty thousand ep- ochs for back propagation to learn (Lang & Witbrock. one component of Y is nonzero after this choice takes 1989). The Fuzzy ART system is summarized in Sec- place. Activation of such a winner-take-all node de- tion 3. fines the category, or symbol, of the input pattern i. Fuzzy ART incorporates the basic features of all Such a category represents all the inputs I that max- ART systems, notably, pattern matching between imally activate the corresponding node. bottom-up input and top-down learned prototype Activation of an F, node may be interpreted as vectors. This matching process leads either to a res- "'making a hypothesis" about an input I. When Y is onant state that focuses attention and triggers stable activated, it generates a signal vector U that is sent prototype learning or to a self-regulating parallel top-down through the second adaptive filter. After memory search. If the search ends by selecting an multiplication by the adaptive weight matrix of the established category, then the category's prototype top-down filter, a net vector V inputs to F, (Figure may be refined to incorporate new information in 2b). Vector V plays the role ol a learned top-down the input pattern. If the search ends by selecting a expectation. Activation of V by ¥ may be interpreted previously untrained node, then learning of a new as "testing the hypothesis" ¥. or "'reading out the category takes place. category prototype" V. The ART l network is de- Figure 1 illustrates a typical example chosen from signed to match the "'expected p r o t o t y p e V of the the family of ART 1 models, and Figure 2 illustrates category against the active input pattern, or exem- a typical ART search cycle. As shown in Figure 2a, plar, I. an input vector ! registers itself as a pattern X of This matching process ma~ change the k acuv~tv activity across level Fj. The F~ output vector S is then pattern X by suppressing activauon of all the feature transmitted through the multiple converging and di- detectors in I that are not confirmed by V. The re- verging adaptive filter pathways emanating from F~. sultant pattern X* encodes the pattern of features to This transmission event multiplies the vector S by a which the network "pays attention." If the expec- matrix of adaptive weights, or long-term memory tation V is close enough to the input |. then a state (LTM) traces, to generate a net input vector T to of resonance occurs as the attentional focus takes level F:. The internal competitive dynamics of /< hold. The resonant state persists long enough for contrast-enhance vector T. A compressed activity learning to occur; hence the term adaptive resonance vector ¥ is thereby generated across F> In ART 1, theory. ART 1 learns prototypes, rather than ex- the competition is tuned so that the F2 node that emplars, because the attended feature vector X ~. receives the maximal F~ --~ Fz input is selected, Only rather than the input ! itself, is learned.
    3. Fuzzy ART 701 (a) , ~ 1 +S F, A I I (c) (d) .EsE, I+ +' I FIGURE 2. ART search for an F2 code: (a) The input pattern I generates the specific STM activity pattern X at F, as it nonspecificaily activates the orienting subsystem A. Pattern X both inhibits A and generates the output signal pattern S. Signal pattern S is transformed into the input pattern T, which activates the STM pattern Y across F2; (b) Pattern Y generates the top-down signal pattern U, which is transformed into the prototype pattern V. If V mismatches I at F1, then a new STM activity pattern X* is generated at F1. The reduction in total STM activity which occurs when X is transformed into X* causes a decrease in the total inhibition from F1 to A; (c) If the matching criterion fails to be met, A releases a nonspecific arousal wave to F2, which resets the STM pattern Y at F2; (d) After Y is inhibited, its top-down prototype signal is eliminated, and X can be reinstated at F, Enduring traces of the prior reset lead X to activate a different STM pattern Y* at F2. If the top-down prototype due to Y* also mismatches I at F1, then the search for an appropriate F2 code continues. The criterion of an acceptable match is defined egory, or hypothesis to represent input I at level F~. by a dimensionless parameter called vigilance. Vig- An orienting subsystem A mediates the search pro- ilance weighs how close the input exemplar I must cess (Figure l). The orienting subsystem interacts be to the top-down prototype V for resonance to with the attentional subsystem, as in Figures 2c and occur. Because vigilance can vary across learning trials, 2d, to enable the attentional subsystem to learn about recognition categories capable of encoding widely novel inputs without risking unselective forgetting of differing degrees of generalization, or morphological its previous knowledge. variability, can be learned by a single ART system. The search process prevents associations from Low vigilance leads to broad generalization and ab- forming between Y and X* if X* is too different from stract prototypes. High vigilance leads to narrow gen- I to satisfy the vigilance criterion. The search process eralization and to prototypes that represent fewer resets Y before such an association can form. A fa- input examplars. In the limit of very high vigilance, miliar category may be selected by the search if its prototype learning reduces to exemplar learning. Thus prototype is similar enough to the input I to satisfy a single ART system may be used, say, to recognize the vigilance criterion. The prototype may then be abstract categories of faces and dogs, as well as in- refined in light of new information carried by I. If I dividual faces and dogs. is too different from any of the previously learned If the top-down expectation V and the bottom- prototypes, then an uncommitted [2 node is selected up input I are too novel, or unexpected, to satisfy and learning of a new category is initiated. the vigilance criterion, then a bout of hypothesis test- A network parameter controls how deeply the ing, or memory search, is triggered. Search leads to search proceeds before an uncommitted node is cho- selection of a better recognition code, symbol, cat- sen. As learning of a particular category self-stabi-
    4. 762 G. /t. ('arpenter, S. Grossbet~g~ attd D. B lqose~ lizes, all inputs coded by that category access it di- learning translate into Fuzzy AR'I operations by re- rectly in a one-pass fashion, and search is automatically placing the set theory intersection operator ((~) of disengaged. The category selected is then the one ART 1 by the fuzzy set theory conjunction, or M1N whose prototype provides the globally best match to operator (/}. Despite this close formal homology, the input pattern. Learning can proceed on-line, and Fuzzy ART is described as an algorithm, rather than in a stable fashion, with familiar inputs directly ac- a locally defined neural model. A t~cural network tivating their categories, while novel inputs continue realization of Fuzzy ART is desc6bed elsewhere to trigger adaptive searches for better categories, un- (Carpenter, Grossberg, & Rosen. 19talc). For ~he til the network's memory capacity is fully utilized. special case of binary inputs and fast learning, the The read-out of the top-down expectation V may computations o[ Fuzzy ART arc identical to those be interpreted as a type of hypothesis-driven query. of the ART 1 neural network, 'Fhc [:uzzv ART al- The matching process at F~ and the hypothesis testing gorithm also includes two optional leatures, tree con- process at /~'~ may be interpreted as query-driven cerning learning and the other input preprocessmg, symbolic substitutions. From this perspective, AR'I as described in Section 2 systems provide examples of new types of self-or- ganizing production systems (Laird. Newell, & Ro- 2. FAST-LEARN S L O W - R E C O D E A N D senbloom, 1987). By incorporating predictive feed- COMPLEMENT CODING back into their control of the hypothesis testing cycle, Many' applications of ART I use [asi iearnmg, whereby ARTMAP systems embody self-organizing produc- adaptive weights fully converge tt~ new equilibrium tion systems that are also goal-oriented. ARTMAP values in response to each input pattern, Fast learn- systems are thus a new type of self-organizing experl ing enables a system to adapt quickly to inputs that system, which is capable of stable autonomous fast may occur only rarely and that may require imme- learning about nonstationary environments that ma3 diate accurate performance. The ability of humans contain a great deal of morphological variability. The to remember man~ details of an exciting movie is a fact that fuzzy logic may also be usefully incorporated typical example of fast learning. I~ has been math-- into ARTMAP systems blurs even further the tra- ematicallv proved that ART 1 c~m carry out fast ditional boundaries between artificial intelligence and learning of stable recognition categories in an on,line neural networks. setting in response to arbitrary lists of binary input The new Fuzzy ART model incorporates the de- patterns (Carpenter & Grossberg, 1987a), in con- sign features of other ART models due to the close trast, error-based learning models like backpropa- formal homolog between ART 1 and Fuzzy ART gation become unstable in this type of learning en- operations. Figure 3 summarizes how the ART 1 vironment. This is because back propagation learning operations of category choice, matching, search, and is driven by the difference between the actual output and a target output. Fast learning would zero this ART 1 FUZZY ART error signal on each input trial and would thus force (BINARY) (ANALOG) unsclective forgetting of past lear~m~g. This feature of back propagation restricts its domain to off-line CATEGORY CHOICE learning applications carried out ~ith a slow' learning r a t e . Off-line learning is needed because real-time presentation of inputs with variable durations has a a+lwjl similar effect on learning as presenting the same in- puts with a fixed duration but variable learning rates. MATCH CRITERION In particular, longer duration inputs reduce the error signal more on each input trial and thus have an effect P similar to fast learning. In addition, lacking the ke), teature of competition, a back propagation system FAST LEARNING tends to average rare events with similar frequent events that may have different consequences. Wj(new) = I f-'lwj(°la) Wj (new) = I A wj (°ld) For some applications, it is useful to combine fas~ initial learning with a slower rate of forgetting. We n = logical A N D A = fuzzy A N D call this the ,['ast-commit slow-recode option, This intersection minimum combination of properties retains the benefit of fast learning; namely, an adequate response to inputs FIGURE 3. Analogy between ART 1 and Fuzzy ART. The no- that may occur only rarely and in response to which tation wt In ART 1 denotes the index set of top-down LTM accurate performance may be quickly demanded. The traces of the j-th category that exceed a prescribed positive threshold value. See Carpenter and Grossberg (1987a) for slow-recode operation also prevents features that have details. already been incorporated into ?. category's proto-
    5. Fuzzy A R T 763 type from being erroneously deleted in response to relationship between A R T on-cell/off-cell r e p r e - noisy or partial inputs. With slow recoding, only a sentations, hypothesis testing, and probabilistic statistically persistent change in a feature's relevance logic that was outlined at the theory's inception to a category can delete it from the prototype of t h e and used to explain various perceptual and cognitive category. The fast-commit slow-recode option in Fuzzy data (Grossberg, 1980, Sections 7-9; Grossberg, ART corresponds to ART 2 learning at intermediate 1982, Section 47). learning rates (Carpenter, Grossberg, & Rosen, Section 4 discusses Fuzzy ART systems in a pa- 1991a). rameter range called the conservative limit. In this The input preprocessing option concerns normal- limit, an input always selects a category whose weight ization of input patterns. It is shown below that input vector is a fuzzy subset of the input, if such a category normalization prevents a problem of category pro- exists. Given such a choice, no weight change occurs liferation that could otherwise occur (Moore, 1989). during learning; hence the name conservative limit, A normalization procedure called complement cod- since learned weights are conserved wherever pos- ing is of particular interest from three vantage points. sible. Section 5 describes Fuzzy ART coding of two- From a neurobiological perspective, complement dimensional analog vectors that are preprocessed into coding uses both on-cells and off-cells to represent complement coding form before being presented to an input pattern, and preserves individual feature the Fuzzy ART system. The geometric interpretation amplitudes while normalizing the total on-cell/off- of Fuzzy ART dynamics is introduced here and il- cell vector. From a functional perspective, the on- lustrative computer simulations are summarized. The cell portion of the prototype encodes features that geometric formulation allows comparison between are critically present in category exemplars, while Fuzzy ART and aspects of the NGE (Nested Gen- the off-cell portion encodes features that are criti- eralized Exemplars) algorithms of Salzberg (199(I). cally absent. Features that are occasionally present Section 6 further develops the geometric interpre- in a category's input exemplars lead to low weights tation and provides a simulation of Fuzzy ART with- in both the on-cell and the off-cell portions of the out complement coding to show how category pro- prototype. Finally, from a set theoretic perspective, liferation can occur. Section 7 compares the stability complement coding leads to a more symmetric theory of Fuzzy ART to that of related clustering algorithms in which both the MIN operator (/~) and the MAX that were discussed by Moore (1989). The Fuzzy ART operator (V) of fuzzy set theory play a role (Figure computations of choice, search, learning, and com- 4). Using both the MIN and the MAX operations, plement coding endow the system with stability prop- a geometrical interpretation of Fuzzy ART learning erties that overcome limitations of the algorithms is given in terms of box-shaped recognition categories described by Moore. whose corners are iteratively defined in terms of the /~ and V operators. Complement coding hereby es- tablishes a connection between on-cell/off-cell re- presentations and fuzzy set theory operations. This 3. S U M M A R Y OF THE FUZZY linkage further develops a theme concerning the ART ALGORITHM Input vector: Each input I is an M-dimensional v e c - A Fuzzy AND (conjunction) tor (L . . . . . IM), where each component li is in the interval [0, 1]. V Fuzzy OR (disjunction) W e i g h t v e c t o r : Each category (j) corresponds to a vector w/-= (Wjl. . . . . win) of adaptive weights, or LTM traces. The number of potential categories N(j = i ..... N ) is arbitrary. Initially y , ........ x.Vy w,, = . . . - w , , = 1. (1) and each category is said to be uncommitted. Alter- natively, initial weights w/, may be taken greater than . xAy ...... • X 1. Larger weights bias the system against selection of uncommitted nodes, leading to deeper searches 0 of previously coded categories. After a category is selected for coding it becomes x= Y = (Yt,Y2) committed. As shown below, each LTM trace wii is (x A Y)l = m i n ( x l , y l ) (x A Y)2 = min(x2,Y2) monotone nonincreasing through time and hence (X V Y)l =max(xl,Yl) (x V Y)2 = max(x2,Y2) converges to a limit. The Fuzzy ART weight vector w, subsumes both the bottom-up and top-down weight FIGURE 4. Fuzzy operations. vectors of ART 1.
    6. 764 G. A. Carpenter, S. Grossberg, and D. B. Rosen P a r a m e t e r s : Fuzzy A R T dynamics are determined inputs erode the norm of weight vectors. Prolifera- by a choice parameter a > 0; a learning rate par- tion of categories is avoided in Fuzzy A R T if inputs ameter fl ~ [0, 1]; and a vigilance parameter p are normalized; that is, for some :, ::, (L [0, 11. Category c h o i c e : For each input I and category j, the choice function Ti is defined by for all inputs 1. Normalization can be achieved by preprocessing each incoming vector a, for example T/(I) = [I/%] setting + !w,l' (2) a where the fuzzy A N D (Zadeh, 1965) o p e r a t o r / ~ is I : ~-. (10) defined by An alternative normalization rule, called c o m p l e - (x A y), -=- min(x, y,), (3) ment coding, achieves normalization while preserv- and where the norm I'1 is defined by ing amplitude information. Complement coding rep- resents both the on-response and the off-response to M Ix}--- .~ Ix,I. (4) a (Figure 5). To define this operation in its simplest i=1 form, let a itself represent the on-response. The com- plement of a, denoted by a c, represents the off-re- For notational simplicity, Tj(I) in eqn (2) is often sponse, where written as Tj when the input I is fixed. The category choice is indexed by J, where a: -=-l - a~. (ll) 1": = max{T/:] = 1 . . . N}. (5) The complement coded input I to the recognition system is the 2M-dimensional vector If more than one T) is maximal, the category j with the smallest index is chosen. In particular, nodes ! = (a, a ~) ~- (at . . . . . aM, a', . . . . . a~). (12) become committed in order j = 1, 2, 3, . . . . Note that R e s o n a n c e o r r e s e t : R e s o n a n c e occurs if the match function of the chosen category meets the vigilance [!I = I(a, ac)l criterion; that is, if II/~ wj[ _> p. (O) =Zo,+(M-Z at ) ~4 ~f M, (13) Learning then ensues, as defined below. Mismatch r e s e t occurs if so inputs preprocessed into complement coding form are automatically normalized. Where complement w~____~t < p. tl A (7) coding is used, the initial condition (1) is replaced Ill by Then the value of the choice function Tj is reset to %1 = . . . : %2~ i (14) - 1 for the duration of the input presentation to pre- vent its persistent selection during search. A new index J is chosen, by eqn (5). The search process continues until the chosen J satisfies eqn (6). L e a r n i n g : The weight vector wj is updated according to the equation Fll I =(a,a c) iIl=M w~"~w) = fl(l A wy ''°)) + (1 - fl)w~°'d). (8) Fast learning corresponds to setting fl = 1 (Figure 3). The learning law (8) is the same as one used by al [aC = (1-a 1, ..., 1-a M) Moore (1989) and Salzberg (1990). F a s t . c o m m i t s i o w - r e c o d e o p t i o n : For efficient coding of noisy input sets, it is useful to set fl = 1 when J is an uncommitted node, and then to take fl < 1 after F0 a the category is committed. Then w(j"~w) = I the first time category J becomes active. l a = (a 1, ..., aM) Input normalization option: Moore (1989) described a category proliferation problem that can occur in FtGUM S. ~ codkw m m ~ m d off-~l ~s some analog A R T systems when a large number of to normalize input vectors.
    7. Fuzzy A R T 765 4. F U Z Z Y SUBSET CHOICE AND ONE- a fuzzy subset choice of I, by eqn (16). If ! is pre- SHOT FAST LEARNING IN T H E sented again, it will either choose J = j or make CONSERVATIVE LIMIT another fuzzy subset choice, maximizing ]w:], be- cause fuzzy subset choices (16) maximize the cate- In fast-learn A R T 1, if the choice parameter a in (2) gory choice function (2) in the conservative limit. In is chosen close to 0 (see Figure 3), then the first either case, w~n~w) = !/~ w~°~d) = w~°~d), which implies category chosen by (5) will always be the category that neither reset nor additional learning occurs. whose weight vector wl is the largest coded subset of the input vector I, if such a category exists (Car- penter & Grossberg, 1987a). In other words, wj is chosen if it has the maximal number of l's (wji = 1) 5. F U Z Z Y A R T W I T H at indices i where Ii = 1, and O's elsewhere, among COMPLEMENT CODING all weight vectors wj. Moreover, when wj is a subset A geometric interpretation of Fuzzy A R T with com- of I during resonance, w: is unchanged, or conserved, during learning. More generally, wj encodes the plement coding will now be developed. For definite- attentional focus induced by I, not I itself. The ness, let the input set consist of two-dimensional limit a ~ 0 is called the conservative limit because vectors a preprocessed into the four-dimensional small values of a tend to minimize recoding during complement coding form. Thus learning. I = (a, a') = (a,, a_,, 1 - a,, 1 - a~_). (18) For analog vectors, the degree to which y is a fuzzy subset of x is given by the term In this case, each category ) has a geometric repre- sentation as a rectangle Rj, as follows. Following eqn Ix A yl (18), the weight vector wj can be written in comple- lyl (15) ment coding form: (Kosko, 1986). In the conservative limit of Fuzzy w/ = (uj, v',), (19) ART, the choice function Tj in eqn (2) reflects the where u/and vj are two-dimensional vectors. Let vec- degree to which the weight vector wj is a fuzzy subset of the input vector I. If tor uj define one corner of a rectangle Rj and let vj define another corner of Rj (Figure 6a). The size of I1 A wjl _ 1, (16) R/is defined to be IR:I -= pv,- u,J, (20) then wj is a fuzzy subset of I (Zadeh, 1965), and which is equal to the height plus the width of Rj in category j is said to be a fuzzy subset choice for input Figure 6a. I. In this case, by eqn (8), no recoding occurs if j is selected since I / ~ wj = wr Resonance depends on the degree to which 1 is a In a fast-learn Fuzzy A R T system, with fl -- 1 in fuzzy subset of wj, by eqns (6) and (7). In particular, eqn (8), w~"ewl = I = (a, a c) when J is an uncommitted if category j is a fuzzy subset choice, then the match node. The corners of R~newl are then given by a and function value is given by (aC)c = a. Hence, R~"~wl is just the point a. Learning increases the size of each Rj. In fact the size of Rj II/~ w/l = ~ (17) grows as the size of wj shrinks during learning, and al al' the maximum size of Rj is determined by the vigilance Thus, choosing J tO maximize ]wjl among fuzzy subset parameter p, as shown below. During each fast-learn- choices also maximizes the opportunity for resonance ing trial, Rj expands to Rj ~ a, the minimum rec- in eqn (6). If reset occurs for the node that maximizes tangle containing Rj and a (Figure 6b). The corners Iwjl, reset will also occur for all other subset choices. of Rj G) a are given by a / ~ us and a V vl, where Consider a Fuzzy A R T system in the conservative limit with fast learning and normalized inputs. Then (x V y)i -= max(x,, y,) (21) c~ = 0 in eqn (2), fl = 1 in eqn (8), and eqn (9) holds. (Zadeh, 1965). Hence, by eqn (20), the size of R: • Under these conditions, one-shot stable learning oc- a is given by curs; that is, no weight change or search occurs after each item of an input set is presented just once, al- IR, • al = p(a V v,) - (a A u,)l. (22) though some inputs may select different categories However, reset leads to a new category choice if on future trials. To see this, note by eqns (6), (8), IRj • a[ is too large. These properties will now be and (9) that when I is presented for the first time, proved. w~"~) ~ I/~ w~°l~) for some category node J = j such Suppose that I = (a, a C) chooses category J, by that II/~ w)°'d)] -> p l I [ = PY- Thereafter, category j is eqn (5). The weight vector wj is updated according
    8. 766 (;. A. Carpenter, S. Grossberg, and D, B. RosetJ satisfies !R~ • a i ~ 2(1 -- p). ~26) By eqn (26), if vigilance p is close to 1, then all R,s are small. If p is close to 0, then some Rj's may grow to fill most of the unit square [0, lj ~: [0, 1]. Suppose now that the match criterion is satisfied. a; By eqn (23) w', '''~' = ! A wl, '''~ (a, a') A (u~ . ~v':"%') i, .... (a/", u',''~"', a' ~v}"~') ' ) 0 1 (a) : ( a / u~''~'~', ,,. v':'"")') (a =: (ul,,~ , ( v~,~,),~ . j . (27) Thus --?,aVvj R~"c~' = R~?~d' @ a In particular, no weight changes occur if a ~ R}°~d'. ~2~) Rj : In summary, with fast learning, each R / e q u a l s the I smallest rectangle that encloses all vectors a that have chosen category j, under the constraint that ]Rji i ai' 2(1 - - p ) . In general, if a has dimension M, the hyper-rec- • ................ e ]aAuj a tangle R/includes the two vertices/'~.ia and V/a, where the i-th component of each vector is 0 1 (A/a)~ = min{a,:a has been coded by category it (29) (b) and (V,a), = max{a,:a has been coded by category j} (30) FIGURE 6. (a) In complement coding form with M = 2, each weight vector w I has a geometric Interpretation as a rectangle (Figure 7). T h e size of R/is given bv Rj with corners (uj, vl); (b) During fast learning, Rj expands to Rj ~ a, the smallest rectangle that Includes Rj and a, iRit = iV~a -.''ial (31) provided that IRa • al -< 2(1 - p). to the fast-learn equation a2 W~"~r) ~ I A w)"tJ~ (23) 1 only if the match criterion eqn (6) is satisfied. Be- cause of complement coding, III = M, by eqn (13). Thus, when M = 2, the match criterion eqn (6) is R; satisfied iff Wc j4 tl A wjI _> 2p. (24) However, w j2 I[ A w,I = I(a, a") A (u,, v~;)l = I(a A u,), (a ~ A v~)l = t(a A u,), (a V v~)q = ka A u , l + 2 - lay v,I 0 w jl wj3 1 1 = 2 - IR: • al, (25) FIGURE 7. ~ fast ~ g ~ ~ ~ thej- by eqn (22). T h u s by eqns (24) and (25), the match th c e ~ ~ n, ~ © ~ d ~ ~ a l . the criterion is met iff the e x p a n d e d rectangle Rj @ a unit s q u m which have activated category ] withoutreset.
    9. Fuzzy A R T 767 As in eqn (27), 1,R1 w, = (A,a, (Via)') (32) SO !w,I = ~ (A,a), + ~" ll - (Via),] (a) (b) (c) - M-IV,a- A,a]. (33) The size of the hyper-rectangle R i is thus IR,I - M -Iw, I. (34) By eqns (6), (8), and (13), [w,l >- Mp. (35) (d) (e) if) By eqns (34) and (35), rR, I-< M(I - p). (36) Thus, in the M-dimensional case, high vigilance (p ~ 1) again leads to small R/while low vigilance (p ~ 0) permits large R i. If]" is an uncommitted node, W Iwii = 2 M , by eqn (14). and so, IR~] -= - M , by eqn (34). These observations may be combined into the (g) (h) (i) following theorem. FIGURE 8. Fuzzy ART complement coding simulation with ~ 0,/~ -- 1, p -- .4, and input vectors a <oselected at random THEOREM (Stable Category Learning): from the unit square. Rectangles Rj grow during learning, In response to an arbitrary sequence of analog or and new categories are established, until the entire square binary input vectors, a Fuzzy A R T system with com- is covered by 8 rectangles. Categories do not proliferate. A plement coding and fast learning forms stable hyper- new point rectangle, R2, is established at t = 4, since R1 • a ~4~is too large to satisfy the match criterion eqn (26). rectangular categories R i, which grow during learning to a maximum size IR/I -< M(1 - p) as Iw,I mono- tonically decreases. In the conservative limit, one- pass learning obtains such that no reset or additional conservative limit, and p = .4. When the first cat- learning occurs on subsequent presentations of any egory is established, R~ is just the point a (~. If a ('~ input. lies within one or more established R i, the rectangle Similar properties hold for the fast-learn slow-re- chosen is the one that has the smallest size IR/. In code case, except that repeated presentations of an this case, neither reset nor weight change occurs. input may be needed before stabilization occurs. Each new input that activates c a t e g o r y / , but does The geometry of the hyper-rectangles Rj resem- not lie within its previously established boundaries, bles part of the Nested Generalized Exemplar (NGE) expands R i unless (as in (d)) such an expansion would algorithm (Salzberg, 1990). Both N G E and Fuzzy cause the size o f R i t o exceed 2(1 - p) - 1.2. As A R T M A P (Carpenter, Grossberg, Reynolds, & Ro- more and more inputs sample the square, all points sen, 1991) construct hyper-rectangles that represent of the square are eventually covered by a set of eight category weights in a supervised learning paradigm. rectangles R i, as illustrated by (g)-(i). Both algorithms use the learning law eqn (8) to up- date weights when an input correctly predicts the output. The two algorithms differ significantly, how- 6. F U Z Z Y A R T W I T H O U T ever, in their response to an incorrect prediction. In COMPLEMENT CODING particular, N G E has no analogue of the A R T vigi- lance parameter p, and its rules for search differ from The advantages of complement coding are high- those of Fuzzy ART. In addition, N G E allows hyper- lighted by consideration of Fuzzy A R T without this rectangles to shrink as well as to grow, so the Fuzzy preprocessing component. Consider again a fast-learn A R T stability properties do not obtain. Fuzzy A R T system with M = 2. Let the input set In the computer simulation summarized in Figure consist of two-dimensional vectors I = a. During 8, M = 2 and vectors a ~), a ~2), . . . are selected at learning, random from the unit square. Each frame shows the w~°'~1 = a/x wj"'ul. (37) vector a t" and the set of rectangles Rj present after learning occurs. The system is run in the fast-learn, since/3 : 1 in eqn (8). Without complement coding,
    10. 768 G. A. Carpenter, S. Grossberg, and D. B. Rosen the monotone decrease in Iwjl that is implied by eqn 1], let (37) can lead to a proliferation of categories, as fol- lows. e la) : w: S = 2: TI, 40) Geometry of Fuzzy A R T : The choice, match, and learning computations of Fuzzy A R T (Figure 3) will where w is a vector in the unit square. For each T, now be described geometrically. Without comple- region Pr(a) is a choice polygon with boundary ment coding, these computations can be described in terms of polygonal regions. With complement cod- aeffa) : { iw:/ ' w L+ _!wi .... " i a . _ (4lt ing, the analogous regions would be four-dimen- sional sets. Progressively smaller values of I induce a family of For a given input a, Fuzzy A R T choice is char- progressively larger polygons that are nested within acterized by a nested set of polygons. These polygons each other. In the conservative limit (a ~ 0), are defined by the choice function P~(a) ~ a and all Pr(a) include a for 0 ~ T ~ 1. la A wfi By eqns (38) and (40), an LTM vector T~(a) - (38) + tw,l" wi C Pr(a) {42} and thus are called choice polygons (Figure 9), As iff in eqn (5), the category choice is indexed by J, where Tr(a) :>- I. (43) Tj(a) = max{Tj(a):j = 1,2 . . . . . N}. (39) Thus, to find a category choice J that maximizes Tj(a), the largest T must be found such that The choice J can be geometrically interpreted as fol- lows. For each input vector a and number T ~ [0, a ~ PT(a), (44) 0 a1 1 Tj=T ~(.L~.X~a1_ ~ (a) (b) Resonance 1 (ii) (i) ~ p ] ~ Resonl a2 (iv) 0 aI 1 0 (c) (d) FIGURE 9. Geometry of Fuzzy ART without complement coding. Input a divides the unit ~ into ~ r ~ ~ s (i) . . . . . (Iv). (a) Choke ~ s PT(a) for 0 < T < 1 and 0 < a < 1. if Input a ~ 8~ J~ Tj = T, then wj Is a point on dPT(a), but no weight vector w, t is ~ " - t s d onto the ~ Vlmgl.; (d) w, shrinks to 0 88 Iw,I ~ O. If 8 is In the~ change occum ¢lurlng IoBmlng,
    11. Fuzzy ART 769 wj E 0Pria), (45) but w/ ~ Ps(a) (46) for any j = 1, 2 . . . . . N and S > T. To accomplish this, T is decreased and the corresponding polygon Pr(a) expands until its boundary 0PT(a) intersects (a) (b) (c) the first LTM vector wj, ] = 1, 2 . . . . . N, at some value T. Then set J = j and ~(a) = T. Resonance and reset regions: Once chosen, node J remains active only if the vigilance criterion is met: namely, by eqn (6), la/~ wj] -> plal. (47) (d) (e) (f) The resonance and reset regions can be visualized in terms of the four rectangular regions into which a divides the unit square (Figure 9b). If wj is in region (i), la/~ w/I -- lal. (48) Thus, eqn (47) is satisfied given any vigilance value p ~ [0, i]. On the other hand, if wi is in region (iv), (g) (h) (i) then FIGURE 10. Fuzzy ART simulation with a -~ 0, /~ = 1, and ]a/~ w,] = Iw,[, (49) p = .4. The sequence of input vectors aro is the same as in Figure 8. The resonance triangle is shown for each category, so node J will be reset if with the triangle for category I shaded. Categories proliferate as Iw/I-* 0. Iw, j <: p[a[. (50) The boundary of the reset region m (iv) is thus de- fined by the straight line {w:lw[ = p[a[}, which ap- proaches a as p approaches 1. The fact that the reset a ~'~ as in Figure 8 were presented to a Fuzzy A R T boundary is a vertical line in region (ii) and a hori- system. Each frame shows the vector a l') and the zontal line in region (iii) is checked by evaluating triangular subset of the resonance region (Figure 9d) [a/~ wi] in these regions. Figure 9b pieces together for all established categories. As shown, proliferating these reset regions and depicts the complementary categories cluster near the origin, where they are resonance region in gray at a vigilance value p < 1. rarely chosen for resonance, while new categories Learning: After search, some node J is chosen with are continually created. This problem is solved by w: in the resonance region. During learning, a / ~ wj complement coding of the input vectors, as was il- becomes the new weight vector for category J, by lustrated in Section 5. eqn (37). That is, wj is projected to region (iv); spe- cifically, to the shaded triangle in Figure 9c. Thus, unless w~ already lies in region (iv), wj is drawn to- 7. STABILITY OF ward the origin during learning. However, as wl ap- CLUSTERING ALGORITHMS proaches the origin, it leaves the resonance region Moore (1989) described a variety of clustering al- of most inputs a (Figure 9b). To satisfy the resonance gorithms, some of them classical and others, based criterion, future inputs are forced to drag other weight on A R T 1, that are similar to Fuzzy ART. All use, vectors toward the origin, or to choose uncommitted however, a choice function that includes a dot prod- nodes, even though the choice value of these nodes uct or Euclidean distance measure that differs from is small. Figure 9d illustrates, for two different weight the choice function ~ in eqn (2). In addition, com- vectors wj and w~, the sets of points a where reso- plement coding is not used. For example, the Cluster nance will occur if category J or J' is chosen. As Euclidean algorithm (Moore, 1989, p. 176) chooses shown, this set shrinks to zero as [wjl approaches 0. the coded category J whose weight vector wj is the Category proliferation: Figure 10 shows how the minimal Euclidean distance d(w/, I) to I, and which properties of Fuzzy A R T described above can lead satisfies to category proliferation. In the simulation illus- trated, the same randomly chosen sequence of inputs d(w:, I) <- 0 (51)
    12. 770 (; A. ( arpenter, S. (.;rossbe,:<. am~ D. B. Rose/e If s u c h a J exists, w: is u p d a t e d by Pattern recognition by self orgatuzing ne:,rai networks' (('hap. 14L Cambridge, MA: MIT Press. w,I'°~w' = /fl + (1 - /0w~'<'. (52) Carpenter, G. A,, Grossberg, S., & Re~nolds, ,i II, (1991a) ARTMAP: A self-organizing neural network architecture (or If n o s u c h c a t e g o r y exists, a n u n c o m m i t t e d n o d e J fast supervised learning and pattern recognJ|ion. ]nlernaIiona/ is chosen and w5"~wl = l, as in the fast commitment Joint ('onlerence on Neural Networks (S~ attlell (pp 863 -g68! option. The Cluster Unidirectional algorithm (Moore. Piscataway, N J: IEEE Service Centre t989, p. 177) is similar except that weights are up- Carpenter, G. A., Grossberg, S., & Reyqoids. d. H. (19¢;ibl. ARTMAP: Supervised real-time learning,, and classification oi dated according to eqn (8). Moore pointed out that nonstationary dala by a self-organizing neural network. Neural the Cluster Euclidean algorithm is unstable in the Networks. 4, 565-588. Reprinted m (; A. Carpenter & S sense that weight vectors and category boundaries Grossberg (Eds.) ( 1991 ). Pattern recogni:ion by se([organizin~: may cycle endlessly. Moore also showed that the neural networks (Chap. 15), Cambridge, MA: MIT Press unidirectional weight update rule eqn (8) avoids this Carpenter, G. A., Grossberg, S , Markuzon, N., Reynolds. J 11., & Rosen. D, B, (19911. Fuz:~ 4RTMAP: A ne'urai type of instability, but introduces the category pro- network architecture/or in( remental vup('rvised learning ~:/ an liferation problem described in Section 6. alov multidimensional maps. Technics~ :~ci ~r¢ (TAS'CN,'-;-gi As noted in the Stable Category Learning Theo- (116. Boston, MA: Boston University rem, normalization of inputs using complement cod- Carpenter, G. A.,Grossberg, S.,& Rosen i ) B.(i9L}Ia), ARi 2-A: An adaptive resonance algorithm 1ol rapid category learning ing allows Fuzzy ART to overcome the category pro- and recognition. Neural Networks, 4, 4~13 504. liferation problem while retaining the stable coding Carpenter, G. A., Grossberg, S , & Rosen I). B. (t991b). t:"uzz~ properties of the weight update rule eqn (8). The ART: An adaptive resonance algorithm [or rapid, stable clas- strong stability and rapid convergence properties of sification of analog patterns (Technical Report CAS/CNS-9~- Fuzzy ART models are due to the direct relationship 1t1161. International ,Ioint Confereme ,,, Neural Networks (Seattle)li (pp. 411--4161, Piscatawav NJ IEEE Service between the choice function eqn (2), the reset rule Center. eqn (7), and the weight update rule eqn (8). Choice_ Carpenlcr, G. ,.. Grossberg, S, ,k R~>,',,, l), B. i I991~) 4 search, and learning are made computationally con- neural network realization o/ Fuzzy _,i/,~!" lechnical Report sistent by the common use of the vector i / ~ w/. This CAS/CNS-91-021. Boston, MA: Bosio;: J, nive~sit~ direct relationship enables Fuzzy ART models to be Galindo, P 1_.. & Michaux, T. {199111 AI; mqmwed c(mlpctimc embedded in multilevel Fuzzy ARTMAP systems for learning algorithm applicd to high lc',cl speech processing. International Joint Conference on ,'e~ra/ Netn'ork,v (Wash- supervised learning of categorical maps between ington, D('}, I (pp, 142-146t |-lills&dc. N,|: Erlbaum A> m-dimensional and n-dimensional analog vector pairs sociatcs. (Carpenter, Grossberg, Reynolds, & Rosen, 1991). Gjerdingen, R. (). (19911). Catcgorizamm c,| musical patterns b~ self-organizing neuronlike networks..,If u@ Perception, 7,339- 37!). Gochm, P. {1991D. Pattern recognition m |n'mmte temporal cortex: REFERENCES But is it ART? International Joint C'ot'.tk'rence on Neural Net- ~orks (Washington, DC), I (pp. 7"> ,>()) Hillsdale. N J: Erl- Baloch, A. A., & Waxman, A. M. 1t9911. Visual learning, adap- baum Associates, tive expectations, and behavioral conditioning of the mobile Grossberg, S, (1976). Adaptive pattern classdication and universal robot MAVIN. Neural Networks, 4, 271-3112, rccoding, II: Feedback, expectation, ,.;!laction. and illusions. Baxter, R. (199l). Error propagation and supervised learning in Biological Cybernetics, 23, 187-2tl2, Reprinted in (L A. (ar- adaptive resonance networks. International Joint Conference pcnter & S. Grossberg (Eds.) (1991) Pattern recognition b) on Neural Networks (Seattle)ll (pp. 423-429). Piscataway, N J: ~e!t:organiziag neural networks (Chap ~)) Cambridge, MA: IEEE Service Center. M1T Press. Carpenter, G. A., & Grossberg, S. (1987a). A massively parallel Grossberg, S. (1980). How does a blain t3ufld a cognitive code? architecture for a self-organizing neural pattern recognition Psychological Review, 1, 1-51. Reprinted in S, Grossberg (Ed.i machine. Computer Vision, Graphics, and Image Processing, (1982). Studies o] mind and brain: Neural principles of learning, 37, 54-115. Reprinted in G. A. Carpenter & S. Grossberg perc~Ttion, development, cognition, and motor control (Chap. (Eds.) (1991). Pattern recognition by self-organizing neural I). Boston, MA: Reidel Press, networks (Chap. 10). Cambridge, MA: MIT Press. Grossberg, S. (1982). Processing of expected and unexpected events Carpenter, G. A., & Grossberg, S. (1987b). ART 2: Stable self- during conditioning and attention: A psychophysiologica! the- organization of pattern recognition codes for analog input pat- ory. Psychological Review, 89,529-572, Reprinted in S. Gross- terns. Applied Optics, 26, 4919-4930. Reprinted in G. A. Car- berg (Ed.) (1987). The adaptive brain I (('hap. 3). Amsterdam: penter & S. Grossberg (Eds.) (199l). Pattern recognition by North-Holland. self-organizing neural networks (Chap. 12) Cambridge, MA: Harvcy, R. L., DiCaprio, P. N., Hmnemann, K. G,, Silverman, MIT Press. M. L., & Dugan, J. M. 1199111. A neural architecture for visual Carpenter, G. A., & Grossberg, S. (1990a). ART 3 hierarchical recognition of general stationary objects by machines. Inter- search: Chemical transmitters in self-organizing pattern rec- national Joint Conference on Neural Networks (San Diego), I1 ognition architectures. International Joint Conference on Neural (pp. 937-942). Piscdataway, NJ: IEEE Service Center. Networks (Washington, DC) 11 (pp. 30-33). Hillsdale, NJ: Hecht-Nielsen, R. (1990). Spatiotemporal pattern segmentation Erlbaum Associates. by expectation feedback. International Joint Conference on Carpenter, G. A., & Grossberg, S. (1990b). ART 3: Hierarchical Neural Networks (Washington, DC), !I (p. 40). Hillsdale, N J: search using chemical transmitters in self-organizing pattern Erlbaum Associates. recognition architectures. Neural Networks, 3, 129-152. Re- Johnson, K. (19901, Characteristics of nemal population codes m printed in G. A. Carpenter & S. Grossberg (Eds.) 11991). hierarchicak self-organizing vision machines, International Joint
    13. Fuzzv Art 771 Conference on Neural Networks (Washington, DC), II (pp. 35- Neural Networks (Washington, DC). 11, pp. 167-17(I. Hills- 39). Hillsdale, N J: Erlbaum Associates. dale, N J: Krlbaum Associates. Kosko, B. (1986), Fuzzy entropy and conditioning. Information Moore, B. (1989). ART 1 and pattern clustering. In D. Touretzky, Sciences, 40, 165-174. G. Hinton, & T. Sejnowski (Eds.), Proceedings of the 1988 Kosko, B. (1987a). Adaptive bidirectional associative memories. Connectionist Models Summer School (pp. 174-1985). San Ma- Applied Optics, 26, 4947-4960. Reprinted in G. A. Carpenter teo. CA: Morgan Kaufmann Publishers. & S. Grossberg (Eds.) (1991). Pattern recognition by self-or- Nigrin, A. L. (199(I). Real-time classification of temporal se- ganizing neural networks (Chap. 13). Cambridge, MA: MIT quences with an adaptive resonance circuit. International Joint Press. Conference on Neural Networks (Washington, DC), !, (pp. Kosko, B. (1987b). Bidirectional associative memories, lEEK 525-528). Hillsdale, N J: Erlbaum Associates. Transactions on Systems, Man, and Cybernetics, SMC-17. Rajapakse, J. C,, Jakubowicz, O. G., & Acharya, R. S. (1990). Kosko, B, (1987c). Competitive bidirectional associative memo- A real time ART-t based vision system for distortion invariant ries. In M. Caudill & C. Butler (Eds.), lEEK International recognition and autoassociation. International Joint Confer- Con[erence on Neural Networks. Piscataway. N J: IEKE Ser- enee on Neural Networks (Washington, DC), II (pp. 298-301). vice Center, 2, 759-766, Hillsdale, N J: Erlbaum Associates. Kumar, S. S., & Guez, A. (1989). A neural network approach to Ryan, T. W. (1988). The resonance correlation network. 1EEL target recognition. International Joint Conference on Neural International ConJorence on Neural Networks. Piscataway, N J: Networks" (Washington, DC), II (p. 573). San Diego, CA: IEEE Service Center, 1,673-680. lEEK. Salzberg, S. L. (1990). Learning with nested generalized exem- Kumar, S. S,, & Guez, A. (1991). ART based adaptive pole plars. Boston: Kluwer Academic Publishers. placement for neurocontrollers. Neural Networks, 4, 319- Seibert, M., & Waxman, A. M. (1990a). Learning aspect graph 335. representations from view sequences. In I). S. ~lburetzky (Ed.), Laird, J. E., NeweH, A., & Rosenbloom, P. S. (1987). SOAR: Advances in neural it~[brmation processing systems 2 (pp. 258- An architecture for general intelligence. Arti/icial Intelligence, 265). San Mateo, CA: Morgan Kaufmann. 33, 1-64. Seibert, M., & Waxman, A. M. (1990b). Learning aspect graph Lang, K. J., & Witbrock, M, J. (1989). Learning to tell two spirals representations of 3D objects in a neural network. Interna- apart. In D, Touretzky, G. Hinton, & T. Sejnowski (Eds.), tional Joint ('onferenee on Neural Networks (Washbtgton, DC), Proceedings c~fthe 1988 Connectionist Models Summer School. 11 (pp. 233-236). Hi[Isdale, N J: Erlbaum Associates. (pp. 52-59). San Mateo. CA: Morgan Kaufmann Pub- Simpson, P. K. (1990). Neural networks for sonar signal process- lishers. ing. In A. J. Maren, C. T, Harston, & R. M. Pap (Eds.), Levinc, D. S., & Penz, P. A. (1990). ART 1.5--A simplified Handbook ~[ neural computing applications (pp. 319-335). adaptive resonance network for classifying low-dimensional San Diego: Academic Press. analog data. International Joint Conference on Neural Net- Weingard, K S. (1990). Serf-organizing analog fields (SOAF). works (Washington, DC), !i (pp. 639-642). Hillsdale. N J: Erl- International Joint Con[erence on Neural Networks (Washing- baum Associates. ton, DC), il, (p. 34), Hillsdale, NJ: Erlbaum Associates. Li, D., & Wee, W. G. (1990). A new neocognitron structure Wilson, C. L., Wilkinson, R. A., & Ganis, M.D. (1990). Sell'- modified by ART and back propagation. International Joint organizing neural network character recognition on a mas- ('onference on Neural Networks ( Washington, DC), ] (pp. 42(I- sively parallel computer. International Joint Con/Erence on 423). Hi[Isdale, N J: Krlbaum Associates. Neural Networks (San Diego), II (pp. 325-329). Piscataway, Liao, F.-Y., Liang, C.-C., & Lin, W.-C. (1990). ART 1 ncural N J: IEEE Service Center. network tk)r reducing search space in 3-D object recognition Winter, C. L, (1989). An adaptive network that learns sequences using multiple views. International Joint Con/orence on Neural of transitions. In D. S. Touretzky (Ed,). Advances in neural Networks (San Diego), II (pp. 931-936). Piscataway, NJ: lEEK in/brmation processing systems 1 (pp. 653-660). San Mate(), Service Center. CA: Morgan Kaufmann. Mekkaoui, A., & Jcspers, P. (1990). An optimal self-organizing Winter, C. L., Ryan, T. W., & Turner, C. J. (1987). TIN: A pattern classifier. International Joint Conference on Neural trainable inference network. In M. Caudill & C. Butler (Eds.), Networks (Washington, DC), ! (pp. 447-450). Hillsdale. N J: lEEK International Conference on Neural Networks, 2, 777- Erlbaum Associates, 785. Piscataway, N J: IEEE Service Center. Michalson, W. R., & Heldt, P. (1990). A hybrid architecture for Zadch, L. (1965). Fuzzy sets. h~[brmation and ('ontrol, 8, 338- the AR~I 2 neural model. International Joint Con[~,rence on 353.

    + ESCOMESCOM, 3 months ago

    custom

    45 views, 0 favs, 0 embeds more stats

    Fuzzy ART: Fast Stable Learning and Categorization more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 45
      • 45 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 2
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories