The Principle of Presence: a heuristic for Growing Knowledge Structured Neural Networks

The Principle of Presence:

Neural Networks

How can people learn so fast?

What do people memorize? (1)

What do people memorize? (2)

What do people memorize? (3)

Presence in everyday life

The principle of presence

Implications

Implementation: NN

First implementation

Locality in learning

Learning: example (0)

Learning: example (1)

Learning : example (2)

Learning : example (3)

Learning : example (4)

NETtalk task

Advantages w/r NNs

But…

Work in progress

Dostları ilə paylaş:

The Principle of Presence: a heuristic for Growing Knowledge Structured Neural Networks

The Principle of Presence:

A Heuristic for Growing Knowledge Structured Neural Networks

Laurent Orseau,

INSA/IRISA, Rennes, France

Neural Networks

Efficient at learning single problems

Lifelong learning:

-> Full connectivity not suitable

-> Need localilty

How can people learn so fast?

Focus, attention

Raw table storing?

With generalization

What do people memorize? (1)

1 memory: a set of « things »

Things are made of other, simpler things

Thing=concept

Basic concept=perceptual event

What do people memorize? (2)

Remember only what is present in mind at the time of memorization:

What do people memorize? (3)

Not what is not in mind!

Good but not always true -> heuristic

Presence in everyday life

Easy to see what is present,

Infants lose attention to balls that have just disappeared

The zero number invented long after other digits

Etc.

The principle of presence

Memorization = create a new concept upon only active concepts

Independant of the number of known concepts

Few active concepts

Implications

A concept can be active or inactive.

Activity must reflect importance, be rare

~ event (programming)

New concept = conjunction of actives ones

Concepts must be re-usable(lifelong):

-> More symbolic than MLP: a neuron can represent too many things

Implementation: NN

Nonlinearity

Graphs properties: local or global connectivity

Weights:

But more symbolic:

First implementation

Inputs: basic events

Output: target concept

No macro-concept:

-> 3-layer

Neuron = conjunction,

Output weights simulate priority

Locality in learning

Only one neuron modified at a time:

If target concept not activated when it should:

If target active, but not enough or too much:

Learning: example (0)

Must learn AB.

Examples: ABC, ABD, ABE, but not AB.

Learning: example (1)

ABC:

Learning : example (2)

ABD:

Learning : example (3)

ABE: N1 slightly active for AB

Learning : example (4)

Final: N1 has generalized, active for AB

NETtalk task

TDNN: 120 neurons, 25.200 cnx, 90%

Presence: 753 neurons, 6.024 cnx, 74%

Then learns by heart

If inputs activity reversed

Many cognitive tasks heavily biased toward the principle of presence?

Advantages w/r NNs

As many inputs as wanted, only active ones are used

Lifelong learning:

Can lower weights without wrong prediction -> imitation

But…

Few data, limiting the number of neurons:

not as good as backprop