Home >> Computers >> Artificial Intelligence >> Neural Networks


  Books
Companies
Conferences
  FAQs, Help, and Tutorials
Organizations
People
  Publications
Research Groups
Software


An artificial neural network (ANN), besides known as the simulated neural network (SNN) or even only the neural network (NN), is an interconnected class action of artificial neurons that uses the mathematical or even computational model for information science according to the connectionist approach to computation. No exact in agreement definition amongst research worker when to what a neural network is, however virtually all would agree that it involves a network of comparatively elementary processing elements, in which a spherical behaviour is determined per modems between the processing elements & element parameters. A original inspiration for the system was from either examination of bioelectrical networks in the brain formed by neurons & their synapses (see biological neural network). Within the neural network model, elementary nodes (called variously "neurons", "neurodes", "PEs" ("processing elements") or even "units") come attached together to form the network of nodes — hence the term "neural network".

Structure

Such as the biological brain, an unreal neural net occurs as massively parallel collection of small & elementary processing units in which the interconnections form a big a share of the network's intelligence. Unreal neural networks, yet, come quite different from either a brainside in terms of structure. E.g., a neural network is good deal little than the brain. As well, the units utilized around a neural network come usually far simpler than nerve cell. Nonetheless, certain functions that seem exclusive to the brain, like learning, use at times been replicated in the simpler shell using neural networks.

Look at as well: artificial neuron, perceptron

Models

Around the average neural network, for each one node operate the principle similar to the biological neuron. Inside the biological nerve cell, from each one incoming synapse of the nerve cell has a weight associated by having it. Whenever a weight of every synapse, days its input, is summed higher for 100% incoming synapses, & that total is greater than a select few 'threshold value', so the nerve cell fires, sending a value to an additional nerve cell in the network.

A average neural network node tries to emulate this behaviour. From each one node has the placed of input lines which are then correspondent to input synapses within the biological nerve cell. For each one node as well has an 'activation work' (as well referred to as the 'transport work'), which tells the node while to fire, similar to a biological nerve cell. Around its simplest form, this activation work might upright become to generate the 'One' whenever the summed input is greater than occasionally value, or even a 'Cipher' otherwise. Activation functions, but, don't use to exist as this elementary - to produce networks that potty launder utile act, it all but universally use at times to exist as supplementary complex, for at least a bit of of the nodes in the network.

The feedforward neural network, which is one of the supplementary commons neural network types, is composed of the placed one nodes & modems. These nodes come intended inside shells. the modems come usually formed by connecting every of the nodes inside a given layer to completely of the nerve cell in the next layer. around that way each node in the given layer is attached to each more node in the next layer.

Usually there are at least trey shells to the feedforward network - an input layer, the hidden layer, & an output layer. A input layer does there is no processing - these are only in which a information vector is fed into a network. A input layer so feeds into a hidden layer. A hidden layer, successively, feeds into a output layer. A actual processing in a network occurs in the nodes of the hidden layer & the output layer.

Whenever plenty nerve cell come attached together around shells, a network may be 'trained' to run utile items utilizing the educational videos algorithmic program. Feedforward networks, particularly, may be utile, whenever trained befittingly, to clean intelligent classificatiin or even identification nature and severity tasks on unfamiliar information.

Calculations

A sigmoid curve is often used as a transport work because it introduces non-linearity into a network's calculations by "squashing" a nerve cell's activation level into a range [0,1]. the sigmoid work hequally a extra profit of getting an extremely elementary derivative work, as involved for back-propagating errors across a feed-forward neural network. More functions by owning similar features may be utilized, virtually all unremarkably tanh which squashes activations into the range of [-1,1] instead, or even once in a while a piece-caring linear work that just clips the activation like than squashing it.

Whenever there is no non-linearity is introduced by squashing or even clipping, the network loses tremendously of its computational power, becoming a elementary matrix multiplication operation from either linear algebra.

Guide calculation system inside neural networks include system sustaining loops, in which a few kinda instance delay run must become utilized, & "winner takes all" system, in which a nerve cell by having a greatest value from either a calculation fires & will require a value Unity, & entirely more nerve cell choose the value Cypher.

Generally the weights around a neural network come at first placed to little random values. This is the network uneducated person; its output is in essence a random work of its input. When a expert training videos run payoff, a connection weights come bit by bit modified based on data from computational system specific to the learning algorithmic program existence utilized. Ideally the weights yet converge to values letting the babies to perform a utile computation. So it may be said that a neural network commences uneducated persin & moves on to benefit a few very cognition, though a noesis is sub-symbolic.

Advantages
Artificial neural networks (ANN) stand many benefits, because it resemble a lesson of the neural body structure. Learning: ANN develop a ability to view according to a then known as learning stage. Auto organization: the ANN creates its have representation of the information given in the learning run. Tolerance to faults: because ANN store redundant references, unfair destruction of a neural network don't damage totally the network response. Flexibility: ANN could address input file forswearing significant changes such as loud signals or even others changes in the given input information (e.g. in case a input file is an object, this can be a trifle different forgoing problems to the ANN response). very Period: ANN come parallel structures; in case it is implemented in that way utilizing computers or even favorite devices real instance may be achieved. Scalability: An ANN may be easy ported to healthy any condition from either the particular condition metropolitan area.

Applications

Usefulness

Neural networks come particularly utile for treating using bounded real-valued data, where the real-valued output is desired; therein way neural networks may perform classification by degrees, & come capable of expressing values same to "not sure". Whenever a neural network is trained using the cross-entropy error work (look at Bishop's book) & whenever a neural network output is sigmoidal, then the outputs is estimates of the confessedly tail probability of a class.

Artificial neural networks (ANN) come unremarkably applied to tasks that come typically convenient for homo to accomplish, however hard for traditional computers. Due to a neural networks mimicking of the braaround, it keep close at h& shown likely in centripetal processing tasks like speech recognition, technical analysis, and the transcription of handwritten text. Traditional computers remaaround superior to neural networks inside numerous situations, however neural networks come dominant within recognizing system in highly big information sets. By having the ability to view from either a placed of examples & generalize this noesis to fresh situations, neural networks come fantabulous for operate requiring adaptative control systems.

Real life applications
Inside realistic applications, neural networks perform particularly swell on the as a consequence most common tasks:

Function approximation (aka regression analysis) Time series prediction Classification Pattern recognition

More sort of neural networks, particularly continuous-instance recurrent neural networks (CTRNN), come utilized around conjunction using genetic algorithms (GAs) to develop golem controllers. A genome is then constituted of a networks parameters & the fitness of the network is the adequacy of the behaviour exhibited per restricted golem (or even typically by a simulation of this behaviour).

There come numbers of real-globe applications to which neural networks are easily-suited. Since neural networks st& a ability to withstand damage and however work properly, it is imaginable candidates for new aircraft control systems. It besides develop expected utilizes within technical analysis software system for radio detection & ranging, echo sounder, and more remote-sensing gear. In the medical field, it may be utilized to model particular figure area (we.e., a nose & cardiovascular rules) & detect diseases at an early stage from either various scans. ANNs can as well become utilized inside the laboratory to recognize cistron in DNA sequences. Neural netfunction applications might work in the business field or even fiscal analysis. around addition, it can possibly become utilized in database mining. Since neural networks come better at identifying system or even trends inside information, it develop utilizes for prediction & prediction around these most common areas:

Sales forecasting Industrial run control Customer research Data validation Risk management Target marketing

On this text come further epitomes for their application: speaker recognition within communication, diagnosing of hepatitis, recovery of telecommunications from either faulty software program, retina detection, submarine mine detection, texture analysis, resource allocation, database mining, run control, programing, 3-cubic object recognition, handwritten word recognition, & automatic face recognition.

Types of neural networks

Feedforward neural network

A feedforward neural networks are the number one & arguably simplest nature and severity of unreal neural networks devised. Thereinside network, a information moves in just 1 counsel, send on, from either the input nodes, through the hidden nodes (whenever any) & to the output nodes. No rounds or even loops in the network.

Single-layer perceptron

A earliest rather neural network occurs as individual-layer perceptron network, which consists of one layer of output nodes; the inputs come fed directly to the outputs via a series of weights. Therein way it may be considered a simplest rather feed-forward network. A total of a products of a weights & a inputs is estimated around every node, & whenever a value is above a select few threshold (generally Cipher) a nerve cell fires & will require the activated value (usually Single); otherwise it will require the deactivated value (generally -One). Nerve cell by having this kinda activation work come besides known as McCulloch-Pitts nerve cell or even threshold nerve cell. In a literature the term perceptron often refers to networks consisting of just one of these units. It were described by Warren McCulloch and Walter Pitts in the 1940s.

a perceptron may be created applying any values for the activated & deactivated states when yearn when the threshold value lies between them. Virtually all perceptrons develop outputs of Single or even -One sustaining the threshold of Cipher & there exists occasionally grounds to believe that such networks may be trained extra quickly than networks created from either nodes by owning different activation & deactivation values.

Perceptrons may be trained by a elementary learning algorithmic rule that is unremarkably known as the delta rule. It calculates the errors between measured output & sample output information, & utilizes this to produce an adjustment to the weights, so implementing a form of gradient descent.

Lone-unit perceptrons come simply capable of learning linearly severable system; within 1969 in a illustrious monograph entitled Perceptrons by Marvin Minsky and Seymour Papert showed that it was impossible for a single-layer perceptron network to view an XOR function. It conjectured (incorrectly) that the similar symptom would hang on to for the multi-layer perceptron network. Although one threshold unit is quite limited inside its computational power, it has been shown that networks of parallel threshold units may approximate any continuous work from either a compact interval of the real into the interval [-1,1]. This super recent symptom may be incurred inside [Auer, Burgsteiner, Maass: The p-delta learning rule for parallel perceptrons, 2001 (state Jan 2003: submitted for publication)].

One-layer neural network could compute the continuous output instead of the step function. The most common selection is the and then-supposed logistic function:

By having this guide, a lone-layer network is monovular to the logistic regression model, widely used around statistical modelling.

Multi-layer perceptron

This class of networks consists of multiple shells of computational units, unremarkably interconnected within the feed-forward way. Both nerve cell within 1 layer has directed modems to the nerve cell of the subsequent layer. Within several applications the units one networks use a sigmoid work as an activation work.

A universal approximation theorem for neural networks states that each continuous work that maps intervals of real to occasionally output interval of real may be estimated indiscriminately closely by the multi-layer perceptron by having simply 1 hidden layer. This symptom holds sole for restricted classes of activation functions, e.g. for the sigmoid functions.

Multi-layer networks have a kind of learning techniques, the virtually all popular existence back-propagation. On this text a output values come likened by owning a correct guide to compute the value of a few predefined error-work. By various techniques a error is so fed back through the network. Applying this page, a algorithmic program adjusts a weights of both connection sequentially to reduce a value of the error work by a select few little total. When repeating this run for a sufficiently big total of expert training oscillations a network may ordinarily converge to occasionally state in which the error of the calculations is little. In that out break of these says that a network has learned a certain target work. To adjust weights properly the single applies a general method for non-linear optimization task that is called gradient descent. For this, a derivation of a error work by using respect to a network weights is estimated & the weights come so changed such that the error lessens (so running downhill on the surface of the error work). For this understanding back-propagatiin might merely exist as applied on networks by having differentiable activation functions.

In the main the conditiin of reaching a network that performs swell, potentially on samples that were non utilized when expert training samples, occurs as quite subtle issue that takes extra techniques. This is especially significant for suits in which simply super limited many expert instruction samples come available. A danger is that a network overfits a training information & fails to capture trueness technical indicator run giving the information. Computational learning theory is concerned by using step by step videos classifiers in the limited total of information. In the context of neural networks the elementary heuristic, called early stopping, often ensures that a network may generalize swell to examples non in the step by step instruction placed.

More average problems of the back-propagation algorithmic rule come a speed of convergence & the possibility to prevent higher within a local minimum of the error function. Now there are practical solutions that produce back-propagation around multi-layer perceptrons a guide of guide for several machine learning tasks.

ADALINE
Adaptative LINear Element.

MADALINE
Multilayer Adaline, when well quoted as Multiple Adaline

Learnmatrix
watch Lernmatrix * [http://de.wikipedia.org/wiki/Lernmatrix] & Karl Steinbuch * [http://de.wikipedia.org/wiki/Karl_Steinbuch] * [http://xputers.informatik.uni-kl.de/papers/publications/karl-steinbuch_en.html]

Radial basis function (RBF)
Radial Basis Functions come right techniques for interpolation around multidimensional space. The RBF occurs as work which has built into the few feet away criterion using respect to the centre. Stellate basis functions stand been applied around neural networks around which it can be utilized as a replacement for the sigmoid hidden layer transport work in multilayer perceptrons. RBF networks use Two shells of processing: In the number one, input is mapped onto every RBF in the 'hidden' layer. the RBF chosen is unremarkably a Gaussian. Within regression problems the output layer is so a linear combination of hidden layer values representing mean foreseen output. A interpretation of this output layer value is the equivalent as a regression model within numbers. Around classification problems the output layer is often the sigmoid work of the linear combination of hidden layer values, representing a tail probability. Performance around each instances is typically improved by shrinkage techniques, called ridge regression inside definitive cost comparisons & known to correspond to the anterior belief around microscopic parameter values (& so smooth output functions) inside the Bayesian framework.

RBF networks keep around a benefit of non suffering from either local minima in the equivalent way when multilayer perceptrons. This is because a merely parameters that come adjusted in a learning run come the linear mapping from either hidden layer to output layer. One-dimensionality ensures that a error surface is quadratic & so has one well uncovered minimum. Inside regression problems this may be discovered around a single matrix operation. Around classification problems a fixed non-linearity introduced per sigmoid output work is virtually all expeditiously dealt by having applying iterated reweighted least squares.

RBF networks develop a disadvantage of requiring dependable coverage of the input space by stellate basis functions. RBF centres come determined by owning information to the distribution of the input file, however while forgoing information to the prediction project. Following, representational resources can be lost in areas of the input space that come irrelevant to the learning project. a most common guide is to associate to each one datum by using its have centre, although this could produce the linear patterns to exist as solved in the final layer like big, & takes shrinkage techniques to make sure your not overfitting.

Associating every input data point by owning an RBF leads naturally to kernel methods like Trend lines Vector Machines & Gaussian Processes (a RBF is the kernel work). Wholly leash approaches utilise the non-linear kernel work to task the input file into a space in which the learning condition may be solved utilizing a linear model. Such as Gaussian Processes, & unlike SVMs, RBF networks come generally trained inside a utmost Likelihood framework by maximizing a probability (minimizing a error) of the information under the model. SVMs take the different approach to avoiding overfitting by avoiding maximizing instead a margin. RBF networks come outperformed around virtually all classification applications by SVMs. Within regression applications it may be competitory while a dimensionality of the input space is comparatively little.

Kohonen self-organizing network

A self-organizing map (SOM) invented by Teuvo Kohonen uses a form of unsupervised learning. The placed of unreal nerve cell see to map points withinside an input space to co-ordinate in an output space. A input space may use at times different dimensions & topology from either a output space, & a SOM might attempt to preserve these.

Recurrent network

Contrary to feedforward networks, recurrent neural network (RNs) are system using bi-directional information flow. When the feedforward network propagates information linearly from either either input to output, RNs likewise propagate information from late processing stages to sooner stages.

Simple recurrent network

The elementary recurrent network (SRN) occurs as variation on the multi-layer perceptron, for instance known as an "Elman network" due to its invention by Jeff Elman. the 3-layer network is utilized, by having the addition of a placed of "context units" in the input layer. There are modems from either the middle (hidden) layer to these context units fixed using a weight of 1. At every period step, the input is propagated within the standard feed-forward fashion, then a learning rule (typically back-propagation) is applied. a fixed back modems symptom in a context units universally maintaining a copy of the former values of the hidden units (since it propagate across the modems prior to the learning rule is applied). So the network may maintain a kind of state, letting it to perform such tasks when sequence-prediction that come beyond the power of a standard multi-layer perceptron.

Inside the fully perennial network, each nerve cell receives inputs from either each more nerve cell in the network. These networks are non intended inside shells. Normally sole the subset of the nerve cell receive external inputs additionally to the inputs from either all the more nerve cell, & an additional disjunct subset of nerve cell report their output externally likewise when sending it to all the nerve cell. These distinctive inputs & outputs perform the work of the input & output shells of a feed-forward or even elementary repeated network, & likewise join all the more nerve cell in the repeated processing.

Hopfield network

A Hopfield network is a recurrent neural network where tons modems come symmetrical. Made-up by John Hopfield in 1982, this network guarantees that its dynamics might converge. Whenever a modems come trained utilizing Hebbian learning then the Hopfield network potty perform robust content-addressable memory, robust to connection alteration.

Stochastic neural networks

The stochastic neural network differs from a regular neural network in the fact that it introduces random variations into the network.

Boltzmann machine

A Boltzmann machine can be thought of as a noisy Hopfield network. Fabricated by Geoff Hinton and Terry Sejnowski in 1985, a Boltzmann machine is crucial because these are one of the number one neural networks to demonstrate learning of latent variables (hidden units). Boltzmann machine learning was slow to simulate, however a contrastive divergence algorithm of Geoff Hinton (circa 2000) allows models including Boltzmann machines & product of experts to exist as trained very much sooner.

Modular neural networks

Biological studies showed that a mortal brain functions non when a lone single massive network, however as a collection of little networks. This realisation gave birth to the construct of modular neural networks, in which many little networks cooperate or even compete to solve problems.

Committee of machines

The committee of machines (CoM) occurs as collection of different neural networks that together "vote" in the given case. This usually gives the very much better symptom in comparison more neural network system. around point of fact in numbers of suits, starting by owning a equivalent architecture & step by step training however different initial random weights gives immensely different networks. a CoM tends to stabilize the symptom.

A CoM is similar to the general machine learning bagging method, except that the necessary kind of machines in the committee is found by expert instructiin from either different random starting weights like than how to step by step instruction on different every which way selected subsets of the training information.

Associative Neural Network (ASNN)

Is an extension of the committee of machines that goes beyond the simple/weighted norm of different system. [http://cogprints.soton.ac.uk/documents/disk0/00/00/14/41/index.html ASNN] is a combination of an ensemble of feed-forward neural networks & the k-nearest neighbour system (kNN). It utilizes a correlation between ensemble reactions as a measure of few feet away amid a analysed subjects for the kNN. This corrects a bias of the neural network ensemble. An associatory neural network has a memory that potty coincide by using the educational videos placed. Whenever freshly information becomes available, a network instantly improves its prognostic ability & will bring information approximation (self-see a information) forswearing a want to retrain the ensemble. An additional crucial feature of ASNN is the possibility to interpret neural network outcomes by analysis of correlations between information suits in the space of system. A method may be utilized in-line or even downloaded at [http://www.vcclab.org/lab/asnn www.vcclab.org].

Other types of networks

These favorite networks don't concord any of the last categories.

Instantaneously trained networks

Instantaneously trained neural networks (ITNNs) are besides known as "Kak networks" fallowing their discoverer Subhash Kak. It were inspired per phenomenon of short-short-run learning that seems to occur outright. Around these networks a weights of a hidden & a output shells come mapped directly from either the expert training vector information. Normally, it functiin on binary information however versions for continuous information that postulate little extra processing come as well available.

Spiking neural networks

Spiking (or even pulsed) neural networks (SNNs) come system which explicitly allow a timing of inputs. A network input & output come unremarkably represented when series of spikes (delta work or even further complex shapes). SNNs develop an benefit of existence take a breath to day and night run tools. It is typically implemented when repeated networks.

Networks of spiking neurons -- & a temporal correlations of neural assemblies inside such networks -- keep close at h& been utilized to model figure/ground separation and area linking in the visual patterns (watch e.g. Reitboeck et.al.inside Haken & Stadler: Synergetics of the Brain. Berlin, 1989).

Gerstner & Kistler have a freely-available low text editiin on [http://diwww.epfl.ch/~gerstner/BUCH.html Spiking Neuron Models].

Dynamic neural networks
Dynamic neural networks non simply treat by having nonlinear multivariate behaviour, however likewise include (learning of) period-dependent behaviour like various transitory phenomena & delay results. [http://www.seeingwithsound.com/thesis.htm Meijer] has the Ph.D. thesis low in which regular feedforward perception networks come generalized sustaining differential equations, utilizing variable period step algorithmic rule for learning in the instance domain & including algorithmic rule for learning in the frequency domain (therein outbreak linearized in the arethe of a placed of electrostatic bias points).

Cascading neural networks

These neural networks lead off their step by step training forswearing any hidden nerve cell. When the output error reaches a predefined error threshold, the networks add a fresh hidden nerve cell. A fresh hidden nerve cell is attached to completely input nodes, also when, altogether former hidden nerve cell. How to videos ends while a suitable error even threshold is reached or while the maximal total of hidden nerve cell is added.

Relation to optimization techniques

Analysis of several neural network techniques reveals their close relationship to mathematical optimization techniques. E.g., multi-layer perceptron back-propagation may be substituted by having further general spherical optimisation techniques. A objective arounfive hundred expert training videos an ANN is, given a select few placed of pairs of information & output, to minimize a few error work ||E||Two, in which E(teni) = F(w,xi personally) - oi personally. On this button F is the neural network work which given the vector of weights w & an input vector produces an output vector for the network. So likewise when utilizing back-propagation to train the network, these are likewise imaginable to apply spherical optimisation techniques to develop a weight vector w.

Boosting Research Site
Scientific homepage on boosting and ensemble learning methods.

neural101.com
Non-commercial site for information related to trading the financial markets using neural networks. Includes message bases, links and database of trading indicators, systems, and scans.

Maximum Entropy Online Resources
Workshops, tutorials, papers and software related to maximum entropy.

Sequential Monte Carlo Methods
Website dedicated to sequential data analysis and tracking using particle filters.

Neuroinformatics
A hub for the neuroinformatics community, with information on workshops and courses. Also hosts the comp-neuro mailing list.

David E. Rumelhart Prize
An annual award for contributions to the formal analysis of human cognition.

Bibliographies on Neural Networks
Part of the collection of computer science bibliographies.

Principal Curves Page
Introduction to principal curves, with summary of and links to publications, demo, and software.

Sumeet's neural networks listing
Includes related fields like genetic algorithms, fuzzy logic, adaptive computing and complex systems and descriptions of each site.

Neural Network FAQ
This FAQ from comp.ai.neural-nets contains the most commonly asked questions about neural networks, including some great introductory material.


Computers: Artificial Intelligence: Belief Networks
Computers: Artificial Intelligence: Machine Learning
Computers: Artificial Intelligence: Vision
Computers: Artificial Life
Health: Medicine: Medical Specialties: Neurology
Science: Biology: Neurobiology
Science: Math: Statistics: Bayesian Analysis
Science: Social Sciences: Cognitive Science
Science: Social Sciences: Psychology: Neuropsychology
Science: Technology: Electronics: Signal Processing




© 2005 GeneralAnswers.org