《人工神经网络》PPT课件.ppt

资源描述

人工神经网络中国科学院自动化研究所吴高巍gaowei wu 2016 11 29 联结主义学派又称仿生学派或生理学派认为人的思维基元是神经元而不是符号处理过程认为人脑不同于电脑核心智能的本质是联接机制原理神经网络及神经网络间的连接机制和学习算法麦卡洛可 McCulloch 皮茨 Pitts 什么是神经网络所谓的人工神经网络就是基于模仿生物大脑的结构和功能而构成的一种信息处理系统计算机个体单元相互连接形成多种类型结构的图循环非循环有向无向自底向上 Bottom Up AI起源于生物神经系统从结构模拟到功能模拟仿生人工神经网络内容生物学启示多层神经网络Hopfield网络自组织网络生物学启示神经元组成细胞体轴突树突突触神经元之间通过突触两两相连信息的传递发生在突触突触记录了神经元间联系的强弱只有达到一定的兴奋程度神经元才向外界传输信息生物神经元神经元神经元特性信息以预知的确定方向传递一个神经元的树突细胞体轴突突触另一个神经元树突时空整合性对不同时间通过同一突触传入的信息具有时间整合功能对同一时间通过不同突触传入的信息具有空间整合功能神经元工作状态兴奋状态对输入信息整合后使细胞膜电位升高当高于动作电位的阈值时产生神经冲动并由轴突输出抑制状态对输入信息整合后使细胞膜电位降低当低于动作电位的阈值时无神经冲动产生结构的可塑性神经元之间的柔性连接突触的信息传递特性是可变的学习记忆的基础神经元模型从生物学结构到数学模型人工神经元 M P模型 McCllochandPitts Alogicalcalculusoftheideasimmanentinnervousactivity 1943 f 激活函数 ActivationFunction g 组合函数 CombinationFunction WeightedSumRadialDistance 组合函数 e f Threshold Linear SaturatingLinear LogisticSigmoid HyperbolictangentSigmoid Gaussian 激活函数人工神经网络多个人工神经元按照特定的网络结构联接在一起就构成了一个人工神经网络神经网络的目标就是将输入转换成有意义的输出生物系统中的学习自适应学习适应的目标是基于对环境信息的响应获得更好的状态在神经层面上通过突触强度的改变实现学习消除某些突触建立一些新的突触生物系统中的学习 Hebb学习律神经元同时激活突触强度增加异步激活突触强度减弱学习律符合能量最小原则保持突触强度需要能量所以在需要的地方保持在不需要的地方不保持 ANN的学习规则能量最小ENERGYMINIMIZATION对人工神经网络需要确定合适的能量定义可以使用数学上的优化技术来发现如何改变神经元间的联接权重 ENERGY measureoftaskperformanceerror 两个主要问题结构Howtointerconnectindividualunits 学习方法HowtoautomaticallydeterminetheconnectionweightsorevenstructureofANN SolutionstothesetwoproblemsleadstoaconcreteANN 人工神经网络前馈结构 FeedforwardArchitecture withoutloops static反馈循环结构 Feedback RecurrentArchitecture withloops dynamic non lineardynamicalsystems ANN结构 Generalstructuresoffeedforwardnetworks Generalstructuresoffeedbacknetworks 通过神经网络所在环境的模拟过程调整网络中的自由参数Learningbydata学习模型Incrementalvs Batch两种类型Supervisedvs Unsupervised ANN的学习方法若两端的神经元同时激活增强联接权重UnsupervisedLearning 学习策略 HebbrianLearning 最小化实际输出与期望输出之间的误差 Supervised DeltaRule LMSRule Widrow Hoff B PLearning Objective Solution 学习策略 ErrorCorrection 采用随机模式跳出局部极小如果网络性能提高新参数被接受否则新参数依概率接受学习策略 StochasticLearning 胜者为王 Winner take all UnsupervisedHowtocompete HardcompetitionOnlyoneneuronisactivated SoftcompetitionNeuronsneighboringthetruewinnerareactivated 学习策略 CompetitiveLearning 重要的人工神经网络模型多层神经网络径向基网络Hopfield网络Boltzmann机自组织网络多层感知机 MLP 感知机实质上是一种神经元模型阈值激活函数 Rosenblatt 1957 感知机判别规则输入空间中样本是空间中的一个点权向量是一个超平面超平面一边对应Y 1另一边对应Y 1 单层感知机学习调整权值减少训练集上的误差简单的权值更新规则初始化对每一个训练样本 ClassifywithcurrentweightsIfcorrect nochange Ifwrong adjusttheweightvector 30 学习 BinaryPerceptron 初始化对每一个训练样本 ClassifywithcurrentweightsIfcorrect i e y y nochange Ifwrong adjusttheweightvectorbyaddingorsubtractingthefeaturevector Subtractify is 1 多类判别情况 Ifwehavemultipleclasses Aweightvectorforeachclass Score activation ofaclassy Predictionhighestscorewins 学习 MulticlassPerceptron 初始化依次处理每个样本PredictwithcurrentweightsIfcorrect nochange Ifwrong lowerscoreofwronganswer raisescoreofrightanswer 感知机特性可分性 trueifsomeparametersgetthetrainingsetperfectlycorrectCanrepresentAND OR NOT etc butnotXOR收敛性 ifthetrainingisseparable perceptronwilleventuallyconverge binarycase Separable Non Separable 感知机存在的问题噪声不可分情况 ifthedataisn tseparable weightsmightthrash泛化性 findsa barely separatingsolution 改进感知机线性可分情况 Whichoftheselinearseparatorsisoptimal SupportVectorMachines Maximizingthemargin goodaccordingtointuition theory practiceOnlysupportvectorsmatter othertrainingexamplesareignorableSupportvectormachines SVMs findtheseparatorwithmaxmargin SVM 优化学习问题描述训练数据目标发现最好的权值使得对每一个样本x的输出都符合类别标签样本xi的标签可等价于标签向量采用不同的激活函数平方损失单层感知机单层感知机单层感知机单层感知机采用线性激活函数权值向量具有解析解批处理模式一次性更新权重缺点收敛慢增量模式逐样本更新权值随机近似但速度快并能保证收敛多层感知机 MLP 层间神经元全连接 MLPs表达能力 3layers Allcontinuousfunctions4layers allfunctions Howtolearntheweights waitingB Palgorithmuntil1986 B PNetwork 结构Akindofmulti layerperceptron inwhichtheSigmoidactivationfunctionisused B P算法学习方法 Inputdatawasputforwardfrominputlayertohiddenlayer thentooutlayer Errorinformationwaspropagatedbackwardfromoutlayertohidderlayer thentoinputlayer Rumelhart Meclelland Nature 1986 B P算法 GlobalErrorMeasure desiredoutput generatedoutput squarederror Theobjectiveistominimizethesquarederror i e reachtheMinimumSquaredError MSE B P算法 Step1 Selectapatternfromthetrainingsetandpresentittothenetwork Step2 Computeactivationofinput hiddenandoutputneuronsinthatsequence Step3 Computetheerrorovertheoutputneuronsbycomparingthegeneratedoutputswiththedesiredoutputs Step4 Usethecalculatederrortoupdateallweightsinthenetwork suchthataglobalerrormeasuregetsreduced Step5 RepeatStep1throughStep4untiltheglobalerrorfallsbelowapredefinedthreshold 梯度下降方法 OptimizationmethodforfindingouttheweightvectorleadingtotheMSE learningrate gradient vectorform elementform 权值更新规则 Foroutputlayer 权值更新规则 Foroutputlayer 权值更新规则 Forhiddenlayer 权值更新规则 Forhiddenlayer 应用 Handwrittendigitrecognition 3 nearest neighbor 2 4 error400 300 10unitMLP 1 6 errorLeNet 768 192 30 10unitMLP 0 9 errorCurrentbest SVMs 0 4 error MLPs 讨论实际应用中PreprocessingisimportantNormalizeeachdimensionofdatato 1 1 Adaptingthelearningrate t 1 t MLPs 讨论优点很强的表达能力容易执行缺点收敛速度慢过拟合 Over fitting 局部极小采用Newton法加正则化项约束权值的平滑性采用更少但足够数量的隐层神经元尝试不同的初始化增加扰动 Hopfield网络反馈结构可用加权无向图表示DynamicSystem两种类型Discrete 1982 andContinuous science 1984 byHopfield Hopfield网络 Combinationfunction WeightedSumActivationfunction Threshold 吸引子与稳定性 Howdowe program thesolutionsoftheproblemintostablestates attractors ofthenetwork Howdoweensurethatthefeedbacksystemdesignedisstable Lyapunov smodernstabilitytheoryallowsustoinvestigatethestabilityproblembymakinguseofacontinuousscalarfunctionofthestatevector calledaLyapunov Energy Function Hopfield网络的能量函数 WithinputWithoutinput Hopfield模型 Hopfield证明了异步Hopfield网络是稳定的其中权值定义为Whateverbetheinitialstateofthenetwork theenergydecreasescontinuouslywithtimeuntilthesystemsettlesdownintoanylocalminimumoftheenergysurface Hopfield网络联想记忆 Hopfield网络的一个主要应用基于与数据部分相似的输入可以回想起数据本身 attractorstate 也称作内容寻址记忆 content addressablememory StoredPattern MemoryAssociation 虞台文 FeedbackNetworksandAssociativeMemories Hopfield网络 AssociativeMemories StoredPattern MemoryAssociation 虞台文 FeedbackNetworksandAssociativeMemories Hopfield网络的一个主要应用基于与数据部分相似的输入可以回想起数据本身 attractorstate 也称作内容寻址记忆 content addressablememory Howtostorepatterns Howtostorepatterns Dimensionofthestoredpattern 权值确定外积 OuterProduct Vectorform Elementform Why SatisfytheHopfieldmodel AnexampleofHopfieldmemory 虞台文 FeedbackNetworksandAssociativeMemories Stable E 4 E 0 E 4 Recallthefirstpattern x1 Stable E 4 E 0 E 4 Recallthesecondpattern x2 Hopfield网络组合优化 CombinatorialOptimization Hopfield网络的另一个主要应用将优化目标函数转换成能量函数 energyfunction 网络的稳定状态是优化问题的解例 SolveTravelingSalesmanProblem TSP Givenncitieswithdistancesdij whatistheshortesttour IllustrationofTSPGraph 1 2 3 4 5 6 7 8 9 10 11 HopfieldNetworkforTSP HopfieldNetworkforTSP Citymatrix Constraint1 Eachrowcanhaveonlyoneneuron on 2 Eachcolumncanhaveonlyoneneuron on 3 Foran cityproblem nneuronswillbeon HopfieldNetworkforTSP 1 2 4 3 5 Time City Thesalesmanreachescity5attime3 WeightdeterminationforTSP DesignEnergyFunction Constraint 1 Constraint 2 Constraint 3 能量函数转换为2DHopfield网络形式 Networkisbuilt Hopfield网络迭代 TSP Theinitialstategeneratedrandomlygoestothestablestate solution withminimumenergy A4 cityexample阮晓刚神经计算科学 2006 自组织特征映射 SOFM WhatisSOFM NeuralNetworkwithUnsupervisedLearningDimensionalityreductionconcomitantwithpreservationoftopologicalinformation Threeprincipals Self reinforcing Competition Cooperation StructureofSOFM 竞争 Competition Findingthebestmatchingweightvectorforthepresentinput Criterionfordeterminingthewinningneuron MaximumInnerProduct MinimumEuclideanDistance 合作 Cooperation Identifyaneighborhoodaroundthewinningneuron TopologicalneighborhoodcanbeofdifferentshapessuchasSquare Hexagonal orGaussian Thewidthoftheneighborhoodisafunctionoftime asepochsoftrainingelapse theneighborhoodshrinks 权值自适应 Adaptation Weightsofneuronswithinthewinningclusterareupdated SOFM算法 Repeat Selection PickasampleSimilarityMatching FindthewinningneuronAdaptation UpdatesynapticvectorsofONLYthewinningcluster Update Updatethelearningrateandneighborhood Until thereisnoobservablechangeinthemap 小结人工神经网络是人工神经元组成的并行自适应网络目标是对人类神经系统的某个功能进行抽象和建模人工神经元基本元素 Asetofconnectinglinks Acombinationfunction Anactivationfunction ANN中的两个关键问题ArchitectureandLearningApproachSolutionstothesetwoproblemsleadstoanANNmodel两种ANN结构Feedforwardvs Feedback Recurrent 学习策略Hebbrain ErrorCorrection Stochastic Winner take all 人工神经网络发展历程谢谢

展开阅读全文

《人工神经网络》PPT课件.ppt

最新文档