LongTailsandNavigation

上传人:c****d 文档编号:243023540 上传时间:2024-09-14 格式:PPT 页数:14 大小:107KB
返回 下载 相关 举报
LongTailsandNavigation_第1页
第1页 / 共14页
LongTailsandNavigation_第2页
第2页 / 共14页
LongTailsandNavigation_第3页
第3页 / 共14页
点击查看更多>>
资源描述
Click to edit Master title style,Click to edit Master text styles,Second level,Third level,Fourth level,Fifth level,*,Long Tails and Navigation,Networked Life,CSE 112,Spring 2007,Prof. Michael Kearns,1,One More (,Structural,) Property,A properly tuned,a,-model can,simultaneously,explain,small diameter,high clustering coefficient,other models can, too (e.g. cycle+random rewirings),But what about connectors and heavy-tailed degree distributions?,a,-model and simple variants will,not,explain this,intuitively, no “bias” towards large degree evolves,all vertices are created equal,Can concoct many,bad,generative models to explain,generate NW according to Erdos-Renyi, reject if tails not heavy,describe,fixed,NWs with heavy tails,all connected to v1; N/2 connected to v2; etc.,not clear we can get a precise power law,not modeling,variation,why would the world evolve this way?,As always, we want a “natural” model,2,Quantifying Connectors:Heavy-Tailed Distributions,3,Heavy-tailed Distributions,Pareto,or,power law,distributions:,for random variables assuming integer values 0,probability of value x 1/x,a,typically 0 ,a, 2; smaller,a,gives heavier tail,here are some,examples,sometimes also referred to as being,scale-free,For binomial, normal, and Poisson distributions the tail probabilities approach 0,exponentially,fast,Inverse,polynomial,decay vs.,inverse,exponential decay,What kind of phenomena does this distribution model?,What kind of process would,generate,it?,4,Distributions vs. Data,All these distributions are,idealized models,In practice, we do not see distributions, but,data,Thus, there will be some,largest,value we observe,Also, can be difficult to “eyeball” data and choose model,So how do we distinguish between Poisson, power law, etc?,Typical procedure:,might restrict our attention to a,range,of values of interest,accumulate,counts,of observed data into equal-sized bins,look at counts on a,log-log plot,note that,power law:,log(PrX = x) = log(1/x,a,) = -,a,log(x),linear, slope ,a,Normal:,log(PrX = x) = log(a exp(-x2/b) = log(a) x2/b,non-linear, concave near mean,Poisson:,log(PrX = x) = log(exp(-,l,),l,x/x!),also non-linear,Lets look at the assigned paper on,dollar bill migration,5,Zipfs Law,Look at the frequency of English words:,“the” is the most common, followed by “of”, “to”, etc.,claim: frequency of the n-th most common 1/n (power law,a,= 1),General theme:,rank,events by their,frequency of occurrence,resulting distribution often is a power law!,Other examples:,North America city sizes,personal income,genus sizes (number of species),lets look at,log-log plots,of these,People seem to dither over exact form of these distributions,e.g. value of,a,but not over heavy tails,6,Generating Heavy-Tailed Degrees: (Just) One Model,7,Preferential Attachment,Start with (say) two vertices connected by an edge,For i = 3 to N:,for each 1 = j i, let d(j) be degree of vertex j (so far),let Z =,S,d(j) (sum of all degrees so far),add new vertex i with k edges back to 1,i-1:,i is connected back to j with probability d(j)/Z,Vertices j with high degree are likely to get,more,links!,“Rich get richer” or “Matthew Effect”,Natural model for many processes:,hyperlinks on the web,new business and social contacts,transportation networks,Generates a power law distribution of degrees,exponent depends on value of k,Lets look at the NetLogo Simulation,8,Two Out of Three Isnt Bad,Preferential attachment explains,heavy-tailed degree distributions,small diameter (log(N), via “hubs”),Will,not,generate high clustering coefficient,no bias towards local connectivity, but towards hubs,Can we simultaneously capture,all three,properties?,probably, but well stop here,soon there will be a fourth property anyway,9,Search and Navigation,10,Finding,the Short Paths,Milgrams experiment, Columbia Small Worlds, E-R,a,-model,all emphasize,existence,of short paths between pairs,How do individuals,find,short paths?,in an incremental, next-step fashion,using purely local information about the NW and location of target,This is not a,structural,question, but an,algorithmic,one,statics vs. dynamics,Navigability,may impose additional restrictions on formation model!,Briefly investigate two alternatives:,a local/long-distance mixture model,a “social identity” model,11,Kleinbergs Model,Start with an n by n,grid,of vertices (so N = n2),add,local,connections: all vertices within grid distance,p,(e.g. 2),add distant connections:,q,additional connections,probability of connection at distance d: ,(1/d)r,so full model given by choice of p, q and r,large r: heavy bias towards “more local” long-distance connections,small r: approach uniformly random,Kleinbergs question:,what value of r permits effective,search?,Assume parties know only:,grid address of target,addresses of their own direct links,Algorithm,:,pass message to neighbor closest to target,12,Kleinbergs Result,Intuition:,if r is too,large,(strong local bias), then “long-distance” connections never help much; short paths may not even exist,if r is too,small,(no local bias), we may quickly get close to the target; but then well have to use local links to finish,think of a transport system with only long-haul jets or donkey carts,effective search requires a delicate,mixture,of link distances,The result (informally):,r = 2 is the,only value,that permits rapid navigation (log(N) steps),any other value,of r will result in time Nc for 0 c log(N) for large N,a,critical value,phenomenon or “knifes edge”; very sensitive,contrast with 1/d(1.59) from dollar bill migration paper,Note:,locality of information,crucial to this argument,centralized algorithm may compute short paths at r 2,can recognize when “backwards” steps are beneficial,13,Navigation via Identity,Watts et al.:,we dont navigate social networks by purely “geographic” information,we dont use any,single,criterion; recall Dodds et al. on Columbia SW,different criteria used a different points in the chain,Represent individuals by a,vector,of attributes,profession, religion, hobbies, education, background, etc,attribute values have distances between them (tree-structured),distance between individuals: minimum distance in,any,attribute,only need,one thing in common,to be close!,Algorithm:,given attribute vector of target,forward message to neighbor closest to target,Permits fast navigation under broad conditions,not as sensitive as Kleinbergs model,all jobs,scientists,athletes,chemistry,CS,baseball,tennis,14,
展开阅读全文
相关资源
正为您匹配相似的精品文档
相关搜索

最新文档


当前位置:首页 > 图纸专区 > 课件教案


copyright@ 2023-2025  zhuangpeitu.com 装配图网版权所有   联系电话:18123376007

备案号:ICP2024067431-1 川公网安备51140202000466号


本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。装配图网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知装配图网,我们立即给予删除!