<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Tore Opsahl</title>
	<atom:link href="http://toreopsahl.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://toreopsahl.com</link>
	<description>bouncing ideas</description>
	<lastBuildDate>Mon, 07 Jun 2010 22:49:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='toreopsahl.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://1.gravatar.com/blavatar/105a4b0a3d05cff775cd2428e34d2e9c?s=96&#038;d=http://s2.wp.com/i/buttonw-com.png</url>
		<title>Tore Opsahl</title>
		<link>http://toreopsahl.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://toreopsahl.com/osd.xml" title="Tore Opsahl" />
	<atom:link rel='hub' href='http://toreopsahl.com/?pushpress=hub'/>
		<item>
		<title>Article: Node centrality in weighted networks: Generalizing degree and shortest paths</title>
		<link>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/</link>
		<comments>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 10:37:10 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2204</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/"><img src="http://thetore.wordpress.com/files/2009/01/fig1_betweenness_s3.gif" alt="Betweenness example" title="Betweenness example" width="146" height="100" class="alignright size-full wp-image-623" /></a>A paper called "Node centrality in weighted networks: Generalizing degree and shortest paths" that I have co-authored will be published in Social Networks. Ties often have a strength naturally associated with them that differentiate them from each other. Tie strength has been operationalized as weights. A few network measures have been proposed for weighted networks, including three common measures of node centrality: degree, closeness, and betweenness. However, these generalizations have solely focused on tie weights, and not on the number of ties, which was the central component of the original measures. This paper proposes generalizations that combine both these aspects. We illustrate the benefits of this approach by applying one of them to Freeman's EIES dataset.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2204&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A paper called &#8220;Node centrality in weighted networks: Generalizing degree and shortest paths&#8221; that I have co-authored with <a href="http://home.fsw.vu.nl/f.agneessens/index.htm" target="_blank">Filip Agneessens<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://www.insna.org/member/profiles/32.html" target="_blank">John Skvoretz<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> will be published in <a href="http://www.sciencedirect.com/science/journal/03788733" target="_blank">Social Networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to Social Networks, you can <a href="http://dx.doi.org/10.1016/j.socnet.2010.03.006" target="_blank">download the paper directly<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Otherwise, a <a href="http://thetore.files.wordpress.com/2010/04/node_centrality_in_weighted_networks1.pdf" target="_blank">preprint with the exact same text<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is available.</p>
<p><strong>Abstract</strong></p>
<p>Ties often have a strength naturally associated with them that differentiate them from each other. Tie strength has been operationalized as weights. A few network measures have been proposed for weighted networks, including three common measures of node centrality: degree, closeness, and betweenness. However, these generalizations have solely focused on tie weights, and not on the number of ties, which was the central component of the original measures. This paper proposes generalizations that combine both these aspects. We illustrate the benefits of this approach by applying one of them to Freeman&#8217;s EIES dataset.</p>
<p><strong>Motivation</strong></p>
<p><div id="attachment_2215" class="wp-caption alignright" style="width: 260px"><img src="http://thetore.files.wordpress.com/2010/04/freeman3_egos.gif?w=250&#038;h=500" alt="" title="Ego networks" width="250" height="500" class="size-full wp-image-2215" /><p class="wp-caption-text">Ego networks of Phipps Arabie (A), John Boyd (B), and Maureen Hallinan (C) from Freeman's third EIES network. The width of a tie corresponds to the number of messages sent from the focal node to their contacts. Adopted from the paper.</p></div> Centrality is the concept of being &#8220;in the thick of things.&#8221; In 1978, Freeman reviewed and clarified a growing field of research on centrality of nodes for binary networks in an article published in the first issue of <em>Social Networks</em>. Three measures were formalised: degree, closeness, and betweenness. Degree was the number of ties or neighbours of a node; closeness was the inverse of the sum of all shortest paths to others or the smallest number of ties to go through to reach all others individually; and betweeness was the number of shortest paths on which a node was on. </p>
<p>The three measures have already been generalised to weighted networks. Barrat et al. (2004) generalised degree to weighted networks by taking the sum of weights instead of the number ties, while Newman (2001) and Brandes (2001) utilised Dijkstra&#8217;s (1959) algorithm of shortest paths for generalising closeness and betweenness to weighted networks, respectiviely. Dijkstra&#8217;s algorithm defined the length of paths as the sum of cost (e.g., time in GPS calculations), which is generally only defined as the sum of the inversed tie weights. All these generalisations fail to take into account the main feature of the original measures formalised by Freeman (1978): the number of ties. </p>
<p>This limitation is highlighted for degree centrality by the three ego networks from Freeman&#8217;s third EIES network. The three nodes have roughly sent the same amount of messages; however, to a quite different number of others. If Freeman&#8217;s (1978) original measure was applied, the centrality score of the node in panel A is almost five times as high as the node in panel C attains. However, when using Barrat et al.&#8217;s generalisation, they get roughly the same score. </p>
<p>This articles proposes a new generation of node centrality measures for weighted networks. The second generation of measures takes into consideration both the weight of ties and the number of ties. The relative importance of these two aspects are controlled by a tuning parameter.  </p>
<p><strong>Want to test it with your data?</strong></p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" />The degree_w, closeness_w, and betweenness_w-functions in <a href="http://opsahl.co.uk/tnet/" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> allows you to calculate the binary, weighted, and the measures that combine these two aspects on your own dataset.</p>
<p>For example, to calculate second generation node centrality measures (alpha = 0.5) on the sample network above, you can run the code below in R. The degree function easily calculates the binary and first generation measures as well; however, this is not the case for the closeness and betweenness-functions. If you would like the binary version, you can either use the dichotomise function or set alpha=0. If you would like the first generation weighted measures, you can set alpha=1 (default value). </p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))

# Calculate degree centrality
degree_w(net, measure=c(&quot;degree&quot;, &quot;output&quot;, &quot;alpha&quot;), alpha=0.5)

# Calculate closeness centrality
closeness_w(net, alpha=0.5)

# Calculate betweenness centrality
betweenness_w(net, alpha=0.5)
</pre>
<p>To test it on Freeman&#8217;s third EIES network from <a href="http://toreopsahl.com/datasets/">the datasets page<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and recreate Table 3 of the paper, you can do the following:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
data(Freemans.EIES)
net &lt;- Freemans.EIES.net.3.n32

# Calculate measures
tmp &lt;- data.frame(
  Freemans.EIES.node.Name.n32,
  degree_w(net, measure=c(&quot;degree&quot;, &quot;output&quot;, &quot;alpha&quot;), alpha=0.5),
  degree_w(net, measure=&quot;alpha&quot;, alpha=1.5)[,&quot;alpha&quot;], stringsAsFactors=FALSE)
dimnames(tmp )[[2]] &lt;- c(&quot;name&quot;, &quot;node&quot;, &quot;a00&quot;, &quot;a10&quot;, &quot;a05&quot;, &quot;a15&quot;)
tmp &lt;- tmp[,c(&quot;name&quot;,&quot;a00&quot;,&quot;a05&quot;,&quot;a10&quot;,&quot;a15&quot;)]

# Merge names and order table
out &lt;- data.frame(
  seq.int(nrow(tmp)),
  tmp[order(-tmp[,&quot;a00&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a00&quot;)],
  tmp[order(-tmp[,&quot;a05&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a05&quot;)],
  tmp[order(-tmp[,&quot;a10&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a10&quot;)],
  tmp[order(-tmp[,&quot;a15&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a15&quot;)])
dimnames(out)[[2]] &lt;- c(&quot;Rank&quot;,
  &quot;a00.name&quot;,&quot;a00&quot;,
  &quot;a05.name&quot;,&quot;a05&quot;,
  &quot;a10.name&quot;,&quot;a10&quot;,
  &quot;a15.name&quot;,&quot;a15&quot;)

# Display table
out
</pre>
<p><strong>References</strong></p>
<p>Brandes, U., 2001. A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25, 163-177.</p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J. (2010). <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32, 245-251.</p>
<div class="knobcite">Please cite the paper if you use it.</div>
<br />Filed under: <a href='http://toreopsahl.com/category/network-thoughts/'>Network thoughts</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/arcs/'>arcs</a>, <a href='http://toreopsahl.com/tag/betweenness/'>betweenness</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/closeness/'>closeness</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/degree/'>degree</a>, <a href='http://toreopsahl.com/tag/directed-networks/'>directed networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/gregariousness/'>gregariousness</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/popularity/'>popularity</a>, <a href='http://toreopsahl.com/tag/reinforcement/'>reinforcement</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-nodes/'>strength of nodes</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/undirected-networks/'>undirected networks</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/2204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/2204/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2204&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2010/04/freeman3_egos.gif" medium="image">
			<media:title type="html">Ego networks</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Closeness centrality in networks with disconnected components</title>
		<link>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/</link>
		<comments>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/#comments</comments>
		<pubDate>Sat, 20 Mar 2010 13:03:15 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2163</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/"><img src="http://thetore.files.wordpress.com/2010/03/social-network_closeness_small.png" alt="Closeness in disconnected components" title="Disconnected components" width="300" height="139" class="alignright size-full wp-image-2187" /></a>A key node centrality measure in networks is closeness centrality (Freeman, 1978; Wasserman and Faust, 1994). It is defined as the inverse of farness, which in turn, is the sum of distances to all other nodes. As the distance between nodes in disconnected components of a network is infinite, this measure cannot be applied to networks with disconnected components (Opsahl et al., 2010; Wasserman and Faust, 1994). This post highlights a possible work-around, which allows the measure to be applied to these networks and at the same time maintain the original idea behind the measure.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2163&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A key node centrality measure in networks is closeness centrality (Freeman, 1978; Opsahl et al., 2010; Wasserman and Faust, 1994). It is defined as the inverse of farness, which in turn, is the sum of distances to all other nodes. As the distance between nodes in disconnected components of a network is infinite, this measure cannot be applied to networks with disconnected components (Opsahl et al., 2010; Wasserman and Faust, 1994). <strong>This post highlights a possible work-around, which allows the measure to be applied to these networks and at the same time maintain the original idea behind the measure.</strong></p>
<p><img src="http://thetore.files.wordpress.com/2010/03/social-network_closeness.png?w=430&#038;h=199" alt="Disconnected components" title="Disconnected components" width="430" height="199" class="alignright size-full wp-image-2178" />This network gives a concrete example of the closeness measure. The distance between node G and node H is infinite as a direct or indirect path does not exist between them (i.e., they belong to separate components). As long as at least one node is unreachable by the others, the sum of distances to all other nodes is infinite. As a consequence, researchers have limited the closeness measure to the largest component of nodes (i.e., measured intra-component). The distance matrix for the nodes in the sample network is:</p>
<table class="tore" cellspacing="1" cellpadding="3" align="center">
<tr align="center">
<th></th>
<th colspan="11">Nodes</th>
<th></th>
<th colspan="2">All inclusive</th>
<th colspan="2">Intra-component</th>
</tr>
<tr align="center">
<th></th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
<th>G</th>
<th>H</th>
<th>I</th>
<th>J</th>
<th>K</th>
<th></th>
<th>Farness</th>
<th>Closeness</th>
<th>Farness</th>
<th>Closeness</th>
</tr>
<tr align="center">
<th>A</th>
<td>&#8230;</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>3</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>12</td>
<td>0.08</td>
</tr>
<tr align="center">
<th>B</th>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>10</td>
<td>0.10</td>
</tr>
<tr align="center">
<th>C</th>
<td>1</td>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>2</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>9</td>
<td>0.11</td>
</tr>
<tr align="center">
<th>D</th>
<td>2</td>
<td>2</td>
<td>1</td>
<td>&#8230;</td>
<td>2</td>
<td>1</td>
<td>1</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>9</td>
<td>0.11</td>
</tr>
<tr align="center">
<th>E</th>
<td>2</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>&#8230;</td>
<td>1</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>11</td>
<td>0.09</td>
</tr>
<tr align="center">
<th>F</th>
<td>3</td>
<td>2</td>
<td>2</td>
<td>1</td>
<td>1</td>
<td>&#8230;</td>
<td>2</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>11</td>
<td>0.09</td>
</tr>
<tr align="center">
<th>G</th>
<td>3</td>
<td>3</td>
<td>2</td>
<td>1</td>
<td>3</td>
<td>2</td>
<td>&#8230;</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>14</td>
<td>0.07</td>
</tr>
<tr align="center">
<th>H</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>3</td>
<td>0.33</td>
</tr>
<tr align="center">
<th>I</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>2</td>
<td>0.50</td>
</tr>
<tr align="center">
<th>J</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>2</td>
<td>1</td>
<td>&#8230;</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>3</td>
<td>0.33</td>
</tr>
<tr align="center">
<th>K</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>&#8230;</td>
<th></th>
<td>Inf</td>
<td>Inf</td>
<td>0</td>
<td>&#8230;</td>
</tr>
</table>
<p>Although the intra-component closeness scores are not infinite for all the nodes in the network, it would be inaccurate to use them as a closeness measure. This is due to the fact that the sum of distances would contain different number of paths (e.g., there are two distance from node H to other nodes in its component, while there are six distances from node G to other nodes in its component). In fact, nodes in smaller components would generally be seen as being closer to others than nodes in larger components. Thus, researchers has focused solely on the largest component. However, this leads to a number of methodological issues, including sample selection.</p>
<p>To develop this measure, I went back to the original equation:</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cmbox%7Bcloseness%7D%28i%29+%3D+%5Csum_j+%5Cleft%5B+d_%7Bij%7D+%5Cright%5D%5E%7B-1%7D+%3D+%5Cfrac%7B1%7D%7B%5Csum_j+d_%7Bij%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mbox{closeness}(i) = \sum_j \left[ d_{ij} \right]^{-1} = \frac{1}{\sum_j d_{ij}}' title='\mbox{closeness}(i) = \sum_j \left[ d_{ij} \right]^{-1} = \frac{1}{\sum_j d_{ij}}' class='latex' /></p>
<p>where <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> is the focal node, <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> is another node in the network, and <img src='http://l.wordpress.com/latex.php?latex=d_%7Bij%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d_{ij}' title='d_{ij}' class='latex' /> is the shortest distance between these two nodes. In this equation, the distances are inversed after they have been summed, and when summing an infinite number, the outcome is infinite. To overcome this issue while staying consistent with the existing measure of closeness, I took advantage of the fact that the limit of a number divided by infinity is zero. Although infinity is not an exact number, the inverse of a very high number is very close to 0. In fact, 0 is returned if you enter 1/Inf in the statistical programme <em>R</em>. By taking advantage of this feature, it is possible to rewrite the closeness equation as <em>the sum of inversed</em> distances to all other nodes instead of the <em>inversed of the sum </em>of distances to all other nodes. The equation would then be:</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cmbox%7Bcloseness%7D%28i%29+%3D+%5Csum_j+%5Cfrac%7B1%7D%7Bd_%7Bij%7D%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mbox{closeness}(i) = \sum_j \frac{1}{d_{ij}} ' title='\mbox{closeness}(i) = \sum_j \frac{1}{d_{ij}} ' class='latex' /></p>
<p>To exemplify this change, for the example network above, the inversed distances and closeness scores are:</p>
<table class="tore" cellspacing="1" cellpadding="3" align="center">
<tr align="center">
<th></th>
<th colspan="11">Nodes</th>
<th></th>
<th colspan="2">Closeness</th>
</tr>
<tr align="center">
<th></th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
<th>G</th>
<th>H</th>
<th>I</th>
<th>J</th>
<th>K</th>
<th></th>
<th>Sum</th>
<th>Normalized</th>
</tr>
<tr align="center">
<th>A</th>
<td>&#8230;</td>
<td>1.00</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>0.33</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.67</td>
<td>0.37</td>
</tr>
<tr align="center">
<th>B</th>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>1.00</td>
<td>0.50</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.33</td>
<td>0.43</td>
</tr>
<tr align="center">
<th>C</th>
<td>1.00</td>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>0.50</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.50</td>
<td>0.45</td>
</tr>
<tr align="center">
<th>D</th>
<td>0.50</td>
<td>0.50</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0.50</td>
<td>1.00</td>
<td>1.00</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.50</td>
<td>0.45</td>
</tr>
<tr align="center">
<th>E</th>
<td>0.50</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.83</td>
<td>0.38</td>
</tr>
<tr align="center">
<th>F</th>
<td>0.33</td>
<td>0.50</td>
<td>0.50</td>
<td>1.00</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0.50</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.83</td>
<td>0.38</td>
</tr>
<tr align="center">
<th>G</th>
<td>0.33</td>
<td>0.33</td>
<td>0.50</td>
<td>1.00</td>
<td>0.33</td>
<td>0.50</td>
<td>&#8230;</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.00</td>
<td>0.30</td>
</tr>
<tr align="center">
<th>H</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>0</td>
<th></th>
<td>1.50</td>
<td>0.15</td>
</tr>
<tr align="center">
<th>I</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0</td>
<th></th>
<td>2</td>
<td>0.20</td>
</tr>
<tr align="center">
<th>J</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0.50</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0</td>
<th></th>
<td>1.50</td>
<td>0.15</td>
</tr>
<tr align="center">
<th>K</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>&#8230;</td>
<th></th>
<td>0</td>
<td>0</td>
</tr>
</table>
<p>As can be seen from this table, a closeness score is attained for all nodes taking into consideration an equal number of distances for each node irrespective of the size of the nodes&#8217; component. Moreover, nodes belonging to a larger component generally attains a higher score. This is deliberate as these nodes can reach a greater number of others than nodes in smaller components. The normalized scores are bound between 0 and 1. It is 0 if a node is an isolate, and 1 if a node is directly connected all others. </p>
<p>This measure can easily be extended to weighted networks by introducing Dijkstra&#8217;s (1959) algorithm as proposed in <a href="http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/">Average shortest distance in weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>.</p>
<p><strong>References</strong></p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J. (2010). <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32, 245-251.</p>
<p>Wasserman, S., Faust, K., 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, New York, NY.</p>
<p><strong>What to try it with your data?</strong></p>
<p>Below is the code to calculate the closeness measure on a binary version of Freeman&#8217;s third EIES network. From tnet version 2.7, this measure will be included in the closeness-function.</p>
<pre class="brush: plain;">
# Load tnet 2.7 or newer
library(tnet)

# Load network
# Node K is assigned node id 8 instead of 10 as isolates at the end of id sequences are not recorded in edgelists
net &lt;- cbind(
  i=c(1,1,2,2,2,3,3,3,4,4,4,5,5,6,6,7,9,10,10,11),
  j=c(2,3,1,3,5,1,2,4,3,6,7,2,6,4,5,4,10,9,11,10),
  w=c(1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1))

# Calculate measures
closeness_w(net, gconly=FALSE)
</pre>
<div class="knobcite">This post is the explaination of a footnote in Opsahl et al. (2010). Please cite that article if you use this measure.</div>
<br />Filed under: <a href='http://toreopsahl.com/category/network-thoughts/'>Network thoughts</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/arcs/'>arcs</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/closeness/'>closeness</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/directed-networks/'>directed networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/global/'>global</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/undirected-networks/'>undirected networks</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/2163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/2163/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2163&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2010/03/social-network_closeness.png" medium="image">
			<media:title type="html">Disconnected components</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Local clustering coefficient for two-mode networks</title>
		<link>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/</link>
		<comments>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/#comments</comments>
		<pubDate>Wed, 06 Jan 2010 18:29:35 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2108</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/"><img src="http://thetore.wordpress.com/files/2009/09/abstract_picture.png" alt="" title="Local clustering coefficient for two-mode networks" width="250" height="73" class="alignright size-full wp-image-1964" /></a>Similar to the motivation of the global clustering coefficient that I proposed in <a href="http://http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/">Clustering in two-mode networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, the local clustering coefficient is biased if applied to a projection of a two-mode network. It is biased in the sense that the randomly expected value is not obtained on the projection of a random two-mode network. To overcome this methodological bias, I redefine the local clustering coefficient for two-mode networks. The new coefficient is a mix between the global clustering coefficient for two-mode networks and Barrat's (2004) local coefficient for a weighted one-mode network. The coefficient is tested on Davis' (1940) Southern Women dataset. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2108&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div class="knobinfo">This post is part of a preprint titled Triadic closure in two-mode networks: Redefining the global and local clustering coefficient. The preprint is available on the <a href="http://toreopsahl.com/publications/">Publications-page</a>.</div>
<p>In a similar vein as the global clustering coefficient that I proposed in <a href="http://http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/">Clustering in two-mode networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, the local clustering coefficient can be redefined for two-mode networks. Originally, Watts and Strogatz (1998) defined the local clustering coefficient for a focal node as the fraction of present ties among a node’s neighbors over the possible number of ties between them. It can be formalized for a focal node, <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> as follows:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=C_%7Blocal%7D%28i%29+%3D+%5Cfrac%7B%5Cmbox%7Bactual+ties+between+a+node%27s+neighbours%7D%7D%7B%5Cmbox%7Bpossible+ties+between+a+node%27s+neighbours%7D%7D%3D%5Cfrac%7B%5Ctau_%7Bi%2C%5CDelta%7D%7D%7B%5Ctau_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=1' alt='C_{local}(i) = \frac{\mbox{actual ties between a node&#039;s neighbours}}{\mbox{possible ties between a node&#039;s neighbours}}=\frac{\tau_{i,\Delta}}{\tau_{i}}' title='C_{local}(i) = \frac{\mbox{actual ties between a node&#039;s neighbours}}{\mbox{possible ties between a node&#039;s neighbours}}=\frac{\tau_{i,\Delta}}{\tau_{i}}' class='latex' /></p>
<p>where <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_%7Bi%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_{i}' title='\tau_{i}' class='latex' /> is the number of 2-paths centered on a node, and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_%7Bi%2C%5CDelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_{i,\Delta}' title='\tau_{i,\Delta}' class='latex' /> is the number of these that are closed. While the global clustering coefficient is an aggregation of all 2-paths, the local one can be seen as simply an intermediary level of aggregation as it can be conceptualized in terms of 2-paths.</p>
<p>When applying the traditional local clustering coefficient to the <a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/">projection of two-mode network<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, cliques among nodes connected to common nodes in the two-mode network are created. These cliques contain a high number of triangles. This has an impact on measures that rely on ego network density, such as the clustering coefficients and structural holes measures (Burt, 1992). The average of local clustering coefficients is over-estimated for these networks as projections of random two-mode networks contain an above random clustering coefficient. Therefore, a new measure that does not over-estimate the level is needed.</p>
<p>Given the extention of 2-paths in one-mode networks to 4-paths in two-mode networks for <a href="http://http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/">the global clustering coefficient<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, the denominator and numerator of the local clustering coefficient can also be redefined in terms of 4-paths. While the original local coefficient was based on 2-paths centered on the focal node, this can be extended to 4-paths centered on a focal node in two-mode networks. This would imply that the first and last nodes of the path are of the same mode as the focal node. Formally, I propose:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=C%5E%7B%2A%7D_%7Blocal%7D%28i%29+%3D+%5Cfrac%7B%5Cmbox%7Bclosed+4-paths+centered+on+ego%7D%7D%7B%5Cmbox%7B4-paths+centered+on+ego%7D%7D%3D%5Cfrac%7B%5Ctau%5E%7B%2A%7D_%7Bi%2C%5CDelta%7D%7D%7B%5Ctau%5E%7B%2A%7D_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=1' alt='C^{*}_{local}(i) = \frac{\mbox{closed 4-paths centered on ego}}{\mbox{4-paths centered on ego}}=\frac{\tau^{*}_{i,\Delta}}{\tau^{*}_{i}}' title='C^{*}_{local}(i) = \frac{\mbox{closed 4-paths centered on ego}}{\mbox{4-paths centered on ego}}=\frac{\tau^{*}_{i,\Delta}}{\tau^{*}_{i}}' class='latex' /></p>
<p>where <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%5E%7B%2A%7D_%7Bi%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau^{*}_{i}' title='\tau^{*}_{i}' class='latex' /> is the number of 4-paths with ego as the middle node, and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%5E%7B%2A%7D_%7Bi%2C%5CDelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau^{*}_{i,\Delta}' title='\tau^{*}_{i,\Delta}' class='latex' /> is the subset of these in which the first and the last nodes of the path share a common node that is not part of the 4-path.</p>
<p>This coefficient has similar properties as the local clustering coefficient. First, for each node, the coefficient varies between 0 and 1 as the numerator and denominator are positive numbers, and the numerator is a subset of the denominator. Second, all 4-paths are closed in a fully connected network, and therefore, the coefficient is equal to 1. Third, if ties are randomly placed in the network, the expected value of the local clustering coefficient is the same as the one for the global coefficient, <img src='http://l.wordpress.com/latex.php?latex=1-%281-d%5E%7B2%7D%29%5E%7B%28N_%7Bp%7D-2%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1-(1-d^{2})^{(N_{p}-2)}' title='1-(1-d^{2})^{(N_{p}-2)}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is the density of the network. </p>
<p><strong>Empirical test</strong></p>
<p>To empirically test the proposed local clustering coefficient for two-mode networks, I have also used the Davis Southern Women dataset as this dataset has a limited number of nodes. The table below shows the local clustering coefficients attained from the two-mode network and projected one-mode network as well as the two-mode and one-mode degree scores (i.e., the number of events attended and the number of other women attending the same events, respectively). </p>
<table class="tore" cellspacing="1" cellpadding="3">
<tr align="center">
<th align="left">Node</th>
<th>Events attended</th>
<th>Other women<br /> attending<br /> same events</th>
<th>One-mode LCC</th>
<th>Two-mode LCC</th>
</tr>
<tr>
<td>EVELYN</td>
<td align="right">8</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.7667</td>
</tr>
<tr>
<td>LAURA</td>
<td align="right">7</td>
<td align="right">15</td>
<td align="right">0.9619</td>
<td align="right">0.8422</td>
</tr>
<tr>
<td>THERESA</td>
<td align="right">8</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.7523</td>
</tr>
<tr>
<td>BRENDA</td>
<td align="right">7</td>
<td align="right">15</td>
<td align="right">0.9619</td>
<td align="right">0.8388</td>
</tr>
<tr>
<td>CHARLOTTE</td>
<td align="right">4</td>
<td align="right">11</td>
<td align="right">1</td>
<td align="right">1</td>
</tr>
<tr>
<td>FRANCES</td>
<td align="right">4</td>
<td align="right">15</td>
<td align="right">0.9619</td>
<td align="right">0.869</td>
</tr>
<tr>
<td>ELEANOR</td>
<td align="right">4</td>
<td align="right">15</td>
<td align="right">0.9619</td>
<td align="right">0.7959</td>
</tr>
<tr>
<td>PEARL</td>
<td align="right">3</td>
<td align="right">16</td>
<td align="right">0.9333</td>
<td align="right">0.6463</td>
</tr>
<tr>
<td>RUTH</td>
<td align="right">4</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.6703</td>
</tr>
<tr>
<td>VERNE</td>
<td align="right">4</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.6741</td>
</tr>
<tr>
<td>MYRNA</td>
<td align="right">4</td>
<td align="right">16</td>
<td align="right">0.9333</td>
<td align="right">0.7139</td>
</tr>
<tr>
<td>KATHERINE</td>
<td align="right">6</td>
<td align="right">16</td>
<td align="right">0.9333</td>
<td align="right">0.7696</td>
</tr>
<tr>
<td>SYLVIA</td>
<td align="right">7</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.7462</td>
</tr>
<tr>
<td>NORA</td>
<td align="right">8</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.838</td>
</tr>
<tr>
<td>HELEN</td>
<td align="right">5</td>
<td align="right">17</td>
<td align="right">0.8971</td>
<td align="right">0.8159</td>
</tr>
<tr>
<td>DOROTHY</td>
<td align="right">2</td>
<td align="right">16</td>
<td align="right">0.9333</td>
<td align="right">0.5407</td>
</tr>
<tr>
<td>OLIVIA</td>
<td align="right">2</td>
<td align="right">12</td>
<td align="right">1</td>
<td align="right">0.5806</td>
</tr>
<tr>
<td>FLORA</td>
<td align="right">2</td>
<td align="right">12</td>
<td align="right">1</td>
<td align="right">0.5806</td>
</tr>
</table>
<p>The two-mode and one-mode degree scores and the traditional and proposed local clustering coefficients (LCC) of the women in Davis’ (1940) Southern Women dataset. The randomly expected one-mode clustering coefficient is 0.9085, while the one for two-mode networks is 0.7978.</p>
<p>There are a number of observations. First, for all the nodes that did not have the maximum value, the two-mode coefficient is smaller than the coefficient attained on the projected network. This feature is not given as multiple 4-paths might exist among three primary nodes, and therefore, the two-mode coefficient might be higher than the one attained on projected one-mode network. It gives, however, an indication of the bias that is created by three or more primary nodes are connected to a common node. Second, the reduction difference between the two coefficients is greater for the women attending fewer events (pair-wise correlation between the number of events and the difference is -0.69, with a <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />-value of 0.001). This might suggest that the bias is greatest for nodes that attend few events. This is not unexpected as a woman attending a single event with at least two others would automatically attain a coefficient of 1 in the binary network.</p>
<p>To further highlight some of the features of the redefined local clustering coefficient, Flora and the network around her up to three steps is shown below. In a one-mode projection, all the possible ties among Flora’s contacts are present. This is due to the fact that eleven out of the twelve contacts attended event 9. The twelfth contact that did not attend event 9, Helen, is connected to all others connect through other events. The redefined clustering coefficient is less than 1 for Flora. This is because event 9 and 11 are not used to form closing ties among the women attending them (i.e., close 4-paths). More specifically, 4-paths exist from the nodes attached to event 11 to the nodes connected to event 9 (excluding themselves). In total, there are 31 4-paths, out of which 18 are closed by the event 6, 7, 8, and 10.</p>
<p><img src="http://thetore.files.wordpress.com/2010/01/flora.png?w=600&#038;h=387" alt="" title="Flora&#39;s local network up to three steps" width="600" height="387" class="aligncenter size-full wp-image-2131" /><br />
Flora&#8217;s local network up to three steps. Only non-redudant ties are shown between the second and third steps.</p>
<p><strong>References</strong></p>
<p>Burt, R.S., 1992. Structural Holes. Harvard University Press, Cambridge, MA.<br />
Davis, A., Gardner, B. B., Gardner, M. R., 1941. Deep South. University of Chicago Press, Chicago, IL.<br />
Watts, D.J., Strogatz, S.H., 1998. Collective dynamics of small-world networks. Nature 393, 440-442.</p>
<p><strong>What to try it with your data</strong></p>
<p>The redefined local clustering coefficient is implemented in tnet as clustering_tm_local. Below is the code for analysing a sample network and Davis&#8217; (1940) Southern Women dataset is shown. </p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load networks
net &lt;- cbind(
 i=c(1,1,2,2,2,3,3,4,5,5,6),
 p=c(1,2,1,3,4,2,3,4,3,5,5),
 w=c(3,5,6,1,2,6,2,1,3,1,2))

# Obtain the binary local clustering coefficients of the nodes in the sample network
clustering_tm_local(net[,1:2])

# Obtain the weighted local clustering coefficients of the nodes in the sample network
clustering_tm_local(net)

# Obtain the binary local clustering coefficients of the women in Davis' (1940) Southern Women dataset
data(&quot;Davis.Southern.women&quot;)
clustering_tm_local(Davis.Southern.women.2mode)
</pre>
<p>The output from the binary and weighted analyses of the sample network is:</p>
<pre class="brush: plain;">
 node  lc
    1 1.0
    2 0.2
    3 0.5
    4 NaN
    5 0.0
    6 NaN

 node  lc     lc.am     lc.gm     lc.ma     lc.mi
    1 1.0 1.0000000 1.0000000 1.0000000 1.0000000
    2 0.2 0.2400000 0.2313222 0.2608696 0.2000000
    3 0.5 0.4666667 0.4317651 0.5000000 0.3333333
    4 NaN       NaN       NaN       NaN       NaN
    5 0.0 0.0000000 0.0000000 0.0000000 0.0000000
    6 NaN       NaN       NaN       NaN       NaN
</pre>
<div class="knobinfo">I would like to acknowledge John Skvoretz and Filip Agneessens in helping to develop the idea behind this post.</div>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, clustering coefficient, complex networks, edges, embeddedness, graphs, Links, local, network, nodes, social network analysis, strength of ties, ties, two-mode networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/2108/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/2108/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2108&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2010/01/flora.png" medium="image">
			<media:title type="html">Flora&#39;s local network up to three steps</media:title>
		</media:content>
	</item>
		<item>
		<title>Online Social Network-dataset now available</title>
		<link>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/</link>
		<comments>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/#comments</comments>
		<pubDate>Tue, 10 Nov 2009 17:46:53 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2054</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/" rel="Online Social Network-dataset now available"><img src="http://thetore.wordpress.com/files/2009/11/af-page20-fixed_150.png" alt="Online Social" title="Online Social Network-dataset now available" width="150" height="151" class="alignright size-full wp-image-2059" /></a>The Online Social Network-dataset used in my <a href="http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/">Ph.D. thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is now available on <a href="http://toreopsahl.com/datasets/">the Dataset-page<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. This network has also been described in <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and used in <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Clustering in weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. The network originate from an online social network among students at University of California, Irvine. The edgelist includes the users that sent or received at least one message during that period (1,899). A total number of 59,835 online messages were sent among these over 20,296 directed ties.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2054&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The Online Social Network-dataset used in my <a href="http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/">Ph.D. thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is now available on <a href="http://toreopsahl.com/datasets/">the Dataset-page<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. This network has also been described in <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and used in <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Clustering in weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. The network originate from an online social network among students at University of California, Irvine. The edgelist includes the users that sent or received at least one message during that period (1,899). A total number of 59,835 online messages were sent among these over 20,296 directed ties.</p>
<p><em>Citation:</em> Opsahl, T., Panzarasa, P., 2009. Clustering in weighted networks. Social Networks 31 (2), 155-163, doi: 10.1016/j.socnet.2009.02.002</p>
<div id="attachment_2055" class="wp-caption aligncenter" style="width: 610px"><img src="http://thetore.files.wordpress.com/2009/11/af-page20-fixed.png?w=600&#038;h=604" alt="A sample page from the online social network" title="A sample page from the online social network" width="600" height="604" class="size-full wp-image-2055" /><p class="wp-caption-text">A sample page from the online social network</p></div>
<br />Posted in Network thoughts Tagged: arcs, communication, complex networks, directed networks, edges, graphs, Links, network, nodes, online, online communication, online social networks, social network analysis, social networking site, ties, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/2054/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/2054/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=2054&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/11/af-page20-fixed.png" medium="image">
			<media:title type="html">A sample page from the online social network</media:title>
		</media:content>
	</item>
		<item>
		<title>Similarity between node degree and node strength</title>
		<link>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/</link>
		<comments>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 12:57:38 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1974</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength"><img src="http://thetore.wordpress.com/files/2009/10/in_small.png" alt="Correlation between node degree and node strength" title="Correlation between node degree and node strength" width="125" height="106" class="alignright size-full wp-image-2002" /></a>This post explores the relationship between node degree and node strength in an online social network. In the online social network, heterogeneity in nodes’ average tie weight across different levels of degree had been <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">reported<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. Although degree and average tie weight are significantly correlated, this post argues for the similarity of degree and node strength.  In particular, high pair-wise correlation between degree and strength is found. In addition, power-law exponents of degree distributions and strength distribution are reported. The exponents are strikingly similar, in fact, they are almost identical.<div align="right">Technical:&#160;3/10</div><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1974&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the paper <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, we found that those individuals with many connections (i.e., high degree) sent on average more messages to each of their contacts than those with fewer connections:</p>
<p><em>&#8220;we measured users’ average out-strength (instrength) as the average number of messages sent to (received from) others (Opsahl, Colizza, Panzarasa,&amp;Ramasco, 2008). We expected hubs to be weakly connected to others, based on the conjecture that all users are homogeneously limited by the same constraints of resources and time. In this case, having more contacts should reduce the amount of resources and time spent on each of them (Burt, 1992). We were surprised to find a positive and significant (p&lt;0.001) Pearson’s pairwise correlation coefficient between average out-strength (in-strength) and out-degree (in-degree) of 0.28 (0.44). This signals that hubs spend more time and resources with each of their contacts than the less connected users.&quot; </em> (excerpt from page 919).</p>
<p>The heterogeneity in average tie weight for users with different levels of gregariousness might indicate that node degree and node strength are not correlated. <strong>This post aims to test this for the online social network used in the paper and compare degree and strength distributions.</strong></p>
<p>Given that this is a directed network, each analysis is conducted twice &#8211; once for outgoing ties and once for incoming ties. The simplest way to test the association between two variables is to calculate the <a href="http://en.wikipedia.org/wiki/Correlation" target="_blank">Pearson pair-wise correlation coefficient <img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. This coefficient tests the linear relationship between two variables, and ranges from -1 to 1. If it is equal to 1, then there is perfect correlation between the two-variables, whereas if it is -1, the two variables are opposites of each other. A value of 0 is attained if there is no linear relationship between the two variables. For out-degree and out-strength, the coefficient is 0.90, and for in-degree and in-strength, the coefficient is 0.89. This indicates that degree and strength is highly correlated with each other (Cohen, 1988). </p>
<p>Since high correlation coefficients were found, it might be interesting to plot the relationships to ensure that extreme values are not distorting the coefficient. The relationships between the two types of degree and strength are: </p>
<p><img src="http://thetore.files.wordpress.com/2009/09/out.png?w=300&#038;h=296" alt="out-degree/strength" title="out-degree/strength" width="300" height="296" class="alignnone size-full wp-image-1978" /><img src="http://thetore.files.wordpress.com/2009/09/in.png?w=300&#038;h=295" alt="in-degree/strength" title="in-degree/strength" width="300" height="295" class="alignnone size-full wp-image-1977" /></p>
<p>As it is possible to see from the above plots, there are a number of nodes with extremely high values of degree and strength. However, there are clear trajectories at low values of degree and strength, which might indicate that the outliers are not distorting the correlation coefficients. The fact that there are nodes with extremely high values of degree is not surprising given that power-law degree distributions with exponents of 0.89 and 1.005 were found in the paper:</p>
<p><img src="http://thetore.files.wordpress.com/2009/09/distko.png?w=300&#038;h=296" alt="Out-degree distribution" title="Out-degree distribution" width="300" height="296" class="alignnone size-full wp-image-1981" /><img src="http://thetore.files.wordpress.com/2009/09/distki.png?w=300&#038;h=296" alt="In-degree distribution" title="In-degree distribution" width="300" height="296" class="alignnone size-full wp-image-1980" /></p>
<p>Given the similarity between degree and strength, it would be interesting to test whether the strength distributions also follow a power-law distribution, and if so, if the exponent is similar to the ones for the degree-distributions:</p>
<p><img src="http://thetore.files.wordpress.com/2009/09/distso.png?w=300&#038;h=296" alt="Out-strength distribution" title="Out-strength distribution" width="300" height="296" class="alignnone size-full wp-image-1983" /><img src="http://thetore.files.wordpress.com/2009/09/distsi.png?w=300&#038;h=296" alt="In-strength distribution" title="In-strength distribution" width="300" height="296" class="alignnone size-full wp-image-1982" /></p>
<p>The exponents of the strength distributions are 0.87 and 1.004. Although I expected some similarity between the degree distributions&#8217; exponents (0.89 and 1.005) and the strength distributions&#8217; exponents, the numerical similarity is striking.</p>
<p><strong>References</strong></p>
<p>Burt, R. S., 1992. Structural Holes: The Social Structure of Competition. Harvard University Press, Cambridge, MA.</p>
<p>Cohen, J., 1988. Statistical power analysis for the behavioral sciences (2nd edition). Hillsdale, NJ: Erlbaum.</p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). <a href="http://arxiv.org/abs/0804.0417" target="_blank">arXiv:0804.0417</a>. </p>
<p>Panzarasa, P., Opsahl, T., Carley, K.M., 2009. <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and dynamics of users’ behavior and interaction: Network analysis of an online community</a>. Journal of the American Society for Information Science and Technology 60 (5), 911-932, doi: 10.1002/asi.21015</p>
<p><strong>What to try it with your data?</strong></p>
<p>Below is the code to calculate the numbers and create the diagrams used in this post. If you also would like to calculate the power-law with exponential cut-off, then you should remove the # on line 41.</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
data(OnlineSocialNetwork.n1899)

`script` &lt;-
function(net){
  output &lt;- list()
  # Calculate out/in-degree/strength
  k &lt;- cbind(degree_w(net), degree_w(net, type=&quot;in&quot;))
  dimnames(k)[[2]] &lt;- c(&quot;node&quot;,&quot;ko&quot;,&quot;so&quot;,&quot;node2&quot;,&quot;ki&quot;,&quot;si&quot;)
  if(sum(k[,&quot;node&quot;] == k[,&quot;node2&quot;])!=nrow(k))
    stop(&quot;Node ids does not match&quot;)
  k &lt;- k[,c(&quot;node&quot;,&quot;ko&quot;,&quot;ki&quot;,&quot;so&quot;,&quot;si&quot;)]
  output[[1]] &lt;- k

  # Get pair-wise correlation coefficients
  corro &lt;- cor.test(k[,&quot;ko&quot;], k[,&quot;so&quot;])
  corri &lt;- cor.test(k[,&quot;ki&quot;], k[,&quot;si&quot;])
  cat(paste(&quot;Pair-wise correlation between degree and strength:\n Out: &quot;, corro$estimate, &quot; (p-value: &quot;, corro$p.value, &quot;)\n In:  &quot;, corri$estimate, &quot; (p-value: &quot;, corri$p.value, &quot;)\n Note: If p-value equal 0, p-value is less than 2.2e-16\n&quot;, sep=&quot;&quot;))
  output[[2]] &lt;- corro
  output[[3]] &lt;- corri

  # Degree distributions
  cat(&quot;Degree distributions\n&quot;)
  looprange &lt;- c(&quot;ko&quot;,&quot;so&quot;,&quot;ki&quot;,&quot;si&quot;)
  for(j in 1:length(looprange)) {
    i &lt;- looprange[j]
    tmp &lt;- table(k[,i])
    tmp &lt;- tmp[which(rownames(tmp)!=&quot;0&quot;)]
    tmp &lt;- tmp/(sum(tmp))
    tmp &lt;- as.data.frame(cbind(k=as.numeric(rownames(tmp)), pk=tmp))
    plaw &lt;- nls(pk ~ C*k^(-t), data=tmp, start=list(C=1, t=1))
    plaweco &lt;- nls(pk ~ C*k^(-t)*exp(-k/K), data=tmp, start=list(C=1, t=1, K=30))
    cat(switch(i,
      &quot;ko&quot; = &quot; Out-degree&quot;,
      &quot;so&quot; = &quot; Out-strength&quot;,
      &quot;ki&quot; = &quot; In-degree&quot;,
      &quot;si&quot; = &quot; In-strength&quot;))
    cat(paste(&quot;\n  Powerlaw:  pk =&quot;, plaw$call$formula[3], &quot;\n   Coefficients:\n    Con =&quot;, coef(plaw)[&quot;C&quot;], &quot;\n    tau =&quot;, coef(plaw)[&quot;t&quot;]))
    # cat(paste(&quot;\n  Powerlaw with exponential cut-off: pk &quot;, plaweco$call$formula[3], &quot;\n   Coefficients:\n    Con =&quot;, coef(plaweco)[&quot;C&quot;], &quot;\n    tau =&quot;, coef(plaweco)[&quot;t&quot;], &quot;\n    cut =&quot;, coef(plaweco)[&quot;K&quot;]))
    cat(&quot;\n&quot;)
    output[[(length(output)+1)]] &lt;- tmp
    output[[(length(output)+1)]] &lt;- plaw
    output[[(length(output)+1)]] &lt;- plaweco
  }
  cat(&quot; Note: These regressions in the article were performed in Stata 9\n The value of the cut-off parameter varies slightly between R and Stata\n&quot;)
  return(output)
}
output &lt;- script(OnlineSocialNetwork.n1899.net)
k &lt;- output[[1]]
plot(k[,&quot;ko&quot;], k[,&quot;so&quot;], main=&quot;Outgoing ties&quot;, xlab=&quot;out-degree&quot;, ylab=&quot;out-strength&quot;)
plot(k[,&quot;ki&quot;], k[,&quot;si&quot;], main=&quot;Incoming ties&quot;, xlab=&quot;in-degree&quot;,  ylab=&quot;in-strength&quot; )

plot(output[[4]][,1], output[[4]][,2], main=&quot;Out-degree distribution&quot;, xlab=&quot;out-degree&quot;, ylab=&quot;p(out-degree)&quot;, log=&quot;xy&quot;)
lines(output[[4]][,1], fitted(output[[5]]))

plot(output[[7]][,1], output[[7]][,2], main=&quot;Out-strength distribution&quot;, xlab=&quot;out-strength&quot;, ylab=&quot;p(out-strength)&quot;, log=&quot;xy&quot;)
lines(output[[7]][,1], fitted(output[[8]]))

plot(output[[10]][,1], output[[10]][,2], main=&quot;In-degree distribution&quot;, xlab=&quot;in-degree&quot;, ylab=&quot;p(in-degree)&quot;, log=&quot;xy&quot;)
lines(output[[10]][,1], fitted(output[[11]]))

plot(output[[13]][,1], output[[13]][,2], main=&quot;In-strength distribution&quot;, xlab=&quot;in-strength&quot;, ylab=&quot;p(in-strength)&quot;, log=&quot;xy&quot;)
lines(output[[13]][,1], fitted(output[[14]]))
</pre>
<div class="knobinfo">I would like to acknowledge <a href="http://vcolizza.googlepages.com/" target="_blank">Vittoria Colizza <img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> in helping to develop the idea behind this post.</div>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, arcs, centrality, complex networks, degree, directed networks, edges, global, graphs, gregariousness, hubs, Links, local, network, nodes, online communication, online social networks, popularity, reinforcement, social network analysis, social networking site, strength of nodes, strength of ties, ties, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/1974/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/1974/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1974&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/09/out.png" medium="image">
			<media:title type="html">out-degree/strength</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/in.png" medium="image">
			<media:title type="html">in-degree/strength</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/distko.png" medium="image">
			<media:title type="html">Out-degree distribution</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/distki.png" medium="image">
			<media:title type="html">In-degree distribution</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/distso.png" medium="image">
			<media:title type="html">Out-strength distribution</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/distsi.png" medium="image">
			<media:title type="html">In-strength distribution</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Clustering in two-mode networks</title>
		<link>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/</link>
		<comments>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/#comments</comments>
		<pubDate>Fri, 11 Sep 2009 00:00:29 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1382</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/"><img src="http://thetore.wordpress.com/files/2009/09/abstract_picture.png" alt="Clustering in two-mode networks" title="Clustering in two-mode networks" width="250" height="73" class="alignright size-full wp-image-1964" /></a>Many network dataset are by definition two-mode networks. Yet, few network measures can be directly applied to them. Therefore, two-mode networks are often projected onto one-mode networks by selecting a node set and linking two nodes if they were connected to common nodes in the two-mode network. This process has a major impact on the level of clustering in the network. If three or more nodes are connected to a common node in the two-mode network, the nodes form a fully-connected clique consisting of one or more triangles in the one-mode projection. Moreover, it produces a number of modeling issues. For example, even a one-mode projection of a random two-mode network with same number of nodes and ties will have a higher clustering coefficient than the randomly expected value. This post represents an attempt to overcome this issue by redefining the clustering coefficient so that it can be calculated directly on the two-mode structure. I illustrate the benefits of such an approach by applying it to two-mode networks from four different domains:  event attendance, scientific collaboration, interlocking directorates, and online communication.
<div align="right">Technical:&#160;6/10</div><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1382&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div class="knobinfo">A much updated version of this post is available as a preprint on the <a href="http://toreopsahl.com/publications/">Publications-page</a> with the title Triadic closure in two-mode networks: Redefining the global and local clustering coefficient. This includes, in addition to the global measure, a local clustering coefficient for two-mode networks.</div>
<div class="knobinfo">The measure proposed in this post was presented at the <a href="http://home.fsw.vu.nl/f.agneessens/2mode/overview.htm">Conference and Workshop on Two-Mode Social Analysis</a> in Amsterdam on October 1, 2009.</div>
<p><strong>Introduction</strong></p>
<p>Networks are representations of systems in which the elements (or nodes) are connected by ties (Wasserman and Faust, 1994). Most networks are defined as one-mode networks with one set of nodes that are similar to each other. However, many network dataset are in fact two-mode networks (also known as affiliation or bipartite networks). These networks are a particular kind of networks with two different sets of nodes (e.g., employees and projects), and ties are only established between nodes belonging to different sets (e.g., employees are linked to a project if they have worked on it). In most two-mode networks, the nodes of one node set are considered more responsible for tie creation (primary node set) than the other (secondary node set). For example, employees decide whether or not to collaborate on a project. </p>
<p>One of the first two-mode datasets to be analysed was the Davis Southern Club Women dataset (Davis et al., 1941), which recorded the attendance of a group of women (node set 1) to a series of events (set 2). A woman would be linked to an event if she attended it. Another category of two-mode networks that has become popular in recent years is scientific collaboration networks (e.g., Newman, 2001). In these networks, the two sets of nodes are scientists (node set 1) and papers (node set 2), and a tie is established if a scientist have authored a paper. </p>
<p>Two-mode networks are rarely analysed without transforming them. A likely cause of this is that most network measures are defined for one-mode network, and few of them have been redefined for two-mode networks. Therefore, researchers have constructed one-mode networks from the two-mode networks. This is often done using a method referred to as projection. This method operates by selecting one of the two node sets and linking two nodes from that set if they were connected to a common node of the other set. For example, interlocking directorates can be studied in two ways from a network perspective (Mizruchi, 1996; Zajaz, 1988). In this kind of networks, the two node sets are directors and corporate boards, and ties represent the affiliation of directors with boards. On the one hand, it is possible to study the interdependence among companies. Levine (1979) studied how companies were connected by having the same individual on both their boards of directors. By being connected, communication, knowledge transfer, and coordination are facilitated between companies (Zajaz, 1988). On the other, it is possible to study the interpersonal relationships among individuals that are directors on a set of boards of directors. In this case, individuals can be linked by being members of the same boards (Opsahl and Seierstad, 2009).</p>
<p>Projection of two-mode networks creates a number of modelling issues. First, each tie in a regular one-mode network is created separately; however, this is not the case in projected two-mode networks. For example, while a standard phone call creates a communication tie from one person to another, a director forms ties with all the other directors on a board when she or he joins that board. This has direct implications for models that utilize random networks to create benchmark values (e.g., Opsahl et al., 2008) and when comparing values found in an observed network with those found in corresponding random networks. Ties in random networks are assumed to be independent of each other. (Footnote 1) Although this is neither the case in observed one-mode nor two-mode networks, in projected two-mode networks, ties are not even formed separately. Thus, they are less comparable to the observed network than to genuine one-mode networks. Second, depending on the degree distribution of the non-selected node set, a projected one-mode network tends to have more and larger fully-connected cliques than non-projected one-mode networks (Wasserman and Faust, 1994). These are produced when a group of nodes are connected to the same other node in the two-mode network (e.g., all the directors on a single board are all connected). This feature impacts a number of network measures, and in particular, the clustering coefficient (for a review, see Opsahl and Panzarasa, 2009). This measure is the fraction of 2-paths (i.e., three nodes connected by two ties) that are part of a triangle. Given that the cliques contain a number of triangles, this measures might be biased. To exemplify the cliques, and the many triangles, produced when projecting a two-mode network, Figure 1 shows the main component of the interpersonal network among Norwegian directors (Opsahl and Seierstad, 2009).</p>
<div id="attachment_1948" class="wp-caption alignright" style="width: 610px"><a href="http://thetore.files.wordpress.com/2009/09/net_02.png"><img src="http://thetore.files.wordpress.com/2009/09/net_02.png?w=600&#038;h=535" alt="Interpersonal network among Norwegian directors" title="Board network" width="600" height="535" class="size-full wp-image-1948" /></a><p class="wp-caption-text">Figure 1: The network structure among directors (circles) who form part of the largest group of interconnected directors. Two directors are connected if they are members of the same board. The solid circles refer to women, whereas the hollow circles refer to men. </p></div>
<p>The rest of the paper is organised as follows. First, I will describe the binary clustering coefficient and its properties. Then, I will propose a new measure that detects triadic closure in two-mode networks while omitting triangles formed by three nodes solely being connected to the same other nodes (e.g., three scientists writing a paper). I will then suggest a method for integrating the generalization by Opsahl and Panzarasa (2009) of the clustering coefficient to weighted one-mode networks by offering a generalisation of the proposed coefficient to weighted two-mode networks. This will be followed by applications of the proposed coefficients to four two-mode networks from the domains of event attendance, scientific collaboration, interlocking directorates, and online communication. Finally, I will offer some concluding remarks.</p>
<p><strong>Clustering coefficient for one-mode networks</strong></p>
<p>A measure that has long received much attention in both theoretical and empirical research is concerned with the degree to which nodes tend to cluster together. Evidence suggests that in most real-world networks, and especially social networks, nodes tend to cluster into densely connected groups (Holland and Leinhardt, 1970; Opsahl and Panzarasa, 2009; Watts and Strogatz, 1998). Two main measures have been developed for testing this tendency: the local clustering coefficient (Watts and Strogatz, 1998) and the global clustering coefficient (Luce and Perry, 1949; Opsahl and Panzarasa, 2009). The global coefficient assesses the overall level of clustering in the network, whereas the local coefficient measures the density of ties among a node’s contacts. This paper is concerned with the former of these two measures. This coefficient is the fraction of triplets or 2-paths that are closed by the presence of a tie between the first and the third node. It is formally defined as:<br />
<img src='http://l.wordpress.com/latex.php?latex=C%3D%5Cfrac%7B3+%5Ctimes+%5Cmbox%7Btriangles%7D%7D%7B%5Cmbox%7Btriplets%7D%7D%3D%5Cfrac%7B%5Cmbox%7Bclosed+triplets%7D%7D%7B%5Cmbox%7Btriplets%7D%7D%3D%5Cfrac%7B%5Ctau_%7B%5CDelta%7D%7D%7B%5Ctau%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C=\frac{3 \times \mbox{triangles}}{\mbox{triplets}}=\frac{\mbox{closed triplets}}{\mbox{triplets}}=\frac{\tau_{\Delta}}{\tau}' title='C=\frac{3 \times \mbox{triangles}}{\mbox{triplets}}=\frac{\mbox{closed triplets}}{\mbox{triplets}}=\frac{\tau_{\Delta}}{\tau}' class='latex' /><br />
where <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau' title='\tau' class='latex' /> is the number of 2-paths, and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_%7B%5CDelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_{\Delta}' title='\tau_{\Delta}' class='latex' /> is the number of these 2-paths that are closed by being part of a triangle. This coefficient varies between 0 and 1. It is equal to 0 if no triangles exist in the network, and equal to 1 if all 2-paths are closed. In a completely connected network, the coefficient would be 1 as all 2-paths would be closed. Moreover, in a classical random network, the expected value of the clustering coefficient is equal to the probability of a tie being formed (i.e., the density) as ties are independent of each other (Erdos and Rényi, 1959). </p>
<p><strong>Clustering coefficient for two-mode networks</strong></p>
<p>In two-mode networks, the denominator and numerator of the clustering coefficient can be redefined in terms of 4-paths and closed 4-paths, respectively. (Footnote 2) First, a 4-path is a path that starts at a node, goes via two nodes of the other node set and one node of the same node set and, and ends up at a node of the same node set. For example, in Figure 2a, a 4-path exists between node 1 and node 3 via node A, node 2, and node C. Second, a closed 4-path is a 4-path in which the two end nodes are connected to a common node (excluding the nodes that are already part of the path). For example, the mentioned 4-path between node 1 and node 3 is closed as these nodes are both connected to node B, which is not part of the path.</p>
<div id="attachment_1949" class="wp-caption alignright" style="width: 610px"><img src="http://thetore.files.wordpress.com/2009/09/net_sample_binary.png?w=600&#038;h=197" alt="Binary two-mode network and projection" title="net_sample_binary" width="600" height="197" class="size-full wp-image-1949" /><p class="wp-caption-text">Figure 2: (a) A two-mode network where the shape and colour of nodes represent the set to which a node belongs, and (b) the one-mode projection of the two-mode network in panel a. The round blue nodes are the primary nodes in this projection. The clustering coefficient of the two-mode network (panel a) is 0.3, while the clustering coefficient of the one-mode projection (panel b) is 0.4615. </p></div>
<p>All 4-paths in a two-mode network are 2-paths in a one-mode projection of the network; however, not all 2-paths in a one-mode projection are created from 4-paths. In fact, 2-paths can also be created due to multiple nodes being connected to the same node. The 2-paths created by the latter mechanism would be excluded when only considering 4-paths in the two-mode structure. This feature is illustrated in Figure 2. In the first panel (a), there are ten 4-paths, three of which are closed. (Footnote 3) These 4-paths represent ten 2-paths in the one-mode projection (panel b). (Footnote 4) However, in the one-mode projection (panel b), there are an additional three 2-paths. These are created among node 2, node 3, and node 5 as these nodes are all connected to node C in the two-mode network. </p>
<p>Formally, the proposed coefficient can be defined as:<br />
<img src='http://l.wordpress.com/latex.php?latex=C%5E%7B%2A%7D%3D%5Cfrac%7B%5Cmbox%7Bclosed+4paths%7D%7D%7B%5Cmbox%7B4paths%7D%7D%3D%5Cfrac%7B%5Ctau%5E%7B%2A%7D_%7B%5CDelta%7D%7D%7B%5Ctau%5E%7B%2A%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C^{*}=\frac{\mbox{closed 4paths}}{\mbox{4paths}}=\frac{\tau^{*}_{\Delta}}{\tau^{*}}' title='C^{*}=\frac{\mbox{closed 4paths}}{\mbox{4paths}}=\frac{\tau^{*}_{\Delta}}{\tau^{*}}' class='latex' /><br />
where <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%5E%7B%2A%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau^{*}' title='\tau^{*}' class='latex' /> is the number of 4-paths, and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%5E%7B%2A%7D_%7B%5CDelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau^{*}_{\Delta}' title='\tau^{*}_{\Delta}' class='latex' /> is the number of these 4-paths that are closed by being part of a 6-cycle (i.e., a loop of six ties with five nodes). </p>
<p>The proposed coefficient has a number of properties. First, it varies between 0 and 1 as the numerator and denominator are positive numbers, and the numerator is a subset of the denominator. Second, in a fully connected network, the coefficient is equal to 1 as all 4-paths are closed. Third, in an ensemble of random two-mode networks with a set number of nodes and ties, the average clustering coefficient is approximately <img src='http://l.wordpress.com/latex.php?latex=1-%281-d%5E2%29%5E%7B%28N_p-2%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1-(1-d^2)^{(N_p-2)}' title='1-(1-d^2)^{(N_p-2)}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is the density (i.e., <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BE%7D%7B%28N_i+%5Ctimes+N_p%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{E}{(N_i \times N_p}' title='\frac{E}{(N_i \times N_p}' class='latex' /> for two-mode networks), <img src='http://l.wordpress.com/latex.php?latex=N_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N_i' title='N_i' class='latex' /> is the number of primary nodes, and <img src='http://l.wordpress.com/latex.php?latex=N_p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N_p' title='N_p' class='latex' /> is the number of secondary nodes). (Footnotes 5 and 6) </p>
<p><strong>Weighted networks</strong></p>
<p>The one-mode clustering coefficient has been extended to weighted networks by assigning a value to each triplet or 2-path, <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> (Opsahl and Panzarasa, 2009). This value is based on the weights of the two ties that compose the 2-path, e.g., the arithmetic mean, geometric mean, minimum, or maximum of the tie weights. The coefficient incorporates these values by being defined as the total value of closed 2-paths over the total value of all 2-paths. In addition to having the same parameters as the original coefficient, this coefficient produces the same outcome as the original one if all 2-paths have the same value (e.g., a binary network) or if the weights are randomly reshuffled in the network. </p>
<p>The coefficient proposed in this paper can be generalized to weighted two-mode networks, such as those created from online forums where the weights is the number of messages or characters posted to a topic. In a similar spirit as the generalization to weighted networks of the one-mode clustering coefficient, a 4-path-value could be defined. This valued should be constructed based on the four tie weights, and could be defined as the arithmetic mean, geometric mean, minimum, or maximum. For example, the 4-path from node 1 to node 4 (via nodes A, 2, and D) in Figure 3 could be assigned a value of 3, 2.45, 1, or 6, respectively. The clustering coefficients based on the four methods for the sample network shown in Figure 3 would be 0.4, 0.398, 0.385, and 0.383. The topology of the networks is identical to the binary two-mode network in Figure 2, which had a clustering coefficient of 0.3. The increase in the coefficient when tie weights are considered is a reflection of the fact that the closed 4-paths have relatively stronger ties than the open 4-paths. The various explanations given in Opsahl and Panzarasa (2009) for the differences between the methods for defining 2-paths values are also applicable in the case of 4-path values. Formally, the coefficient for weighted two-mode networks could be defined as:<br />
<img src='http://l.wordpress.com/latex.php?latex=C%5E%7B%2A%5Comega%7D%3D%5Cfrac%7B%5Cmbox%7Btotal+value+of+closed+4paths%7D%7D%7B%5Cmbox%7Btotal+value+of+4paths%7D%7D%3D%5Cfrac%7B%5Csum_%7B%5Ctau%5E%7B%2A%7D_%7B%5CDelta%7D%7D+%5Comega%7D%7B%5Csum_%7B%5Ctau%5E%7B%2A%7D%7D+%5Comega%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C^{*\omega}=\frac{\mbox{total value of closed 4paths}}{\mbox{total value of 4paths}}=\frac{\sum_{\tau^{*}_{\Delta}} \omega}{\sum_{\tau^{*}} \omega}' title='C^{*\omega}=\frac{\mbox{total value of closed 4paths}}{\mbox{total value of 4paths}}=\frac{\sum_{\tau^{*}_{\Delta}} \omega}{\sum_{\tau^{*}} \omega}' class='latex' /></p>
<div id="attachment_1950" class="wp-caption aligncenter" style="width: 310px"><img src="http://thetore.files.wordpress.com/2009/09/net_sample_weighted.png?w=300&#038;h=161" alt="Weighted two-mode network" title="net_sample_weighted" width="300" height="161" class="size-medium wp-image-1950" /><p class="wp-caption-text">Figure 3: A weighted two-mode network.</p></div>
<p>The generalized coefficient has a number of properties. In addition to having the same properties as the binary two-mode clustering coefficient, the generalized coefficient is equal to the binary one when all ties have the same value (i.e., binary). Moreover, the outcome of the generalised coefficient is approximately equal to the binary one if tie weights are randomly reshuffled in the network.</p>
<p><strong>Empirical test</strong></p>
<p>To illustrate the proposed coefficient, I will apply it to Davis Southern Club Women dataset (Davis et al., 1941), Newman’s (2001) scientific collaboration network, the interpersonal network among directors on Norwegian public limited company boards (Opsahl and Seierstad, 2009), and data collected from an online forum. First, Davis Southern Club Women was collected in the 1930s, and contains the attendance of 18 women to 14 events. This network is relatively dense as 91 percent of the possible ties are present in the one-mode projection, and a slightly higher clustering coefficient is found (0.9284). Thus, one would assume that a triadic closure effect drives tie formation. However, the observed two-mode clustering coefficient (0.77) is less than the randomly expected value (0.80). This suggests that a triadic closure effect is not at play, in fact, it suggests an opposite reverse effect. </p>
<p>Second, Newman’s (2001) collaboration network is based upon 22,016 co-authored papers published on the arXiv e-repository website between 1995 and 1999. In total, these papers have 16,726 authors listed. The one-mode projection of this network using the authors as the primary node set has often been used as an empirical example for a variety of network measures (e.g., Opsahl et al., 2008). In the projected one-mode network, a clustering coefficient of 0.3596 is obtained, while the randomly expected value is 0.0003. Due to the great difference between the observed coefficient and the expected one, it has been argued that there is a strong tendency towards clustering (Newman, 2001). However, this number includes many of the triangles that are form by construction as the average number of authors per paper is 2.66. (Footnote 7) The proposed clustering coefficient for two-mode networks can be applied in an effort to exclude these triangles. This coefficient is 0.2769, while the randomly expected value is 0.0006. Albeit a weaker effect than the one found in the projected one-mode network, this suggests that there is an actual triadic closure effect in this network.</p>
<p>Third, the interlocking directorate dataset is composed of the 367 Norwegian public limited company boards as of August 1, 2009 (Opsahl and Seierstad, 2009). Based on this dataset, it is possible to construct an interpersonal network among the 1,413 directors, and link two individuals if they sit on the same board. In this one-mode projection, the observed and randomly expected clustering coefficients are 0.6477 and 0.0035, respectively, giving a ratio between them of 185. However, in the two-mode network, the observed and randomly expected clustering coefficients are 0.0119 and 0.0041, respectively. As the ration between these two coefficients is only 2.93, a much weaker triadic closure effect in the network is suggested than assumed from the projected one-mode network.</p>
<p>Fourth, the online forum data was collected from an online community of students at University of California, Irvine, in 2004. This community consisted of 2,995 students who could create any group that they wanted, and post messages to any group that they were a member of. In total, 889 students posted 33,720 messages to 552 groups. On average, each student posted 4.76 messages or 480.19 characters to each group they actively participated in. A two-mode network can be constructed from this dataset by linking a student to a group if she or he posted to it. Unlike the other datasets, it is possible to create both binary and weighted to-mode networks from this one. As weights can be assigned to each tie based on either the number of messages or characters posted, I have created a binary and two weighted two-mode networks from this dataset. The various clustering coefficients obtained from these networks are listed in Table 1. (Footnote 8)</p>
<table class="tore" cellspacing="1" cellpadding="3">
<tr align="center">
<th align="left">Network</th>
<th><img src='http://l.wordpress.com/latex.php?latex=N_i&#038;bg=eeeeee&#038;fg=000000&#038;s=0' alt='N_i' title='N_i' class='latex' /></th>
<th><img src='http://l.wordpress.com/latex.php?latex=N_p&#038;bg=eeeeee&#038;fg=000000&#038;s=0' alt='N_p' title='N_p' class='latex' /></th>
<th>2M <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=eeeeee&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /></th>
<th>1M <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=eeeeee&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /></th>
<th>1M CC</th>
<th>Random 1M CC</th>
<th>2M CC</th>
<th>Random 2M CC</th>
</tr>
<tr>
<td>Davis Southern Women Club</td>
<td align="right">18</td>
<td align="right">14</td>
<td align="right">89</td>
<td align="right">139</td>
<td align="center">0.9284</td>
<td align="center">0.9085</td>
<td align="center">0.7719</td>
<td align="center">0.7978</td>
</tr>
<tr>
<td>Scientific collaboration network</td>
<td align="right">16,726</td>
<td align="right">22,016</td>
<td align="right">58,595</td>
<td align="right">47,594</td>
<td align="center">0.3596</td>
<td align="center">0.0003</td>
<td align="center">0.2769</td>
<td align="center">0.0006</td>
</tr>
<tr>
<td>Interpersonal network among directors</td>
<td align="right">1,413</td>
<td align="right">367</td>
<td align="right">1,734</td>
<td align="right">3,414</td>
<td align="center">0.6477</td>
<td align="center">0.0035</td>
<td align="center">0.0119</td>
<td align="center">0.0041</td>
</tr>
<tr>
<td>Online forum (binary)</td>
<td align="right">899</td>
<td align="right">522</td>
<td align="right">7,089</td>
<td align="right">16,860</td>
<td align="center">0.5049</td>
<td align="center">0.1768</td>
<td align="center">X</td>
<td align="center">0.1119</td>
</tr>
<tr>
<td>Online forum (weighted, messages)</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="center">…</td>
<td align="center">…</td>
<td align="center">X</td>
<td align="center">0.1119</td>
</tr>
<tr>
<td>Online forum (weighted, characters)</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="right">…</td>
<td align="center">…</td>
<td align="center">…</td>
<td align="center">X</td>
<td align="center">0.1119</td>
</tr>
</table>
<p>Table 1: The various clustering coefficients (CC) obtained on the four empirical datasets. 2M refers to two-mode networks, and 1M to one-mode ones. The one-mode clustering coefficients are obtained on binary projections.</p>
<p><strong>Concluding remarks</strong></p>
<p>Two-mode networks are rarely analysed without projecting them onto one-mode networks as there are few methods for this purpose. However, by projecting two-mode networks, certain assumptions in the one-mode methods might be violated, such as the ability of each tie to be formed separately. Moreover, measures based on triangles or ties among nodes’ contacts might be biased. This is due to the fact that projected two-mode networks generally contain more and larger fully-connected cliques than regular one-mode networks. In particular, depending on the degree distribution of the non-projected nodes, the measure that assesses the overall level of clustering in a one-mode network, the clustering coefficient, could be biased. More specifically, if the non-projected nodes have a degree greater than 2, triangles will be automatically formed in the one-mode projection, which will increase the coefficient. Thus, there is a need to redefine one-mode measures for two-mode networks, and in particular, the ones that are directly affected by the increase in triangles. </p>
<p>This paper proposed such a redefinition for the clustering coefficient. It subtracts the level of triadic closure caused by groups of nodes that are connected to common nodes in the two-mode structure. More specifically, it excludes triplets that are formed due to this effect, and only counts the number of triplets that are formed based on independent dyadic interactions. </p>
<p>The proposed coefficient is not without its limitations. A major one is that one of the node sets must be considered to be responsible for tie generation, and designated as the primary node set. Nodes from this set would be first and last nodes of the 4-paths. Generally, the selection of the primary node set depends on the perspective of analysis. If the network is projected onto a one-mode network, the chosen node set is generally thought of as the primary one. In certain empirical cases, the selection of these nodes is obvious. For example, in scientific collaboration networks where researchers come together and co-author papers, the researchers are often assumed to be the primary nodes (Newman, 2001). However, in the case of interlocking directorates, it is not obvious whether directors or boards are the primary node set (Levine, 1979; Opsahl and Seierstad, 2009). This is due to the fact that tie formation is a mutual process where the directors must (1) be invited to join the board, and (2) accept the invitation. </p>
<p>The proposed coefficient represents only a first step in the process of redefining one-mode network measures for two-mode networks. Although the clustering coefficient is particularly affected by the projection procedure, there are many other measures that are also affected, such as the local clustering coefficient (Watts and Strogatz, 1998) and the array of structural holes measures (Burt, 1992). Redefining these measures for two-mode networks could increase their accuracy, and might give rise to novel insights.</p>
<p><strong>Footnotes</strong></p>
<p>1: In random networks where the present of each tie is given by a sought after density (i.e.,  <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BE%7D%7B%28N%5Ctimes%28N-1%29%2F2%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{E}{(N\times(N-1)/2)}' title='\frac{E}{(N\times(N-1)/2)}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> is the number of ties, and <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> is the number of nodes), ties are independent of each other. However, ties are dependent upon each other in random networks with a set number of nodes and ties. For example, in an undirected random network, the likelihood of the first tie to be present is equal to the density. However, the likelihood of the second tie being present is dependent upon whether the first tie is present. Thus, it is equal to <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BE-x_1%7D%7B%28%28N%5Ctimes%28N-1%29%2F2%29-1%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{E-x_1}{((N\times(N-1)/2)-1)}' title='\frac{E-x_1}{((N\times(N-1)/2)-1)}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1' title='x_1' class='latex' /> is equal to 1 if the first was created and 0 otherwise. In fact, the presence of the last possible tie in a random network is given by the presence or lack thereof of the other ties in the network. Nevertheless, in large and sparse random networks, this dependence plays a negligible role as the number of actual ties is marginal relative to the number of possible ties.</p>
<p>2: I have knowingly chosen not to define the measure as the number of 4-cycles over the number of 3-paths. This is due to the fact that a 3-path would simply be, in the case of collaboration networks, the number of projects that a person’s collaborators are affiliated with. A 4-cycle would be, in that case, two individuals collaborating twice. Although this could be viewed as a form of clustering, it would not be triadic closure. In fact, it is a measure of reinforcement between two individuals rather than clustering of a group of individuals. </p>
<p>3: The 4-paths are 1-A-2-C-3 (closed); 1-A-2-C-5; 1-A-2-D-4; 1-B-3-C-2 (closed); 1-B-3-C-5; 2-A-1-B-3 (closed); 2-C-5-E-6; 3-C-2-D-4; 3-C-5-E-6; 4-D-2-C-5.</p>
<p>4: These 2-paths are: 1-2-3 (closed); 1-2-4; 1-2-5; 1-3-2 (closed); 1-3-5; 2-1-3 (closed); 2-5-6; 3-2-4; 3-5-6; 4-2-5.</p>
<p>5: The components of this express are the following. The density of the two-mode network, <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' />, is the likelihood that a tie is present. The square of <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is the likelihood that (1) a tie is present from the first node on a 4-path to a node, and (2) a tie is present from that node to the last node on the 4-path. The inner brackets, <img src='http://l.wordpress.com/latex.php?latex=%281-d%5E2%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1-d^2)' title='(1-d^2)' class='latex' />, is the likelihood that these two ties are not present. The power of the inner bracket, <img src='http://l.wordpress.com/latex.php?latex=N_p-2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N_p-2' title='N_p-2' class='latex' />, ensures that <img src='http://l.wordpress.com/latex.php?latex=%281-d%5E2%29%5E%7B%28N_p-2%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1-d^2)^{(N_p-2)}' title='(1-d^2)^{(N_p-2)}' class='latex' />, is the likelihood that no node connect the first and last node on the 4-path. As the two nodes that are on the 4-path cannot close it, 2 is subtracted from <img src='http://l.wordpress.com/latex.php?latex=N_p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N_p' title='N_p' class='latex' />. By subtracting the inner bracket and its power from 1, the outcome is the likelihood that a 4-path is closed. This is the randomly expected value of Eq. 2.</p>
<p>6: Simulations conducted on ensembles of random networks with different number of nodes and ties produced outcomes that were not statistically different from the outcome attained with the above expression. Each ensemble contained 1,000 random networks.</p>
<p>7: To highlight the intrinsic level of clustering in this network, I randomised the two-mode structure while keeping the degree distributions (i.e., randomly assign the ties in the two-mode network while keeping each author’s number of co-authored papers, and each paper’s number of authors) before projecting it onto a one-mode network and calculating the clustering coefficient (Eq. 1). The average clustering coefficient across 100 random networks was 0.1235764, which is over 350 times larger than the coefficient found in a random one-mode network with the same number of nodes and ties.</p>
<p>8: Unless otherwise stated, the weighted clustering coefficients are based on the arithmetic mean method for defining the 2-path and 4-path values.</p>
<p><strong>References</strong></p>
<p>Burt, R.S., 1992. Structural Holes. Harvard University Press, Cambridge, MA.<br />
Davis, A., Gardner, B. B., Gardner, M. R., 1941. Deep South. University of Chicago Press, Chicago, IL.<br />
Erdo s, P., Rényi, A., 1959. On random graphs. Publicationes Mathematicae 6, 290-297.<br />
Holland, P.W., Leinhardt, S., 1970. A method for detecting structure in sociometric data. American Journal of Sociology 76, 492-513.<br />
Levine, J., 1979. Joint-space analysis of “pick-any” data: Analysis of choices from an unconstrained set of alternatives. Psychometrika 44(1), 85-92.<br />
Luce, R.D., Perry, A.D., 1949. A method of matrix analysis of group structure. Psychometrika 14 (1), 95-116.<br />
Mizruchi, M. S., 1996. What Do Interlocks Do? An Analysis, Critique, and Assessment of Research on Interlocking Directorates. Annual Review of Sociology 22, 271-298.<br />
Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132.<br />
Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J.J., 2008. Prominence and control: The weighted rich-club effect. Physical Review Letters 101, 168702.<br />
Opsahl, T., Panzarasa, P., 2009. Clustering in weighted networks. Social networks 31, 155-163.<br />
Opsahl, T., Seierstad, C., 2009. For the few, not the many: The effects of affirmative action on presence, prominence, and social capital of female directors in Norway. Unpublished manuscript available on http://toreopsahl.com/publications/<br />
Wasserman, S., Faust, K., 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, New York, NY.<br />
Watts, D.J., Strogatz, S.H., 1998. Collective dynamics of small-world networks. Nature 393, 440-442.<br />
Zajac, E., 1988. Interlocking directorates as an interorganizational strategy: A test of critical assumptions. Academy of Management Journal 31(2), 428-438.</p>
<p><strong>What to try it with your data</strong></p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# This is the code for the binary two-mode network above
net &lt;- cbind(
i=c(1,1,2,2,2,3,3,4,5,5,6),
p=c(1,2,1,3,4,2,3,4,3,5,5))

# Calculate measure
clustering_tm(net)

# This is the code for the weighted two-mode network above
net &lt;- cbind(
i=c(1,1,2,2,2,3,3,4,5,5,6),
p=c(1,2,1,3,4,2,3,4,3,5,5),
w=c(3,5,6,1,2,6,2,1,3,1,2))

# Calculate measure
clustering_tm(net)
</pre>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, clustering coefficient, community, complex networks, degree, edges, embeddedness, global, graphs, Links, local, network, nodes, reinforcement, social network analysis, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/1382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/1382/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1382&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/net_02.png" medium="image">
			<media:title type="html">Board network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/net_sample_binary.png" medium="image">
			<media:title type="html">net_sample_binary</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/09/net_sample_weighted.png?w=300" medium="image">
			<media:title type="html">net_sample_weighted</media:title>
		</media:content>
	</item>
		<item>
		<title>tnet: Software for Analysing Weighted Networks</title>
		<link>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 00:00:54 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[weighted-richclub]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[r-package]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1105</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/"><img src="http://thetore.wordpress.com/files/2008/12/social-network_weighted.png?w=300" alt="Weighted Social Network" title="Weighted Social Network" width="300" height="136" class="alignright size-medium wp-image-144" /></a><em>tnet</em> is a package written in <em>R</em> that can calculate weighted social network measures. Almost all of the ideas posted on this blog are related to weighted networks as, I believe, taking into consideration tie weights enables us to uncover and study interesting network properties. Not only are few social network measures applicable to weighted networks, but there is also a lack of software programmes that can analyse this type of networks. In fact, there are no open-source programmes. This hinders the use and development of weighted measures. <em>tnet</em> represents a first step towards creating such a programme. Through this platform, weighted network measures can easily be applied, and new measures easily implemented and distributed.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1105&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div class="knobinfo">At the Sunbelt conference 2010, I will hold a workshop on using <em>tnet</em>.</div>
<p><em>tnet</em> is a package written in <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> that currently can calculate weighted social network measures, from version 0.1.0 analyse two-mode networks, and from version 0.2.0 detect underlying principles that guide tie formation in datasets with time-stamped ties. It forms part of a wider effort to analyse richer networks datasets without transforming the networks into simple static binary undirected one-mode ones. This post details <em>tnet</em>&#8216;s capabilities related to weighted networks. </p>
<p><strong>Motivation</strong></p>
<p>Almost all of the ideas posted on this blog are related to weighted networks as, I believe, taking into consideration tie weights enable us to uncover and study interesting network properties. Not only are few network measures applicable to weighted networks, but there is also a lack of software programmes that can analyse this type of networks. To the best of my knowledge, there are no programmes that can both deal with weighted networks and allow users to create their own functions. On the one hand, programmes like <a href="http://www.analytictech.com/" target="_blank"><em>UCINET</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://pajek.imfm.si/" target="_blank"><em>Pajek</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> have a small set of functions for weighted networks, but they do not allow users to programme additional functions (Batagelj and Mrvar, 2007; Borgatti et al., 2002). Therefore, researchers proposing new measures must create stand-alone programmes to deal with a single aspect of weighted networks (e.g., Brandes, 2001; Newman, 2001; <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Opsahl et al., 2008<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>; <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Opsahl and Panzarasa, 2009<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>). On the other, a number of packages for analysing networks has been created within the open-source statistical programme <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, notably the <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>sna</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://csde.washington.edu/statnet/" target="_blank"><em>statnet</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-packages (Butts, 2006; Handcock et al., 2003). These packages allow researchers to create additional functions on top of existing ones. This ability reduces the time spent on programming greatly, and let researchers focus on the contribution to the literature instead. For example, if someone has already written a function for identifying the shortest paths in a network, a researcher that would like to extend this measure can simply work on this code without programming the function from scratch. However, the <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>sna</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://csde.washington.edu/statnet/" target="_blank"><em>statnet</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-packages rely on the basic <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>network</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-package for data structures to represent networks (Butts et al., 2008). This basic package does not have a data class for weighted networks. Therefore, to ease the development of new functions for weighted networks, a new platform is needed. <em>tnet</em> represent an attempt to create such a platform. Although it is a user-written package in <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> similar to the <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>sna</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://csde.washington.edu/statnet/" target="_blank"><em>statnet</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-packages, it does not rely on the <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>network</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-package. It utilises its own data structures, one of which can handle weighted networks. </p>
<p><strong>Data Structures</strong></p>
<p>Since most networks are sparse (i.e., the number of ties (<em>A</em>) is much lower than the squared number of nodes (<img src='http://l.wordpress.com/latex.php?latex=N%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='N^2' title='N^2' class='latex' />), I opted for an edgelist format instead of a matrix one. A binary edgelist consists of two columns that represent the pairs of nodes that are tied together in a network (e.g., the edgelist1-format in <a href="http://www.analytictech.com/networks/dataentry.htm" target="_blank"><em>UCINET&#8217;s dl files</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>; Borgatti et al., 2002). When a directed network is represented, the first column represents the nodes that create the ties, whereas the second column represents the target nodes. Thus, an edgelist is an <img src='http://l.wordpress.com/latex.php?latex=A+%5Ctimes+2&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='A \times 2' title='A \times 2' class='latex' /> table. This type of list has been extended to cover weighted networks by adding a third column representing the weight of the ties (an <img src='http://l.wordpress.com/latex.php?latex=A+%5Ctimes+3&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='A \times 3' title='A \times 3' class='latex' /> table; Borgatti et al., 2002). While the matrix format records the weight of all possible ties (a non-established tie would get a weight of 0), this format records the sender, receiver, and weight of established ties only. Thus, the space needed to store a network is proportional <img src='http://l.wordpress.com/latex.php?latex=3+%5Ctimes+A&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='3 \times A' title='3 \times A' class='latex' /> instead of <img src='http://l.wordpress.com/latex.php?latex=N%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='N^2' title='N^2' class='latex' />. The main advantage of this format is that it can scale to networks with many nodes as it is the number of ties, not nodes, that determine the size of the data object. Although many programmes can read edgelists, most network analysis programmes rely on an internal matrix representation, e.g. <a href="http://www.analytictech.com/" target="_blank"><em>UCINET</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and the <a href="http://erzuli.ss.uci.edu/R.stuff/" target="_blank"><em>network</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>-package (Borgatti et al., 2002). Conversely, <a href="http://pajek.imfm.si/" target="_blank"><em>Pajek</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, which was designed to analyse large-scale sparse networks, specifically uses an internal edgelist representation (Batagelj and Mrvar, 2007). Thus, <em>tnet</em> can efficiently be applied to large-scale sparse networks.</p>
<p>In an effort to stay consistent with existing data structures, this three column table is also the structure used by <em>tnet</em>. The object class of an edgelist in <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> should be <code>data.frame</code> class. This class allows the different columns of a table to be of different classes, such as integer and numeric (i.e., real numbers, which takes more space than integers). Thus, the <code>data.frame</code> class is more efficient at storing data than the <code>matrix</code> class, which requires all columns to be numeric. The first two columns of the edgelist are assumed integers (i.e., the identification number of the node creating the tie and the identification number of the node receiving the tie, respectively). The third column can be integers or numeric that represents the weights attached to the ties. </p>
<p>To illustrate the edgelist structure, the figure below shows two networks with two ties each. The ties in the first network are directed, whereas in the second one they are undirected.<br />
<div id="attachment_1702" class="wp-caption aligncenter" style="width: 610px"><a href="http://thetore.files.wordpress.com/2009/06/thesis-fig16.gif"><img src="http://thetore.files.wordpress.com/2009/06/thesis-fig16_300.gif?w=300&#038;h=78" alt="Figure 16" title="Figure 16" width="300" height="78" class="size-full wp-image-1702" /></a><p class="wp-caption-text">Example of a directed (a) and an undirected (b) network with weighted ties. This figure is based on Figure&nbsp;16 in my thesis.</p></div></p>
<p>The directed network in the above figure should be represented by using the following table:</p>
<pre>1   2   4
1   3   2</pre>
<p>A network is deemed as undirected if all ties are included twice&nbsp;&#8211;&nbsp;one in each direction. The undirected network in the above figure should be represented by the following table:</p>
<pre>1   2   4
2   1   4
1   3   2
3   1   2</pre>
<p>There are a number of functions that help users to convert data in other formats into the weighted edgelist format. For example, if a dataset is undirected, but there is only one entry for each tie in the edgelist, the <code>symmetrise</code>-function adds a second entry of the edge with the identification numbers of the creator and target nodes reversed. Moreover, if a dataset is similar to an edgelist, but with only two columns (representing the identification numbers of the creator and target nodes) and multiple entries of the same tie refer to the weight of that tie (e.g., if a tie has a weight of 3, it is included three times), then the <code>shrink_to_weighted_network</code>-function allows the users to convert the edgelist into the correct format. To allow for a comparison between weighted and binary network measures, the <code>dichotomise</code>-function creates a binary network from a weighted one. It does so by removing the ties in a weighted edgelist that fall below a certain cut-off and sets the weight to 1 for the remaining ones. </p>
<p><strong>Implemented Network Measures</strong></p>
<p>The main measures that are implemented in <em>tnet</em> are generalised versions of the global clustering coefficient or transitivity (<a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Opsahl and Panzarasa, 2009<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>), local clustering coefficient (Barrat et al., 2004), and centrality measures (degree, closeness, and betweenness; Barrat et al., 2004; Brandes, 2001; Newman, 2001) as well as the weighted rich-club effect framework (<a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Opsahl et al., 2008<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>).</p>
<p>First, in <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Opsahl and Panzarasa (2009)<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, we generalised the global clustering coefficient or transitivity, which measures the fraction of triplets that are part of triangles (or closed), to weighted networks. We generalised this measure by assigning a value to the triplets, and then taking the ratio between the total value of closed triplets and the total value of all triplets. This measure has been implemented in the function <code>clustering_w</code>. </p>
<p>Second, the local clustering coefficient has also been generalised to weighted networks (Barrat et al., 2004). This coefficient measures the density of nodes&#8217; ego networks by taking the ratio between the number of ties that exist among a node&#8217;s contacts over the total possible number. The number of possible ties is equal to the number of triplets where the focal node is the middle node. Barrat et al. (2004) generalised the measure by assigning a value to each triplet, which was defined as the mean tie weight of the two ties that make up a triplet, and then took the ratio between the total value of closed triplets and the total value of all triplet values. In a similar spirit as the global clustering coefficient, I have proposed to use <a href="http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/">three additional methods for defining triplet values<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. This coefficient with all the four methods for defining triplet values is implemented in the function <code>clustering_w_barrat</code>. </p>
<p>Third, generalisations of Freeman&#8217;s (198) centrality measures, namely degree, closeness and betweenness, have been implemented as <code>degree_w</code>, <code>closeness_w</code>, and <code>betweenness_w</code>. The generalisations of <a href="http://toreopsahl.com/2008/11/28/network-weighted-network/">degree<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> and <a href="http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/">betweenness<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> have previously been highlighted in posts on this blog, but closeness has yet to be mentioned. The original measure was formalised as the inverse distance to all other reachable nodes in the network. Since it relies on the calculation of shortest distances in the network, a first step towards extending it to weighted networks is to generalise how shortest distances are defined. I have covered one such method in the post <a href="http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/">average shortest distance in weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, which was based on work by Dijkstra (1959) and Newman (2001). In fact, Newman (2001) has already extended the closeness measure using this method. He applied both the binary and weighted measures to a coauthorship network, and found that different authors had the highest scores. This highlights the importance of considering the weights, and suggests that a binary measure might not be a good proxy of a possible weighted one. </p>
<p>Fourth, I have included the weighted rich-club effect framework (<a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Opsahl et al., 2008<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>). However, due to the very general nature of the framework, I have only implemented two prominence parameters (degree and node strength), and two reshuffling procedures (weight reshuffling and weight &amp; link reshuffling) in the function <code>weighted_richclub</code>. For code with additional reshuffling procedures and significance (error bars), see details in the post: <a href="http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/">Weighted Rich-club Effect: A more appropriate null model for scientific collaboration networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>.</p>
<p>In addition, there are two functions that create random weighted networks. First, <code>rg_w</code> takes a set of properties, and according to these properties, it produces a random network. These properties included the number of nodes and ties, the range of weights, and whether the resulting network should be directed. Second, <code>rg_reshuffling_w</code> takes an observed network and randomises certain properties, such as the creator node, target node, or the location of the weights. In fact, the latter function is used by <code>weighted_richclub</code> when creating corresponding random networks.</p>
<p><strong>Availability and Licensing</strong></p>
<p>Compiled versions and the source code of <em>tnet</em> are available through the CRAN servers. If you are using the Windows version of <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, you should be able to install <em>tnet</em> by going to the &#8216;Package&#8217;-menu in <a href="http://www.r-project.org/" target="_blank"><em>R</em><img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, opening &#8216;Install package(s)&#8217;, selecting a server close to you, and then, choosing &#8216;tnet&#8217; from the list. For specific details on how to install if you are unfamiliar with the CRAN system, see <a href="http://opsahl.co.uk/tnet/content/view/30/25/" target="_blank"><em>tnet</em>&#8216;s supporting website<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. </p>
<p><em>tnet</em>, as the information on this blog, is published under the <a href="http://creativecommons.org/licenses/by-nc/3.0/" target="_blank">Creative Commons Attribution-Noncommercial 3.0-lisence<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. This means that you are free to:<br />
· share (copy, distribute and transmit)<br />
· remix (adapt)<br />
under the following conditions:<br />
· attribution (you must cite or link to it)<br />
· noncommercial (you may not use it for commercial purposes).</p>
<p>The current citation for <em>tnet</em> is: </p>
<p>Opsahl, T., 2009. Structure and Evolution of Weighted Networks. University of London (Queen Mary College), London, UK, pp. 104-122. Available at <a href="http://toreopsahl.com/publications/thesis/">http://toreopsahl.com/publications/thesis/</a>.</p>
<p><strong>References</strong></p>
<p>Barrat, A., Barthelemy, M., Pastor-Satorras, R., Vespignani, A., 2004. The architecture of complex weighted networks. Proceedings of the National Academy of Sciences 101 (11), 3747-3752. <a href="http://arxiv.org/abs/cond-mat/0311416" target="_blank">arXiv:cond-mat/0311416</a></p>
<p>Batagelj, V., Mrvar, A., 2007. Pajek: Program for Large Network Analysis: version 1.20. <a href="http://pajek.imfm.si/" target="_blank"><em>http://pajek.imfm.si/</em></a>. </p>
<p>Borgatti, S. P., Everett, M. G., Freeman, L. C., 2002. Ucinet for Windows: Software for Social Network Analysis. Analytic Technologies, Harvard, MA. </p>
<p>Brandes, U., 2001. A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25, 163-177.</p>
<p>Butts, C. T., 2006. sna-package: Package for Social Network Analysis. R package version 1.4. </p>
<p>Butts, C. T., Handcock, M. S., Hunter, D. R., 2008. network: Classes for Relational Data. <a href="http://statnet.org/" target="_blank">http://statnet.org/</a>. </p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271. </p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239. </p>
<p>Handcock, M. S., Hunter, D. R., Butts, C. T., Goodreau, S. M., Morris, M., 2003. statnet: Software Tools for the Statistical Modeling of Network Data. <a href="http://statnetproject.org" target="_blank">http://statnetproject.org</a>. </p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). <a href="http://arxiv.org/abs/0804.0417" target="_blank">arXiv:0804.0417</a>. </p>
<p>Opsahl, T., Panzarasa, P., 2009. <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Clustering in weighted networks</a>. Social Networks 31 (2), 155-163. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, arcs, betweenness, centrality, closeness, clustering coefficient, complex networks, degree, directed networks, edges, embeddedness, global, graphs, hubs, Links, local, network, nodes, r-package, reciprocation, reinforcement, shortest distance, shortest path, social network analysis, software, strength of nodes, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/1105/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/1105/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1105&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/06/thesis-fig16_300.gif" medium="image">
			<media:title type="html">Figure 16</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Weighted Rich-club Effect: A more appropriate null model for scientific collaboration networks</title>
		<link>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/</link>
		<comments>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/#comments</comments>
		<pubDate>Fri, 29 May 2009 00:00:20 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[weighted-richclub]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1158</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/"><img src="http://thetore.wordpress.com/files/2009/06/fig1_twomode_reshuffling_sw.gif" alt="Two-mode reshuffling" title="Two-mode reshuffling" width="127" height="200" class="alignright size-full wp-image-1388" /></a>In this post, I extend the <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Weighted Rich-club paper<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> by suggesting and testing a different null model for the scientific collaboration network (Newman, 2001). This network is a two-mode network, which becomes an undirected one-mode network when <a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/">projected<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. In the paper, we compared the observed weighted rich-club coefficient with the one found on random networks. The random networks were constructed by a null model defined for directed networks when prominence was based on node strength. Therefore, we created a directed network from the undirected scientific collaboration network by linking connected nodes with two directed ties that had the same weight. The null model consisted in reshuffling the tie weights attached to out-going ties for each node. However, this local reshuffling broke the weight symmetry of the two directed ties between connected nodes. The null model proposed in this post is based on the randomisation of the two-mode network before projecting it onto a one-mode network. By randomising before projecting, we are able to randomise a network while keeping the symmetry of weights. Technical:&#160;10/10.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1158&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Weighted Rich-club paper<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, we proposed a measure to study whether, and the extent to which, of &#8220;prominent&#8221; nodes (e.g., the ones with high <a href="http://toreopsahl.com/2008/11/28/network-weighted-network/">(out-)degree or node (out-)strength for (directed) networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>) exchange among themselves the strongest ties in a network. Since certain ways of defining the prominent nodes can be associated with having strong ties, we divided the measure calculated on the observed network by the measure calculated on an ensemble of random networks (similar to the topological rich-club measure that we built on; Colizza, 2006). To ensure that the random networks were comparable to the observed network, we constrained the random networks so that each node maintained its prominence (e.g., degree or node strength). When prominence was defined as degree, this was straight-forward by either globally reshuffling the weights in the network (the topology remained, and hence, the degree of nodes) or reshuffling ties while maintaining nodes’ degree using Molloy and Reed&#8217;s (1995) method. </p>
<p><img src="http://thetore.files.wordpress.com/2009/05/fig1_directed_weight_reshuffle2.png?w=271&#038;h=372" alt="Directed weight reshuffle" title="Directed weight reshuffle" width="271" height="372" class="alignright size-full wp-image-1440" />However, when prominence was defined as node strength, we could not use either of these methods for undirect networks (For directed networks, it is possible to redirect a node&#8217;s out-going ties randomly in the network, thus maintaining the node strength). Therefore, for undirected networks, we created directed networks by linking connected nodes with two directed ties. The weight of an undirected tie is duplicated to the two directed ones. A transformed network based on this process is is illustrated in the first panel of this figure. Then we reshuffled the weights attached to the outgoing ties of a node, thus, maintaining the sum of weights on outgoing ties. For example, in the second panel of the figure (the weight of a directed tie is placed close to the origin of the tie in this figure) we have randomly the weights of node E&#8217;s outgoing ties, 1 and 5. This method has been refered to as the directed weight reshuffling method (Serrano et al., 2007). However, this method breaks the weight symmetry of ties. </p>
<p>The appropriateness of this method for undirected networks depends on the research setting and how tie weights are defined (I would like to thank <a href="http://stat.gamma.rug.nl/snijders/">Tom Snijders<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> for highlighting this during my viva). For example, its applicability to undirected transportation networks is justified by the typically directed nature of traffic flows (although the US airport network displays a high symmetry; Barrat et al., 2004). Conversely, in an undirected collaboration network this might not be appropriate. More generally, for <a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/">one-mode projections of two-mode networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, it might be more appropriate to reshuffle the two-mode network before projecting it onto a one-mode network (see randomisation without structural zeros in Rao et al., 1996; Snijders, 1991). </p>
<p>To test this method, I have applied it to the scientific collaboration network used in the <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Weighted Rich-club paper<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. Although the two-mode structure of this network is not publically available, Mark Newman sent me a copy of it (Thanks!). </p>
<p>To calculate <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7B%5Cmathrm%7Bnull%7D%7D%5Ew%28s%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi_{\mathrm{null}}^w(s)' title='\phi_{\mathrm{null}}^w(s)' class='latex' /> using this null model, I first randomised the two-mode network by reshuffling ties while maintaining authors&#8217; and papers&#8217; degree (similarly to the method proposed by Molloy and Reed, 1995, for one-mode networks). One step of this method is exemplified in the diagram below. I then projected the randomised two-mode network onto a weighted one-mode network using the same method in the paper (for more details, see my <a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/">post on projecting two-mode networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>). Finally, I calculated the weighted-rich club effect, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi^w' title='\phi^w' class='latex' />, on the one-mode projections. The <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7B%5Cmathrm%7Bnull%7D%7D%5Ew%28s%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi_{\mathrm{null}}^w(s)' title='\phi_{\mathrm{null}}^w(s)' class='latex' /> is the average <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi^w' title='\phi^w' class='latex' /> over many random networks.<br />
<img src="http://thetore.files.wordpress.com/2009/06/fig1_twomode_reshuffling.gif?w=455&#038;h=173" alt="Two-mode reshuffling" title="Two-mode reshuffling" width="455" height="173" class="aligncenter size-full wp-image-1387" /></p>
<p>The result from this procedure is the displayed in the following diagram (the bars refer to the 95% confidence interval, see <a href="http://toreopsahl.com/publications/thesis/thesis-3-2-the-weighted-rich-club-effect/">Section&nbsp;3.2.2 of my thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> for more details):</p>
<div id="attachment_1649" class="wp-caption aligncenter" style="width: 610px"><a href="http://thetore.files.wordpress.com/2009/05/wrc_twomode_scn.gif"><img src="http://thetore.files.wordpress.com/2009/06/wrc_twomode_scn_600.gif?w=600&#038;h=418" alt="Weighted rich-club coefficient for Newman&#39;s (2001) scientific collaboration network." title="Weighted rich-club coefficient for Newman&#39;s (2001) scientific collaboration network" width="600" height="418" class="size-full wp-image-1649" /></a><p class="wp-caption-text">Weighted rich-club coefficient for Newman's (2001) scientific collaboration network. <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7B%5Cmathrm%7Bnull%7D%7D%5Ew%28s%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='phi_{mathrm{null}}^w(s)' title='phi_{mathrm{null}}^w(s)' class='latex' /> is the average <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='phi^w' title='phi^w' class='latex' /> measured on 1,000 one-mode projections of randomised two-mode network.</p></div>
<p>The diagram suggest that, unlike what we found using the Directed weight reshuffle (see <a href="http://toreopsahl.com/publications/thesis/thesis-3-3-empirical-tests/">Section&nbsp;3.3 of my thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>), when prominence is defined as having co-authored more than 20 articles, there is a negative and significant weighted rich-club effect. Thus, this finding is a substantiation of the speculation that in certain social networks a negative effect exists among the very prominent people due to some form of competition. This might account for the reluctance of highly productive authors to establish strong ties among themselves, as is suggested by the lack of interaction among the two most productive authors in the Figure below: Barabasi and Newman are not connected.¹</p>
<div id="attachment_1651" class="wp-caption aligncenter" style="width: 610px"><a href="http://thetore.files.wordpress.com/2009/05/thesis-fig08b.gif"><img src="http://thetore.files.wordpress.com/2009/06/thesis-fig08b_600.gif?w=600&#038;h=361" alt="Subset of the prominent nodes (s &gt;= 5) in the network science collaboration network (Newman, 2006)." title="The Network Science Collaboration Network (Newman, 2006)." width="600" height="361" class="size-full wp-image-1651" /></a><p class="wp-caption-text">Subset of the prominent nodes (<img src='http://l.wordpress.com/latex.php?latex=s+%5Cgeq+5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s geq 5' title='s geq 5' class='latex' />) in the network science collaboration network (Newman, 2006). Only ties among the prominent nodes are shown. The size of the nodes corresponds to the strength or number of co-authored papers, and the width of each tie is proportional to its weight. This figure is adapted from Figure&nbsp;8B in my thesis, which in turn was based on Figure&nbsp;3B in Opsahl et al. (2008).</p></div>
<p>The null model used in this post could also be used in a number of other frameworks. For example, it could be used in the topological rich-club (Colizza, 2006; Zlatic et al., 2008). In particular, it might be more appropriate for Zlatic et al. (2008) as they defined prominence as node strength.<br />
_____________________<br />
¹ Although Albert-László Barabási and Mark Newman have written a book together with Duncan Watts, it did not form part of this dataset. Nevertheless, this collaboration would add only a weak tie between these authors.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>The following code was used to produced the above diagrams:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Download data
data(Newman.Condmat.95.99)

# Run function
weighted_richclub_tm(Newman.Condmat.95.99.net.2mode)
</pre>
<p><strong>References</strong></p>
<p>Barrat, A., Barthelemy, M., Pastor-Satorras, R., Vespignani, A., 2004. The architecture of complex weighted networks. Proceedings of the National Academy of Sciences 101 (11), 3747-3752. <a href="http://arxiv.org/abs/cond-mat/0311416" target="_blank">arXiv:cond-mat/0311416</a></p>
<p>Colizza, V., Flammini, A., Serrano, M. A., Vespignani, A., 2006. Detecting rich-club ordering in complex networks. Nature Physics 2, 110-115. <a href="http://arxiv.org/abs/physics/0602134" target="_blank">arXiv:physics/0602134</a></p>
<p>Molloy, M., Reed, B., 1995. A critical point for random graphs with a given degree sequence. Random Structures and Algorithms 6, 161-180. </p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<p>Newman, M. E. J., 2006. Finding community structure in networks using the eigenvectors of matrices. Physical Review E 76 (036104). <a href="http://arxiv.org/abs/physics/0605087" target="_blank">arXiv:physics/0605087</a></p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). <a href="http://arxiv.org/abs/0804.0417" target="_blank">arXiv:0804.0417</a>. </p>
<p>Rao, A. R., Jana, R., Bandyopadhyay, S., 1996. A markov chain monte carlo method for generating random (0, 1)-matrices with given marginals. Sankhya A 58, 225- 242. </p>
<p>Serrano, M. A., Boguna, M., Vespignani, A., 2007. Patterns of dominant flows in the world trade web. Journal of Economic Interaction and Coordination 2, 111-124. <a href="http://arxiv.org/abs/0704.1225" target="_blank">arXiv:0704.1225</a>. </p>
<p>Snijders, T. A. B., 1991. Enumeration and simulation methods for 0-1 matrices with given marginals. Psychometrika 56 (3), 397-417.</p>
<p>Zlatic, V., Bianconi, G., Diaz-Guilera, A., Garlaschelli, D., Rao, F., Caldarelli, G., 2008. On the rich-club effect in dense and weighted networks. <a href="http://arxiv.org/abs/0807.0793" target="_blank">arXiv:0807.0793</a>.</p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, centrality, complex networks, edges, embeddedness, global, graphs, hubs, Links, network, nodes, popularity, reinforcement, richclub, social network analysis, strength of nodes, strength of ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/1158/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1158&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/05/fig1_directed_weight_reshuffle2.png" medium="image">
			<media:title type="html">Directed weight reshuffle</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/06/fig1_twomode_reshuffling.gif" medium="image">
			<media:title type="html">Two-mode reshuffling</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/06/wrc_twomode_scn_600.gif" medium="image">
			<media:title type="html">Weighted rich-club coefficient for Newman&#39;s (2001) scientific collaboration network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/06/thesis-fig08b_600.gif" medium="image">
			<media:title type="html">The Network Science Collaboration Network (Newman, 2006).</media:title>
		</media:content>
	</item>
		<item>
		<title>Thesis: Structure and Evolution of Weighted Networks</title>
		<link>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/#comments</comments>
		<pubDate>Fri, 15 May 2009 00:00:27 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[weighted-richclub]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[longitudinal networks]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1143</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/"><img src="http://thetore.wordpress.com/files/2009/02/fig1_evolving_s.gif" alt="Evolving network" title="Evolving network" width="156" height="100" class="alignright size-full wp-image-764" /></a>I have now completed my Ph.D. at the School of Business and Management of Queen Mary College, University of London. My Ph.D. programme was defined around a number of projects, which drew on, and extended, recent theoretical and methodological advances in network science. The projects that were concerned with weighted networks and longitudinal networks were outlined and critically discussed in my thesis (Structure and Evolution of Weighted Networks). It is my hope and intention that the contributions made in the thesis have an impact on the community of researchers interested in networks. Although many parts have been published, the thesis contained a number of extensions. Therefore, in an effort to disseminate some of these, I have made most of the thesis available through this blog. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1143&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I have now completed my Ph.D. at the School of Business and Management of Queen Mary College, University of London. My Ph.D. programme was defined around a number of projects, which drew on, and extended, recent theoretical and methodological advances in network science. The projects that were concerned with weighted networks and longitudinal networks were outlined and critically discussed in my thesis (Structure and Evolution of Weighted Networks). The entire thesis, except Chapter&nbsp;4 (Evolution of Networks; still to be published) and Appendix C (Source code of tnet 0.1.0; available through <a href="http://opsahl.co.uk/tnet/">tnet&#8217;s support website<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>), is available on the <a href="http://toreopsahl.com/publications/thesis/">Publication &gt; Thesis-page<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>.</p>
<p><strong>Acknowledgements</strong></p>
<p>The theme of this thesis is interdependence among elements. In fact, this thesis is not just a product of myself, but also of my interdependence with others. Without the support of a number of people, it would not have been possible to write. It is my pleasure to have the opportunity to express my gratitude to many of them here.</p>
<p>For my academic achievements, I would like to acknowledge the constant support from my supervisors. In particular, I thank Pietro Panzarasa for taking an active part of all the projects I have worked on. I have also had the pleasure to collaborate with people other than my supervisors. I worked with Vittoria Colizza and Jose J. Ramasco on the analysis and method presented in Chapter&nbsp;2, Kathleen M. Carley on an empirical analysis of the online social network used throughout this thesis, and Martha J. Prevezer on a project related to knowledge transfer in emerging countries. In addition to these direct collaborations, I would also like to thank Filip Agneessens, Sinan Aral, Steve Borgatti, Ronald Burt, Mauro Faccioni Filho, Thomas Friemel, John Skvoretz, and Vanina Torlo for encouragement and helpful advice. In particular, I would like to thank Tom A. B. Snijders and Klaus Nielsen for insightful reading of this thesis and many productive remarks and suggestions. I have also received feedback on my work at a number of conferences and workshops. I would like to express my gratitude to the participants at these.</p>
<p>On a social note, I would like to thank John, Claudius, and my family for their continuing support. Without them I would have lost focus. My peers and the administrative staff have also been a great source of support. In particular, I would like to extend my acknowledgements to Mariusz Jarmuzek, Geraldine Marks, Roland Miller, Jenny Murphy, Cathrine Seierstad, Lorna Soar, Steven Telford, and Eshref Trushin.</p>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, betweenness, bipartite networks, centrality, closeness, clustering coefficient, communication, community, complex networks, degree, directed networks, edges, embeddedness, evolution, global, graphs, gregariousness, hubs, Links, local, longitudinal networks, network, nodes, online, online communication, online social networks, popularity, reciprocation, reinforcement, richclub, shortest distance, shortest path, social network analysis, social networking site, strength of nodes, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/1143/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/1143/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=1143&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Projecting two-mode networks onto weighted one-mode networks</title>
		<link>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/</link>
		<comments>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/#comments</comments>
		<pubDate>Fri, 01 May 2009 00:00:06 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=782</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/"><img src="http://thetore.wordpress.com/files/2009/03/fig1_twomode_s.png" alt="Projection of binary and weighted two-mode networks onto one-mode weighted networks" title="Projection of binary and weighted two-mode networks onto one-mode weighted networks" width="136" height="100" class="alignright size-full wp-image-798" /></a>This post highlights a number of methods for projecting both binary and weighted two-mode networks (also known as affiliation or bipartite networks) onto weighted one-mode networks. Although I would prefer to analyse two-mode networks in their original form, few methods exist for that purpose. These networks can be transformed into one-mode networks by projecting them (i.e., selecting one set of nodes, and linking two nodes if they are connected to the same node of the other set). Traditionally, ties in the one-mode networks are without weights. By carefully considering multiple ways of projecting two-mode networks onto weighted one-mode networks, we can maintain some of the richness contained within the two-mode structure. This enables researchers to conduct a deeper analysis than if the two-mode structure was completely ignored. (Technical:&#160;6/10)
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=782&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_half.png?w=271&#038;h=204" alt="Two-mode network" title="Two-mode network" width="271" height="204" class="alignright size-full wp-image-1012" />Many network dataset are by definition two-mode networks (also known as affiliation or bipartite networks). These are a particular type of networks with two sets of nodes and ties are only established between nodes belonging to different sets. This diagram illustrates a binary two-mode network where the colour represent the node set to which a node belongs. One of the first two-mode datasets to be analysed was the Davis Southern Club Women dataset (Davis et al., 1941), which recorded the attendance of a group of women (node set 1) to a series of events (set 2). A woman would be linked to an event if she attended it. Another type of two-mode dataset that has become popular in recent years is scientific collaboration networks. In this type of networks, a tie is established between a scientist (node set 1) and a paper (node set 2) if the scientist authored that paper (e.g., Newman, 2001)</p>
<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_binary.png?w=271&#038;h=183" alt="Binary one-mode projection" title="Binary one-mode projection" width="271" height="183" class="alignright size-full wp-image-1014" />Few network measures exist for two-mode networks, and therefore, these networks are often projected onto a one-mode network (only one type of nodes). This is done by selecting one of the sets of nodes and linking two nodes from that set if they were connected to the same node (of the other kind). This process is illustrated for the blue nodes of the above diagram in this diagram. For example, node E and node F are connected to the same red node, therefore, in the one-mode projection they are tied together.</p>
<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_simple.png?w=271&#038;h=183" alt="Simple weighted one-mode projection" width="271" height="183" class="alignright size-full wp-image-1010" />Traditionally, the ties in projected one-mode networks do not have weights attached to them. However, recent empirical studies of two-mode networks has created a weighted one-mode network by defining the weights as the number of co-occurrences (e.g., the number of events two individuals have co-attended or the number of papers that two authors had collaborated on). To formally describe this method and ease the comparison among the different methods introduced in this post, this method can be formalised as: <img src='http://l.wordpress.com/latex.php?latex=w_%7Bij%7D+%3D+%5Csum_p+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{ij} = \sum_p 1' title='w_{ij} = \sum_p 1' class='latex' /> where where <img src='http://l.wordpress.com/latex.php?latex=w_%7Bij%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{ij}' title='w_{ij}' class='latex' /> is the weight between node <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and node <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> is the nodes of the other kind that node <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and node <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> are connected to, their co-occurrences (e.g., the red nodes in the above diagram). In the sample network, the tie between node A and node B has a weight of 2 as these two nodes have connections to two common red nodes, whereas the tie between node A and node C has only a weight of 1 as these nodes have connections to merely one common red node.</p>
<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_newman2001.png?w=271&#038;h=183" alt="One-mode projection using Newman's (2001) method for defining tie weights" width="271" height="183" class="alignright size-full wp-image-1013" />Newman (2001) extended this procedure while working with scientific collaboration networks. He argued that the social bonds among scientist collaborating with few others on a paper were stronger than the bonds among scientists collaborating with many on a paper. He proposed to discount for the size of the collaboration by defining the weights among the nodes using the following formula: <img src='http://l.wordpress.com/latex.php?latex=w_%7Bij%7D+%3D+%5Csum_p+%5Cfrac%7B1%7D%7BN_p+-+1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{ij} = \sum_p \frac{1}{N_p - 1}' title='w_{ij} = \sum_p \frac{1}{N_p - 1}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=N_p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N_p' title='N_p' class='latex' /> is the number of authors on paper <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> (e.g., the number of blue nodes connected to the red node <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />). In the context of scientific collaboration networks, this implies that if two scientists who only write a single paper together with no other co-authors get a weight of 1 (e.g., node B and node D). Moreover, if two scientists have written two papers together without any co-author, the weight of their tie would be 2 (e.g., node B and E). However, if the two scientists have a co-author, the weight on the tie between them is 0.5 (e.g., node A and node C). A more complicated example is the tie between node A and node B in the diagram. They have written two papers together: one without any other co-author and one with node C as a co-author. The first paper would give their tie a weight of 1, and the second tie would add 0.5 to the weight of this tie. Therefore, the weight attached to their tie is 1.5. By discounting for the number of blue nodes attached to the same red node, this methods creates a one-mode projection in which <a href="http://toreopsahl.com/2008/11/28/network-weighted-network/">the strength of a node<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is equal to the number of ties originating from that node in the two-mode network (e.g. the sum of weights attached to ties originating from node A in the one-mode projection is 2, and node A is connected to two red nodes).</p>
<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum.png?w=271&#038;h=204" alt="Weighted two-mode network" title="Weighted two-mode network" width="271" height="204" class="alignright size-full wp-image-1011" />So far this post has only dealt with binary two-mode networks. However, weighted two-mode networks also exist. For example, in online forums, where the node sets are users and threads (or topics), and a tie between two nodes is established if a user posts a message to a thread, it is possible to differentiate the two-mode ties based on the number of messages or characters posted by users to a specific thread. Thus creating a weighted two-mode network. By including the weights in the two-mode network, the data used for analysis is richer than if they were discarded.</p>
<p><div id="attachment_1109" class="wp-caption alignright" style="width: 281px"><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum_simple1.png?w=271&#038;h=177" alt="Note: The weights of the directed ties are placed close to the node that the ties originate from. For example, the tie from node B to node D has a weight of 4, whereas the tie from node D to node B has a weight of 6." title="One-mode projection of a weighted two-mode network" width="271" height="177" class="size-full wp-image-1109" /><p class="wp-caption-text"><em>Note</em>: The weights of the directed ties are placed close to the node that the ties originate from. For example, the tie from node B to node D has a weight of 4, whereas the tie from node D to node B has a weight of 6.</p></div>In a similar spirit as simply the number of co-occurrence for a binary two-mode network, the one-mode projection of a weighted two-mode network could be based on the weights the two nodes have directed towards common nodes (of different kind). In addition, with such a method it is possible to differentiate how the two nodes interact with the common node, and to project it onto a directed weighted one-mode network. In this type of projected network, the weight of a tie from a node to another is not necessarily the same as the weight attached to the tie from the latter node to the former node. It is not a truly directed network as two directed ties exist between any connected node pair. More specifically, all dyads are composed of either two directed ties (mutual) or no directed ties (null), and no dyads are made of a single directed tie (asymmetric). For example, if the diagram above referred to an online forum, node B has posted 4 messages in to a thread that node D participate in, thus giving the directed tie between node B and node D in the one-mode projection a weight of 4. Conversely, node D posted 6 messages to that thread, and therefore, the weight attached to the tie from node D to node B is equal to 6. This method can be formalised as follows: <img src='http://l.wordpress.com/latex.php?latex=w_%7Bij%7D+%3D+%5Csum_p+w_%7Bi%2Cp%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{ij} = \sum_p w_{i,p}' title='w_{ij} = \sum_p w_{i,p}' class='latex' />. </p>
<p><img src="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum_newman2001.png?w=271&#038;h=177" alt="Note: The weights of the directed ties are placed close to the node that the ties originate from. For example, the tie from node B to node D has a weight of 4, whereas the tie from node D to node B has a weight of 6." title="One-mode projection of a weighted two-mode network" width="271" height="177" class="alignright size-full wp-image-1114" />In a similar spirit as the method used by Newman (2001), it is also possible to discount for the number of nodes when projecting weighted two-mode networks. For example, it could be argued that if many online users post to a thread, their ties should be weaker than if there were few people posting to the thread. A straight forward generalisation is the following function: <img src='http://l.wordpress.com/latex.php?latex=w_%7Bij%7D+%3D+%5Csum_p+%5Cfrac%7Bw_%7Bi%2Cp%7D%7D%7BN_p+-+1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{ij} = \sum_p \frac{w_{i,p}}{N_p - 1}' title='w_{ij} = \sum_p \frac{w_{i,p}}{N_p - 1}' class='latex' />. This formula would create a directed one-mode network in which <a href="http://toreopsahl.com/2008/11/28/network-weighted-network/">the out-strength of a node<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is equal to the sum of the weights attached to the ties in the two-mode network that originated from that node. For example, node C has a tie with a weight of 5 in the two-mode network and an out-strength of 5 in the one-mode projection. </p>
<p><strong>Want to test it with your data?</strong></p>
<p><em>Binary two-mode networks</em></p>
<p>The following code requires a binary two-mode network to be listed in an edgelist format with two columns named i and p. The i column refers to the nodes you would like to keep in the one-mode projection (e.g., the blue nodes), and the p column refers to the nodes you would like to discard (e.g., the red nodes). The binary two-mode network in the diagram above can be loaded using the following function.</p>
<pre class="brush: plain;">
net &lt;- cbind(
i=c(1,1,2,2,2,2,2,3,4,5,5,5,6),
p=c(1,2,1,2,3,4,5,2,3,4,5,6,6))
</pre>
<p>The one-projections highlighted above can be created using the following code:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Binary one-mode projection
projecting_tm(net, method=&quot;binary&quot;)

# Simply the number of common red nodes
projecting_tm(net, method=&quot;sum&quot;)

# Newman's method
projecting_tm(net, method=&quot;Newman&quot;)
</pre>
<p><em>Weighted two-mode networks</em></p>
<p>The following code requires a weighted two-mode network to be listed in an edgelist format with three columns named i, p, and w. The i column refers to the nodes you would like to keep in the one-mode projection (e.g., the blue nodes), the p column refers to the nodes you would like to discard (e.g., the red nodes), and the w column must be the weight of the ties. The weighted two-mode network in the diagram above can be loaded using the following function.</p>
<pre class="brush: plain;">
net.w &lt;- cbind(
i=c(1,1,2,2,2,2,2,3,4,5,5,5,6),
p=c(1,2,1,2,3,4,5,2,3,4,5,6,6),
w=c(4,2,2,1,4,3,2,5,6,2,4,1,1))
</pre>
<p>The one-projections highlighted above can be created using the following code:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Simply the sum of weights towards common red nodes
projecting_tm(net.w, method=&quot;sum&quot;)

# Generalisation of Newman's method
projecting_tm(net.w, method=&quot;Newman&quot;)
</pre>
<p><strong>References</strong></p>
<p>Davis, A., Gardner, B. B., Gardner, M. R., 1941. Deep South. University of Chicago Press, Chicago, IL.</p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, complex networks, edges, global, graphs, Links, network, nodes, social network analysis, strength of nodes, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/782/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/782/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=782&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_half.png" medium="image">
			<media:title type="html">Two-mode network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_binary.png" medium="image">
			<media:title type="html">Binary one-mode projection</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_simple.png" medium="image">
			<media:title type="html">Simple weighted one-mode projection</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_newman2001.png" medium="image">
			<media:title type="html">One-mode projection using Newman's (2001) method for defining tie weights</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum.png" medium="image">
			<media:title type="html">Weighted two-mode network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum_simple1.png" medium="image">
			<media:title type="html">One-mode projection of a weighted two-mode network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/04/fig1_twomode_forum_newman2001.png" medium="image">
			<media:title type="html">One-mode projection of a weighted two-mode network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Are triangles made up by strong ties?</title>
		<link>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/</link>
		<comments>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/#comments</comments>
		<pubDate>Fri, 17 Apr 2009 00:00:40 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=429</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/"><img src="http://thetore.wordpress.com/files/2009/02/strongtiesembedded_s.png" alt="Are strong ties embedded?" title="Are strong ties embedded?" width="153" height="100" class="alignright size-full wp-image-768" /></a>A key assumption of Granovetter's (1973) Strength of Weak Ties theory is that strong ties are embedded by being part of triangles, whereas weak ties are not embedded by being created towards disconnected nodes. This assumption have been tested by calculating the traditional clustering coefficient on binary networks created with increasing cut-off parameters (i.e., creating a series of binary networks from a weighted network where ties with a weight greater than a cut-off parameter is set to present and the rest removed). Contrarily to theories of strong ties and embeddedness, these methods generally showed that the clustering coefficient decreased as the cut-off parameter increased. However, the binary networks were not comparable with each other as they had a different number of ties. Another way of testing this assumption is to take the ratio between the weighted global clustering coefficient and the traditional coefficient measured on networks where all ties are considered present. Thus, the number of ties is maintained. This post highlights this feature and empirically tests it on a number of publically available weighted network datasets. (Technical:&#160;5/10)<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=429&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Simmel (1923[1950]) argued that a strong social tie could not exist without being embedded among other ties. For example, two best friends would always have common friends and acquaintances.</p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" />Simmel&#8217;s argument was used as a key assumption by Granovetter (1973) when he formulated the Strength of Weak Ties theory. The theory states that weak ties in social networks (e.g., acquaintances as opposed to best friends) are more valuable than strong ties as they bring novel information to a person (ego). This assumption is based on the argument that ego&#8217;s close contacts are more likely to know each other (Simmel, 1923[1950]). This argument is illustrated in this sample network for node B. The node has two strong ties to two nodes that are connected, and two weak ties to nodes that are disconnected. Since the close contacts are connected, they are more likely to move in the same social circles as ego and each other than ego&#8217;s acquaintances. Thus, close friends&#8217; knowledge is more likely to overlap with ego&#8217;s existing knowledge and each others&#8217;. </p>
<p>Following Granovetter, Burt (1992) and Coleman (1988) developed two sides of this theory. Burt developed the notion of structural holes, which states that a person benefits from being a broker among other disconnected contacts. On the contrary, Coleman argued that, although it might be beneficial to have many acquaintances when it comes to explicit knowledge, the same cannot be argued for tacit knowledge. A key factor in tacit knowledge transfer is close bonds, shared framework, and trust. These elements describe embedded relationships. Thus, ego must be embedded to optimise tacit knowledge transfer, and broker to optimise explicit knowledge transfer. </p>
<p>Simmel&#8217;s claim that strong social ties cannot exist without being embedded is key to this area of investigation; however, <strong>are triangles more likely to be constructed by strong ties than by weak ones?</strong> </p>
<p>Doreian (1969) and <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Opsahl and Panzarasa (2009)<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> studied how the clustering coefficient in weighted networks changed when only considering ties with at least a certain value. As the cut-off value increased, the clustering coefficient generally decreased. However, it is not accurate to argue that strong ties is less embedded in triangles than weak ties based on such an analysis. This is due to the fact that the network structure changes when applying an increasinly restrictive cut-off as ties are removed. This implies that the networks on which the clustering coefficient is measured is not comparable. </p>
<p><img src="http://thetore.files.wordpress.com/2009/02/strongtiesembedded.png?w=271&#038;h=177" alt="Are strong ties embedded?" title="Are strong ties embedded?" width="271" height="177" class="alignright size-full wp-image-769" />A possible way of appropriately testing the above question is to compare the <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">weighted clustering coefficient<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> (<img src='http://l.wordpress.com/latex.php?latex=C_%7Bw%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C_{w}' title='C_{w}' class='latex' />) to the clustering coefficient calculated on a binary network where all ties with a weight greater than 0 is set to present (<img src='http://l.wordpress.com/latex.php?latex=C_%7BGT0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C_{GT0}' title='C_{GT0}' class='latex' />). In so doing, the two coefficients are calculated on the same network structure. The only difference is that the former takes the weights of ties into consideration. This is possible due to the fact that if weights are randomly assigned in the network, the weighted clustering coefficient is equal to the binary coefficient. </p>
<p>If the ratio between the weighted clustering coefficient and the binary coefficient is higher than 1, it can be argued that triplets made up by strong ties are more likely to be closed than triplets made up by weak ties (<img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BC_%7Bw%7D%7D%7BC_%7BGT0%7D%7D+%3E+1+%5CRightarrow+p%28X%29+%3E+p%28Y%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{C_{w}}{C_{GT0}} &gt; 1 \Rightarrow p(X) &gt; p(Y)' title='\frac{C_{w}}{C_{GT0}} &gt; 1 \Rightarrow p(X) &gt; p(Y)' class='latex' />). On the contrary, it can be argued that triplets made up by weak ties are more likely to be closed than those made up by strong ties if the ratio is less than 1.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>First, you need to ensure that your data confirm to <a href="http://opsahl.co.uk/tnet/content/view/22/25/" target="_blank">the tnet standard for weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Then you need to load it into an <a href="http://www.r-project.org/" target="_blank">R<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> session. The sample network above can be loaded using the following command:</p>
<pre class="brush: plain;">
# Load network as an object called net:
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))
</pre>
<p>Second, you need to <a href="http://opsahl.co.uk/tnet/content/view/30/25/" target="_blank">download and install<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> if you have not done so already for another function.</p>
<p>Third, by running the following commands, you will get the ratio between the weighted and binary clustering coefficient:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Create a binary version of net
net.b &lt;- dichotomise(net, GT=0)

# Calculate the clustering coefficient of the weighted network and of the binary network, and then, divided the former by the latter:
clustering_w(net, measure=&quot;am&quot;)/
clustering_w(net.b, measure=&quot;am&quot;)
</pre>
<p><strong>Empirical tests</strong></p>
<p>To empirically investigate whether strong ties are more likely to be part of closed triplets than weak ties, I have relied upon a set of <a href="http://toreopsahl.com/datasets/">publicly available datasets<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> that I have collected. </p>
<table class="tore" cellspacing="1" cellpadding="3">
<tr align="center">
<th align="left" colspan="2">Network</th>
<th>Nodes</th>
<th>Ties</th>
<th><img src='http://l.wordpress.com/latex.php?latex=C_%7BGT0%7D&#038;bg=EEEEEE&#038;fg=000000&#038;s=0' alt='C_{GT0}' title='C_{GT0}' class='latex' /></th>
<th><img src='http://l.wordpress.com/latex.php?latex=C_%7Bw%7D&#038;bg=EEEEEE&#038;fg=000000&#038;s=0' alt='C_{w}' title='C_{w}' class='latex' /></th>
<th><img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BC_%7Bw%7D%7D%7BC_%7BGT0%7D%7D&#038;bg=EEEEEE&#038;fg=000000&#038;s=0' alt='\frac{C_{w}}{C_{GT0}}' title='\frac{C_{w}}{C_{GT0}}' class='latex' /></th>
</tr>
<tr>
<td align="right">1</td>
<td>Freeman&#8217;s EIES network (time 1)</td>
<td align="right">48</td>
<td align="right">695</td>
<td align="center">0.7627</td>
<td align="center">0.7702</td>
<td align="center">1.0099</td>
</tr>
<tr>
<td align="right">2</td>
<td>Freeman&#8217;s EIES network (time 2)</td>
<td align="right">48</td>
<td align="right">830</td>
<td align="center">0.8131</td>
<td align="center">0.8214</td>
<td align="center">1.0102</td>
</tr>
<tr>
<td align="right">3</td>
<td>Freeman&#8217;s EIES network (messages)</td>
<td align="right">32</td>
<td align="right">460</td>
<td align="center">0.6386</td>
<td align="center">0.7378</td>
<td align="center">1.1555</td>
</tr>
<tr>
<td align="right">4</td>
<td>Consulting (advice)</td>
<td align="right">46</td>
<td align="right">879</td>
<td align="center">0.6932</td>
<td align="center">0.7130</td>
<td align="center">1.0285</td>
</tr>
<tr>
<td align="right">5</td>
<td>Consulting (value)</td>
<td align="right">46</td>
<td align="right">858</td>
<td align="center">0.6764</td>
<td align="center">0.6852</td>
<td align="center">1.0131</td>
</tr>
<tr>
<td align="right">6</td>
<td>Research team (advice)</td>
<td align="right">77</td>
<td align="right">2228</td>
<td align="center">0.6848</td>
<td align="center">0.7127</td>
<td align="center">1.0408</td>
</tr>
<tr>
<td align="right">7</td>
<td>Research team (awareness)</td>
<td align="right">77</td>
<td align="right">2326</td>
<td align="center">0.6785</td>
<td align="center">0.6957</td>
<td align="center">1.0253</td>
</tr>
<tr>
<td align="right">8</td>
<td>C.elegans&#8217; neural network</td>
<td align="right">306</td>
<td align="right">2345</td>
<td align="center">0.1818</td>
<td align="center">0.2364</td>
<td align="center">1.3009</td>
</tr>
<tr>
<td align="right">9</td>
<td>US airport network</td>
<td align="right">500</td>
<td align="right">2980</td>
<td align="center">0.3514</td>
<td align="center">0.4765</td>
<td align="center">1.3562</td>
</tr>
<tr>
<td align="right">10</td>
<td>Newman&#8217;s scientific collaboration network</td>
<td align="right">16730</td>
<td align="right">47594</td>
<td align="center">0.3596 </td>
<td align="center">0.3178 </td>
<td align="center">0.8838</td>
</tr>
</table>
<p>As it is possible to see from the above table, 9 of the 10 networks have a weighted clustering coefficient that is higher than the traditional clustering coefficient. This suggest that triangles generally are, in fact as suggested by Simmel (1923[1950]), made up by strong ties.</p>
<p>The tenth network, which shows the opposite result, is different from the other nine networks. It is a one-mode projection of a two-mode network, whereas the first nine networks are native one-mode networks. This feature could explain the observed result as the one-mode projection is constructed in a way that penalises the weight of ties within triangles, or more specifically, large fully connected cliques (Newman, 2001). See the <a href="http://toreopsahl.com/datasets/">description of the construction<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> for more details. The next post will describe some <a href="http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/">methods for creating weighted one-mode projections of two-mode networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>.</p>
<p>The code needed to created the above table was: </p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load networks
data(Freemans.EIES, Cross.Parker.Consulting, Cross.Parker.Manufacturing, celegans.n306, USairport.n500, Newman.Condmat.95.99)
net &lt;- list()
net[[1]] &lt;- Freemans.EIES.net.1.n48
net[[2]] &lt;- Freemans.EIES.net.2.n48
net[[3]] &lt;- Freemans.EIES.net.3.n32
net[[4]] &lt;- Cross.Parker.Consulting.net.info
net[[5]] &lt;- Cross.Parker.Consulting.net.value
net[[6]] &lt;- Cross.Parker.Manufacturing.net.info
net[[7]] &lt;- Cross.Parker.Manufacturing.net.aware
net[[8]] &lt;- celegans.n306.net
net[[9]] &lt;- USairport.n500.net
net[[10]] &lt;- Newman.Condmat.95.99.net.1mode.wNewman

# Calculate values
output &lt;- data.frame(CGT0=NaN, Cw=NaN, Ratio=NaN)
for(i in 1:length(net)) {
  output[i,&quot;CGT0&quot;] &lt;- clustering_w(dichotomise(net[[i]]), measure=&quot;am&quot;)
  output[i,&quot;Cw&quot;] &lt;- clustering_w(net[[i]], measure=&quot;am&quot;)
}
output[,&quot;Ratio&quot;] &lt;- output[,&quot;Cw&quot;]/output[,&quot;CGT0&quot;]
</pre>
<p><strong>References</strong></p>
<p>Burt, R. S., 1992. Structural Holes: The Social Structure of Competition. Harvard University Press, Cambridge, MA. </p>
<p>Coleman, J. S., 1988. Social capital in the creation of human capital. American Journal of Sociology 94, S95-S120. </p>
<p>Doreian, P., 1969. A note on the detection of cliques in valued graphs. Sociometry 32 (2), 237-242. </p>
<p>Granovetter, M., 1973. The strength of weak ties. American Journal of Sociology 78, 1360-1380. </p>
<p>Opsahl, T., Panzarasa, P., 2009. <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Clustering in weighted networks</a>. Social Networks 31 (2), 155-163. </p>
<p>Simmel, G., 1950. The Sociology of Georg Simmel (KH Wolff, trans.). Free Press, New York, NY. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: arcs, clustering coefficient, complex networks, edges, embeddedness, graphs, Links, network, social network analysis, strength of ties, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/429/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/429/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=429&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/02/strongtiesembedded.png" medium="image">
			<media:title type="html">Are strong ties embedded?</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Article: Clustering in Weighted Networks</title>
		<link>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 03 Apr 2009 00:00:38 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=662</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/"><img src="http://thetore.wordpress.com/files/2009/01/triplet_small21.png" alt="Triplet" title="Triplet" width="86" height="100" class="alignright size-full wp-image-534" /></a>A paper called "Clustering in Weighted Networks" that I have co-authored will be published in Social Networks. Although many social network measures exist for binary networks and many theories differentiate between strong and weak ties, few measures have been generalised so that they can be applied to weighted networks and retain the information encoded in the weights of ties. One of these measures is the global clustering coefficient, which measures embeddedness or, more specifically, the likelihood of a triplet being closed by a tie so that it forms a triangle. This article proposes a generalisation of this key network measure to weighted networks.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=662&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A paper called &#8220;Clustering in Weighted Networks&#8221; that I have co-authored with <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> will be published in <a href="http://www.sciencedirect.com/science/journal/03788733" target="_blank">Social Networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to Social Networks, you can <a href="http://dx.doi.org/10.1016/j.socnet.2009.02.002" target="_blank">download the paper directly<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Otherwise, a <a href="http://thetore.files.wordpress.com/2009/03/clustering.pdf" target="_blank">preprint with the exact same text<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> is available.</p>
<p><strong>Abstract</strong></p>
<p>In recent years, researchers have investigated a growing number of weighted networks where ties are differentiated according to their strength or capacity. Yet, most network measures do not take weights into consideration, and thus do not fully capture the richness of the information contained in the data. In this paper, we focus on a measure originally defined for unweighted networks: the global clustering coefficient. We propose a generalization of this coefficient that retains the information encoded in the weights of ties. We then undertake a comparative assessment by applying the standard and generalized coefficients to a number of network datasets.</p>
<p><strong>Motivation</strong></p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" />In this sample network the binary clustering coefficient is 0.33 as a third of the triplets are closed by being part of a triangle. By looking at the weights, it is possible to see that the strongest ties are in part of the closed triplets. This is not reflected in the binary clustering coefficient.</p>
<p>By applying the proposed generalisation of the coefficient using the <a href="http://en.wikipedia.org/wiki/Arithmetic_mean" target="_blank">arithmetic mean<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> method for defining triplet value, the clustering coefficient increases to 0.42. This increase of this coefficient from the binary coefficient is a reflection of the fact that the strongest ties are part of the closed triplets.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>The <a href="http://opsahl.co.uk/tnet/content/view/17/30/" target="_blank">clustering_w<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> function in <a href="http://opsahl.co.uk/tnet/" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> allows you to test the generalised clustering coefficient on your own dataset.</p>
<p>For example, to test the <a href="http://opsahl.co.uk/tnet/content/view/17/30/" target="_blank">clustering_w<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> function on the sample network above, you can run the following code in R:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))

# Run function
clustering_w(net, measure=c(&quot;am&quot;, &quot;gm&quot;, &quot;ma&quot;, &quot;mi&quot;))

# The output is:
#       am        gm        ma        mi
#0.4166667 0.4361302 0.3750000 0.5000000
</pre>
<p>To test in on Freeman&#8217;s third EIES network from <a href="http://toreopsahl.com/datasets/">the datasets page<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, you can do the following:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
data(Freemans.EIES)

# Run function
clustering_w(Freemans.EIES.net.3.n32, measure=c(&quot;am&quot;, &quot;gm&quot;, &quot;ma&quot;, &quot;mi&quot;))

# The output is:
#0.7378310 0.7331536 0.7410959 0.7249982</pre>
<br />Posted in Network thoughts Tagged: arcs, clustering coefficient, complex networks, directed networks, edges, embeddedness, global, graphs, Links, network, nodes, reinforcement, social network analysis, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/662/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/662/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=662&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>The importance of allowing ties to decay</title>
		<link>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/</link>
		<comments>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/#comments</comments>
		<pubDate>Fri, 20 Mar 2009 00:00:45 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[longitudinal networks]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=436</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/"><img src="http://thetore.wordpress.com/files/2009/02/fig1_evolving_s.gif" alt="Evolving network" title="Evolving network" width="156" height="100" class="alignright size-full wp-image-764" /></a>Recently, a number of network dataset have been constructed from archival data (e.g., email logs) with the aim to study human interaction. This has allowed researchers to study large-scale social networks. If the archival data does not included information about the severing or weakening of ties, non-relevant interaction among people, which occurred far in the past, might be deemed relevant. This post highlights this issue and suggests imposing a lifespan on interactions to record only relevant ties with the current strength. (Technical:&#160;2/10)<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=436&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A great number of social network datasets have been, and are, collected through surveys and interviews. For example, an advice network could be collected by asking each individual within a group to designate the people they go to for advice. Another, more rigid, method is to give each individual a list of the other people in the group and let them select the people they go to for advice (roast surveys).</p>
<p>In addition to number of biases (e.g., the informant inaccuracy bias; Bernard et al., 1984; see my <a href="http://toreopsahl.com/publications/thesis/">thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> for a critic), survey instruments and direct observation methods are generally labour-intensive and difficult to administer. As a result, most networks collected using these methods are of a fairly limited size, often comprising only a few tens (e.g., Bernard et al., 1988) or hundreds (e.g., Fararo and Sunshine, 1964) of people.</p>
<p>This limitation has been overcome by using archival data sources instead of surveys. For example, the online social network of 1,899 people used in <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> could only reasonably be collected since the social interactions were automatically recorded. Other social network papers using archival data sources include Kossinets and Watts (2006) and Uzzi and Spiro (2005).</p>
<p>Although archival data sources allow for larger networks to be collected, and in turn, more robust statistical analysis to be applied, a bias might be introduced into the data if information about the severing of ties is not included: <strong>archival data sources have a much better memory than individuals</strong>.¹ For a social network, this could imply that social interactions that are no longer relevant to an individual are recorded as being relevant. Moreover, the weight of ties might be overestimated. These issues do not exist when data is collected through surveys as each individual would only list current or relevant friends with the current tie strength (if they are honest that is).</p>
<p><img src="http://thetore.files.wordpress.com/2009/02/fig1_evolving.gif?w=210&#038;h=135" alt="Evolving network" title="Evolving network" width="210" height="135" class="alignright size-full wp-image-765" />In <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">the empirical analysis of the online social network<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, we studied the network in two ways. First, we assumed that social ties never decay (the cumulative perspective). This assumes that if a social interaction is recorded on, for example, day 12, it will become included in the analysis from that point, and it will always remain included. Second, we followed Kossinets and Watts (2006) and imposed lifespans to the social relationships. This ensured that, if two people do not continue to communicate over time, their tie will be severed. This also applied to the weighted network: if the rate of messages sent from one person to another decreases, the tie would be weakened. </p>
<p>The length of the lifespan is crucial in determining which past events are taken into account to generate the network structure at a given point in time. By analysing which past events are relevant to the current state of the network, the length of the lifespan can be defined (Kossinets and Watts, 2006). An ill-defined lifespan will have the effect of, either breaking continuous social interactions into independent sets of interactions, or combining two separate interactions into a single one.</p>
<p>To illustrate the difference between imposing a lifespan and not imposing one, the following figure shows results from the <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">the online social network<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> where networks are constructed both cumulatively and with smoothing windows of 2, 3, and 6 weeks. Both panels in the figure highlight the vulnerability of network measures to the use of a smoothing window. Panel a suggests that there is only a small core of users that actively use the virtual community at the end of the observation period. An analysis of the cumulative network at that point would be heavily influenced by the majority of users that only used the network in the first 6 weeks, and would not reflect the current activities that are occurring in the community. This could bias network measures and, ultimately, the analysis. Panel b shows the evolution of one possible measure, the clustering coefficient. In particular, the clustering coefficient measured on the active core is mostly below the value found in the cumulative network.</p>
<p><img src="http://thetore.files.wordpress.com/2009/02/use_of_windows1.png?w=450&#038;h=169" alt="Use of windows: (a) active users; (b) clustering coefficient" title="Use of windows: (a) active users; (b) clustering coefficient" width="450" height="169" class="alignnone size-full wp-image-750" /></p>
<p>The above figure also highlights the sensitivity of sampling time. By using shorter lifespans, the network measures become more unstable and dependent on the time at which the observation is taken. Kossinets and Watts (2006) argued that network measures would remain stable over time. As a result, the average of the measures in a given observation period can be generalised to a longer period of time. The figure, however, suggest that, when social relationships have a lifespan, network measures are not stable. Therefore, it is difficult to infer from network snapshots stable network measures that can reflect the  network structure over a longer period of time.</p>
<p>By allowing for the severing of ties and sampling the network structure at various times over a longer period (e.g., each day in the observation period as we did for <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">the online social network<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>), the validity and robustness of a network analysis could be improved.<br />
_____________________<br />
¹ A number of other limitations, notably validity issues, could also be introduced into the data when using archival data sources.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>First, you need to ensure that your data confirm to <a href="http://opsahl.co.uk/tnet/content/view/22/25/">the tnet standard for longitudinal networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Then you need to load it into an <a href="http://www.r-project.org/">R<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> session. </p>
<p>Second, you need to <a href="http://opsahl.co.uk/tnet/content/view/30/25/">download, install, and load tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>.</p>
<p>Third, by combining the add_window_to_longitudinal_data-function and the longitudinal_data_to_edgelist-function, an instantaneous structure of the network at any point in time can be created.</p>
<pre class="brush: plain;">
# Add the severing of ties after 21 days
net &lt;- add_window_to_longitudinal_data(net, window=21)

# Create the static network on February 20, 2009 at 7am
net0902200700 &lt;- longitudinal_data_to_edgelist(net[net[,1]&lt;=&quot;2009-02-20 07:00:00&quot;,])
</pre>
<p>Then you use the other functions to study the network. </p>
<pre class="brush: plain;">
# Average degree
tmp &lt;- degree_w(net0902200700)
sum(tmp[,&quot;degree&quot;])/length(which(tmp[,&quot;degree&quot;]!=0))

# The global clustering coefficient
clustering_w(dichotomise(net0902200700))
</pre>
<p><strong>References</strong></p>
<p>Bernard, H. R., Killworth, P. D., Kronenfeld, D., Sailer, L. D., 1984. The problem of informant accuracy: the validity of retrospective data. Annual Review of Anthropology 13, 495-517. </p>
<p>Bernard, H. R., Kilworth, P. D., Evans, M. J., McCarty, C., Selley, G. A., 1988. Studying social relations cross-culturally. Ethnology 27 (2), 155-179. </p>
<p>Fararo, T. J., Sunshine, M., 1964. A Study of a Biased Friendship Network. Syracuse University Press, Syracuse, NY. </p>
<p>Kossinets, G., Watts, D. J., 2006. Empirical analysis of an evolving social network. Science 311, 88-90. </p>
<p>Uzzi, B., Spiro, J., 2005. Collaboration and creativity: The small world problem. American Journal of Sociology 111, 447-504. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: arcs, complex networks, edges, evolution, graphs, Links, longitudinal networks, network, online communication, online social networks, reinforcement, social network analysis, social networking site, strength of ties, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/436/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/436/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=436&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/02/fig1_evolving.gif" medium="image">
			<media:title type="html">Evolving network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/02/use_of_windows1.png" medium="image">
			<media:title type="html">Use of windows: (a) active users; (b) clustering coefficient</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Article: Patterns and Dynamics of Users&#8217; Behaviour and Interaction: Network Analysis of an Online Community</title>
		<link>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/</link>
		<comments>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/#comments</comments>
		<pubDate>Fri, 06 Mar 2009 00:00:19 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reachability]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[gender differences]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=659</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/"><img src="http://thetore.wordpress.com/files/2009/11/af-page20-fixed_150.png" alt="Online Social Network" title="Online Social Network" width="150" height="151" class="alignright size-full wp-image-2059" /></a>A paper called "Patterns and Dynamics of Users' Behaviour and Interaction: Network Analysis of an Online Community" that I have co-authored will be published in the Journal of the American Society for Information Science and Technology (JASIST). In this paper, we studied the evolution of a variety of properties in an online community, including how users create, reciprocate, and deepen relationships with one another, variations in users’ gregariousness and popularity, reachability and typical distances among users, and the degree of local redundancy in the community. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=659&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A paper called Patterns and Dynamics of Users&#8217; Behaviour and Interaction: Network Analysis of an Online Community that I have co-authored with <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and <a href="http://www.casos.cs.cmu.edu/bios/carley/carley.html" target="_blank">Kathleen M. Carley<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> will be published in the May issue (estimated) of the <a href="http://www.asis.org/jasist.html" target="_blank">Journal of the American Society for Information Science and Technology (JASIST)<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Unfortunately, the copyright agreement prevents me from uploading the paper to this blog. However, if you have access to JASIST, you can <a href="http://www3.interscience.wiley.com/journal/117946195/grouphome/home.html" target="_blank">download the paper directly<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. Otherwise, if you write me an <a href="http://toreopsahl.com/about/">email<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>, I can send it to you.</p>
<p><strong>Abstract</strong></p>
<p>This research draws on longitudinal network data from an online community to examine patterns of users’ behavior and social interaction and infer the processes underpinning dynamics of system use. The online community represents a prototypical example of a complex evolving social network in which connections between users are established over time by online messages. We study the evolution of a variety of properties since the inception of the system, including how users create, reciprocate, and deepen relationships with one another, variations in users’ gregariousness and popularity, reachability and typical distances among users, and the degree of local redundancy in the system. Results indicate that the system is a &#8220;small world&#8221;  characterized by the emergence, in its early stages, of a hub-dominated structure with highly heterogeneous users’ behavior. We investigate whether hubs are responsible for holding the system together and facilitating information flow, examine first-mover advantages underpinning users’ ability to rise to system prominence, and uncover gender differences in users’ gregariousness, popularity, and local redundancy. We discuss the implications of the results for research on system use and evolving social networks, and for a host of applications, including information diffusion, communities of practice, and the security and robustness of information systems. </p>
<br />Posted in Network thoughts Tagged: clustering coefficient, communication, community, complex networks, embeddedness, evolution, gender differences, graphs, gregariousness, hubs, network, online, popularity, reachability, reciprocation, reinforcement, shortest distance, social network analysis, social networking site, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/659/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/659/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=659&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Betweenness in weighted networks</title>
		<link>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 20 Feb 2009 00:00:13 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=464</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/"><img src="http://thetore.wordpress.com/files/2009/01/fig1_betweenness_s3.gif" alt="Betweenness example" title="Betweenness example" width="146" height="100" class="alignright size-full wp-image-623" /></a>This post highlights a generalisation of Freeman's (1978) betweenness measure to weighted networks implicitly introduced by Brandes (2001) when he developed an algorithm for calculating betweenness faster. Betweenness is a measure of the extent to which a node funnels transactions among all the other nodes in the network. By funnelling the transactions, a node can broker. This could be by taking a cut (e.g. Ukraine controls most gas pipelines from Russia to Europe) or distorting the information being transmitted to its advantage. (Technical:&#160;8/10)<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=464&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In a network, certain nodes occupy advantageous positions, whereas others must rely on these nodes to, for example, exchange knowledge. The extent to which a node is part of the transactions among other nodes can be studied using Freeman&#8217;s (1978) betweenness measure. This measure has been applied to a variety of settings, such as <a href="http://www.thenetworkthinker.com/2009/01/power-in-economic-networks.html" target="_blank">the Russian-Ukraine gas dispute<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>.</p>
<p>Formally Freeman (1978) defined the betweenness of a focal node <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> to be:</p>
<div align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7Bg_%7Bjk%7D%28i%29%7D%7Bg_%7Bjk%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=3' alt='\frac{g_{jk}(i)}{g_{jk}}' title='\frac{g_{jk}(i)}{g_{jk}}' class='latex' /></div>
<p>where <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> are all the other nodes in the network, <img src='http://l.wordpress.com/latex.php?latex=g_%7Bjk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_{jk}' title='g_{jk}' class='latex' /> is the number of shortest paths between node <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> and node <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=g_%7Bjk%7D%28i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_{jk}(i)' title='g_{jk}(i)' class='latex' /> is the number of those paths that go through the focal node.</p>
<p><img src="http://thetore.files.wordpress.com/2009/01/fig1_betweenness3.gif?w=242&#038;h=166" alt="Betweenness example" title="Betweenness example" width="242" height="166" class="alignright size-full wp-image-624" />In the sample network on the right, if the ties did not have a weight assigned to them, the grey lines represent the 9 shortest paths in the network that pass through intermediate nodes. The highlighted node is an intermediate on 8 of these paths. This will give this node a betweenness score of 8.</p>
<p>Brandes (2001) proposed a new algorithm for calculating betweenness faster. In addition to reducing the time, this algorithm also relaxed the assumption that ties had to be either present or absent (i.e. a binary network), and allowed betweenness to be calculated on weighted networks. This generalisation takes into account, that in weighted networks, the transaction between two nodes might be quicker along paths with more intermediate nodes that are strongly connected than paths with fewer weakly-connected intermediate nodes. This is due to the fact that the strongly connected intermediate nodes have, for example, more frequent contact than the weakly connected ones. For example, the tie between the top-left node and the focal node in the sample network above has four times the strength of the tie between the bottom-left node and the focal node. This could mean that top-left node has more frequent contact with the focal node than the bottom-left node has. In turn, this could imply that top-left node might give the focal node a piece of information (or a disease) four times quicker than the bottom-left node. If we are studying the nodes that are most likely to be funnelling information or diseases in a network, then the speed at which it travels, and routes that it takes, are clearly affected by the weights.</p>
<p><strong>This post highlights the generalisation of Freeman&#8217;s (1978) betweenness measure to weighted networks by Brandes (2001).</strong> This generalisation is separate from the flow measure proposed by Freeman et al. (1991), which might be more appropriate in certain settings. The generalisation highlighted in this post follows the generalisation of closeness to weighted networks by Newman (2001) who built on work by Dijkstra (1959). Newman&#8217;s (2001) generalisation was introduced in detail in my previous post: <a href="http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/">a method for calculating the average shortest distance in weighted networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. Nevertheless, I will quickly reiterate Dijkstra&#8217;s (1959) and Newman&#8217;s (2001) work here:
<ol>
<li>Dijkstra (1959) proposed an algorithm to find the shortest paths in a network where the weights could be considered <em>costs</em>. The least costly path connecting two nodes was the shortest path between them (e.g. a network of roads where each leg of road has a time-cost assign to it). </li>
<li>Newman (2001)  transformed the positive weights in a collaboration network into costs by inverting them (dividing 1 by the weight). </li>
<li>Based on the inverted weights, Newman (2001) applied Dijkstra&#8217;s algorithm and found the least-costly paths among all nodes.</li>
<li>The total cost of the paths from a node to all others was a measure of farness: the higher the number, the more it cost a node to reach all other nodes. To create a closeness measure, Newman (2001) followed Freeman (1978) and inverted the numbers (1 divided by the farness). Thus, a high farness was transformed into a low closeness, and a low farness was transformed into a high closeness. </li>
</ol>
<p>The identification of the shortest paths in weighted networks outlined in the first three steps above can also be used when identifying the nodes which funnel transactions among other nodes in weighted networks. If we assume that transactions in a weighted network follow the shortest paths identified by Dijkstra&#8217;s algorithm instead of the one with the least number of intermediate nodes, then the number of shortest paths that pass through a node might change.</p>
<p><strong>Want to test it with your data?</strong></p>
<p><img src="http://thetore.files.wordpress.com/2008/12/geodesic-n1.png?w=271&#038;h=177" alt="geodesic-n1" title="geodesic-n1" width="271" height="177" class="alignright size-full wp-image-194" />First, you need to download, install, and load <a href="http://opsahl.co.uk/tnet/" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> in <em>R</em>. Then, you need to load your data accordingly to the tnet standard. The standard is <a href="http://opsahl.co.uk/tnet/content/view/22/25/" target="_blank">described in detail<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> on tnet&#8217;s website along with examples on how to transfer data from other software packages, such as <a href="http://opsahl.co.uk/tnet/content/view/29/25/" target="_blank">UCINET and Pajek<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. In addition, you can manually enter the weighted edgelist. For example, this sample network can be loaded using the following command:</p>
<pre class="brush: plain;">
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,1,4,2,2,1,4,2,1,1))
</pre>
<p>Freeman&#8217;s (1978) binary betweenness measure can be calculated by running the betweenness_w and dichotomise-functions: </p>
<pre class="brush: plain;">
betweenness_w(dichotomise(net))[[1]]
     vertex betweenness
[1,]      1           0
[2,]      2           8
[3,]      3           0
[4,]      4           0
[5,]      5           4
[6,]      6           0
</pre>
<p>To see the outcome of the proposed algorithm in this post, you simply need to run the betweenness_w-function alone: </p>
<pre class="brush: plain;">
betweenness_w(net)[[1]]
     vertex betweenness
[1,]      1           4
[2,]      2           8
[3,]      3           0
[4,]      4           0
[5,]      5           4
[6,]      6           0
</pre>
<p>Now, node 1 (A) has gotten betweenness score of 4 as well. This is because the indirect path from node B to node C through A is used instead of the direct connection. </p>
<p>Note: The betweenness_w-function relies upon the sp.between-function in the RBGL package. The sp.between-function currently does not find more than one shortest path connecting two nodes even if an equally short path exists. This limitation affects the betweenness_w-function. </p>
<p><em>UPDATE</em>: There is now a <a href="http://www.boost.org/doc/libs/1_37_0/libs/graph/doc/betweenness_centrality.html" target="_blank">boost C++ implementation of Brandes&#8217; (2001) algorithm<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. I have not tested it, but as far as I can tell, it should be more efficient and without this limitation. </p>
<p><em>UPDATE</em>: The <a href="http://www.boost.org/doc/libs/1_37_0/libs/graph/doc/betweenness_centrality.html" target="_blank">boost C++ implementation of Brandes&#8217; (2001) algorithm<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> will be included in <a href="http://opsahl.co.uk/tnet" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> from version 0.0.6. This algorithm finds multiple paths if they have <em>exactly</em> the same distance. For example, if one path is found over the direct tie with a weight of 1 (distance = 1/1 = 1) and a second path is through an intermediary node with two ties with weights of 2 (distance = 1/2 + 1/2 = 1), the two paths have <em>exactly</em> the same distance. However, if there is a third path through two intermediaries with three ties with weights of 3 (distance = 1/3 + 1/3 + 1/3), it does not <em>exactly</em> equal 1 as computers read these values as 0.3333333 and the sum of these values is 0.9999999. Therefore, this path is considered shorter than the other two paths (distance = 1). </p>
<p><strong>References</strong></p>
<p>Brandes, U., 2001. A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25, 163-177.</p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Freeman, L. C., Borgatti, S. P., White, D. R., 1991. Centrality in valued graphs: A measure of betweenness based on network flow. Social Networks 13 (2), 141-154. </p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: betweenness, centrality, closeness, complex networks, graphs, local, network, shortest distance, shortest path, social network analysis, strength of ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/464/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/464/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=464&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/01/fig1_betweenness3.gif" medium="image">
			<media:title type="html">Betweenness example</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/geodesic-n1.png" medium="image">
			<media:title type="html">geodesic-n1</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Operationalisation of tie strength in social networks</title>
		<link>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/</link>
		<comments>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/#comments</comments>
		<pubDate>Fri, 06 Feb 2009 00:00:47 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=427</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/"><img src="http://thetore.wordpress.com/files/2009/01/triangle_strength_weight1_s.png" alt="Ties&#39; strength and weight" title="Ties&#39; strength and weight" width="190" height="100" class="alignright size-full wp-image-536" /></a>The method used to operationalise ties' strength into weights affects the outcomes of weighted networks measures. Simply assigning 1, 2, and 3 to three different levels of tie strength might not be appropriate as this scale might misrepresent the actually difference among the three levels (using an ordinal scale). In this post, I highlight issues with collecting weighted social network data from surveys. (Technical:&#160;1/10)
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=427&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Ties in many empirical networks have naturally a strength associated with them. For example, in social networks, some contacts are friends, whereas others are simply acquaintances. Granovetter (1973, pg. 1361) argued that the strength of a social tie is a function of its duration, emotional intensity, intimacy, and exchange of services. For non-social networks, the strength often reflects the function performed by the ties, e.g. carbon flow (mg/m²/day) between species in food webs (Luczkowich et al., 2003; Nordlund, 2007) or the number of synapses and gap junctions in a neural networks (Watts and Strogatz, 1998). In infrastructure and information networks, variations in the strength of a tie depend on the flow of information, energy, people, and goods along that tie (Barrat et al., 2004; Guimera et al., 2005; Pastor-Satorras and Vespignani, 2004).</p>
<p><img src="http://thetore.files.wordpress.com/2009/01/triangle_strength_weight1.png?w=268&#038;h=141" alt="Tie strength and weight" title="Tie strength and weight" width="268" height="141" class="alignright size-full wp-image-487" />The way tie strength is recorded as weights (<a href="http://en.wikipedia.org/wiki/Operationalization" target="_blank">operationalisation<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> of tie strength) can introduce subjective biases in the analysis of weighted networks that might invalidate the analysis. Therefore, a method that retains elements from the research question and setting should be applied as it affects the outcomes of weighted network measures. </p>
<p>A precise definition of tie strength often exists in non-social networks (e.g. the number of synapses and gap junctions), making the operationalisation of tie strength into weights straight forward. Conversely, in social networks, this tend not to be the case. Most social networks are collected through survey instruments (Wasserman and Faust, 1994). For example, a friendship network could be collected by asking people to designate others from a list or roaster as &#8220;know this person&#8217;s name&#8221;; &#8220;acquaintace&#8221;, or &#8220;friend&#8221;. By collecting social relations in such a way, two issues exist. </p>
<p>First, <strong>what is considered an &#8220;acquaintance&#8221; or a &#8220;friend&#8221; can differ considerably from one person to another</strong>. This bias is usually referred to as the informant inaccuracy bias (Bernard et al., 1984). A possible way of overcoming this bias is to design questions that are targeting the elements of a social tie (duration, emotional intensity, intimacy, and exchange of services; Granovetter, 1973). A question that does this is: </p>
<blockquote><p>Please indicate how often you have turned to this person for information or advice on work-related topics in the past three months.</p></blockquote>
<p>with the ordinal scale: 0, Do not know this person; 1, Never; 2, Seldom; 3, Sometimes; 4, Often; 5, Very Often. (This question was used by Cross and Parker, 2004, to collect an advice network.)</p>
<p>Second, by using an ordinal scale (associating 0, 1, 2, 3, 4, and 5 to the six levels of relationship), <strong>the differences among the levels might be misrepresented</strong>. In addition, with the ordinal scale used in the above question, answers are subjected to the inevitable bias that comes from the different ways in which different people assess duration. One possible way to overcome the scale issues is to design a scale that is closer to a ratio scale and reflects duration more accurately. For example, a better scale for the above question could be: 0, Do not know this person/Never; 1, Once; 3, Monthly; 6, Bi-weekly; 12, Weekly. </p>
<p>Researchers should carefully design questions and, in turn, the scale of weights, to overcome these issues. Marsden and Campbell (1984) conducted a comparative analysis of Granovetter&#8217;s (1973) four criteria for defining tie weights. They found that emotional intensity was a better indicator of strength of friendship than the other three criteria. Nevertheless, I believe that researchers should choose the appropriate measures of tie strength depending on the nature of the nodes and ties and, more generally, on the context of the research setting. In turn, such a scale is likely to yield a network dataset that is richer in information, more robust against potential inaccuracies emanating from subjective judgments, and more suitable to investigations that rely on weighted network measures.</p>
<p><strong>References</strong></p>
<p>Barrat, A., Barthelemy, M., Pastor-Satorras, R., Vespignani, A., 2004. The architecture of complex weighted networks. Proceedings of the National Academy of Sciences 101 (11), 3747-3752. <a href="http://arxiv.org/abs/cond-mat/0311416" target="_blank">arXiv:cond-mat/0311416</a> </p>
<p>Bernard, H. R., Killworth, P. D., Kronenfeld, D., Sailer, L. D., 1984. The problem of informant accuracy: the validity of retrospective data. Annual Review of Anthropology 13, 495-517. </p>
<p>Cross, R., Parker, A., 2004. The Hidden Power of Social Networks. Harvard Business School Press, Boston, MA. </p>
<p>Granovetter, M., 1973. The strength of weak ties. American Journal of Sociology 78, 1360-1380.</p>
<p>Guimera, R., Mossa, S., Turtschi, A., Amaral, L. A. N., 2005. The worldwide air transportation network: Anomalous centrality, community structure, and cities&#8217; global roles. Proceedings of the National Academy of Sciences 102, 7794-7799. <a href="http://arxiv.org/abs/cond-mat/0312535" target="_blank">arXiv:cond-mat/0312535</a></p>
<p>Luczkowich, J. J., Borgatti, S. P., Johnson, J. C., Everett, M. G., 2003. Defining and measuring trophic role similarity in food webs using regular equivalence. Journal of Theoretical Biology 220, 303321. </p>
<p>Marsden, P.V., Campbell, K.E., 1984. Measuring Tie Strength. Social Forces 63, 482-501.</p>
<p>Nordlund, C., 2007. Identifying regular blocks in valued networks: A heuristic applied to the St. Marks carbon flow data, and international trade in cereal products. Social Networks 29 (1), 59-69. </p>
<p>Pastor-Satorras, R., Vespignani, A., 2004. Evolution and Structure of the Internet. Cambridge University Press, New York, NY. </p>
<p>Wasserman, S., Faust, K., 1994. Social Network Analysis. Cambridge University Press, Cambridge, MA. </p>
<p>Watts, D. J., Strogatz, S. H., 1998. Collective dynamics of &#8220;small-world&#8221; networks. Nature 393, 440-442. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: arcs, complex networks, edges, graphs, Links, network, social network analysis, strength of nodes, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/427/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=427&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/01/triangle_strength_weight1.png" medium="image">
			<media:title type="html">Tie strength and weight</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Weighted local clustering coefficient</title>
		<link>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/</link>
		<comments>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/#comments</comments>
		<pubDate>Fri, 23 Jan 2009 00:00:22 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://thetore.wordpress.com/?p=312</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/"><img src="http://thetore.wordpress.com/files/2009/01/triplet_small21.png" alt="Triplet" title="Triplet" width="86" height="100" class="alignright size-full wp-image-534" /></a>The generalisation of the local clustering coefficient to weighted networks by Barrat et al. (2004) considers the value of a triplet to be the average of the weights attached to the two ties that make up the triplet. In this post, I suggest three additional methods for defining the triplet value. (Technical:&#160;6/10)
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=312&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><img src="http://thetore.files.wordpress.com/2008/12/triplet.gif?w=118&#038;h=140" alt="Triplet" title="Triplet" width="118" height="140" class="alignright size-full wp-image-351" />A fundamental measure that has long received attention in both theoretical and empirical research is the clustering coefficient. This measure assesses the degree to which nodes tend to cluster together. Evidence suggests that in most real-world networks, and in particular social networks, nodes tend to create tightly knit groups characterised by a relatively high density of ties (Feld, 1981; Friedkin, 1984; Heider, 1946; Holland and Leinhardt, 1970; Louch, 2000; Snijders, 2001; Snijders et al., 2006; Watts and Strogatz, 1998). In real-world networks, this likelihood tends to be greater than the average probability of a tie randomly established between two nodes (Holland and Leinhardt, 1971; Wasserman and Faust, 1994; Watts and Strogatz, 1998).</p>
<p>Two versions of this measure exist: the global and the local. The global version was designed to give an overall indication of the clustering in the network, whereas the local gives an indication of the embeddedness of single nodes. This post is concerned with the latter version, but I will briefly introduce the former quickly here.</p>
<p>The global clustering coefficient is based on triplets of nodes. A triplet is three nodes that are connected by either two (open triplet) or three (closed triplet) undirected ties. A triangle consists of three closed triplets, one centred on each of the nodes. The global clustering coefficient is the number of closed triplets (or <em>3  x triangles</em>) over the total number of triplets (both open and closed). The first attempt to measure it was made by Luce and Perry (1949). This measure gives an indication of the clustering in the whole network (global), and can be applied to both undirected and directed networks (often called transitivity, see Wasserman and Faust, 1994, page 243). However, it cannot be applied to weighted networks. </p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" />For this sample network, the binary global clustering coefficient would be 3 over 9, or 0.33. The closed triplets are B&rarr;A&larr;C; A&rarr;B&larr;C; A&rarr;C&larr;B; whereas the open triplets are A&rarr;B&larr;D; A&rarr;B&larr;E; C&rarr;B&larr;D; C&rarr;B&larr;E; D&rarr;B&larr;E; B&rarr;E&larr;F. As can be seen from the sample network, the strongest ties are inside the triangle. This is not captured by the binary coefficient as the weights are not considered. In the second chapter of my thesis, I proposed a generalisation of the global clustering coefficient to weighted networks (both undirected and directed). For this network, the outcome would be 0.44. The increase from the binary coefficient (0.33) reflects the tendency of strong ties to be part of closed triplets.</p>
<p>The local clustering coefficient is based on ego network density or local density (Scott, 2000; Uzzi and Spiro, 2005; Watts and Strogatz, 1998). For a node, this is the fraction of the number of present ties over the total number of possible ties between the node&#8217;s neighbours. Therefore, the outcome ranges between 0 and 1: 0 if no ties exist between the neighbours, and 1 if all possible ties exists. For undirected networks, the <em>local</em> clustering coefficient is formally defined as:</p>
<p>
<div align="center"><img src='http://l.wordpress.com/latex.php?latex=C+%3D+%5Cfrac%7B%5Cmbox%7Bactual+ties+between+a+node%27s+neighbours%7D%7D%7B%5Cmbox%7Bpossible+ties+between+a+node%27s+neighbours%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=1' alt='C = \frac{\mbox{actual ties between a node&#039;s neighbours}}{\mbox{possible ties between a node&#039;s neighbours}}' title='C = \frac{\mbox{actual ties between a node&#039;s neighbours}}{\mbox{possible ties between a node&#039;s neighbours}}' class='latex' /></div>
</p>
<p>The number of possible ties between a node&#8217;s neighbours can be calculated by multiplying the number of neighbours by the number of neighbours minus 1. </p>
<p>This formula can be re-written in terms of the adjacency matrix <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> whose entries <img src='http://l.wordpress.com/latex.php?latex=a_%7Bi%2Cj%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{i,j}' title='a_{i,j}' class='latex' /> is 1 if node <em>i</em> is connected with node <em>j</em> and 0 otherwise:</p>
<p><img src="http://thetore.files.wordpress.com/2008/12/lc_math.gif?w=250&#038;h=69" alt="Local clustering coefficient (adjacency matrix)" title="Local clustering coefficient (adjacency matrix)" width="250" height="69" class="aligncenter size-full wp-image-322" /></p>
<p>The main advantage of this version of the clustering coefficient is that a score is assigned to each node (local measure). This enables researchers to study associations between the coefficient and other nodal properties (e.g. Panzarasa et al., 2009) and perform regression analyses with the observations being the nodes of a network (e.g. Uzzi and Lancaster, 2004). However, this version of the clustering coefficient suffers from three major limitations. First, its outcome does not take into consideration the weight of the ties in the network. Second, the local clustering coefficient cannot be calculated on directed networks. Third, a negatively correlation with degree is often found in real-world networks. This is due to the fact that it is &#8220;easier&#8221; for a node with two neighbours to get a score of 1 (only one tie is need) than for a node with 10 neighbours (45 ties must be present). </p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" />For the sample network, nodes A and C would get a score of 1 since all possible ties among their neighbours are present, whereas node E would get a score of 0 since none of the possible ties among its neighbours are present. Nodes D and F would not get a value since the number of possible ties among their neighbours is 0 (if a node has less than 2 neighbours, the coefficient is undefined). For node B, one out of six possible ties is present, so the coefficient is 1/6 or 0.1667.</p>
<p>Barrat et al. (2004) proposed a generalisation of the local clustering coefficient to weighted networks by taking the weights explicitly into account. First, they assigned a triplet value to each triplet in the network based on the <a href="http://en.wikipedia.org/wiki/Arithmetic_mean" target="_blank">arithmetic mean<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> (normal average). Then for each node, they summed the value of the closed triplets that were centred on the node and divided it by the total value of all triplets centred on the node. Formally, they proposed the following equation:</p>
<p><img src="http://thetore.files.wordpress.com/2008/12/lcw_math.gif?w=424&#038;h=70" alt="Weighted local clustering coefficient" title="Weighted local clustering coefficient" width="424" height="70" class="aligncenter size-full wp-image-321" /></p>
<p>For the above sample network, all the nodes, except node B, would get the same outcome as they get the extreme values. Node B would get a value of 0.242. The difference between this value and the one obtain in the binary analysis (0.167) is a reflection of node B&#8217;s strong ties being directed towards neighbours that are themselves connected.</p>
<p><strong>However, the arithmetic mean is insensitive to differences between the two tie weights</strong> as an extreme value can have a major impact on the triplet value. Therefore, I propose three additional methods for defining the triplet value. These were originally used in my proposed generalisation of the global clustering coefficient. </p>
<p>First, it can be defined as the <a href="http://en.wikipedia.org/wiki/Arithmetic-geometric_mean" target="_blank">geometric mean<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> of the weights attached to the two ties. This method overcomes some of the sensitivity issues as a triplet made up by a tie with a low value and a tie with a high value will have a lower value than if the arithmetic mean were used. </p>
<p>Second, it can be defined as the maximum value of the two weights. This method offsets a low tie weight and makes a triplet with a strong tie and a weak tie equal to a triplet with two strong ties. </p>
<p>Third, it can be defined as the minimum value of the two weights. This method offsets a high tie weight by making triplets with a strong tie and a weak tie equal to triplets with two weak ties. </p>
<p>The table below highlights the differences between the methods of defining the triplet value (adopted from Chapter 2 of my thesis). </p>
<p><img src="http://thetore.files.wordpress.com/2008/12/triplet_value.gif?w=455&#038;h=219" alt="Triplet value" title="Triplet value" width="455" height="219" class="aligncenter size-full wp-image-327" /></p>
<p>It is vital to use an appropriate method for defining the value of a triplet as this impacts on the outcome of the coefficient. The method should be chosen based on the research question as well as the way in which the strength of the ties are operationalised into weights. For example, in a network where the weights correspond to the level of flow, and a weak tie would act as a bottleneck, the minimum method might be most appropriate to use. </p>
<p><strong>Want to test it with your data?</strong></p>
<p>First, you need to create an edgelist of your network (<a href="http://opsahl.co.uk/tnet" target="_blank">see the tnet documentation<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>). The edgelist for the sample network above can be entered into <em>R</em> using the following command:</p>
<pre class="brush: plain;">
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))
</pre>
<p>Then you can run the following code to calculate the local clustering coefficient using the four definitions for triplet value.</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Calculated all measures (including the binary one)
clustering_w_local(net, measure=c(&quot;am&quot;,&quot;gm&quot;,&quot;ma&quot;,&quot;mi&quot;,&quot;bi&quot;))
</pre>
<p>The output should be something along these lines (am is the weighted local clustering coefficient (wlcc) using the arithmetic mean; gm is the wlcc using the geometric mean; ma is the wlcc using the maximum method; mi is the wlcc using the minimum method; and bi is the binary version of the wlcc):</p>
<pre class="brush: plain;">
     node        am        gm        ma        mi        bi
[1,]    1 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000
[2,]    2 0.2424242 0.2654092 0.1818182 0.3636364 0.1666667
[3,]    3 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000
[4,]    4       NaN       NaN       NaN       NaN       NaN
[5,]    5 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000
[6,]    6       NaN       NaN       NaN       NaN       NaN
</pre>
<p>Nodes 4 and 6 do not have a value as they have less than 2 neighbours. Nodes 1 and 3 get a value of 1 because all their neighbours are connected, whereas node 5 get a value of 0 because none of its neighbours are connected. </p>
<p>Node 2 is the interesting node as it does not get the extreme values. The local clustering coefficient obtained by using the four definitions of triplet differ. The choice of which definition to use should be based on the research question. Moreover, within a regression framework, the different values might be used as robustness checks.</p>
<p><strong>Empirical test</strong></p>
<p>I have also tested this extension on the network of the <a href="http://cxnets.googlepages.com/usairtransportationnetwork" target="_blank">500 busiest commercial airports in the US linked together by scheduled flights<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> (Thanks <a href="http://vcolizza.googlepages.com/" target="_blank">Vittoria<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> for uploading this network!).</p>
<p>The code for downloading and calculating the measures is:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load network
## Please cite Colizza et al. (2007) doi:10.1038/nphys560
data(USairport.n500)
net &lt;- USairport.n500.net

# Since this network has are very large tie weights in this network, the geometric method in R fails. This limitation in R can be overcome by transforming the values.
# Note: This step is not required unless you receive warnings when running the function.
net[,3] &lt;- (net[,3]/min(net[,3]))

# Calculate measure
output &lt;- clustering_w_local(net, measure=c(&quot;am&quot;,&quot;gm&quot;,&quot;ma&quot;,&quot;mi&quot;,&quot;bi&quot;))

# Find the average value for nodes with a degree greater than 1
colMeans(output[,c(&quot;am&quot;,&quot;gm&quot;,&quot;ma&quot;,&quot;mi&quot;,&quot;bi&quot;)], na.rm=T)
       am        gm        ma        mi        bi
0.7660205 0.7751321 0.7610040 0.7828600 0.7264579
</pre>
<p><strong>References</strong></p>
<p>Barrat, A., Barthelemy, M., Pastor-Satorras, R., Vespignani, A., 2004. The architecture of complex weighted networks. Proceedings of the National Academy of Sciences 101 (11), 3747-3752. <a href="http://arxiv.org/abs/cond-mat/0311416" target="_blank">arXiv:cond-mat/0311416</a></p>
<p>Colizza, V., Pastor-Satorras, R., Vespignani, A., 2007. Reaction-diffusion processes and metapopulation models in heterogeneous networks. Nature Physics 3, 276-282. <a href="http://arxiv.org/abs/cond-mat/0703129" target="_blank">arXiv:cond-mat/0703129</a></p>
<p>Feld, S. L., 1981. The focused organization of social ties. American Journal of Sociology 86, 1015-1035.</p>
<p>Friedkin, N. E., 1984. Structural cohesion and equivalence explanations of social homogeneiety. Sociological Methods and Research 12, 235-261. </p>
<p>Heider, F., 1946. Attitudes and cognitive organization. Journal of Psychology 21, 107-112. </p>
<p>Holland, P. W., Leinhardt, S., 1970. A method for detecting structure in sociometric data. American Journal of Sociology 76, 492-513. </p>
<p>Holland, P. W., Leinhardt, S., 1971. Transitivity in structural models of small groups. Comparative Group Studies 2, 107-124. </p>
<p>Louch, H., 2000. Personal network integration: Transitivity and homophily in strong-tie relations. Social Networks 22, 45-64. </p>
<p>Luce, R. D., Perry, A. D., 1949. A method of matrix analysis of group structure. Psychometrika 14 (1), 95-116. </p>
<p>Panzarasa, P., Opsahl, T., Carley, K. M., 2009. <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and dynamics of users&#8217; behavior and interaction: Network analysis of an online community</a>. Journal of the American Society for Information Science and Technology 60 (5), 911-932. </p>
<p>Scott, J., 2000. Social Network Analysis: A Handbook. Sage Publications, London, UK. </p>
<p>Snijders, T. A. B., 2001. The statistical evaluation of social network dynamics. Sociological Methodology 31, 361-395. </p>
<p>Snijders, T. A. B., Pattison, P. E., Robins, G. L., Handcock, M. S., 2006. New specifications for exponential random graph models. Sociological Methodology 35, 99-153. </p>
<p>Uzzi, B., Lancaster, R., 2004. Embeddedness and price formation in the corporate law market. American Sociological Review 69, 319-344. </p>
<p>Uzzi, B., Spiro, J., 2005. Collaboration and creativity: The small world problem. American Journal of Sociology 111, 447-504. </p>
<p>Wasserman, S., Faust, K., 1994. Social Network Analysis. Cambridge University Press, Cambridge, MA. </p>
<p>Watts, D. J., Strogatz, S. H., 1998. Collective dynamics of &#8220;small-world&#8221; networks. Nature 393, 440-442. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: clustering coefficient, complex networks, embeddedness, graphs, local, network, social network analysis, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/312/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=312&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/triplet.gif" medium="image">
			<media:title type="html">Triplet</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/lc_math.gif" medium="image">
			<media:title type="html">Local clustering coefficient (adjacency matrix)</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/lcw_math.gif" medium="image">
			<media:title type="html">Weighted local clustering coefficient</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/triplet_value.gif" medium="image">
			<media:title type="html">Triplet value</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Average shortest distance in weighted networks</title>
		<link>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 09 Jan 2009 00:00:23 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://thetore.wordpress.com/?p=174</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/"><img src="http://thetore.wordpress.com/files/2009/01/distance.png" alt="Distance between nodes" title="Distance between nodes" width="300" height="134" class="alignright size-full wp-image-530" /></a>The average distance that separate nodes in a network became a famous measure following Milgram's six-degrees of separation experiment in 1967 that found that people in the US were on average 6-steps from each other. This post proposes a generalisation of this measure to weighted networks by building on work by Dijkstra (1959) and Newman (2001). (Technical:&#160;4/10)
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=174&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Shortest distances has long been a key element of network research. First, it has been used a global measure: what is the average shortest distance among nodes in a network? The global perspective became famous following Milgram&#8217;s six-degrees of separation experiment in 1967 that found that people in the US were on average 6-steps from any other person (see <a href="http://toreopsahl.com/publications/thesis/">the introduction of my thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> for a more complete story and criticism).</p>
<p>Second, the shortest distances have been used as an underlying metric in a number of measures, such as centrality ones (Freeman, 1978). One of these measures is closeness, which is inversely proportional to the distance from a node to all other nodes in the network. A number of researchers has found an association between the closeness to others and performance &#8211; but this post is not about applications, but about the global average shortest distances.</p>
<p><img src="http://thetore.files.wordpress.com/2008/12/geodesic-n1.png?w=271&#038;h=177" alt="geodesic-n1" title="geodesic-n1" width="271" height="177" class="alignright size-full wp-image-194" />The shortest distance among nodes in a network is quite easy to calculate if you only have present or absent ties: you simply count the ties along the shortest path. If two nodes are directly connected: <em>distance=1</em>; and if they are not directly connected, but are connected through intermediaries, then it is the lowest number of intermediary nodes +1. For example, if the ties in this network did not have weights, the distance between node A and F would be 3. Below is a matrix showing the shortest distances among the nodes in the sample network.</p>
<pre class="brush: plain;">
   A  B  C  D  E  F
A NA  1  1  2  2  3
B  1 NA  1  1  1  2
C  1  1 NA  2  2  3
D  2  1  2 NA  2  3
E  2  1  2  2 NA  1
F  3  2  3  3  1 NA
</pre>
<p>The average shortest distance in this network is <strong>1.8</strong> (the mean of the matrix, diagonal excluded).</p>
<p>Difficulty occurs when ties are differentiated, as they are in a weighted network. For example, the tie between node A and B has twice the strength of the tie between node A and node C. This could mean that node A has more frequent contact with node B than with node C. In turn, this could imply that node A might give node B a piece of information (or a disease) twice as likely as node C. If we are looking at the <a href="http://woodlandownernetworks.wordpress.com/2008/12/15/strong-ties-weak-ties-innov/" target="_blank">diffusion of information<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> or <a href="http://www.gleamviz.org/" target="_blank">diseases<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> in a network, then the speed that it travels, and routes that it takes, are clearly affected by the weights.</p>
<p>Let us look at the shortest path from node C to node B. The direct connection carries a weight of 1 only; however, the indirect connection through node A is composed of stronger ties. Therefore, a piece of information (or a disease) might be transmitted quicker through node A than directly.</p>
<p>Dijkstra (1959) proposed an algorithm that sum the <em>cost</em> of connections and find the path of least resistance. For example, GPS devices uses this algorithm by assigning a time-cost to each leg of the road, and then find the route that cost least in terms of time. This algorithm can also be used in social network analysis. Newman (2001) applied it to a collaboration network of scientists by inversing the tie weights (dividing 1 by the weight). This implies that a stronger tie gets a lower cost than a weaker tie. Below is the network from above with the inverted weights:<br />
<img src="http://thetore.files.wordpress.com/2008/12/geodesic-m2.png?w=271&#038;h=177" alt="geodesic-m2" title="geodesic-m2" width="271" height="177" class="aligncenter size-full wp-image-195" /></p>
<p>By applying Dijkstra&#8217;s algorithm, we find that the direct connection between node C and node B has a cost of 1, whereas the indirect connection via node A has a cost of 0.75 (1/2 + 1/4). Therefore, according to this algorithm, the information will travel faster through the indirect connection. Below is a matrix showing the cost of the least costly routes among the nodes in the sample network.</p>
<pre class="brush: plain;">
     A    B    C    D    E    F
A   NA 0.25 0.50 0.50 0.75 1.75
B 0.25   NA 0.75 0.25 0.50 1.50
C 0.50 0.75   NA 1.00 1.25 2.25
D 0.50 0.25 1.00   NA 0.75 1.75
E 0.75 0.50 1.25 0.75   NA 1.00
F 1.75 1.50 2.25 1.75 1.00   NA
</pre>
<p>The average of this matrix is <strong>0.9833</strong>.</p>
<p>This application of Dijkstra&#8217;s algorithm is fine to use if we wish to rank the nodes in terms of closeness to others (node A is closer to the others than node C because it has a stronger tie to node B); however, <strong>what does an average distance of 0.98 mean?</strong> </p>
<p>To this end, I suggest to &#8220;normalise&#8221; the weights by the average weight in the network. The weights would then be:<br />
<img src="http://thetore.files.wordpress.com/2008/12/geodesic-navg1.png?w=271&#038;h=177" alt="geodesic-navg1" title="geodesic-navg1" width="271" height="177" class="aligncenter size-full wp-image-223" /></p>
<p>By calculating Dijkstra&#8217;s algorithm using this normalisation, we get the following shortest distances among nodes:</p>
<pre class="brush: plain;">
       A      B      C      D      E      F
A     NA 0.5825 1.1650 1.1650 1.7475 4.0775
B 0.5825     NA 1.7475 0.5825 1.1650 3.4950
C 1.1650 1.7475     NA 2.3300 2.9125 5.2425
D 1.1650 0.5825 2.3300     NA 1.7475 4.0775
E 1.7475 1.1650 2.9125 1.7475     NA 2.3300
F 4.0775 3.4950 5.2425 4.0775 2.3300     NA
</pre>
<p>The average of this matrix is <strong>2.29</strong>.</p>
<p>A unit of distance then refers to one step with the average weight in the network. The average of the matrix suggests that on average nodes are 2.29 steps with average tie weight away from each other. This measure would be comparable across networks with different ranges of tie weights.</p>
<p>If anyone has any suggestions for improvements, let me know!</p>
<p><strong>Want to test it with your data?</strong></p>
<p>First, you need to download <a href="http://opsahl.co.uk/tnet" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> and install it in <em>R</em>.</p>
<p>You need to create an edgelist of your network (see the <a href="http://opsahl.co.uk/tnet" target="_blank">tnet documentation<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>). The edgelist for the sample network here can be entered into <em>R</em> using the following command:</p>
<pre class="brush: plain;">
net  &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,1,4,2,2,1,4,2,1,1))
</pre>
<p>Then you can write the following commands:</p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Create a binary version of the network
bnet &lt;- dichotomise(net)

# Calculate the average distance in the binary network
mean(distance_w(bnet),na.rm=T)

# Calculate the average distance in the weighted network
mean(distance_w(net),na.rm=T)
</pre>
<p><strong>References</strong></p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271. </p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Milgram, S., 1967. The small world problem. Psychology Today 2, 60-67. </p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: centrality, closeness, complex networks, global, graphs, network, shortest distance, social network analysis, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/174/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=174&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/geodesic-n1.png" medium="image">
			<media:title type="html">geodesic-n1</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/geodesic-m2.png" medium="image">
			<media:title type="html">geodesic-m2</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/geodesic-navg1.png" medium="image">
			<media:title type="html">geodesic-navg1</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Local weighted rich-club measure</title>
		<link>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/</link>
		<comments>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/#comments</comments>
		<pubDate>Fri, 26 Dec 2008 00:31:20 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[weighted-richclub]]></category>

		<guid isPermaLink="false">http://thetore.wordpress.com/?p=5</guid>
		<description><![CDATA[This post proposes a local (node-level) version of the <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Weighted Rich-club Effect (PRL 101, 168702)</a>. By incorporating this measure into a regression analysis, the impact of targeting efforts towards prominent nodes on performance can be studied. (Technical:&#160;10/10)<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=5&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The other day I was asked about an application of a method proposed in a recent PRL paper called <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">the Weighted Rich-club Effect<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> to a social science article. In this paper, we proposed a method of analysing whether a subset of prominent nodes directed their strongest ties towards each other to a greater extent than randomly expected. This method was global (gave one results for the entire network); however, in social science papers, a regression is commonly used where each node is an observation and a number of measures are created for each node. So, the problem was whether I could redefine the global weighted rich-club measure into a local one (for each node).</p>
<p>So to reiterate the paper quickly:<br />
1) we define a set of prominent nodes<br />
2) we measure the total sum of weights attached to ties among the prominent nodes <img src='http://l.wordpress.com/latex.php?latex=W_%7B%3Er%7D&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='W_{&gt;r}' title='W_{&gt;r}' class='latex' /> (see panel a of the figure below)<br />
3) we sum the <img src='http://l.wordpress.com/latex.php?latex=E_%7B%3Er%7D&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='E_{&gt;r}' title='E_{&gt;r}' class='latex' /> strongest ties in the network, where <img src='http://l.wordpress.com/latex.php?latex=E_%7B%3Er%7D&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='E_{&gt;r}' title='E_{&gt;r}' class='latex' /> is the number of ties among the prominent nodes  (see panel b of the figure below)<br />
4) we take the ratio between the sum from point 2 to the sum from point 3, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='\phi^w(r)' title='\phi^w(r)' class='latex' />. As a result, if the prominent nodes share the strongest ties in the network, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='\phi^w(r)=1' title='\phi^w(r)=1' class='latex' />. If we the prominent nodes do not share all of the strongest nodes, we get a value lower than 1.</p>
<div id="attachment_104" class="wp-caption aligncenter" style="width: 610px"><img src="http://thetore.files.wordpress.com/2008/12/wrc_fig1ab.gif?w=600&#038;h=166" alt="Weighted Rich-club Effect" title="Weighted Rich-club Effect" width="600" height="166" class="size-full wp-image-104" /><p class="wp-caption-text"> Schematic representation of a weighted network, with size of nodes proportional to their prominence, and width of links to their weight indicated by the corresponding numbers. Several definitions of prominence can be considered. (a)&nbsp;The prominent nodes and the ties among them are highlighted, giving <img src='http://l.wordpress.com/latex.php?latex=E_%7B%3Er%7D%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='E_{&gt;r}=6' title='E_{&gt;r}=6' class='latex' /> ties and <img src='http://l.wordpress.com/latex.php?latex=W_%7B%3Er%7D+%3D+4+%2B+2+%2B+2+%2B+3+%2B+1+%2B+2+%3D+14&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='W_{&gt;r} = 4 + 2 + 2 + 3 + 1 + 2 = 14' title='W_{&gt;r} = 4 + 2 + 2 + 3 + 1 + 2 = 14' class='latex' />. (b)&nbsp;The strongest <img src='http://l.wordpress.com/latex.php?latex=E_%7B%3Er%7D&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='E_{&gt;r}' title='E_{&gt;r}' class='latex' /> ties of the network are highlighted (6), yielding the following value for the denominator of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='phi^w(r)' title='phi^w(r)' class='latex' />: <img src='http://l.wordpress.com/latex.php?latex=4+%2B+4+%2B+4+%2B+3+%2B+3+%2B+3+%3D+21&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='4 + 4 + 4 + 3 + 3 + 3 = 21' title='4 + 4 + 4 + 3 + 3 + 3 = 21' class='latex' />. We thus obtain <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29+%3D+%5Cfrac%7B14%7D%7B21%7D&#038;bg=ffffff&#038;fg=000000&#038;s=-3' alt='phi^w(r) = rac{14}{21}' title='phi^w(r) = rac{14}{21}' class='latex' />. Adapted from Opsahl et al. (2008).</p></div>
<p>However, a problem might exist. Some definitions of prominence could be associated with the strength of ties, such as the average weight of the ties originating from a node (if a set of nodes has on average stronger ties, of course it would have stronger ties among them &#8211; everything else being equal). So therefore, we divide <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='\phi^w(r)' title='\phi^w(r)' class='latex' /> obtained in the real network by <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Ew%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='\phi^w(r)' title='\phi^w(r)' class='latex' /> obtained on simulated corresponding random networks, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7B%5Cmathrm%7Bnull%7D%7D%5Ew%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=-2' alt='\phi_{\mathrm{null}}^w(r)' title='\phi_{\mathrm{null}}^w(r)' class='latex' /> (see <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">the paper<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> for this part and Colizza et al., 2006). </p>
<p>Now, in an effort to create a local measure, we would have to retain the research question: do prominent nodes direct their strongest ties to one another?, but rewrite it in terms of a single node. So, I thought:<br />
1) designate prominent nodes<br />
2) for each node, sum the weights towards prominent nodes (if a node has three ties with weights of 1, 2 and 6, and the two ties with weight of 1 and 6 are directed towards prominent nodes, then the sum is 7).<br />
3) In case the definition of prominence is associated with the weight of ties, we need to discount for the randomly expected value. To this end, we could divide the value obtained in point 2, to the randomly expected value (average weight of ties (which is 3 in this example) multiplied with the number of ties towards prominent nodes): 6.</p>
<p>Let me know what you guys think?</p>
<p><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="Sample network" title="Sample network" width="271" height="177" class="alignright size-full wp-image-160" /><strong>Want to test it with your data?</strong></p>
<p>First, you need to load your network in R in tnet format, and ensure that it complies with the required standard (see the <em><a href="http://opsahl.co.uk/tnet/">tnet documentation<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a></em> for more information on the network structure required). The documentation also contains information on exporting network data from other programmes. Then you need to define a promience vector consisting on 1&#8242;s and 0&#8242;s where 1 signify prominence and 0 non-prominence. This vector must be of the same length as the number of nodes in the network. </p>
<pre class="brush: plain;">
# Load tnet
library(tnet)

# Load the above sample network
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))

# Define prominence parameter (node 1 (A), 2 (B) and 3 (C) are designated as prominent)
prominence &lt;- c(1,1,1,0,0,0)

# Run function
weighted_richclub_w_local(net, prominence)
</pre>
<p>The output table is like this:</p>
<pre class="brush: plain;">
     node    ratio
[1,]    1 1.000000
[2,]    2 1.454545
[3,]    3 1.000000
[4,]    4 1.000000
[5,]    5 1.333333
[6,]    6 1.000000
</pre>
<p>Nodes 1, 3, 4 and 6 get a value of 1 as they do not have a choice in their behaviour because they are <em>not</em> connected to <em>both</em> prominent and non-prominent nodes. However, node 2 and 5 do have a choice in how to distributed their efforts. For example, node 5 have two ties, one with a weight of 2 to a prominent node and one with a weight of 1 to a non-prominent node. Therefore, it can be said that node 5 preferentially direct attention to prominent nodes.</p>
<p><strong>References</strong></p>
<p>Colizza, V., Flammini, A., Serrano, M. A., Vespignani, A., 2006. Detecting rich-club ordering in complex networks. Nature Physics 2, 110-115. <a href="http://arxiv.org/abs/physics/0602134" target="_blank">arXiv:physics/0602134</a></p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). <a href="http://arxiv.org/abs/0804.0417" target="_blank">arXiv:0804.0417</a>. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: complex networks, graphs, local, network, richclub, social network analysis, valued networks, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/5/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=5&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/wrc_fig1ab.gif" medium="image">
			<media:title type="html">Weighted Rich-club Effect</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Article: Prominence and control: The weighted rich-club effect</title>
		<link>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/</link>
		<comments>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/#comments</comments>
		<pubDate>Fri, 12 Dec 2008 00:00:08 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=779</guid>
		<description><![CDATA[A paper called Prominence and control: The weighted rich-club effect that I have co-authored was published in Physical Review Letters (PRL). In this paper, we proposed a new general framework for studying the tendency of prominent nodes to direct their strongest ties toward each other.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=779&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A paper called Prominence and control: The weighted rich-club effect that I have co-authored with <a href="http://vcolizza.googlepages.com/" target="_blank">Vittoria Colizza<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, and <a href="http://isiosf.isi.it/~jramasco/" target="_blank">José Javier Ramasco<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> was published in the October 17 issue of <a href="http://prl.aps.org/" target="_blank">Physical Review Letters (PRL)<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>. A pdf version of <a href="http://vcolizza.googlepages.com/PhysRevLett_101_168702.pdf" target="_blank">the paper is available<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>.</p>
<p><strong>Abstract</strong></p>
<p>Complex systems are often characterized by large-scale hierarchical organizations.  Whether the prominent elements, at the top of the hierarchy, share and control resources or avoid one another lies at the heart of the global organization and  functioning. Inspired by network perspectives, we propose a new general framework for studying the tendency of prominent elements to form clubs with exclusive control over the majority of a system&#8217;s resources. We explore associations between prominence and control in the fields of transportation, scientific collaboration, and online communication. </p>
<div class="knobinfo"><em>UPDATE</em>: I have suggested an additional null model in <a href="http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/" target="_blank">Weighted Rich-club Effect: A more appropriate null model for scientific collaboration networks<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a></div>
<p><strong>Want to test it with your data?</strong></p>
<p>The <a href="http://opsahl.co.uk/tnet/content/view/41/30/" target="_blank">weighted_richclub<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> function in <a href="http://opsahl.co.uk/tnet/" target="_blank">tnet<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a> allows you to test the framework on your own dataset.</p>
<br />Posted in Network thoughts Tagged: actors, arcs, complex networks, directed networks, edges, embeddedness, global, graphs, hubs, Links, network, nodes, richclub, social network analysis, strength of nodes, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/779/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/779/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=779&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />
	</item>
		<item>
		<title>Network? Weighted network?</title>
		<link>http://toreopsahl.com/2008/11/28/network-weighted-network/</link>
		<comments>http://toreopsahl.com/2008/11/28/network-weighted-network/#comments</comments>
		<pubDate>Fri, 28 Nov 2008 00:00:46 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://thetore.wordpress.com/?p=376</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2008/11/28/network-weighted-network/"><img src="http://thetore.wordpress.com/files/2008/12/social-network_weighted.png?w=300" alt="social-network_weighted" title="social-network_weighted" width="300" height="136" class="alignright size-medium wp-image-144" /></a>Networks are the cornerstone of this blog, therefore I have decided to make the first post my definition of a network and a few basic network measures. (Technical:&#160;1/10)<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=376&subd=thetore&ref=&feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><div id="attachment_160" class="wp-caption alignright" style="width: 260px"><img src="http://thetore.files.wordpress.com/2008/12/social-network.png?w=250&#038;h=151" alt="social-network" title="social-network" width="250" height="151" class="alignright size-full wp-image-145" /><p class="wp-caption-text"> </p></div>A system whose elements are somehow connected can be represented as a network. The elements of a system is represented as nodes (also known as actors or vertices) and the connections among interacting elements are known as ties, edges, arcs, or links. The nodes might be <a href="http://cdg.columbia.edu/cdg/datasets" target="_blank">neurons<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, <a href="http://orgnet.com/" target="_blank">individuals<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, groups, organisations, <a href="http://cxnets.googlepages.com/usairtransportationnetwork" target="_blank">airports<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, or even a countries, whereas ties can take the form of friendship, communication, collaboration, alliance, flow, or <a href="http://www.intracen.org/tradstat/welcome.htm" target="_blank">trade<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif"></a>, to name only a few.</p>
<p><div id="attachment_160" class="wp-caption alignright" style="width: 310px"><img src="http://thetore.files.wordpress.com/2008/12/social-network_weighted.png?w=300&#038;h=136" alt="social-network_weighted" title="social-network_weighted" width="300" height="136" class="alignright size-medium wp-image-144" /><p class="wp-caption-text"> </p></div>Not all ties in a network have the same capacity. For example, in a social network some contacts are friends, whereas others are simply acquaintances. It would be grossly unfair to treat everyone in the same manner. </p>
<p>By recording the strength of ties, we can create a weighted network. However, these networks are more difficult to analyse than if ties were simply present or absent. I devoted a large part my <a href="http://toreopsahl.com/publications/thesis/">Ph.D. thesis<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a> to the analysis of these networks.</p>
<p><strong>Basic network definitions</strong></p>
<p>Throughout this blog I make use of some network terminology beyond nodes and ties, and I would like to clarify my definition of these terms.</p>
<p><em>Undirected / directed network</em><br />
<div id="attachment_160" class="wp-caption alignright" style="width: 281px"><img src="http://thetore.files.wordpress.com/2009/01/fig1-directed.png?w=271&#038;h=177" alt="Directed network" title="Directed network" width="271" height="177" class="alignright size-full wp-image-448" /><img src="http://thetore.files.wordpress.com/2008/12/fig1.png?w=271&#038;h=177" alt="t" title="Undirected network" width="271" height="177" class="size-full wp-image-160" /><p class="wp-caption-text"> </p></div>Networks can be directed or undirected. This refers to whether ties are created from one node and directed towards another, or simply tie two nodes together. For example, a network where ties are created when people ask for advice from others are generally recorded as directed (Lazega, 2001). This is because asking for advice refers to social interactions in which knowledge flows from one node to another in a specific direction. In these networks, the weight of the tie from a node to another might be different from the weight from the other node to the initial node (e.g. the two ties between nodes A and B in the first sample network). Conversely, a network where ties are formed when two people collaborate on a project tends to be undirected. This is due to the fact that collaboration usually implies a two-way interaction between the nodes. In these network, a tie is not &#8220;produced&#8221; by one node and directed towards another, but is a joint product of both the nodes.</p>
<p><em>Degree and Strength of nodes</em></p>
<p>The degree of a node is equal to the number of other nodes the node is connected to. For example, node E in the above undirected network has a degree of 2 since it is connected to nodes B and F. In a directed network, each node can have an out-degree and in-degree. Out-degree is the number of nodes that the node connect to, whereas in-degree is the number of nodes that connect to the node. In the directed network, node E would have an out-degree of 2, but only an in-degree of 1 since only node B has directed a tie towards it.</p>
<p>The strength of a node is equal to the sum of weights attached to the ties that connect a node to others. For node E in the undirected network, this would be equal to 3. In a similar fashion as degree, in a directed network, each node has an out-strength and an in-strength. Out-strength is the sum of weights attached to ties originating from the node, whereas in-strength is the sum of weights attached to ties directed toward the node. Therefore, node E would have an out-strength of 3, but only an in-strength of 2.</p>
<p><em>One-/two-mode networks</em><br />
<div id="attachment_796" class="wp-caption alignright" style="width: 281px"><img src="http://thetore.files.wordpress.com/2009/03/fig1_twomode.png?w=271&#038;h=396" alt="Two-mode network projected onto a one-mode weighted network" title="Two-mode network projected onto a one-mode weighted network" width="271" height="396" class="size-full wp-image-796" /><p class="wp-caption-text">Two-mode network projected onto a one-mode weighted network</p></div>Most networks are analysed as one-mode networks. These are simple networks with one set of nodes and ties among these nodes. For example, an inter-organisational network where the nodes are employees and ties are formed between two nodes when one employee asks other one for advice. However, many network dataset are by definition two-mode networks (also known as affiliation or bipartite networks). These are a particular type of networks with two sets of nodes and ties are only established between nodes belonging to different sets. The first network in this diagram illustrates a binary two-mode network where the colour represent the node set to which a node belongs. One of the first two-mode datasets to be analysed was the Davis Southern Club Women dataset (Davis et al., 1941), which recorded the attendance of a group of women (node set 1) to a series of events (set 2). A woman would be linked to an event if she attended it. Another type of two-mode dataset that has become popular in recent years is scientific collaboration networks. In this type of networks, a tie is established between a scientist (node set 1) and a paper (node set 2) if the scientist authored that paper (e.g., Newman, 2001). </p>
<p>Although the two-mode structure contains a number of details of the network, few network measures exist for two-mode networks. Therefore, these networks are often projected onto a one-mode network (only one type of nodes). This is done by selecting one of the sets of nodes and linking two nodes from that set if they were connected to the same node (of the other kind). This process is illustrated for the blue nodes of the second part of the diagram above. For example, node E and node F are connected to the same red node, therefore, in the one-mode projection they are tied together.</p>
<p><strong>References</strong></p>
<p>Davis, A., Gardner, B. B., Gardner, M. R., 1941. Deep South. University of Chicago Press, Chicago, IL.</p>
<p>Lazega, E., 2001. The Collegial Phenomenon: The Social Mechanisms of Cooperation Among Peers in a Corporate Law Partnership. Oxford University Press, Oxford, UK. </p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<div class="knobcite">Please cite or link to this post if you use it.</div>
<br />Posted in Network thoughts Tagged: actors, arcs, complex networks, degree, directed networks, edges, graphs, Links, network, nodes, social network analysis, strength of nodes, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/thetore.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/thetore.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/thetore.wordpress.com/376/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&blog=5878280&post=376&subd=thetore&ref=&feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2008/11/28/network-weighted-network/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/social-network.png" medium="image">
			<media:title type="html">social-network</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/links_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2008/12/social-network_weighted.png?w=300" medium="image">
			<media:title type="html">social-network_weighted</media:title>
		</media:content>

		<media:content url="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif" medium="image" />

		<media:content url="http://thetore.files.wordpress.com/2009/01/fig1-directed.png" medium="image">
			<media:title type="html">Directed network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Undirected network</media:title>
		</media:content>

		<media:content url="http://thetore.files.wordpress.com/2009/03/fig1_twomode.png" medium="image">
			<media:title type="html">Two-mode network projected onto a one-mode weighted network</media:title>
		</media:content>
	</item>
	</channel>
</rss>