<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Tore Opsahl</title>
	<atom:link href="http://toreopsahl.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://toreopsahl.com</link>
	<description></description>
	<lastBuildDate>Sat, 25 May 2013 12:45:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='toreopsahl.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://1.gravatar.com/blavatar/991ed75bf373cc160b2fb9141206f110?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Tore Opsahl</title>
		<link>http://toreopsahl.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://toreopsahl.com/osd.xml" title="Tore Opsahl" />
	<atom:link rel='hub' href='http://toreopsahl.com/?pushpress=hub'/>
		<item>
		<title>Article: Triadic closure in two-mode networks: Redefining the global and local clustering coefficients</title>
		<link>http://toreopsahl.com/2011/12/21/article-triadic-closure-in-two-mode-networks-redefining-the-global-and-local-clustering-coefficients/</link>
		<comments>http://toreopsahl.com/2011/12/21/article-triadic-closure-in-two-mode-networks-redefining-the-global-and-local-clustering-coefficients/#comments</comments>
		<pubDate>Wed, 21 Dec 2011 09:00:03 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[valued networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=3487</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2011/12/21/article-triadic-closure-in-two-mode-networks-redefining-the-global-and-local-clustering-coefficients/"><img src="http://toreopsahl.files.wordpress.com/2011/06/triadicclosurefig3a.png" alt="" title="Clustering in Two-mode Networks" width="224" height="145" class="alignright size-full wp-image-4033" /></a>A paper called "Triadic closure in two-mode networks: Redefining the global and local clustering coefficients" that I have authored will be published in the special issue of Social Networks on two-mode networks (2012). 

As the vast majority of network measures are defined for one-mode networks, two-mode networks often have to be projected onto one-mode networks to be analyzed. A number of issues arise in this transformation process, especially when analyzing ties among nodes' contacts. For example, the values attained by the global and local clustering coefficients on projected random two-mode networks deviate from the expected values in corresponding classical one-mode networks. Moreover, both the local clustering coefficient and constraint (structural holes) are inversely associated to nodes' two-mode degree. To overcome these issues, this paper proposes redefinitions of the clustering coefficients for two-mode networks.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3487&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A paper called &#8220;Triadic closure in two-mode networks: Redefining the global and local clustering coefficients&#8221; that I have authored will be published in the special issue of Social Networks on two-mode networks (2013). Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to Social Networks, you can <a href="http://dx.doi.org/10.1016/j.socnet.2011.07.001" target="_blank">download the paper directly</a>. Otherwise, a preprint with the exact same text is available (<a href="http://arxiv.org/abs/1006.0887" target="_blank">arXiv:1006.0887</a>).</p>
<p><strong>Abstract</strong></p>
<p>As the vast majority of network measures are defined for one-mode networks, two-mode networks often have to be projected onto one-mode networks to be analyzed. A number of issues arise in this transformation process, especially when analyzing ties among nodes&#8217; contacts. For example, the values attained by the global and local clustering coefficients on projected random two-mode networks deviate from the expected values in corresponding classical one-mode networks. Moreover, both the local clustering coefficient and constraint (structural holes) are inversely associated to nodes&#8217; two-mode degree. To overcome these issues, this paper proposes redefinitions of the clustering coefficients for two-mode networks.</p>
<p><strong>Motivation</strong></p>
<p>The clustering coefficients for one-mode networks are a measure of cohesion or group formation. These measures are defined around triplets (i.e., three nodes with at least two ties among them) and whether or not these triplets are closed (i.e., they form part of a triangle). Two-mode networks are often projected onto one-mode networks to be analysed. These networks often contain many more triangles than prototypical networks, and thus overestimates the level of clustering in a network. Methodological issues exist at a local level as well. Specifically, when calculating the local clustering coefficient (Watts and Strogatz, 1998) or the structural holes measure constraint (Burt, 1992) on projected two-mode networks, the measures are inversely correlated with nodes&#8217; two-mode degree on a randomly tie reshuffled two-mode network (each node maintains their degree). Below is the average (a) local clustering coefficient and (b) constraint scores for nodes in a random version of the Scientific Collaboration Network (Newman, 2001) for various levels of two-mode degree.</p>
<p><img src="http://toreopsahl.files.wordpress.com/2011/10/localclusteringconstraintvstwomodedegree1.png?w=455" alt="" title="Local Clustering and Constraint plotted against Two-mode Degree"   class="aligncenter size-full wp-image-4053" /></p>
<p><img src="http://toreopsahl.files.wordpress.com/2011/06/reinforcement.png?w=455" alt="Reinforcement: 4-cycles and 3-paths" title="Reinforcement"   class="alignright size-full wp-image-3960" />As a result, a host of clustering measures for two-mode networks has been developed. For example, Robin and Alexander (2004) defined a coefficient as the number of four-cycles divided by the number of three-paths. Four-cycles in two-mode networks are the smallest possible cycle (like triangles are the smallest possible cycle in one-mode networks). However, this measure is distinctly different from the idea of triadic closure as the measure only include two primary nodes. In fact, a four-cycle is an indication of reinforcement or agreement between two-nodes and not cohesion or group formation.</p>
<p>The paper proposes redefinitions of the global and local clustering coefficients for two-mode networks. The measures are defined around 4-paths or triplets of primary nodes in two-mode networks. Specifically, the global coefficient is defined as the number of 4-paths that are closed divided by the total number, while the local is similar but focused on 4-paths centred on the focal node. For more details, see the paper (<a href="http://dx.doi.org/10.1016/j.socnet.2011.07.001" target="_blank">Social Networks</a>; <a href="http://arxiv.org/abs/1006.0887" target="_blank">arXiv</a>) or the tnet documentation (<a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/" title="Clustering in Two-mode Networks">tnet » Two-mode Networks » Clustering</a>).</p>
<p><strong>Want to test it with your data?</strong></p>
<p>The clustering_tm and clustering_local_tm-functions in <a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/" title="Clustering in Two-mode Networks">tnet</a> allows you to calculate the global and local clustering coefficients for two-mode networks (both binary and weighted) on your own dataset.</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load a sample network (Figure 3A of the paper)
net &lt;- rbind(
c(1,1),
c(1,2),
c(2,1),
c(2,3),
c(3,2),
c(3,3),
c(4,3))

# Calculate global clustering coefficient
clustering_tm(net)

# Calculate local clustering coefficient
clustering_local_tm(net)
</pre>
<p><strong>References</strong></p>
<p>Burt, R.S., 1992. Structural holes. Harvard University Press, Cambridge, MA.</p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132.</p>
<p>Opsahl, T., 2013. Triadic closure in two-mode networks: Redefining the global and local clustering coefficients. Social Networks 35, doi: 10.1016/j.socnet.2011.07.001.</p>
<p>Robins, G., Alexander, M., 2004. Small worlds among interlocking Directors: Network structure and distance in bipartite graphs. Computational and Mathematical Organization Theory 10 (1), 69–94.</p>
<p>Watts, D. J., Strogatz, S. H., 1998. Collective dynamics of “small-world” networks. Nature 393, 440-442.</p>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., 2013. Triadic closure in two-mode networks: Redefining the global and local clustering coefficients. Social Networks 35, doi: 10.1016/j.socnet.2011.07.001.</div>
<br />Filed under: <a href='http://toreopsahl.com/category/articles/'>Articles</a> Tagged: <a href='http://toreopsahl.com/tag/affiliation-networks/'>affiliation networks</a>, <a href='http://toreopsahl.com/tag/bipartite-networks/'>bipartite networks</a>, <a href='http://toreopsahl.com/tag/clustering-coefficient/'>clustering coefficient</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/embeddedness/'>embeddedness</a>, <a href='http://toreopsahl.com/tag/global/'>global</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/two-mode-networks/'>two-mode networks</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/3487/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/3487/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3487&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2011/12/21/article-triadic-closure-in-two-mode-networks-redefining-the-global-and-local-clustering-coefficients/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/localclusteringconstraintvstwomodedegree1.png" medium="image">
			<media:title type="html">Local Clustering and Constraint plotted against Two-mode Degree</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/06/reinforcement.png" medium="image">
			<media:title type="html">Reinforcement</media:title>
		</media:content>
	</item>
		<item>
		<title>Securely using R and RStudio on Amazon&#8217;s EC2</title>
		<link>http://toreopsahl.com/2011/10/17/securely-using-r-and-rstudio-on-amazons-ec2/</link>
		<comments>http://toreopsahl.com/2011/10/17/securely-using-r-and-rstudio-on-amazons-ec2/#comments</comments>
		<pubDate>Mon, 17 Oct 2011 15:32:49 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[RStudio]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[SSH]]></category>
		<category><![CDATA[SSH tunnel]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=3874</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2011/10/17/securely-using-r-and-rstudio-on-amazons-ec2/"><img src="http://toreopsahl.files.wordpress.com/2011/11/rstudio-server.png" alt="Securely using R and RStudio on Amazon's EC2" title="Securely using R and RStudio on Amazon's EC2" width="228" height="255" class="alignright size-full wp-image-3928" /></a>R is a great tool for analysing data with an intuitive and interactive programming language. There are a number of limitations with an interactive programming language compared to compiled languages, such as higher memory and processing requirements. One way of overcoming these requirements is to use cloud computing, such as Amazon EC2. The Bioconductor group has an Amazon Machine Image with the latest version of R and RStudio; however, there is a major security hole in the default setup that allows others to "borrow" the resources you are paying for as well as being able to steal your data. This post highlights how to close this hole and securely use R and RStudio on Amazon EC2. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3874&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>R is a great tool for analysing data. It consists of an intuitive and interactive programming language with a vast number of extension packages (such as <a href="http://toreopsahl.com/tnet/">tnet</a>) that allow analysts to take advantage of functions created by individuals outside the R core team. As such, it is rapidly becoming (or has it already become?) the de facto tool for data scientists. There are a number of limitations with an interactive programming language compared to compiled languages, such as higher memory (&#8220;Error: cannot allocate vector of size 762.9 Mb&#8221;) and processing requirements. There are two ways to overcome these limitations :</p>
<ol>
<li>Reprogramme everything in (and learn) C++</li>
<li>Get more resources</li>
</ol>
<p>While the first solution might be the most appropriate one for repetitive tasks or production code, the second one might be quicker and easier for data scientists doing one-off analyses. It is possible to get more resources without buying a new server or more memory chips by using cloud computing. In essence, cloud computing allows analysts to rent resources when they need them. Amazon is one provider with the <a href="http://aws.amazon.com/ec2/" target="_blank">Elastic Compute Cloud</a> (EC2). For example, it is possible to rent a server with 8 cores and 68.4 GB of memory for $2 per hour instead of buying a $5,000+ server.</p>
<p>The downside to using cloud computing is security. When I first started using R on Amazon EC2, I followed the instructions <a href="http://bioconductor.org/help/bioconductor-cloud-ami/" target="_blank">Bioconductor in the cloud</a> (Thanks guys for maintaining an updated AMI and a great tutorial!). This tutorial shows how to set up an account, creating public/private keys, changing the firewall, launching a machine image with the latest version of R pre-installed, connecting to the command line using Secure Shell or SSH, using R through the command line interface, and using R through a web browser (<a href="http://rstudio.org/" target="_blank">RStudio</a>). While the command line interface is secure using SSH and the private key, the web interface is not secure (standard username, password, and port as well as non-encrypted traffic). This means that anyone who knows the hostname or ip address could login to an R session. In fact, a port scan of the standard port across the range of ip addresses could allow a hacker to detect vulnerable servers and get access to their computing resources and data. In the rest of this post, I will borrow from <a href="http://bioconductor.org/help/bioconductor-cloud-ami/" target="_blank">Bioconductor in the cloud</a> but suggest points to increase the security of using their machine image on Amazon EC2.</p>
<p>This tutorial is basic and highlights all the steps needed to get up and running. It is written using Windows and Putty, but should be applicable to people using other software with a bit of fudging. Before you start, please download <a href="http://www.chiark.greenend.org.uk/~sgtatham/putty/download.html" target="_blank">PuTTY and PuTTYgen</a> and save them in a convenient location. Note: These programmes do not need to be installed.</p>
<p><strong>Getting to the Management Console</strong></p>
<p>The first thing to do is to set up an Amazon account (yes, the same one as you buy books with), and register for <a href="http://aws.amazon.com/ec2/" target="_blank">Amazon Web Services (AWS)</a>. Then you need to go to the AWS Management Console. The link will be on the very top of the page when you are signed in with the Amazon id. The Management Console controls a number of AWS products, so you want to go to Elastic Compute Cloud by clicking on &#8220;Amazon EC2&#8243; on the horizontal menu.</p>
<div id="attachment_3893" class="wp-caption aligncenter" style="width: 590px"><img class="size-full wp-image-3893" title="Amazon Web Service's Management Console: The EC2 Dashboard" src="http://toreopsahl.files.wordpress.com/2011/10/aws_580.gif?w=455" alt=""   /><p class="wp-caption-text">Amazon Web Service&#8217;s Management Console: The EC2 Dashboard</p></div>
<p><strong>Public/private key pair</strong></p>
<p>The first thing to do in the Management Console is to create a public/private key pair. This is important because the private key will be the &#8220;username and password&#8221; when accessing the server. This is done by clicking on &#8220;0 Key Pairs&#8221; under &#8220;My Resources&#8221; on the right, and then &#8220;Create Key Pair&#8221;. You need to give it an arbitrary name and then save the file with that-name.pem somewhere safe on your computer.</p>
<p>Unlike other SSH programmes, PuTTY cannot read pem-files directly. They must be converted to ppk-files. This can be done using <a href="http://www.chiark.greenend.org.uk/~sgtatham/putty/download.html" target="_blank">PuTTYgen</a>. When running the programme, click on the Load button in the middle of the screen (do not use the top menu!). Change the file-type drop-down menu from &#8220;PuTTY Private Key Files (*.ppk)&#8221; to &#8220;All Files (*.*), and select the pem-file with the private key downloaded from the Management Console. Then, click on the &#8220;Save private key&#8221; button to save the private key as a ppk-file. You do not need to password protect the file if you store it in a location only you have access to.</p>
<p><strong>Firewall settings</strong></p>
<p>Communication to a server occurs through ports. A firewall is a mechanism for opening some ports and closing others. A part of maintain a secure server is to only open the ones you need. The <a href="http://bioconductor.org/help/bioconductor-cloud-ami/" target="_blank">Bioconductor in the cloud</a>-guide suggests that you open ports 22 and 8787. Port 22 is used to connect to the command line of the server using SSH. Port 8787 is the web interface of <a href="http://rstudio.org/" target="_blank">RStudio</a>. While SSH traffic is encrypted, http traffic towards port 8787 is not. As such, this is a potential security vulnerability. I suggest that you only open port 22. Later on in this post, I will show how you can reach the web interface securely over port 22.</p>
<p>The firewall protecting servers on Amazon EC2 is controlled through the EC2 Dashboard&#8217;s Security Groups. A security group is a collection of instructions or rules. By default, there should be one security group called default. To see the details of this group, click on &#8220;1 Security Group&#8221; on the EC2 Dashboard and then click on &#8220;default&#8221;. The rules are listed under the &#8220;Inbound&#8221;-tab. If &#8220;22 (SSH)&#8221; is not listed, you need to open it. This is done by selecting SSH from the drop-down menu, clicking &#8220;+Add Rule&#8221;, and then clicking &#8220;Apply Rule Changes&#8221;. The servers with the default security settings will now be reachable on port 22.</p>
<div id="attachment_3894" class="wp-caption aligncenter" style="width: 590px"><img class="size-full wp-image-3894" title="Amazon EC2 Default Firewall Settings with SSH" src="http://toreopsahl.files.wordpress.com/2011/10/aws-firewall_580.gif?w=455" alt=""   /><p class="wp-caption-text">Amazon EC2 Default Firewall Settings with SSH</p></div>
<p><strong>Running a server</strong></p>
<p>Now you have completed all the one-off set-up tasks, and you are ready to launch a server or instance. By clicking on &#8220;Instances&#8221; on the left-side, you should see the instances running as well as being able to start new one. Click on &#8220;Launch Instance&#8221; to get started, and select &#8220;Launch Classic Wizard&#8221;. There are five parts to this process:</p>
<p><em>1: Choose an AMI</em></p>
<p>The first question you are asked is which kind of software system or machine image (AMI) you want. There are a number of standard ones, but to save time and many lines of code, I will show you how to make use of <a href="http://bioconductor.org/help/bioconductor-cloud-ami/" target="_blank">Bioconductor in the cloud</a>&#8216;s 64-bit Linux system with the latest version of R installed. To load this, select the Community AMIs-tab, enter ami-b5a079dc in the search box (R-2.15; check their website for a new AMI id when a new version of R is released), and then click the Select-button.</p>
<p><em>2: Instance Details</em></p>
<p>The next question you are asked is the resources you would like, and where you would like the server to be located (note that prices vary based on location with &#8220;US East (Virginia)&#8221; often being the cheapest). Please refer to <a href="http://aws.amazon.com/ec2/pricing/" target="_blank">Amazon&#8217;s current pricing table</a>. At the time of writing, the Hi-Memory On-Demand Instances (Linux) cost the following:</p>
<table class="tore" align="center">
<tbody>
<tr>
<th>Instance</th>
<th>Processor Units</th>
<th>Memory</th>
<th>Price per hour</th>
</tr>
<tr>
<td>Extra Large</td>
<td>2 cores / 6.5 ECUs</td>
<td>17.1 GB</td>
<td>$0.50</td>
</tr>
<tr>
<td>Double Extra Large</td>
<td>4 cores/ 13 ECUs</td>
<td>34.2 GB</td>
<td>$1.00</td>
</tr>
<tr>
<td>Quadruple Extra Large</td>
<td>8 cores / 26 ECUs</td>
<td>68.4 GB</td>
<td>$2.00</td>
</tr>
</tbody>
</table>
<p>By clicking continue, you will be offered a number of more advanced options. The default values are ok. On the third screen, you are asked to give the instance a name (e.g., R-server).</p>
<p><em>3: Create Key Pair</em></p>
<p>You should already have completed this part, so you should see the arbitrary name chosen in a drop-down box and be able to just click Continue.</p>
<p><em>4: Configure Firewall</em></p>
<p>We have also already completed this step, so make sure the default group is selected and click Continue.</p>
<p><em>5: Review</em></p>
<p>On the final page, you are able to review all the settings. Below is an example of a 68.4 memory instance.</p>
<div id="attachment_3891" class="wp-caption aligncenter" style="width: 590px"><img class="size-full wp-image-3891" title="Amazon EC2 Launching An AMI" src="http://toreopsahl.files.wordpress.com/2011/10/aws-launchinganaim_580.gif?w=455" alt=""   /><p class="wp-caption-text">Launching an AMI</p></div>
<p>After hitting launch, a server will be allocated and the AMI will be loaded onto it. When it is complete, the status light will turn green and state &#8220;running&#8221;. Do note that you are being charged from this moment. See the final section for information on how to stop being charged.</p>
<p><strong>Connecting to the command line using SSH</strong></p>
<p>To control the server, you need to use SSH. The first thing we need to find out is the address of the server. This information is found by click on a running instance in the Instance-page of the EC2 Dashboard under &#8220;Public DNS&#8221;. An address will be similar to &#8220;ec2-184-72-187-196.compute-1.amazonaws.com&#8221;. Write this address down, or more easily, copy it to your clipboard. I will use this example address in the rest of the post, remember to change it to the address of your instance!</p>
<p>In this post, I am showing how <a href="http://www.chiark.greenend.org.uk/~sgtatham/putty/download.html" target="_blank">PuTTY</a> can be used for this; however, there are a number of other programmes out there that does the same thing. In PuTTY, we need to enter the address of the server and load the ppk-file created earlier with the private key. The screenshots below show the server&#8217;s address entered under Session and the private key loaded under SSH &gt; Auth.</p>
<div id="attachment_3897" class="wp-caption aligncenter" style="width: 590px"><img class="size-full wp-image-3897" title="PuTTY with private key as authentication" src="http://toreopsahl.files.wordpress.com/2011/10/putty_1_address_2_privatekey.gif?w=455" alt=""   /><p class="wp-caption-text">PuTTY with private key as authentication</p></div>
<p>By clicking on Open, PuTTY connects to the server. The first time you connect to a server, you will have to accept the public key. You can check the finger print against the one listed on the Key Pairs-page of the EC2 Dashboard. When asked &#8220;login as:&#8221;, simply enter <code>root</code> to get full privileges on the server.</p>
<p><strong>Running R and installing tnet using SSH</strong></p>
<p>When you have access to the command line, you can start R by simply typing <code>R</code> and hitting enter. This version of R comes with the Bioconductor-packages. To install the latest version of tnet, you need to type <code>install.packages("tnet")</code></p>
<p>After downloading and compiling tnet and its dependencies, you can load tnet by typing <code>library(tnet)</code></p>
<p><strong>Connecting securely to RStudio Server</strong></p>
<p>There are certain limitations to using the command line interface with R. First, it does not allow for graphical representations. Second, it is more cumbersome than the standard R for Windows GUI. To overcome these limitations, <a href="http://rstudio.org/" target="_blank">RStudio Server</a> can be used. This software is a nice GUI for Linux servers running R, and is pre-installed on the Bioconductor AMI. If you opened port 8787 that the <a href="http://bioconductor.org/help/bioconductor-cloud-ami/" target="_blank">Bioconductor in the cloud</a>-tutorial suggests, you could reach this interface by typing <code><a href="http://ec2-184-72-187-196.compute-1.amazonaws.com:8787" rel="nofollow">http://ec2-184-72-187-196.compute-1.amazonaws.com:8787</a></code> in a web browser (remember to replace the address with the one of your instance). However, as mentioned above, this leaves a large security hole open and allows others to &#8220;borrow&#8221; the resources you are paying for as well as being able to steal your data.</p>
<p>It is possible to communicate securely with the RStudio using an SSH tunnel. An SSH tunnel is an encrypted wrapper for other internet traffic. It is possibly best described using a diagram:</p>
<div id="attachment_3899" class="wp-caption aligncenter" style="width: 590px"><img class="size-full wp-image-3899" title="SSH tunnel" src="http://toreopsahl.files.wordpress.com/2011/10/ssh_tunnel.gif?w=455" alt=""   /><p class="wp-caption-text">Using an SSH tunnel to encrypt the connection between a web browser and RStudio server</p></div>
<p>In a standard connection, the web browser connects directly to the RStudio Server on port 8787 (e.g., <code><a href="http://ec2-184-72-187-196.compute-1.amazonaws.com:8787" rel="nofollow">http://ec2-184-72-187-196.compute-1.amazonaws.com:8787</a></code>). This traffic can be intercepted. Conversely, when using an SSH tunnel, the web browser connects to PuTTY, which encrypts the traffic and sends it to the SSH server, which decrypts it and sends it to RStudio Server. By not opening port 8787 in the Firewall, RStudio Server is only available to people logged on to the server.</p>
<p>To configure PuTTY to run an SSH tunnel, you need to follow the instructions for connecting to the command line. Additionally, you need to enter the following details under SSH &gt; Tunnels:</p>
<ul>
<li>Source port: 8787</li>
<li>Destination: localhost:8787</li>
</ul>
<p>Do remember to click &#8220;Add&#8221;. The panel should look similar to this:</p>
<div id="attachment_3904" class="wp-caption aligncenter" style="width: 471px"><img class="size-full wp-image-3904" title="SSH tunnel in PuTTY" src="http://toreopsahl.files.wordpress.com/2011/10/putty_3_tunnel1.gif?w=455" alt=""   /><p class="wp-caption-text">Setting up an SSH tunnel in PuTTY for RStudio Server</p></div>
<p>When you then connect to the instance (click Open and login), the SSH tunnel will be active. Congratulations: You can then open a web browser and type <code><a href="http://localhost:8787" rel="nofollow">http://localhost:8787</a></code> to securely connect to the instance. The default username and password are unbuntu and bioc. In RStudio, you can install tnet by selecting it from the &#8220;Packages&#8221;-tab in the lower-right panel.</p>
<div id="attachment_3905" class="wp-caption aligncenter" style="width: 583px"><img class="size-full wp-image-3905" title="RStudio Server login screen" src="http://toreopsahl.files.wordpress.com/2011/10/rstudioserver.gif?w=455" alt=""   /><p class="wp-caption-text">The login screen of RStudio Server. The default username and password are unbuntu and bioc.</p></div>
<p><strong>A second less-secure alternative</strong></p>
<p>By looking at the length of this post, I do realise that there are quite a few steps to achieve a secure http connection with an Amazon EC2 instance. Although the above solution ensures that it is not possible to eavesdrop on the traffic between your computer and the EC2 instance, there is a simpler trick that should stop most people trying to log into your session: change the default password. You still need to connect to the command line using SSH. When you are there, you should write <code>passwd ubuntu</code> to be prompted to enter a new password. Note that this procedure would require you to open port 8787 in addition to port 22 in the firewall (instead of selecting SSH from the drop-down menu, select &#8220;Custom TCP rule&#8221; and enter 8787 in the port range). Having said that, I do strongly encourage taking the extra step and using an SSH tunnel to ensure that your data and resources are safe.</p>
<p><strong>Stopping and Terminating</strong></p>
<p>As a final note, it is important to stop instances when you are done with them. Otherwise, you will continue to be charged! This is done by selecting an instance on the Instance-page of the EC2 Dashboard, and selecting Stop or Terminate from the Instance Actions drop-down box. Stop means that the server will be shut-down, but all the data and programmes on it will be saved on the Amazon Elastic Block Store (EBS; not free but quite cheap). A stopped instance can easily be restarted by choosing it and selecting Start from the Instance Actions drop-down box. Conversely, Terminate stops an instance without saving it to EBS. It is not possible to restart a terminated instance.</p>
<br />Filed under: <a href='http://toreopsahl.com/category/articles/'>Articles</a> Tagged: <a href='http://toreopsahl.com/tag/ec2/'>EC2</a>, <a href='http://toreopsahl.com/tag/r/'>R</a>, <a href='http://toreopsahl.com/tag/rstudio/'>RStudio</a>, <a href='http://toreopsahl.com/tag/security/'>security</a>, <a href='http://toreopsahl.com/tag/ssh/'>SSH</a>, <a href='http://toreopsahl.com/tag/ssh-tunnel/'>SSH tunnel</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/3874/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/3874/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3874&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2011/10/17/securely-using-r-and-rstudio-on-amazons-ec2/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/aws_580.gif" medium="image">
			<media:title type="html">Amazon Web Service&#039;s Management Console: The EC2 Dashboard</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/aws-firewall_580.gif" medium="image">
			<media:title type="html">Amazon EC2 Default Firewall Settings with SSH</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/aws-launchinganaim_580.gif" medium="image">
			<media:title type="html">Amazon EC2 Launching An AMI</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/putty_1_address_2_privatekey.gif" medium="image">
			<media:title type="html">PuTTY with private key as authentication</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/ssh_tunnel.gif" medium="image">
			<media:title type="html">SSH tunnel</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/putty_3_tunnel1.gif" medium="image">
			<media:title type="html">SSH tunnel in PuTTY</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/10/rstudioserver.gif" medium="image">
			<media:title type="html">RStudio Server login screen</media:title>
		</media:content>
	</item>
		<item>
		<title>Why Anchorage is not (that) important: Binary ties and Sample selection</title>
		<link>http://toreopsahl.com/2011/08/12/why-anchorage-is-not-that-important-binary-ties-and-sample-selection/</link>
		<comments>http://toreopsahl.com/2011/08/12/why-anchorage-is-not-that-important-binary-ties-and-sample-selection/#comments</comments>
		<pubDate>Fri, 12 Aug 2011 01:39:36 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=3566</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2011/08/12/why-anchorage-is-not-that-important-binary-ties-and-sample-selection/" rel="attachment wp-att-3913"><img src="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_300.png" alt="" title="BTS airport network" width="300" height="121" class="alignright size-full wp-image-3913" /></a>A surprising finding when analysing airport networks is the importance of Anchorage airport in Alaska. In fact, it is the most central airport in the network when applying betweenness! I do not believe this finding is completely accurate due to two reasons: (1) there is a potential for measurement error when not including tie weights (i.e., assigning the same importance to the connection between London Heathrow and New York's JFK as to the connection between Pack Creek Airport and Sitka Harbor Sea Plane Base in Alaska), and (2) relying on US data only leads to sample selection as the airport network is a global system. This post highlights how to use a weighted betweenness measure as well as the extent of the sample selection issue.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3566&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A surprising finding when analysing airport networks is the importance of Anchorage airport in Alaska. In fact, it is the most central airport in the network when applying betweenness! Betweenness for a node is defined as the number of shortest paths among all others that passes through the node (see <a href="http://toreopsahl.com/tnet/weighted-networks/node-centrality/" title="Node Centrality in Weighted Networks">Opsahl et al., 2010</a>, for a review). A host of explanations have been offered to account for the high betweenness of Anchorage given its relatively few connections. For example, Guimera et al (2004, pg. 7797) reasoned that:</p>
<blockquote><p>Alaska is a sparsely populated, isolated region with a disproportionately large, for its population size, number of airports. Most Alaskan airports have connections only to other Alaskan airports. This fact makes sense geographically. However, distance-wise, it also would make sense for some Alaskan airports to be connected to airports in Canada’s Northern Territories. These connections are, however, absent. Instead, a few Alaskan airports, singularly Anchorage, are connected to the continental U.S. The reason is clear: the Alaskan population needs to be connected to the political centers, which are located in the continental U.S., whereas there are political constraints making it difficult to have connections to cities in Canada, even to ones that are close geographically ([Guimera and Amaral, 2004]). It is now obvious why Anchorage’s centrality is so large. Indeed, the existence of nodes with anomalous centrality is related to the existence of regions with a high density of airports but few connections to the outside. The degree-betweenness anomaly is therefore ultimately related to the existence of communities in the network.</p></blockquote>
<p>While many researchers and practitioners highlight this finding, I do not believe it is completely accurate. There are two reasons for this:</p>
<p><strong>Issue 1: Binary ties</strong></p>
<p>Admittedly this might be a personal bias as most of my work has been on <a href="http://toreopsahl.com/tnet/weighted-networks/" title="Weighted Networks">weighted networks</a>. Without going into much detail in this blog post, I actually strongly believe that if you assign the same importance to the connection between London Heathrow and New York&#8217;s JFK as you do to the connection between Pack Creek Airport and Sitka Harbor Sea Plane Base in Alaska (<a href="http://maps.google.co.uk/maps/ms?msid=208094715026309109028.0004aa42253f58e0c3772&amp;msa=0&amp;ll=55.825973,-131.748047&amp;spn=21.031216,57.084961" target="_blank">map</a>), then there is a potential for measurement error. The table below lists the top ten airports in terms of betweenness when analyzing the binary and weighted (by passengers) versions of the Bureau of Transportation Statistics (BTS) Transtats data (Brandes, 2001). The code to replicate these results can be found at the end of this page.</p>
<table class="tore" align="center">
<tr>
<th rowspan="3">Rank</th>
<th colspan="4" align="center">Betweenness</th>
</tr>
<tr>
<th colspan="2" align="center">Binary Analysis</th>
<th colspan="2" align="center">Weighted Analysis</th>
</tr>
<tr>
<th align="center">Airport</th>
<th align="center">Score</th>
<th align="center">Airport</th>
<th align="center">Score</th>
</tr>
<tr>
<td>1</td>
<td>ANC (Anchorage, AK, USA)</td>
<td>465272</td>
<td>SEA (Seattle/Tacoma, WA, USA)</td>
<td>834217</td>
</tr>
<tr>
<td>2</td>
<td>FAI (Fairbanks, AK, USA)</td>
<td>215503</td>
<td>ANC (Anchorage, AK, USA)</td>
<td>761834</td>
</tr>
<tr>
<td>3</td>
<td>YYZ (Toronto, Canada, Canada)</td>
<td>131562</td>
<td>ATL (Atlanta, GA, USA)</td>
<td>735628</td>
</tr>
<tr>
<td>4</td>
<td>LAX (Los Angeles, CA, USA)</td>
<td>129246</td>
<td>LAX (Los Angeles, CA, USA)</td>
<td>531980</td>
</tr>
<tr>
<td>5</td>
<td>SEA (Seattle/Tacoma, WA, USA)</td>
<td>125151</td>
<td>ORD (Chicago, IL, USA)</td>
<td>409001</td>
</tr>
<tr>
<td>6</td>
<td>JFK (New York, NY, USA)</td>
<td>124927</td>
<td>DEN (Denver, CO, USA)</td>
<td>314764</td>
</tr>
<tr>
<td>7</td>
<td>HPN (White Plains, NY, USA)</td>
<td>121096</td>
<td>JFK (New York, NY, USA)</td>
<td>247791</td>
</tr>
<tr>
<td>8</td>
<td>MIA (Miami, FL, USA)</td>
<td>120643</td>
<td>MIA (Miami, FL, USA)</td>
<td>206547</td>
</tr>
<tr>
<td>9</td>
<td>DEN (Denver, CO, USA)</td>
<td>120342</td>
<td>BOS (Boston, MA, USA)</td>
<td>168140</td>
</tr>
<tr>
<td>10</td>
<td>MSP (Minneapolis, MN, USA)</td>
<td>111188</td>
<td>FAI (Fairbanks, AK, USA)</td>
<td>157491</td>
</tr>
</table>
<p>This table demonstrates that Anchorage has twice the betweenness of the runner-up, Fairbanks Alaska, in the binary analysis. In the weighted analysis, Anchorage loses the first place to Seattle and Fairbanks moves to 10th place. It is also worth noticing that only US airports are in the top ten lists using both analyses, which leads me on to the second issues with using the BTS data: sample selection.</p>
<p><strong>Issue 2: Sample selection</strong></p>
<p>This issue affects all network studies, and something I have been interested in for a while. We define a population and analyse the connections among them. For example, I have analysed the scientific collaboration network based on the papers uploaded to the arXiv preprint server (e.g., <a href="http://toreopsahl.com/tnet/two-mode-networks/weighted-rich-club-effect/" title="Weighted Rich-club Effect">Opsahl et al., 2008</a>) with the full knowledge that there are many more scientific publications out there as well as other forms of collaboration and channels for knowledge flow among scientists, such as grant proposals and conference attendance. By simply restricting ourselves to data that is easy to collect (often stored in a central location / repository), the research is vulnerable to <a href="http://en.wikipedia.org/wiki/Selection_bias" target="_blank">sample selection bias</a>. </p>
<p>When it comes to airport networks, the Bureau of Transportation Statistics (BTS) Transtats data is straight forward to collect: Go <a href="http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=292" target="_blank">here</a>, select what you want (Origin, Destination), and click Download! However, there is a small note on another page explaining the dataset: &#8220;<em>This table combines domestic and international market data reported by U.S. and foreign air carriers, and contains market data by carrier, origin and destination, and service class for enplaned passengers, freight, and mail. For a uniform end date for the combined databases, the last 3 months U.S. carrier domestic data released in T-100 Domestic Market (U.S. Carriers Only) are not included. Flights with both origin and destination in a foreign country are not included.</em>&#8221; It is the last line of this description that highlights the potential sample selection bias. While the data contain all US airports and all domestic flights, it only contains Non-US flights that leave or terminate at a US airport and the Non-US airports on the other end of these flights. As such, a section of the square adjacency matrix is missing (flights from Non-US to Non-US airports in the dataset) as well as the entire rows and columns for airports without flights to the US. To exemplify this bias, I have plotted the routes on a world map below. </p>
<div id="attachment_3666" class="wp-caption aligncenter" style="width: 560px"><a href="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_plot_reduced1.pdf"><img src="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_plot_550.png?w=455" alt="" title="The BTS airport network"   class="size-full wp-image-3666" /></a><p class="wp-caption-text">The BTS airport network in 2010 where the line colour is based on the number of passengers. The code to replicate this image can be found at the end of this page. If you click on the image, a vector graphic version of it is available (pdf; 5.95mb).</p></div>
<p>While it is possible to see a concentration in the US on the above picture, the sample selection becomes much more apparent when highlighting Europe. In the picture to below, it is possible to see that no flights are between any pair of European cities nor any other point on the map. Would you have to transit at New York&#8217;s JFK to get from London to Barcelona? This gap highlights the need for looking for more complete data sources than the Bureau of Transportation Statistics when analysing airport networks.</p>
<div align="center"><div id="attachment_3668" class="wp-caption aligncenter" style="width: 410px"><img src="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_plot_europe_400.png?w=455" alt="" title="The European part of the BTS airport network"   class="size-full wp-image-3668" /><p class="wp-caption-text">The European part of the BTS airport network</p></div></div>
<p><strong>Alternative data-source</strong></p>
<p>There are a couple of authoritative databases with world-wide airline routes. However, most of them are proprietary as they have enormous business intelligence potential and, as a consequence, are difficult to collect. OAG Worldwide is one such database, and it should be noted that Guimera et al (2004) went through the hoops by getting this data, and therefore, had a much more complete view of the airport network than if they had used the BTS Transtats data. While I do not have access to such a database, <a href="http://openflights.org/" target="_blank">Openflights.org</a> is a crowdsourced alternative. Although using this data comes without any guarantee, it has the potential to showcase the limitations of the BTS Transtats data. As a first step, I mapped the data to ensure there were no obvious pockets of missing data.</p>
<div id="attachment_3667" class="wp-caption aligncenter" style="width: 560px"><a href="http://toreopsahl.files.wordpress.com/2011/08/airport_of_plot_reduced1.pdf"><img src="http://toreopsahl.files.wordpress.com/2011/08/airport_of_plot_550.png?w=455" alt="" title="The Openflight.org airport network"   class="size-full wp-image-3667" /></a><p class="wp-caption-text">The Openflight.org airport network where the line colour is based on the number of routes (accessed on August 12, 2011). The code to replicate this image can be found at the end of this page. If you click on the image, a vector graphic version of it is available (pdf; 5.25mb).</p></div>
<p><strong>Conclusion 1: Anchorage is not the most important airport</strong></p>
<p>As can be seen from this picture, there are no obvious areas without any form of airline traffic. To show how this data impacts on a betweenness analysis, I have computed betweenness on both the binary and weighted (by number of routes as the passenger numbers were not available) versions of the network. As can be seen in the table below, major airports located around the globe get the highest scores in these analyses instead of only US airports. Specifically, Anchorage is only the third most central in the binary analysis, and the 14th most central in the weighted analysis. As such, it is still an important airport in the networks, but maybe not the most important. </p>
<table class="tore" align="center">
<tr>
<th rowspan="3">Rank</th>
<th colspan="4" align="center">Betweenness</th>
</tr>
<tr>
<th colspan="2" align="center">Binary Analysis</th>
<th colspan="2" align="center">Weighted Analysis</th>
</tr>
<tr>
<th align="center">Airport</th>
<th align="center">Score</th>
<th align="center">Airport</th>
<th align="center">Score</th>
</tr>
<tr>
<td>1</td>
<td>FRA (Frankfurt, Germany)</td>
<td>587531</td>
<td>LHR (London, United Kingdom)</td>
<td>1858349</td>
</tr>
<tr>
<td>2</td>
<td>CDG (Paris, France)</td>
<td>520707</td>
<td>LAX (Los Angeles, United States)</td>
<td>1310287</td>
</tr>
<tr>
<td>3</td>
<td>ANC (Anchorage, United States)</td>
<td>481044</td>
<td>JFK (New York, United States)</td>
<td>1084392</td>
</tr>
<tr>
<td>4</td>
<td>DXB (Dubai, United Arab Emirates)</td>
<td>443314</td>
<td>BKK (Bangkok, Thailand)</td>
<td>797785</td>
</tr>
<tr>
<td>5</td>
<td>GRU (Sao Paulo, Brazil)</td>
<td>402882</td>
<td>SIN (Singapore)</td>
<td>739981</td>
</tr>
<tr>
<td>6</td>
<td>YYZ (Toronto, Canada)</td>
<td>398869</td>
<td>SEA (Seattle, United States)</td>
<td>723145</td>
</tr>
<tr>
<td>7</td>
<td>LHR (London, United Kingdom)</td>
<td>389846</td>
<td>MAD (Madrid, Spain)</td>
<td>707354</td>
</tr>
<tr>
<td>8</td>
<td>LAX (Los Angeles, United States)</td>
<td>356600</td>
<td>GRU (Sao Paulo, Brazil)</td>
<td>684057</td>
</tr>
<tr>
<td>9</td>
<td>DME (Moscow, Russia)</td>
<td>353902</td>
<td>NRT (Tokyo, Japan)</td>
<td>639074</td>
</tr>
<tr>
<td>10</td>
<td>BKK (Bangkok, Thailand)</td>
<td>352682</td>
<td>DXB (Dubai, United Arab Emirates)</td>
<td>610765</td>
</tr>
<tr>
<td>&#8230;</td>
<td>&#8230;</td>
<td>&#8230;</td>
<td>&#8230;</td>
<td>&#8230;</td>
</tr>
<tr>
<td>14</td>
<td>&#8230;</td>
<td>&#8230;</td>
<td>ANC (Anchorage, United States)</td>
<td>469203</td>
</tr>
<tr>
<td>18</td>
<td>&#8230;</td>
<td>&#8230;</td>
<td>FRA (Frankfurt, Germany)</td>
<td>392418</td>
</tr>
</table>
<p><font color="white">_</font><br /><strong>Conclusion 2: Finding the global superhub using a weighted approached</strong></p>
<p>London Heathrow is the most central airport when considering both tie weights and the global airport network. And this, unlike Anchorage, is not a surprising finding as it is the airport with most international passengers (<a href="http://www.aci.aero/cda/aci_common/display/main/aci_content07_c.jsp?zn=aci&amp;cp=1-5-212-1376-1379_666_2__" target="_blank">Airports Council International, 2011</a>). </p>
<p>To further investigate the effects on the ranking when considering tie weights in the global airport networks, I considered the change in ranking of the two airports ranked first in the binary and weighted analyses, Frankfurt and London Heathrow. Frankfurt went from having the highest betweenness in the binary analysis to only having 18th highest betweenness in the weighted analysis. Conversely, London Heathrow went from having the seventh highest to the highest betweenness score. To look into this cross-over of rankings, I compared the degree (number of airports with direct flights) and strength (number of direct routes) from these two airports:</p>
<table class="tore" align="center">
<tr>
<th align="center" rowspan="2">Airport</th>
<th align="center" rowspan="2">Degree</th>
<th align="center" rowspan="2">Node Strength</th>
<th align="center" colspan="5">Strength distribution</th>
</tr>
<tr>
<th>1</th>
<th>2</th>
<th>3</th>
<th>4</th>
<th>5</th>
</tr>
<tr>
<td>FRA (Frankfurt, Germany)</td>
<td>237</td>
<td>349</td>
<td>142</td>
<td>82</td>
<td>9</td>
<td>4</td>
<td>0</td>
</tr>
<tr>
<td>LHR (London, United Kingdom)</td>
<td>157</td>
<td>288</td>
<td>71</td>
<td>55</td>
<td>22</td>
<td>4</td>
<td>5</td>
</tr>
</table>
<p>This table shows that Frankfurt has direct flights to 51% more airports than London Heathrow, but only 21% more routes. The variation in tie weights can be further investigated by looking at the weight distribution. While there are only four airports with four direct routes from Frankfurt, there are nine airports with four or five direct routes from London Heathrow. </p>
<p>Moreover, by looking at which airports have the strong ties (i.e., with tie weights greater or equal to 4) with Frankfurt and London Heathrow, it is possible to see that the geographical distribution is strikingly different. Frankfurt has four direct routes to Antalya (Turkey), Madrid (Spain), Mallorca (Spain), and Vienna (Austria), which are 5,597 kilometres long (average: 1,399km). Conversely, London Heathrow has five routes to Delhi (India), Dubai (UAE), Hong Kong (China), Los Angeles (LAX, USA), and New York City (JFK, USA) and four routes to Bangkok (Thailand), Mumbai (India), Boston (USA), and Miami (USA), which are 65,376 kilometres long (average 7,264km). By having strong ties to geographically distant instead of close airports, London Heathrow acts as a intercontinental hub instead of a continental hub. Additionally, the airports with strong ties to London Heathrow have high betweenness, and therfore, act as hubs in their respective regions. As such, London Heathrow can be seen as the global hub of the world-wide airport network.</p>
<p><strong>References</strong></p>
<p>Airports Council International, 2011. <a href="http://www.aci.aero/cda/aci_common/display/main/aci_content07_c.jsp?zn=aci&amp;cp=1-5-212-1376-1379_666_2__" target="_blank">Year to date International Passenger Traffic, Apr-2011</a>, accessed August 12, 2011.</p>
<p>Brandes, U., 2001. A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25, 163-177.</p>
<p>Bureau of Transportation Statistics, 2011. <a href="http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=292" target="_blank">Air Carrier Statistics (Form 41 Traffic): T-100 Market (All Carriers)</a>, accessed August 12, 2011.</p>
<p>Guimera, R., Amaral, L. A. N., 2004. <a href="http://www.etseq.urv.cat/seeslab/media/publication_pdfs/Guimera-2004-Eur.Phys.J.B-38-381.pdf" target="_blank">Modeling the world-wide airport network</a>. The European Physical Journal B 38, 381–385.</p>
<p>Guimera, R., Mossa, S., Turtschi, A., Amaral, L. A. N., 2004. <a href="http://www.pnas.org/content/102/22/7794" target="_blank">The worldwide air transportation network: Anomalous centrality, community structure, and cities’ global roles</a>. Proceedings of the National Academy of Sciences 102(22), 7794-7799. </p>
<p>Openflights.org, 2011. <a href="http://openflights.org/data.html" target="_blank">Airport, airline and route data</a>, accessed August 12, 2011.</p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J., 2010. <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/" title="Article: Node centrality in weighted networks: Generalizing degree and shortest paths">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32 (3), 245-251.</p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702).</p>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Agneessens, F., Skvoretz, J., 2010. Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks 32 (3), 245-251.</div>
<div class="knobinfo">I would like to acknowledge <a href="http://people.oii.ox.ac.uk/hogan/" target="_blank">Bernie Hogan</a> in helping to develop the idea behind this post.</div>
<p><strong>Code used to create the results in this blog post</strong></p>
<p>Below is the code to redo the analysis in this post. You need to have the R-packages geosphere, maps, and tnet installed before to run the code. You also need to download the Bureau of Transportation Statistics (BTS) Transtats data. Please see notes in the code.</p>
<pre class="brush: plain; title: ; notranslate">
###################################
## US Airport network (BTS data) ##
###################################

# Load BTS Transtats data
# Downloaded from http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=292 using
# Filters: Geography=all; Year=2010; Months=all
# Columns: Passengers, Origin, OriginCountryName, Dest, DestCountryName
BTS &lt;- read.csv(&quot;data/344989982_T_T100_MARKET_ALL_CARRIER.csv&quot;, header=TRUE, stringsAsFactors=FALSE)
BTS &lt;- BTS[,c(&quot;ORIGIN&quot;, &quot;ORIGIN_COUNTRY_NAME&quot;, &quot;DEST&quot;, &quot;DEST_COUNTRY_NAME&quot;, &quot;PASSENGERS&quot;)]

# Load airport information (incl. geolocations)
# Downloaded from http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=288 (select all columns)
BTSairports &lt;- read.csv(&quot;data/344990073_T_MASTER_CORD.csv&quot;, stringsAsFactors=FALSE)

# Replace airport codes with id numbers (net1)
net1 &lt;- BTS
net1.labels &lt;- unique(c(net1[,&quot;ORIGIN&quot;], net1[,&quot;DEST&quot;]))
net1.labels &lt;- net1.labels[order(net1.labels)]
net1[,&quot;ORIGIN&quot;] &lt;- factor(x=net1[,&quot;ORIGIN&quot;], levels=net1.labels)
net1[,&quot;DEST&quot;]   &lt;- factor(x=net1[,&quot;DEST&quot;], levels=net1.labels)
net1 &lt;- data.frame(i=as.integer(net1[,&quot;ORIGIN&quot;]), j=as.integer(net1[,&quot;DEST&quot;]), w=net1[,&quot;PASSENGERS&quot;])

# Add up duplicated entries (multiple routes)
net1 &lt;- net1[order(net1[,&quot;i&quot;], net1[,&quot;j&quot;]),]
index &lt;- !duplicated(net1[,c(&quot;i&quot;,&quot;j&quot;)])
net1 &lt;- data.frame(net1[index,c(&quot;i&quot;,&quot;j&quot;)], w=tapply(net1[,&quot;w&quot;], cumsum(index), sum))

# Take out routes with no passengers (cargo)
net1 &lt;- net1[net1[,&quot;w&quot;]&gt;0,]

# Take out routes from an airport to itself
net1 &lt;- net1[net1[,&quot;i&quot;]!=net1[,&quot;j&quot;],]

# Load tnet and the network as a tnet object
library(tnet)
net1 &lt;- as.tnet(net1, type=&quot;weighted one-mode tnet&quot;)

# Calculate binary and weighted betweenness
tmp0 &lt;- betweenness_w(net1, alpha=0)
tmp1 &lt;- betweenness_w(net1, alpha=1)

# Create output object with top x airports
x &lt;- 10
out &lt;- data.frame(
  tmp0[order(-tmp0[,&quot;betweenness&quot;]),][1:x,],
  tmp1[order(-tmp1[,&quot;betweenness&quot;]),][1:x,])
dimnames(out)[[2]] &lt;- c(&quot;BTS.bb.node&quot;, &quot;BTS.bb.score&quot;, &quot;BTS.wb.node&quot;, &quot;BTS.wb.score&quot;)
BTSairports[BTSairports[,&quot;TR_COUNTRY_NAME&quot;]==&quot;United States of America&quot;,&quot;TR_COUNTRY_NAME&quot;]  &lt;- &quot;USA&quot;
for(i in 1:x) {
  # Insert label of airport ID (binary)
  tmp2 &lt;- net1.labels[as.integer(out[i,&quot;BTS.bb.node&quot;])][1]
  tmp2 &lt;- BTSairports[BTSairports[,&quot;AIRPORT&quot;]==tmp2,][1,]
  out[i,&quot;BTS.bb.node&quot;] &lt;- paste(tmp2[&quot;AIRPORT&quot;], &quot; (&quot;, tmp2[&quot;TR_CITY_NAME&quot;], &quot;, &quot;, tmp2[&quot;TR_COUNTRY_NAME&quot;], &quot;)&quot;, sep=&quot;&quot;)
  # Insert label of airport ID (weighted)
  tmp2 &lt;- net1.labels[as.integer(out[i,&quot;BTS.wb.node&quot;])][1]
  tmp2 &lt;- BTSairports[BTSairports[,&quot;AIRPORT&quot;]==tmp2,][1,]
  out[i,&quot;BTS.wb.node&quot;] &lt;- paste(tmp2[&quot;AIRPORT&quot;], &quot; (&quot;, tmp2[&quot;TR_CITY_NAME&quot;], &quot;, &quot;, tmp2[&quot;TR_COUNTRY_NAME&quot;], &quot;)&quot;, sep=&quot;&quot;)
}


#########################
## Plot the US network ##
#########################

# Based on FlowingData's blog post (http://flowingdata.com/2011/05/11/how-to-map-connections-with-great-circles/) Thanks Nathan! 
# Load required packages (type ?install.packages if you get an error)
library(maps)
library(geosphere)

# Symmetrise the network to get the correct tie weight for visualisation as the two directed ties are plotted on top of each other
net1s &lt;- as.data.frame(symmetrise_w(net1, method=&quot;SUM&quot;))
net1s &lt;- net1s[net1s[,&quot;i&quot;]&lt;net1s[,&quot;j&quot;],]

# Put labels back in summed up network (i.e, no duplicates) 
net1s[,&quot;i&quot;] &lt;- net1.labels[net1s[,&quot;i&quot;]]
net1s[,&quot;j&quot;] &lt;- net1.labels[net1s[,&quot;j&quot;]]

# Sort data so that weak ties are plotted first
net1s &lt;- net1s[order(net1s[,&quot;w&quot;]),]

# Set up world map and colors for lines
pdf(&quot;airport_BTS_plot.pdf&quot;, width=11, height=7)
map(&quot;world&quot;, col=&quot;#eeeeee&quot;, fill=TRUE, bg=&quot;white&quot;, lwd=0.05)
pal &lt;- colorRampPalette(c(&quot;#cccccc&quot;, &quot;black&quot;))
colors &lt;- pal(length(unique(net1s[,&quot;w&quot;])))
colors &lt;- rep(colors, times=as.integer(table(net1s[,&quot;w&quot;])))

# Plot ties
for(i in 1:nrow(net1s)) {
  # Get longitude and latitude of the two airports
  tmp1 &lt;- BTSairports[BTSairports[&quot;AIRPORT&quot;]==net1s[i,&quot;i&quot;],c(&quot;LONGITUDE&quot;,&quot;LATITUDE&quot;)][1,]
  tmp2 &lt;- BTSairports[BTSairports[&quot;AIRPORT&quot;]==net1s[i,&quot;j&quot;],c(&quot;LONGITUDE&quot;,&quot;LATITUDE&quot;)][1,]
  # Get the geographical distance to see how many points on the Great Circle to plot
  tmp3 &lt;- 10*ceiling(as.numeric(log(3963.1 * acos((sin(tmp1[2]/(180/pi))*sin(tmp2[2]/(180/pi)))+(cos(tmp1[2]/(180/pi))*cos(tmp2[2]/(180/pi))*cos(tmp1[1]/(180/pi)-tmp2[1]/(180/pi)))))))
  # Line coordinates
  inter &lt;- gcIntermediate(tmp1, tmp2, n=round(tmp3), addStartEnd=TRUE, breakAtDateLine=TRUE)
  # Plot one line if the line does not cross the date line; two if so
  if(is.matrix(inter)) {
    lines(inter, col=colors[i], lwd=0.6)
  } else {
    for(j in 1:length(inter))
      lines(inter[[j]], col=colors[i], lwd=0.6)
  }
}
dev.off()


##########################
## Openflights.org data ##
##########################

# Download airport geolocations from openflights.org/data.html, and set column headings
# &quot;http://openflights.svn.sourceforge.net/viewvc/openflights/openflights/data/airports.dat&quot;
OFairports &lt;- read.csv(&quot;data/airports.dat&quot;, header=FALSE, stringsAsFactors=FALSE)
dimnames(OFairports)[[2]] &lt;- c(&quot;Airport ID&quot;, &quot;Name&quot;, &quot;City&quot;, &quot;Country&quot;, &quot;IATA/FAA&quot;, &quot;ICAO&quot;, &quot;Latitude&quot;, &quot;Longitude&quot;, &quot;Altitude&quot;, &quot;Timezone&quot;, &quot;DST&quot;)

# Download routes from openflights.org/data.html
# &quot;http://openflights.svn.sourceforge.net/viewvc/openflights/openflights/data/routes.dat&quot;
OF &lt;- read.csv(&quot;data/routes.dat&quot;, header=FALSE, stringsAsFactors=FALSE)
dimnames(OF)[[2]] &lt;- c(&quot;Airline&quot;, &quot;Airline ID&quot;, &quot;Source airport&quot;, &quot;Source airport ID&quot;, &quot;Destination airport&quot;, &quot;Destination airport ID&quot;, &quot;Codeshare&quot;, &quot;Stops&quot;, &quot;Equipment&quot;)

# Remove code-shares as these are duplicated entries
net2 &lt;- OF[OF[,&quot;Codeshare&quot;]==&quot;&quot;,c(&quot;Source airport ID&quot;, &quot;Destination airport ID&quot;)]

# Take out routes from an airport to itself and the missing cases (~1%)
net2 &lt;- net2[net2[,&quot;Source airport ID&quot;]!=net2[,&quot;Destination airport ID&quot;],]
net2 &lt;- net2[net2[,&quot;Source airport ID&quot;]!=&quot;\\N&quot;,]
net2 &lt;- net2[net2[,&quot;Destination airport ID&quot;]!=&quot;\\N&quot;,]

# As passengers per route is not available, create a weighted network with the weight equal to number of routes
net2 &lt;- data.frame(i=as.integer(net2[,&quot;Source airport ID&quot;]), j=as.integer(net2[,&quot;Destination airport ID&quot;]))
net2 &lt;- shrink_to_weighted_network(net2)


##################################
## Plot the OpenFlights network ##
##################################

# Symmetrise data for visualisation
net2s &lt;- as.data.frame(symmetrise_w(net2, method=&quot;SUM&quot;))
net2s &lt;- net2s[net2s[,&quot;i&quot;]&lt;net2s[,&quot;j&quot;],]

# Sort data so that weak ties are plotted first
net2s &lt;- net2s[order(net2s[,&quot;w&quot;]),]

# Set up world map and colors for lines
pdf(&quot;airport_OF_plot.pdf&quot;, width=11, height=7)
map(&quot;world&quot;, col=&quot;#eeeeee&quot;, fill=TRUE, bg=&quot;white&quot;, lwd=0.05)
pal &lt;- colorRampPalette(c(&quot;#cccccc&quot;, &quot;black&quot;))
colors &lt;- pal(length(unique(net2s[,&quot;w&quot;])))
colors &lt;- rep(colors, times=as.integer(table(net2s[,&quot;w&quot;])))

# Plot ties
for(i in 1:nrow(net2s)) {
  # Get longitude and latitude of the two airports
  tmp1 &lt;- as.numeric(OFairports[OFairports[&quot;Airport ID&quot;]==net2s[i,&quot;i&quot;],c(&quot;Longitude&quot;,&quot;Latitude&quot;)][1,])
  tmp2 &lt;- as.numeric(OFairports[OFairports[&quot;Airport ID&quot;]==net2s[i,&quot;j&quot;],c(&quot;Longitude&quot;,&quot;Latitude&quot;)][1,])
  # Get the geographical distance to see how many points on the Great Circle to plot
  tmp3 &lt;- 10*ceiling(as.numeric(log(3963.1 * acos((sin(tmp1[2]/(180/pi))*sin(tmp2[2]/(180/pi)))+(cos(tmp1[2]/(180/pi))*cos(tmp2[2]/(180/pi))*cos(tmp1[1]/(180/pi)-tmp2[1]/(180/pi)))))))
  # Line coordinates
  inter &lt;- gcIntermediate(tmp1, tmp2, n=round(tmp3), addStartEnd=TRUE, breakAtDateLine=TRUE)
  # Plot one line if the line does not cross the date line; two if so
  if(is.matrix(inter)) {
    lines(inter, col=colors[i], lwd=0.6)
  } else {
    for(j in 1:length(inter))
      lines(inter[[j]], col=colors[i], lwd=0.6)
  }
}
dev.off()


#####################################
## Analyse the OpenFlights network ##
#####################################

# Calculate binary and weighted betweenness (on the directed network, net2)
tmp0 &lt;- betweenness_w(net2, alpha=0)
tmp1 &lt;- betweenness_w(net2, alpha=1)

# Create output object with top x airports
out &lt;- data.frame(out,
  tmp0[order(-tmp0[,&quot;betweenness&quot;]),][1:x,],
  tmp1[order(-tmp1[,&quot;betweenness&quot;]),][1:x,])
dimnames(out)[[2]][5:8] &lt;- c(&quot;OF.bb.node&quot;, &quot;OF.bb.score&quot;, &quot;OF.wb.node&quot;, &quot;OF.wb.score&quot;)
for(i in 1:x) {
  # Insert label of airport ID (binary)
  tmp2 &lt;- OFairports[OFairports[,&quot;Airport ID&quot;]==out[i,&quot;OF.bb.node&quot;],]
  out[i,&quot;OF.bb.node&quot;] &lt;- paste(tmp2[&quot;IATA/FAA&quot;], &quot; (&quot;, tmp2[&quot;City&quot;], &quot;, &quot;, tmp2[&quot;Country&quot;], &quot;)&quot;, sep=&quot;&quot;)
  # Insert label of airport ID (weighted)
  tmp2 &lt;- OFairports[OFairports[,&quot;Airport ID&quot;]==out[i,&quot;OF.wb.node&quot;],]
  out[i,&quot;OF.wb.node&quot;] &lt;- paste(tmp2[&quot;IATA/FAA&quot;], &quot; (&quot;, tmp2[&quot;City&quot;], &quot;, &quot;, tmp2[&quot;Country&quot;], &quot;)&quot;, sep=&quot;&quot;)
}


###########################
## Comparing FRA and LHR ##
###########################

# Get FRA and LHR's airport ids
ids &lt;- sapply(c(&quot;FRA&quot;, &quot;LHR&quot;), function(a) OFairports[OFairports[,&quot;IATA/FAA&quot;]==a,&quot;Airport ID&quot;])
# Rank and Score of FRA
tmp1 &lt;- as.data.frame(tmp1[order(-tmp1[,&quot;betweenness&quot;]),])
tmp1[tmp1[,&quot;node&quot;]==ids[&quot;FRA&quot;],]
# Degree and Node strength
tmp3 &lt;- degree_w(net2)
tmp3[ids,]
# Weight distribution
sapply(ids, function(a) table(net2[net2[,&quot;i&quot;]==a,3]))
# Airports with strong ties (w&gt;=4)
tmp4 &lt;- lapply(ids, function(a) data.frame(net2[net2[,&quot;i&quot;]==a &amp; net2[,&quot;w&quot;]&gt;=4,], label=&quot;&quot;, geo.dist=NaN, stringsAsFactors=FALSE))
# Insert labels
for(a in 1:2) {
  for(b in 1:nrow(tmp4[[a]])) {
    tmp2 &lt;- OFairports[OFairports[,&quot;Airport ID&quot;]==tmp4[[a]][b,&quot;j&quot;],][1,]
    tmp4[[a]][b, &quot;label&quot;] &lt;- paste(tmp2[&quot;IATA/FAA&quot;], &quot; (&quot;, tmp2[&quot;City&quot;], &quot;, &quot;, tmp2[&quot;Country&quot;], &quot;)&quot;, sep=&quot;&quot;)
  }
}
# Geographical distance
for(a in 1:2) {
  tmp5 &lt;- as.numeric(OFairports[OFairports[&quot;Airport ID&quot;]==ids[a],c(&quot;Longitude&quot;,&quot;Latitude&quot;)][1,])
  for(b in 1:nrow(tmp4[[a]])) {
    tmp6 &lt;- as.numeric(OFairports[OFairports[&quot;Airport ID&quot;]==tmp4[[a]][b,&quot;j&quot;],c(&quot;Longitude&quot;,&quot;Latitude&quot;)][1,])
    tmp4[[a]][b, &quot;geo.dist&quot;] &lt;- 6378.7 * acos((sin(tmp5[2]/(180/pi))*sin(tmp6[2]/(180/pi)))+(cos(tmp5[2]/(180/pi))*cos(tmp6[2]/(180/pi))*cos(tmp5[1]/(180/pi)-tmp6[1]/(180/pi))))
  }
}
sapply(1:2, function(a) mean(tmp4[[a]][,&quot;geo.dist&quot;]))
sapply(1:2, function(a) sum(tmp4[[a]][,&quot;geo.dist&quot;]))
</pre>
<br />Filed under: <a href='http://toreopsahl.com/category/network-thoughts/'>Network thoughts</a> Tagged: <a href='http://toreopsahl.com/tag/arcs/'>arcs</a>, <a href='http://toreopsahl.com/tag/betweenness/'>betweenness</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/directed-networks/'>directed networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/global/'>global</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/3566/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/3566/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3566&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2011/08/12/why-anchorage-is-not-that-important-binary-ties-and-sample-selection/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_plot_550.png" medium="image">
			<media:title type="html">The BTS airport network</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/08/airport_bts_plot_europe_400.png" medium="image">
			<media:title type="html">The European part of the BTS airport network</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/08/airport_of_plot_550.png" medium="image">
			<media:title type="html">The Openflight.org airport network</media:title>
		</media:content>
	</item>
		<item>
		<title>Degree Centrality and Variation in Tie Weights</title>
		<link>http://toreopsahl.com/2011/08/08/degree-centrality-and-variation-in-tie-weights/</link>
		<comments>http://toreopsahl.com/2011/08/08/degree-centrality-and-variation-in-tie-weights/#comments</comments>
		<pubDate>Mon, 08 Aug 2011 23:28:56 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=3494</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2011/08/08/degree-centrality-and-variation-in-tie-weights/"><img src="http://toreopsahl.files.wordpress.com/2011/08/degree_variation1.png" alt="" title="Variation in Tie Weights" width="274" height="149" class="alignright size-full wp-image-3543" /></a>A central metric in network research is the number of ties each node has, degree. Degree has been generalised to weighted networks as the sum of tie weights (Barrat et al., 2004), and as a function of the number of ties and the sum of their weights (Opsahl et al., 2010). However, all these measures are insensitive to variation in the tie weights. As such, the two nodes in this diagram would always have the same degree score. This post showcases a new measure that uses a tuning parameter to control whether variation should be taken favourable or discount the degree centrality score of a focal node.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3494&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Node centrality, or the detection and identification of the central nodes in a network, has been a key issue in network studies. The basic node centrality measure is degree, which is defined as the number of connections or ties a focal node has (Freeman, 1978). Degree is a basic indicator and often used as a first step when studying networks (Wasserman and Faust, 1994). To formally describe this measure and ease the comparison among the different measures introduced in this post, this measure can be formalised for a focal node <em>i</em> as:</p>
<div align="center"><img src='http://s0.wp.com/latex.php?latex=k%28i%29+%3D+%5Csum_j%5EN+x_%7Bij%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='k(i) = &#92;sum_j^N x_{ij}' title='k(i) = &#92;sum_j^N x_{ij}' class='latex' /></div>
<p>where <em>j</em> represents all other nodes, <em>N</em> is the total number of nodes, and <em>x</em> is the adjacency matrix, in which the cell <img src='http://s0.wp.com/latex.php?latex=x_%7Bij%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='x_{ij}' title='x_{ij}' class='latex' /> is defined as 1 if node <em>i</em> is connected to node <em>j</em>, and 0 otherwise.</p>
<p>Degree has generally been extended to the sum of weights when analysing weighted networks, and labeled node strength (Barrat et al., 2004). This measure can be formalised as follows:</p>
<div align="center"><img src='http://s0.wp.com/latex.php?latex=k%5Ew%28i%29+%3D+s%28i%29+%3D+%5Csum_j%5EN+w_%7Bij%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='k^w(i) = s(i) = &#92;sum_j^N w_{ij}' title='k^w(i) = s(i) = &#92;sum_j^N w_{ij}' class='latex' /></div>
<p>where <em>w</em> is the weighted adjacency matrix, in which <img src='http://s0.wp.com/latex.php?latex=w_%7Bij%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='w_{ij}' title='w_{ij}' class='latex' /> is greater than 0 if the node <em>i</em> is connected to node <em>j</em>, and the value represents the weight of the tie. This is equal to the definition of degree if the network is binary, i.e. each tie has a weight of 1. Conversely, in weighted networks, the outcomes of these two measures are different. Since node strength takes into consideration the weights of ties, this has been the preferred measure for analyzing weighted networks (e.g., Barrat et al., 2004; Opsahl et al., 2008). </p>
<p><div id="attachment_3545" class="wp-caption alignright" style="width: 284px"><img src="http://toreopsahl.files.wordpress.com/2011/08/degree_variation3.png?w=455" alt="" title="Degree and Strength"   class="size-full wp-image-3545" /><p class="wp-caption-text">Degree and Strength: Two nodes with the same node strength, but different number of ties.</p></div>Nevertheless, node strength is a blunt measure as it only takes into consideration a node&#8217;s total level of involvement in the network, and not the number of other nodes to which it connected. To exemplify this, node A and node B have the same strength, but node A is connected to three times as many nodes as node A, and is therefore, involved in more parts of the network. As degree and strength can be both indicators of the level of involvement of a node in the surrounding network, a second generalisation was proposed by Opsahl et al. (2010) that incorporated both the number of ties and the sum of tie weights. Their measure can be formalised as:</p>
<div align="center"><img src='http://s0.wp.com/latex.php?latex=k%5E%7B%5Calpha%7D+%3D+k%28i%29+%5Ctimes+%5Cleft%28%5Cfrac%7Bs%28i%29%7D%7Bk%28i%29%7D%5Cright%29%5E%5Calpha&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='k^{&#92;alpha} = k(i) &#92;times &#92;left(&#92;frac{s(i)}{k(i)}&#92;right)^&#92;alpha' title='k^{&#92;alpha} = k(i) &#92;times &#92;left(&#92;frac{s(i)}{k(i)}&#92;right)^&#92;alpha' class='latex' /></div>
<p>where <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> is a positive tuning parameter that controls the relative importance of the number of ties and the sum of ties. Specifically, there are two benchmark values (0 and 1), and if the parameter is set to either of these values, the existing measure is reproduced. If the parameter is set to the benchmark value of 0, the outcomes of the measure is solely based on the number of ties, and are equal to the ones found when applying Freeman&#8217;s (1978) measure to a binary version of a network where all the ties with a weight greater than 0 are set to present. Conversely, if the value of the parameter is 1, the outcomes of the measure is based on tie weights only, and are identical to the already proposed generalization of degree (Barrat et al., 2004). For other values of <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' />, alternative outcomes are attained, which are based on both the number of ties and tie weights. In particular, two ranges of values can be distinguished. First, a parameter set between 0 and 1 would positively value both the number of ties and tie weights. This implies that both increments in node degree and node strength will then increase the outcome. Second, if the value of the parameter is above 1, the measures would positively value tie strength and negatively value the number of ties. Nodes with on average stronger ties will get a higher score.</p>
<p><div id="attachment_3543" class="wp-caption alignright" style="width: 284px"><img src="http://toreopsahl.files.wordpress.com/2011/08/degree_variation1.png?w=455" alt="" title="Variation in Tie Weights"   class="size-full wp-image-3543" /><p class="wp-caption-text">Variation in Tie Weights: Two nodes with the same scores using Freeman&#039;s (1978), Barrat et al.&#039;s (2004), and Opsahl et al.&#039;s (2010) degree measures.</p></div>All of the above measures are insensitive to variation in tie weights. For example, the two nodes, A and B, in this diagram have the same number of connections, the same node strength, and attains the same score using the second generalisation as that it is a product of the degree and node strength. While the closeness and betweenness measures proposed in Opsahl et al. (2010) are sensitive to variation in tie weights, the degree measure was designed not to be. However, a measure closely related to the closeness and betweenness measures that is sensitive to tie weight differences can be defined as follows:</p>
<div align="center"><img src='http://s0.wp.com/latex.php?latex=k%5E%7B%5Calpha2%7D%28i%29+%3D+%5Csum_j%5EN+w_%7Bij%7D%5E%5Calpha&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='k^{&#92;alpha2}(i) = &#92;sum_j^N w_{ij}^&#92;alpha' title='k^{&#92;alpha2}(i) = &#92;sum_j^N w_{ij}^&#92;alpha' class='latex' /></div>
<p>By exponenting the tie weight instead of the average tie weight, the measure becomes sensitive to variation in tie weights. For example, node A and node B would get the following score using the various measures:</p>
<table class="tore" align="center">
<tr>
<th rowspan="2" align="center">Measure</th>
<th colspan="2" align="center">Node</th>
</tr>
<tr>
<th align="center">A</th>
<th align="center">B</th>
</tr>
<tr>
<td>Freeman&#8217;s</td>
<td>2</td>
<td>2</td>
</tr>
<tr>
<td>Barrat et al.&#8217;s</td>
<td>4</td>
<td>4</td>
</tr>
<tr>
<td>Opsahl et al.&#8217;s, alpha=0.5</td>
<td>2.83</td>
<td>2.83</td>
</tr>
<tr>
<td>Opsahl et al.&#8217;s, alpha=1.5</td>
<td>5.66</td>
<td>5.66</td>
</tr>
<tr>
<td>New measure, alpha=0.5</td>
<td>2.83</td>
<td>2.73</td>
</tr>
<tr>
<td>New measure, alpha=1.5</td>
<td>5.66</td>
<td>6.20</td>
</tr>
</table>
<p>As it is possible to see from the above table, the new measure is closely linked to generalisation proposed by Opsahl et al. (2010); however, when the tie weights are different, the measure vary between the two nodes. Similarly as the other centrality measures using a tuning parameter, the tuning parameter in these measures control the relative importance of the number of ties and the sum of ties. In addition, it also controls whether variation in tie weights should be discounted or taken favourable. A parameter between 0 and 1 discounts, whereas a parameter above 1, increase the outcome of the measure when tie weights are different.</p>
<p><strong>What to try it with your data?</strong></p>
<p>Below is the code to calculate the proposed degree measure. You need to have the R-package tnet installed before to run the code.</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load a function to calculate the new measures
degree2_w &lt;- function (net, type=&quot;out&quot;, alpha = 1) {
    net &lt;- as.tnet(net, type=&quot;weighted one-mode tnet&quot;)
    if (type == &quot;in&quot;) {
        net &lt;- data.frame(i = net[, 2], j = net[, 1], w = net[,3])
        net &lt;- net[order(net[, &quot;i&quot;], net[, &quot;j&quot;]), ]
    }
    index &lt;- cumsum(!duplicated(net[, 1]))
    k.list &lt;- cbind(unique(net[, 1]), NaN, NaN, NaN)
    dimnames(k.list)[[2]] &lt;- c(&quot;node&quot;, &quot;degree&quot;, &quot;output&quot;, &quot;alpha&quot;)
    k.list[, &quot;degree&quot;] &lt;- tapply(net[, &quot;w&quot;], index, length)
    k.list[, &quot;output&quot;] &lt;- tapply(net[, &quot;w&quot;], index, sum)
    net[,&quot;w&quot;] &lt;- net[,&quot;w&quot;]^alpha
    k.list[, &quot;alpha&quot;] &lt;- tapply(net[, &quot;w&quot;], index, sum)
    if (max(net[, c(&quot;i&quot;, &quot;j&quot;)]) != nrow(k.list)) {
        k.list &lt;- rbind(k.list, cbind(1:max(net[, c(&quot;i&quot;, &quot;j&quot;)]), 0, 0, 0))
        k.list &lt;- k.list[order(k.list[, &quot;node&quot;]), ]
        k.list &lt;- k.list[!duplicated(k.list[, &quot;node&quot;]), ]
    }
    return(k.list)
}

# Load a sample network
net &lt;- cbind(
i=c(1,1,2,2),
j=c(2,3,1,3),
w=c(2,2,1,3))

# Calculate the measures
degree_w(net, measure=c(&quot;degree&quot;,&quot;output&quot;,&quot;alpha&quot;), alpha=1.5)
degree_w(net, measure=c(&quot;degree&quot;,&quot;output&quot;,&quot;alpha&quot;), alpha=0.5)
degree2_w(net, alpha=0.5)
degree2_w(net, alpha=1.5)
</pre>
<p><strong>References</strong></p>
<p>Barrat, A., Barthelemy, M., Pastor-Satorras, R., Vespignani, A., 2004. The architecture of complex weighted networks. Proceedings of the National Academy of Sciences 101 (11), 3747-3752.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J. (2010). <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/" title="Article: Node centrality in weighted networks: Generalizing degree and shortest paths">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32, 245-251.</p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). </p>
<p>Wasserman, S., Faust, K., 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, New York, NY.</p>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Agneessens, F., Skvoretz, J., 2010. Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks 32 (3), 245-251</div>
<br />Filed under: <a href='http://toreopsahl.com/category/network-thoughts/'>Network thoughts</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/degree/'>degree</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/gregariousness/'>gregariousness</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/popularity/'>popularity</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-nodes/'>strength of nodes</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/3494/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/3494/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=3494&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2011/08/08/degree-centrality-and-variation-in-tie-weights/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/08/degree_variation3.png" medium="image">
			<media:title type="html">Degree and Strength</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2011/08/degree_variation1.png" medium="image">
			<media:title type="html">Variation in Tie Weights</media:title>
		</media:content>
	</item>
		<item>
		<title>Article: For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway</title>
		<link>http://toreopsahl.com/2010/09/30/article-for-the-few-not-the-many-the-effects-of-affirmative-action-on-presence-prominence-and-social-capital-of-women-directors-in-norway/</link>
		<comments>http://toreopsahl.com/2010/09/30/article-for-the-few-not-the-many-the-effects-of-affirmative-action-on-presence-prominence-and-social-capital-of-women-directors-in-norway/#comments</comments>
		<pubDate>Thu, 30 Sep 2010 16:02:09 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[vertices]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2303</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/09/30/article-for-the-few-not-the-many-the-effects-of-affirmative-action-on-presence-prominence-and-social-capital-of-women-directors-in-norway/"><img src="http://toreopsahl.files.wordpress.com/2010/09/bodsmall.png" alt="" title="For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway" width="133" height="96" class="alignright size-full wp-image-2307" /></a>A paper called "For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway" that I have co-authored will be published in the Scandinavian Journal of Management. Governments have implemented various affirmative action policies to address vertical sex segregation in organizations. A gender representation law was introduced in Norway, which required public limited companies’ boards to have at least 40 percent representation of each sex by 2008. This law acted as an external shock, and this paper aims to explore its effects. In particular, it explores the gender bias, the emergence and sex of prominent directors, and directors’ social capital. We utilize data from May 2002 to August 2009 to analyze these aspects. The implied intention of the law was to create a larger pool of women acting as directors on boards, and the law has had the effect of increasing the representation of women on boards. However, it has also created a small elite of women directors who rank among the top on a number of proxies of influence. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2303&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.boardsandgender.com/" target="_blank"><img src="http://toreopsahl.files.wordpress.com/2010/09/bod.png?w=455" alt="Average proportion of women and men directors in Norway" title="Average proportion of women and men directors in Norway"   class="alignright size-full wp-image-2305" /></a>A paper called &#8220;For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway&#8221; that I have co-authored with <a href="http://cathrineseierstad.com/" target="_blank">Cathrine Seierstad</a> will be published in the Scandinavian Journal of Management. Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to the Scandinavian Journal of Management, you can <a href="http://dx.doi.org/10.1016/j.scaman.2010.10.002" target="_blank">download the paper</a> directly. Otherwise, a preprint of the paper is available on its supporting website: <a href="http://www.boardsandgender.com/" target="_blank">www.boardsandgender.com</a></p>
<p>The main foundation of the paper is a gender representation law that required all public limited companies to compose their boards with at least 40% of each gender by January 2008. The paper attempted to stike a balance between the urgency of studying the gender represention law and the amount of data available (the analysis relied on data from August 2009). <a href="http://www.boardsandgender.com/" target="_blank">The supporting website</a> tries to alleviate this tension by providing up-to-date data for the analysis conducted in the paper.</p>
<p>The paper gathered news coverage in the major Norwegian newspapers. Online versions are available through <a href="http://e24.no/jobb/article3852441.ece" target="_blank">e24.no</a> and <a href="http://www.dn.no/forsiden/naringsliv/article1994376.ece" target="_blank">dn.no</a>  and <a href="http://www.spiegel.de/international/zeitgeist/0,1518,745664,00.html" target="_blank">Der Spiegel</a>.</p>
<p><strong>Abstract</strong></p>
<p>Governments have implemented various affirmative action policies to address vertical sex segregation in organizations. A gender representation law was introduced in Norway, which required public limited companies’ boards to have at least 40 percent representation of each sex by 2008. This law acted as an external shock, and this paper aims to explore its effects. In particular, it explores the gender bias, the emergence and sex of prominent directors, and directors’ social capital. We utilize data from May 2002 to August 2009 to analyze these aspects. The implied intention of the law was to create a larger pool of women acting as directors on boards, and the law has had the effect of increasing the representation of women on boards. However, it has also created a small elite of women directors who rank among the top on a number of proxies of influence. </p>
<div class="knobcite">If you use any of the information in this post, please cite: Seierstad, C., Opsahl, T., 2011. For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway. Scandinavian Journal of Management 27 (1), 44-54</div>
<br />Filed under: <a href='http://toreopsahl.com/category/articles/'>Articles</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/affiliation-networks/'>affiliation networks</a>, <a href='http://toreopsahl.com/tag/betweenness/'>betweenness</a>, <a href='http://toreopsahl.com/tag/bipartite-networks/'>bipartite networks</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/global/'>global</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/two-mode-networks/'>two-mode networks</a>, <a href='http://toreopsahl.com/tag/undirected-networks/'>undirected networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/2303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/2303/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2303&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/09/30/article-for-the-few-not-the-many-the-effects-of-affirmative-action-on-presence-prominence-and-social-capital-of-women-directors-in-norway/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2010/09/bod.png" medium="image">
			<media:title type="html">Average proportion of women and men directors in Norway</media:title>
		</media:content>
	</item>
		<item>
		<title>Article: Node centrality in weighted networks: Generalizing degree and shortest paths</title>
		<link>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/</link>
		<comments>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 10:37:10 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2204</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/"><img src="http://toreopsahl.wordpress.com/files/2009/01/fig1_betweenness_s3.gif" alt="Betweenness example" title="Betweenness example" width="146" height="100" class="alignright size-full wp-image-623" /></a>A paper called "Node centrality in weighted networks: Generalizing degree and shortest paths" that I have co-authored will be published in Social Networks. Ties often have a strength naturally associated with them that differentiate them from each other. Tie strength has been operationalized as weights. A few network measures have been proposed for weighted networks, including three common measures of node centrality: degree, closeness, and betweenness. However, these generalizations have solely focused on tie weights, and not on the number of ties, which was the central component of the original measures. This paper proposes generalizations that combine both these aspects. We illustrate the benefits of this approach by applying one of them to Freeman's EIES dataset.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2204&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A paper called &#8220;Node centrality in weighted networks: Generalizing degree and shortest paths&#8221; that I have co-authored with <a href="http://home.fsw.vu.nl/f.agneessens/index.htm" target="_blank">Filip Agneessens</a> and <a href="http://www.insna.org/member/profiles/32.html" target="_blank">John Skvoretz</a> will be published in <a href="http://www.sciencedirect.com/science/journal/03788733" target="_blank">Social Networks</a>. Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to Social Networks, you can <a href="http://dx.doi.org/10.1016/j.socnet.2010.03.006" target="_blank">download the paper directly</a>. Otherwise, a <a href="http://toreopsahl.files.wordpress.com/2010/04/node_centrality_in_weighted_networks1.pdf" target="_blank">preprint with the exact same text</a> is available.</p>
<p><strong>Abstract</strong></p>
<p>Ties often have a strength naturally associated with them that differentiate them from each other. Tie strength has been operationalized as weights. A few network measures have been proposed for weighted networks, including three common measures of node centrality: degree, closeness, and betweenness. However, these generalizations have solely focused on tie weights, and not on the number of ties, which was the central component of the original measures. This paper proposes generalizations that combine both these aspects. We illustrate the benefits of this approach by applying one of them to Freeman&#8217;s EIES dataset.</p>
<p><strong>Motivation</strong></p>
<p><div id="attachment_2215" class="wp-caption alignright" style="width: 260px"><img src="http://toreopsahl.files.wordpress.com/2010/04/freeman3_egos.gif?w=455" alt="" title="Ego networks"   class="size-full wp-image-2215" /><p class="wp-caption-text">Ego networks of Phipps Arabie (A), John Boyd (B), and Maureen Hallinan (C) from Freeman's third EIES network. The width of a tie corresponds to the number of messages sent from the focal node to their contacts. Adopted from the paper.</p></div> Centrality is the concept of being &#8220;in the thick of things.&#8221; In 1978, Freeman reviewed and clarified a growing field of research on centrality of nodes for binary networks in an article published in the first issue of <em>Social Networks</em>. Three measures were formalised: degree, closeness, and betweenness. Degree was the number of ties or neighbours of a node; closeness was the inverse of the sum of all shortest paths to others or the smallest number of ties to go through to reach all others individually; and betweeness was the number of shortest paths on which a node was on. </p>
<p>The three measures have already been generalised to weighted networks. Barrat et al. (2004) generalised degree to weighted networks by taking the sum of weights instead of the number ties, while Newman (2001) and Brandes (2001) utilised Dijkstra&#8217;s (1959) algorithm of shortest paths for generalising closeness and betweenness to weighted networks, respectiviely. Dijkstra&#8217;s algorithm defined the length of paths as the sum of cost (e.g., time in GPS calculations), which is generally only defined as the sum of the inversed tie weights. All these generalisations fail to take into account the main feature of the original measures formalised by Freeman (1978): the number of ties. </p>
<p>This limitation is highlighted for degree centrality by the three ego networks from Freeman&#8217;s third EIES network. The three nodes have roughly sent the same amount of messages; however, to a quite different number of others. If Freeman&#8217;s (1978) original measure was applied, the centrality score of the node in panel A is almost five times as high as the node in panel C attains. However, when using Barrat et al.&#8217;s generalisation, they get roughly the same score. </p>
<p>This articles proposes a new generation of node centrality measures for weighted networks. The second generation of measures takes into consideration both the weight of ties and the number of ties. The relative importance of these two aspects are controlled by a tuning parameter.  </p>
<p><strong>Want to test it with your data?</strong></p>
<p><img src="http://toreopsahl.files.wordpress.com/2008/12/fig1.png?w=455" alt="Sample network" title="Sample network"   class="alignright size-full wp-image-160" />The degree_w, closeness_w, and betweenness_w-functions in <a href="http://toreopsahl.com/tnet/weighted-networks/node-centrality/">tnet</a> allows you to calculate the binary, weighted, and the measures that combine these two aspects on your own dataset.</p>
<p>For example, to calculate second generation node centrality measures (alpha = 0.5) on the sample network above, you can run the code below in R. The degree function easily calculates the binary and first generation measures as well; however, this is not the case for the closeness and betweenness-functions. If you would like the binary version, you can either use the dichotomise function or set alpha=0. If you would like the first generation weighted measures, you can set alpha=1 (default value). </p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))

# Calculate degree centrality (note that alpha is included in the list of measures)
degree_w(net, measure=c(&quot;degree&quot;, &quot;output&quot;, &quot;alpha&quot;), alpha=0.5)

# Calculate closeness centrality
closeness_w(net, alpha=0.5)

# Calculate betweenness centrality
betweenness_w(net, alpha=0.5)
</pre>
<p>To test it on Freeman&#8217;s third EIES network from the <a href="http://toreopsahl.com/datasets/">datasets-page</a> and recreate Table 3 of the paper, you can do the following:</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network
data(Freemans.EIES)
net &lt;- Freemans.EIES.net.3.n32

# Calculate measures
tmp &lt;- data.frame(
  Freemans.EIES.node.Name.n32,
  degree_w(net, measure=c(&quot;degree&quot;, &quot;output&quot;, &quot;alpha&quot;), alpha=0.5), 
  degree_w(net, measure=&quot;alpha&quot;, alpha=1.5)[,&quot;alpha&quot;], stringsAsFactors=FALSE)
dimnames(tmp )[[2]] &lt;- c(&quot;name&quot;, &quot;node&quot;, &quot;a00&quot;, &quot;a10&quot;, &quot;a05&quot;, &quot;a15&quot;)
tmp &lt;- tmp[,c(&quot;name&quot;,&quot;a00&quot;,&quot;a05&quot;,&quot;a10&quot;,&quot;a15&quot;)]

# Merge names and order table
out &lt;- data.frame(
  seq.int(nrow(tmp)),
  tmp[order(-tmp[,&quot;a00&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a00&quot;)],
  tmp[order(-tmp[,&quot;a05&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a05&quot;)],
  tmp[order(-tmp[,&quot;a10&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a10&quot;)],
  tmp[order(-tmp[,&quot;a15&quot;], -tmp[,&quot;a10&quot;]),c(&quot;name&quot;, &quot;a15&quot;)])
dimnames(out)[[2]] &lt;- c(&quot;Rank&quot;,
  &quot;a00.name&quot;,&quot;a00&quot;,
  &quot;a05.name&quot;,&quot;a05&quot;,
  &quot;a10.name&quot;,&quot;a10&quot;,
  &quot;a15.name&quot;,&quot;a15&quot;)

# Display table
out
</pre>
<p><strong>References</strong></p>
<p>Brandes, U., 2001. A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25, 163-177.</p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Newman, M. E. J., 2001. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E 64, 016132. </p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J. (2010). <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32, 245-251.</p>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Agneessens, F., Skvoretz, J., 2010. Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks 32 (3), 245-251</div>
<br />Filed under: <a href='http://toreopsahl.com/category/articles/'>Articles</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/arcs/'>arcs</a>, <a href='http://toreopsahl.com/tag/betweenness/'>betweenness</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/closeness/'>closeness</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/degree/'>degree</a>, <a href='http://toreopsahl.com/tag/directed-networks/'>directed networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/gregariousness/'>gregariousness</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/popularity/'>popularity</a>, <a href='http://toreopsahl.com/tag/reinforcement/'>reinforcement</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-nodes/'>strength of nodes</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/undirected-networks/'>undirected networks</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/2204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/2204/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2204&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/feed/</wfw:commentRss>
		<slash:comments>55</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2010/04/freeman3_egos.gif" medium="image">
			<media:title type="html">Ego networks</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>
	</item>
		<item>
		<title>Closeness centrality in networks with disconnected components</title>
		<link>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/</link>
		<comments>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/#comments</comments>
		<pubDate>Sat, 20 Mar 2010 13:03:15 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2163</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/"><img src="http://toreopsahl.files.wordpress.com/2010/03/social-network_closeness_small.png" alt="Closeness in disconnected components" title="Disconnected components" width="300" height="139" class="alignright size-full wp-image-2187" /></a>A key node centrality measure in networks is closeness centrality (Freeman, 1978; Wasserman and Faust, 1994). It is defined as the inverse of farness, which in turn, is the sum of distances to all other nodes. As the distance between nodes in disconnected components of a network is infinite, this measure cannot be applied to networks with disconnected components (Opsahl et al., 2010; Wasserman and Faust, 1994). This post highlights a possible work-around, which allows the measure to be applied to these networks and at the same time maintain the original idea behind the measure.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2163&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A key node centrality measure in networks is closeness centrality (Freeman, 1978; Opsahl et al., 2010; Wasserman and Faust, 1994). It is defined as the inverse of farness, which in turn, is the sum of distances to all other nodes. As the distance between nodes in disconnected components of a network is infinite, this measure cannot be applied to networks with disconnected components (Opsahl et al., 2010; Wasserman and Faust, 1994). <strong>This post highlights a possible work-around, which allows the measure to be applied to these networks and at the same time maintain the original idea behind the measure.</strong></p>
<p><img src="http://toreopsahl.files.wordpress.com/2010/03/social-network_closeness.png?w=455" alt="Disconnected components" title="Disconnected components"   class="alignright size-full wp-image-2178" />This network gives a concrete example of the closeness measure. The distance between node G and node H is infinite as a direct or indirect path does not exist between them (i.e., they belong to separate components). As long as at least one node is unreachable by the others, the sum of distances to all other nodes is infinite. As a consequence, researchers have limited the closeness measure to the largest component of nodes (i.e., measured intra-component). The distance matrix for the nodes in the sample network is:</p>
<table class="tore" cellspacing="1" cellpadding="3" align="center">
<tr align="center">
<th></th>
<th colspan="11">Nodes</th>
<th></th>
<th colspan="2">All inclusive</th>
<th colspan="2">Intra-component</th>
</tr>
<tr align="center">
<th></th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
<th>G</th>
<th>H</th>
<th>I</th>
<th>J</th>
<th>K</th>
<th></th>
<th>Farness</th>
<th>Closeness</th>
<th>Farness</th>
<th>Closeness</th>
</tr>
<tr align="center">
<th>A</th>
<td>&#8230;</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>3</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>12</td>
<td>0.08</td>
</tr>
<tr align="center">
<th>B</th>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>10</td>
<td>0.10</td>
</tr>
<tr align="center">
<th>C</th>
<td>1</td>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>2</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>9</td>
<td>0.11</td>
</tr>
<tr align="center">
<th>D</th>
<td>2</td>
<td>2</td>
<td>1</td>
<td>&#8230;</td>
<td>2</td>
<td>1</td>
<td>1</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>9</td>
<td>0.11</td>
</tr>
<tr align="center">
<th>E</th>
<td>2</td>
<td>1</td>
<td>2</td>
<td>2</td>
<td>&#8230;</td>
<td>1</td>
<td>3</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>11</td>
<td>0.09</td>
</tr>
<tr align="center">
<th>F</th>
<td>3</td>
<td>2</td>
<td>2</td>
<td>1</td>
<td>1</td>
<td>&#8230;</td>
<td>2</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>11</td>
<td>0.09</td>
</tr>
<tr align="center">
<th>G</th>
<td>3</td>
<td>3</td>
<td>2</td>
<td>1</td>
<td>3</td>
<td>2</td>
<td>&#8230;</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>14</td>
<td>0.07</td>
</tr>
<tr align="center">
<th>H</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>&#8230;</td>
<td>1</td>
<td>2</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>3</td>
<td>0.33</td>
</tr>
<tr align="center">
<th>I</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>1</td>
<td>&#8230;</td>
<td>1</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>2</td>
<td>0.50</td>
</tr>
<tr align="center">
<th>J</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>2</td>
<td>1</td>
<td>&#8230;</td>
<td>Inf</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>3</td>
<td>0.33</td>
</tr>
<tr align="center">
<th>K</th>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>Inf</td>
<td>&#8230;</td>
<th></th>
<td>Inf</td>
<td>0</td>
<td>0</td>
<td>Inf</td>
</tr>
</table>
<p>Although the intra-component closeness scores are not infinite for all the nodes in the network, it would be inaccurate to use them as a closeness measure. This is due to the fact that the sum of distances would contain different number of paths (e.g., there are two distance from node H to other nodes in its component, while there are six distances from node G to other nodes in its component). In fact, nodes in smaller components would generally be seen as being closer to others than nodes in larger components. Thus, researchers has focused solely on the largest component. However, this leads to a number of methodological issues, including sample selection.</p>
<p>To develop this measure, I went back to the original equation:</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cmbox%7Bcloseness%7D%28i%29+%3D+%5Csum_j+%5Cleft%5B+d_%7Bij%7D+%5Cright%5D%5E%7B-1%7D+%3D+%5Cfrac%7B1%7D%7B%5Csum_j+d_%7Bij%7D%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='&#92;mbox{closeness}(i) = &#92;sum_j &#92;left[ d_{ij} &#92;right]^{-1} = &#92;frac{1}{&#92;sum_j d_{ij}}' title='&#92;mbox{closeness}(i) = &#92;sum_j &#92;left[ d_{ij} &#92;right]^{-1} = &#92;frac{1}{&#92;sum_j d_{ij}}' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='i' title='i' class='latex' /> is the focal node, <img src='http://s0.wp.com/latex.php?latex=j&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='j' title='j' class='latex' /> is another node in the network, and <img src='http://s0.wp.com/latex.php?latex=d_%7Bij%7D&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='d_{ij}' title='d_{ij}' class='latex' /> is the shortest distance between these two nodes. In this equation, the distances are inversed after they have been summed, and when summing an infinite number, the outcome is infinite. To overcome this issue while staying consistent with the existing measure of closeness, I took advantage of the fact that the limit of a number divided by infinity is zero. Although infinity is not an exact number, the inverse of a very high number is very close to 0. In fact, 0 is returned if you enter 1/Inf in the statistical programme <em>R</em>. By taking advantage of this feature, it is possible to rewrite the closeness equation as <em>the sum of inversed</em> distances to all other nodes instead of the <em>inversed of the sum </em>of distances to all other nodes. The equation would then be:</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cmbox%7Bcloseness%7D%28i%29+%3D+%5Csum_j+%5Cfrac%7B1%7D%7Bd_%7Bij%7D%7D+&amp;bg=ffffff&amp;fg=414141&amp;s=0' alt='&#92;mbox{closeness}(i) = &#92;sum_j &#92;frac{1}{d_{ij}} ' title='&#92;mbox{closeness}(i) = &#92;sum_j &#92;frac{1}{d_{ij}} ' class='latex' /></p>
<p>To exemplify this change, for the example network above, the inversed distances and closeness scores are:</p>
<table class="tore" cellspacing="1" cellpadding="3" align="center">
<tr align="center">
<th></th>
<th colspan="11">Nodes</th>
<th></th>
<th colspan="2">Closeness</th>
</tr>
<tr align="center">
<th></th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
<th>G</th>
<th>H</th>
<th>I</th>
<th>J</th>
<th>K</th>
<th></th>
<th>Sum</th>
<th>Normalized</th>
</tr>
<tr align="center">
<th>A</th>
<td>&#8230;</td>
<td>1.00</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>0.33</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.67</td>
<td>0.37</td>
</tr>
<tr align="center">
<th>B</th>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>1.00</td>
<td>0.50</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.33</td>
<td>0.43</td>
</tr>
<tr align="center">
<th>C</th>
<td>1.00</td>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>0.50</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.50</td>
<td>0.45</td>
</tr>
<tr align="center">
<th>D</th>
<td>0.50</td>
<td>0.50</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0.50</td>
<td>1.00</td>
<td>1.00</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>4.50</td>
<td>0.45</td>
</tr>
<tr align="center">
<th>E</th>
<td>0.50</td>
<td>1.00</td>
<td>0.50</td>
<td>0.50</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.33</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.83</td>
<td>0.38</td>
</tr>
<tr align="center">
<th>F</th>
<td>0.33</td>
<td>0.50</td>
<td>0.50</td>
<td>1.00</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0.50</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.83</td>
<td>0.38</td>
</tr>
<tr align="center">
<th>G</th>
<td>0.33</td>
<td>0.33</td>
<td>0.50</td>
<td>1.00</td>
<td>0.33</td>
<td>0.50</td>
<td>&#8230;</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<th></th>
<td>3.00</td>
<td>0.30</td>
</tr>
<tr align="center">
<th>H</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0.50</td>
<td>0</td>
<th></th>
<td>1.50</td>
<td>0.15</td>
</tr>
<tr align="center">
<th>I</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1.00</td>
<td>&#8230;</td>
<td>1.00</td>
<td>0</td>
<th></th>
<td>2</td>
<td>0.20</td>
</tr>
<tr align="center">
<th>J</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0.50</td>
<td>1.00</td>
<td>&#8230;</td>
<td>0</td>
<th></th>
<td>1.50</td>
<td>0.15</td>
</tr>
<tr align="center">
<th>K</th>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>&#8230;</td>
<th></th>
<td>0</td>
<td>0</td>
</tr>
</table>
<p>As can be seen from this table, a closeness score is attained for all nodes taking into consideration an equal number of distances for each node irrespective of the size of the nodes&#8217; component. Moreover, nodes belonging to a larger component generally attains a higher score. This is deliberate as these nodes can reach a greater number of others than nodes in smaller components. The normalized scores are bound between 0 and 1. It is 0 if a node is an isolate, and 1 if a node is directly connected all others. </p>
<p>This measure can easily be extended to weighted networks by introducing Dijkstra&#8217;s (1959) algorithm as proposed in <a href="http://toreopsahl.com/tnet/weighted-networks/shortest-paths/">Average shortest distance in weighted networks</a>.</p>
<p><strong>References</strong></p>
<p>Dijkstra, E. W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik 1, 269-271.</p>
<p>Freeman, L. C., 1978. Centrality in social networks: Conceptual clarification. Social Networks 1, 215-239.</p>
<p>Opsahl, T., Agneessens, F., Skvoretz, J. (2010). <a href="http://toreopsahl.com/2010/04/21/article-node-centrality-in-weighted-networks-generalizing-degree-and-shortest-paths/">Node centrality in weighted networks: Generalizing degree and shortest paths</a>. Social Networks 32, 245-251.</p>
<p>Wasserman, S., Faust, K., 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, New York, NY.</p>
<p><strong>What to try it with your data?</strong></p>
<p>Below is the code to calculate the closeness measure on the sample network above. </p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network 
# Node K is assigned node id 8 instead of 10 as isolates at the end of id sequences are not recorded in edgelists
net &lt;- cbind(
  i=c(1,1,2,2,2,3,3,3,4,4,4,5,5,6,6,7,9,10,10,11),
  j=c(2,3,1,3,5,1,2,4,3,6,7,2,6,4,5,4,10,9,11,10),
  w=c(1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1))

# Calculate measures
closeness_w(net, gconly=FALSE)
</pre>
<div class="knobcite">This post is the explaination of a footnote the node centrality paper. If you use any of the information in this post, please cite: Opsahl, T., Agneessens, F., Skvoretz, J., 2010. Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks 32 (3), 245-251</div>
<br />Filed under: <a href='http://toreopsahl.com/category/network-thoughts/'>Network thoughts</a> Tagged: <a href='http://toreopsahl.com/tag/actors/'>actors</a>, <a href='http://toreopsahl.com/tag/arcs/'>arcs</a>, <a href='http://toreopsahl.com/tag/centrality/'>centrality</a>, <a href='http://toreopsahl.com/tag/closeness/'>closeness</a>, <a href='http://toreopsahl.com/tag/complex-networks/'>complex networks</a>, <a href='http://toreopsahl.com/tag/directed-networks/'>directed networks</a>, <a href='http://toreopsahl.com/tag/edges/'>edges</a>, <a href='http://toreopsahl.com/tag/global/'>global</a>, <a href='http://toreopsahl.com/tag/graphs/'>graphs</a>, <a href='http://toreopsahl.com/tag/hubs/'>hubs</a>, <a href='http://toreopsahl.com/tag/links/'>Links</a>, <a href='http://toreopsahl.com/tag/local/'>local</a>, <a href='http://toreopsahl.com/tag/network/'>network</a>, <a href='http://toreopsahl.com/tag/nodes/'>nodes</a>, <a href='http://toreopsahl.com/tag/shortest-distance/'>shortest distance</a>, <a href='http://toreopsahl.com/tag/shortest-path/'>shortest path</a>, <a href='http://toreopsahl.com/tag/social-network-analysis/'>social network analysis</a>, <a href='http://toreopsahl.com/tag/strength-of-ties/'>strength of ties</a>, <a href='http://toreopsahl.com/tag/ties/'>ties</a>, <a href='http://toreopsahl.com/tag/undirected-networks/'>undirected networks</a>, <a href='http://toreopsahl.com/tag/valued-networks/'>valued networks</a>, <a href='http://toreopsahl.com/tag/vertices/'>vertices</a>, <a href='http://toreopsahl.com/tag/weighted-networks/'>weighted networks</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/2163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/2163/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2163&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/03/20/closeness-centrality-in-networks-with-disconnected-components/feed/</wfw:commentRss>
		<slash:comments>37</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2010/03/social-network_closeness.png" medium="image">
			<media:title type="html">Disconnected components</media:title>
		</media:content>
	</item>
		<item>
		<title>Local clustering coefficient for two-mode networks</title>
		<link>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/</link>
		<comments>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/#comments</comments>
		<pubDate>Wed, 06 Jan 2010 18:29:35 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2108</guid>
		<description><![CDATA[Similar to the motivation of the global clustering coefficient that I proposed in Clustering in two-mode networks, the local clustering coefficient is biased if applied to a projection of a two-mode network. It is biased in the sense that the randomly expected value is not obtained on the projection of a random two-mode network. To [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2108&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/"><img src="http://toreopsahl.files.wordpress.com/2009/09/abstract_picture.png?w=455" alt="" title="Local clustering coefficient for two-mode networks"   class="alignright size-full wp-image-1964" /></a>Similar to the motivation of the global clustering coefficient that I proposed in <a href="http://http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/">Clustering in two-mode networks</a>, the local clustering coefficient is biased if applied to a projection of a two-mode network. It is biased in the sense that the randomly expected value is not obtained on the projection of a random two-mode network. To overcome this methodological bias, I redefine the local clustering coefficient for two-mode networks. The new coefficient is a mix between the global clustering coefficient for two-mode networks and Barrat&#8217;s (2004) local coefficient for a weighted one-mode network. The coefficient is tested on Davis&#8217; (1940) Southern Women dataset. </p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/">Clustering in Two-mode Networks</a>.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, clustering coefficient, complex networks, edges, embeddedness, graphs, Links, local, network, nodes, social network analysis, strength of ties, ties, two-mode networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/2108/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/2108/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2108&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2010/01/06/local-clustering-coefficient-for-two-mode-networks/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/abstract_picture.png" medium="image">
			<media:title type="html">Local clustering coefficient for two-mode networks</media:title>
		</media:content>
	</item>
		<item>
		<title>Online Social Network-dataset now available</title>
		<link>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/</link>
		<comments>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/#comments</comments>
		<pubDate>Tue, 10 Nov 2009 17:46:53 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=2054</guid>
		<description><![CDATA[The Online Social Network-dataset used in my Ph.D. thesis is now available on the Dataset-page. This network has also been described in Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community and used in Prominence and control: The weighted rich-club effect and Clustering in weighted networks. The network originate from [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2054&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/datasets/"><img src="http://toreopsahl.files.wordpress.com/2009/11/af-page20-fixed_150.png?w=455" alt="Online Social"   class="alignright size-full wp-image-2059" /></a>The Online Social Network-dataset used in my <a href="http://toreopsahl.com/publications/thesis/">Ph.D. thesis</a> is now available on <a href="http://toreopsahl.com/datasets/">the Dataset-page</a>. This network has also been described in <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community</a> and used in <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a> and <a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/">Clustering in weighted networks</a>. The network originate from an online social network among students at University of California, Irvine. The edgelist includes the users that sent or received at least one message during that period (1,899). A total number of 59,835 online messages were sent among these over 20,296 directed ties.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/datasets/">Datasets</a>.</div>
<br />Posted in Network thoughts Tagged: arcs, communication, complex networks, directed networks, edges, graphs, Links, network, nodes, online, online communication, online social networks, social network analysis, social networking site, ties, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/2054/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/2054/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=2054&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/11/10/online-social-network-dataset-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/11/af-page20-fixed_150.png" medium="image">
			<media:title type="html">Online Social</media:title>
		</media:content>
	</item>
		<item>
		<title>Similarity between node degree and node strength</title>
		<link>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/</link>
		<comments>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 12:57:38 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1974</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength"><img src="http://toreopsahl.wordpress.com/files/2009/10/in_small.png" alt="Correlation between node degree and node strength" title="Correlation between node degree and node strength" width="125" height="106" class="alignright size-full wp-image-2002" /></a>This post explores the relationship between node degree and node strength in an online social network. In the online social network, heterogeneity in nodes’ average tie weight across different levels of degree had been <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">reported<img src="http://s3.wordpress.com/wp-content/themes/pub/blix/images/spring_flavour/post_yellow.gif"></a>. Although degree and average tie weight are significantly correlated, this post argues for the similarity of degree and node strength.  In particular, high pair-wise correlation between degree and strength is found. In addition, power-law exponents of degree distributions and strength distribution are reported. The exponents are strikingly similar, in fact, they are almost identical.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1974&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In the paper <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community</a>, we found that those individuals with many connections (i.e., high degree) sent on average more messages to each of their contacts than those with fewer connections:</p>
<p><em>&#8220;we measured users’ average out-strength (instrength) as the average number of messages sent to (received from) others (Opsahl, Colizza, Panzarasa,&amp;Ramasco, 2008). We expected hubs to be weakly connected to others, based on the conjecture that all users are homogeneously limited by the same constraints of resources and time. In this case, having more contacts should reduce the amount of resources and time spent on each of them (Burt, 1992). We were surprised to find a positive and significant (p&lt;0.001) Pearson’s pairwise correlation coefficient between average out-strength (in-strength) and out-degree (in-degree) of 0.28 (0.44). This signals that hubs spend more time and resources with each of their contacts than the less connected users.&quot; </em> (excerpt from page 919).</p>
<p>The heterogeneity in average tie weight for users with different levels of gregariousness might indicate that node degree and node strength are not correlated. <strong>This post aims to test this for the online social network used in the paper and compare degree and strength distributions.</strong></p>
<p>Given that this is a directed network, each analysis is conducted twice &#8211; once for outgoing ties and once for incoming ties. The simplest way to test the association between two variables is to calculate the <a href="http://en.wikipedia.org/wiki/Correlation" target="_blank">Pearson pair-wise correlation coefficient </a>. This coefficient tests the linear relationship between two variables, and ranges from -1 to 1. If it is equal to 1, then there is perfect correlation between the two-variables, whereas if it is -1, the two variables are opposites of each other. A value of 0 is attained if there is no linear relationship between the two variables. For out-degree and out-strength, the coefficient is 0.90, and for in-degree and in-strength, the coefficient is 0.89. This indicates that degree and strength is highly correlated with each other (Cohen, 1988). </p>
<p>Since high correlation coefficients were found, it might be interesting to plot the relationships to ensure that extreme values are not distorting the coefficient. The relationships between the two types of degree and strength are: </p>
<p><img src="http://toreopsahl.files.wordpress.com/2009/09/out.png?w=455" alt="out-degree/strength" title="out-degree/strength"   class="alignnone size-full wp-image-1978" /><img src="http://toreopsahl.files.wordpress.com/2009/09/in.png?w=455" alt="in-degree/strength" title="in-degree/strength"   class="alignnone size-full wp-image-1977" /></p>
<p>As it is possible to see from the above plots, there are a number of nodes with extremely high values of degree and strength. However, there are clear trajectories at low values of degree and strength, which might indicate that the outliers are not distorting the correlation coefficients. The fact that there are nodes with extremely high values of degree is not surprising given that power-law degree distributions with exponents of 0.89 and 1.005 were found in the paper:</p>
<p><img src="http://toreopsahl.files.wordpress.com/2009/09/distko.png?w=455" alt="Out-degree distribution" title="Out-degree distribution"   class="alignnone size-full wp-image-1981" /><img src="http://toreopsahl.files.wordpress.com/2009/09/distki.png?w=455" alt="In-degree distribution" title="In-degree distribution"   class="alignnone size-full wp-image-1980" /></p>
<p>Given the similarity between degree and strength, it would be interesting to test whether the strength distributions also follow a power-law distribution, and if so, if the exponent is similar to the ones for the degree-distributions:</p>
<p><img src="http://toreopsahl.files.wordpress.com/2009/09/distso.png?w=455" alt="Out-strength distribution" title="Out-strength distribution"   class="alignnone size-full wp-image-1983" /><img src="http://toreopsahl.files.wordpress.com/2009/09/distsi.png?w=455" alt="In-strength distribution" title="In-strength distribution"   class="alignnone size-full wp-image-1982" /></p>
<p>The exponents of the strength distributions are 0.87 and 1.004. Although I expected some similarity between the degree distributions&#8217; exponents (0.89 and 1.005) and the strength distributions&#8217; exponents, the numerical similarity is striking.</p>
<p><strong>References</strong></p>
<p>Burt, R. S., 1992. Structural Holes: The Social Structure of Competition. Harvard University Press, Cambridge, MA.</p>
<p>Cohen, J., 1988. Statistical power analysis for the behavioral sciences (2nd edition). Hillsdale, NJ: Erlbaum.</p>
<p>Opsahl, T., Colizza, V., Panzarasa, P., Ramasco, J. J., 2008. <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Prominence and control: The weighted rich-club effect</a>. Physical Review Letters 101 (168702). <a href="http://arxiv.org/abs/0804.0417" target="_blank">arXiv:0804.0417</a>. </p>
<p>Panzarasa, P., Opsahl, T., Carley, K.M., 2009. <a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/">Patterns and dynamics of users’ behavior and interaction: Network analysis of an online community</a>. Journal of the American Society for Information Science and Technology 60 (5), 911-932, doi: 10.1002/asi.21015</p>
<p><strong>What to try it with your data?</strong></p>
<p>Below is the code to calculate the numbers and create the diagrams used in this post. If you also would like to calculate the power-law with exponential cut-off, then you should remove the # on line 41.</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network
data(OnlineSocialNetwork.n1899)

`script` &lt;-
function(net){
  output &lt;- list()
  # Calculate out/in-degree/strength
  k &lt;- cbind(degree_w(net), degree_w(net, type=&quot;in&quot;))
  dimnames(k)[[2]] &lt;- c(&quot;node&quot;,&quot;ko&quot;,&quot;so&quot;,&quot;node2&quot;,&quot;ki&quot;,&quot;si&quot;)
  if(sum(k[,&quot;node&quot;] == k[,&quot;node2&quot;])!=nrow(k))
    stop(&quot;Node ids does not match&quot;)
  k &lt;- k[,c(&quot;node&quot;,&quot;ko&quot;,&quot;ki&quot;,&quot;so&quot;,&quot;si&quot;)]
  output[[1]] &lt;- k

  # Get pair-wise correlation coefficients
  corro &lt;- cor.test(k[,&quot;ko&quot;], k[,&quot;so&quot;])
  corri &lt;- cor.test(k[,&quot;ki&quot;], k[,&quot;si&quot;])
  cat(paste(&quot;Pair-wise correlation between degree and strength:\n Out: &quot;, corro$estimate, &quot; (p-value: &quot;, corro$p.value, &quot;)\n In:  &quot;, corri$estimate, &quot; (p-value: &quot;, corri$p.value, &quot;)\n Note: If p-value equal 0, p-value is less than 2.2e-16\n&quot;, sep=&quot;&quot;))
  output[[2]] &lt;- corro
  output[[3]] &lt;- corri

  # Degree distributions
  cat(&quot;Degree distributions\n&quot;)
  looprange &lt;- c(&quot;ko&quot;,&quot;so&quot;,&quot;ki&quot;,&quot;si&quot;)
  for(j in 1:length(looprange)) {
    i &lt;- looprange[j]
    tmp &lt;- table(k[,i])
    tmp &lt;- tmp[which(rownames(tmp)!=&quot;0&quot;)]
    tmp &lt;- tmp/(sum(tmp))
    tmp &lt;- as.data.frame(cbind(k=as.numeric(rownames(tmp)), pk=tmp))
    plaw &lt;- nls(pk ~ C*k^(-t), data=tmp, start=list(C=1, t=1))
    plaweco &lt;- nls(pk ~ C*k^(-t)*exp(-k/K), data=tmp, start=list(C=1, t=1, K=30))
    cat(switch(i,
      &quot;ko&quot; = &quot; Out-degree&quot;,
      &quot;so&quot; = &quot; Out-strength&quot;,
      &quot;ki&quot; = &quot; In-degree&quot;,
      &quot;si&quot; = &quot; In-strength&quot;))
    cat(paste(&quot;\n  Powerlaw:  pk =&quot;, plaw$call$formula[3], &quot;\n   Coefficients:\n    Con =&quot;, coef(plaw)[&quot;C&quot;], &quot;\n    tau =&quot;, coef(plaw)[&quot;t&quot;]))
    # cat(paste(&quot;\n  Powerlaw with exponential cut-off: pk &quot;, plaweco$call$formula[3], &quot;\n   Coefficients:\n    Con =&quot;, coef(plaweco)[&quot;C&quot;], &quot;\n    tau =&quot;, coef(plaweco)[&quot;t&quot;], &quot;\n    cut =&quot;, coef(plaweco)[&quot;K&quot;]))
    cat(&quot;\n&quot;)
    output[[(length(output)+1)]] &lt;- tmp
    output[[(length(output)+1)]] &lt;- plaw
    output[[(length(output)+1)]] &lt;- plaweco
  }
  cat(&quot; Note: These regressions in the article were performed in Stata 9\n The value of the cut-off parameter varies slightly between R and Stata\n&quot;)
  return(output)
}
output &lt;- script(OnlineSocialNetwork.n1899.net)
k &lt;- output[[1]]
plot(k[,&quot;ko&quot;], k[,&quot;so&quot;], main=&quot;Outgoing ties&quot;, xlab=&quot;out-degree&quot;, ylab=&quot;out-strength&quot;)
plot(k[,&quot;ki&quot;], k[,&quot;si&quot;], main=&quot;Incoming ties&quot;, xlab=&quot;in-degree&quot;,  ylab=&quot;in-strength&quot; )

plot(output[[4]][,1], output[[4]][,2], main=&quot;Out-degree distribution&quot;, xlab=&quot;out-degree&quot;, ylab=&quot;p(out-degree)&quot;, log=&quot;xy&quot;)
lines(output[[4]][,1], fitted(output[[5]]))

plot(output[[7]][,1], output[[7]][,2], main=&quot;Out-strength distribution&quot;, xlab=&quot;out-strength&quot;, ylab=&quot;p(out-strength)&quot;, log=&quot;xy&quot;)
lines(output[[7]][,1], fitted(output[[8]]))

plot(output[[10]][,1], output[[10]][,2], main=&quot;In-degree distribution&quot;, xlab=&quot;in-degree&quot;, ylab=&quot;p(in-degree)&quot;, log=&quot;xy&quot;)
lines(output[[10]][,1], fitted(output[[11]]))

plot(output[[13]][,1], output[[13]][,2], main=&quot;In-strength distribution&quot;, xlab=&quot;in-strength&quot;, ylab=&quot;p(in-strength)&quot;, log=&quot;xy&quot;)
lines(output[[13]][,1], fitted(output[[14]]))
</pre>
<div class="knobinfo">I would like to acknowledge <a href="http://vcolizza.googlepages.com/" target="_blank">Vittoria Colizza </a> in helping to develop the idea behind this post.</div>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Panzarasa, P., 2009. Clustering in weighted networks. Social Networks 31 (2), 155-163</div>
<br />Posted in Network thoughts Tagged: actors, arcs, centrality, complex networks, degree, directed networks, edges, global, graphs, gregariousness, hubs, Links, local, network, nodes, online communication, online social networks, popularity, reinforcement, social network analysis, social networking site, strength of nodes, strength of ties, ties, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/1974/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/1974/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1974&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/10/16/similarity-between-node-degree-and-node-strength/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/out.png" medium="image">
			<media:title type="html">out-degree/strength</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/in.png" medium="image">
			<media:title type="html">in-degree/strength</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/distko.png" medium="image">
			<media:title type="html">Out-degree distribution</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/distki.png" medium="image">
			<media:title type="html">In-degree distribution</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/distso.png" medium="image">
			<media:title type="html">Out-strength distribution</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/distsi.png" medium="image">
			<media:title type="html">In-strength distribution</media:title>
		</media:content>
	</item>
		<item>
		<title>Clustering in two-mode networks</title>
		<link>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/</link>
		<comments>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/#comments</comments>
		<pubDate>Fri, 11 Sep 2009 00:00:29 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1382</guid>
		<description><![CDATA[Many network dataset are by definition two-mode networks. Yet, few network measures can be directly applied to them. Therefore, two-mode networks are often projected onto one-mode networks by selecting a node set and linking two nodes if they were connected to common nodes in the two-mode network. This process has a major impact on the [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1382&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/"><img src="http://toreopsahl.files.wordpress.com/2009/09/abstract_picture.png?w=455" alt="Clustering in two-mode networks" title="Clustering in two-mode networks"   class="alignright size-full wp-image-1964" /></a>Many network dataset are by definition two-mode networks. Yet, few network measures can be directly applied to them. Therefore, two-mode networks are often projected onto one-mode networks by selecting a node set and linking two nodes if they were connected to common nodes in the two-mode network. This process has a major impact on the level of clustering in the network. If three or more nodes are connected to a common node in the two-mode network, the nodes form a fully-connected clique consisting of one or more triangles in the one-mode projection. Moreover, it produces a number of modeling issues. For example, even a one-mode projection of a random two-mode network with same number of nodes and ties will have a higher clustering coefficient than the randomly expected value. This post represents an attempt to overcome this issue by redefining the clustering coefficient so that it can be calculated directly on the two-mode structure. I illustrate the benefits of such an approach by applying it to two-mode networks from four different domains:  event attendance, scientific collaboration, interlocking directorates, and online communication.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/two-mode-networks/clustering/">Clustering in Two-mode Networks</a>.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, clustering coefficient, community, complex networks, degree, edges, embeddedness, global, graphs, Links, local, network, nodes, reinforcement, social network analysis, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/1382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/1382/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1382&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/09/11/clustering-in-two-mode-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/09/abstract_picture.png" medium="image">
			<media:title type="html">Clustering in two-mode networks</media:title>
		</media:content>
	</item>
		<item>
		<title>tnet: Software for Analysing Weighted Networks</title>
		<link>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 00:00:54 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[r-package]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[weighted-richclub]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1105</guid>
		<description><![CDATA[tnet is a package written in R that can calculate weighted social network measures. Almost all of the ideas posted on this blog are related to weighted networks as, I believe, taking into consideration tie weights enables us to uncover and study interesting network properties. Not only are few social network measures applicable to weighted [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1105&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/software/"><img src="http://toreopsahl.files.wordpress.com/2008/12/social-network_weighted.png?w=300&#038;h=136" alt="Weighted Social Network" title="Weighted Social Network" width="300" height="136" class="alignright size-medium wp-image-144" /></a><em>tnet</em> is a package written in <em>R</em> that can calculate weighted social network measures. Almost all of the ideas posted on this blog are related to weighted networks as, I believe, taking into consideration tie weights enables us to uncover and study interesting network properties. Not only are few social network measures applicable to weighted networks, but there is also a lack of software programmes that can analyse this type of networks. In fact, there are no open-source programmes. This hinders the use and development of weighted measures. <em>tnet</em> represents a first step towards creating such a programme. Through this platform, weighted network measures can easily be applied, and new measures easily implemented and distributed.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/software/">Software</a>.</div>
<br />Posted in Network thoughts Tagged: actors, arcs, betweenness, centrality, closeness, clustering coefficient, complex networks, degree, directed networks, edges, embeddedness, global, graphs, hubs, Links, local, network, nodes, r-package, reciprocation, reinforcement, shortest distance, shortest path, social network analysis, software, strength of nodes, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/1105/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/1105/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1105&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/06/12/tnet-software-for-analysing-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2008/12/social-network_weighted.png?w=300" medium="image">
			<media:title type="html">Weighted Social Network</media:title>
		</media:content>
	</item>
		<item>
		<title>Weighted Rich-club Effect: A more appropriate null model for scientific collaboration networks</title>
		<link>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/</link>
		<comments>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/#comments</comments>
		<pubDate>Fri, 29 May 2009 00:00:20 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[weighted-richclub]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1158</guid>
		<description><![CDATA[In this post, I extend the Weighted Rich-club Effect by suggesting and testing a different null model for the scientific collaboration network (Newman, 2001). This network is a two-mode network, which becomes an undirected one-mode network when projected. In the paper, we compared the observed weighted rich-club coefficient with the one found on random networks. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1158&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/two-mode-networks/weighted-rich-club-effect/"><img src="http://toreopsahl.files.wordpress.com/2009/06/fig1_twomode_reshuffling_sw.gif?w=455" alt="Two-mode reshuffling" title="Two-mode reshuffling"   class="alignright size-full wp-image-1388" /></a>In this post, I extend the <a href="http://toreopsahl.com/tnet/weighted-networks/weighted-rich-club-effect/">Weighted Rich-club Effect</a> by suggesting and testing a different null model for the scientific collaboration network (Newman, 2001). This network is a two-mode network, which becomes an undirected one-mode network when <a href="http://toreopsahl.com/tnet/two-mode-networks/projection/">projected</a>. In the paper, we compared the observed weighted rich-club coefficient with the one found on random networks. The random networks were constructed by a null model defined for directed networks when prominence was based on node strength. Therefore, we created a directed network from the undirected scientific collaboration network by linking connected nodes with two directed ties that had the same weight. The null model consisted in reshuffling the tie weights attached to out-going ties for each node. However, this local reshuffling broke the weight symmetry of the two directed ties between connected nodes. The null model proposed in this post is based on the randomisation of the two-mode network before projecting it onto a one-mode network. By randomising before projecting, we are able to randomise a network while keeping the symmetry of weights.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/two-mode-networks/weighted-rich-club-effect/">Weighted Rich-club Effect in Two-mode Networks</a>.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, centrality, complex networks, edges, embeddedness, global, graphs, hubs, Links, network, nodes, popularity, reinforcement, richclub, social network analysis, strength of nodes, strength of ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/1158/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1158&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/29/weighted-rich-club-effect-a-more-appropriate-null-model-for-scientific-collaboration-networks/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/06/fig1_twomode_reshuffling_sw.gif" medium="image">
			<media:title type="html">Two-mode reshuffling</media:title>
		</media:content>
	</item>
		<item>
		<title>Thesis: Structure and Evolution of Weighted Networks</title>
		<link>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/#comments</comments>
		<pubDate>Fri, 15 May 2009 00:00:27 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[degree]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[longitudinal networks]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[weighted-richclub]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=1143</guid>
		<description><![CDATA[I have now completed my Ph.D. at the School of Business and Management of Queen Mary College, University of London. My Ph.D. programme was defined around a number of projects, which drew on, and extended, recent theoretical and methodological advances in network science. The projects that were concerned with weighted networks and longitudinal networks were [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1143&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I have now completed my Ph.D. at the School of Business and Management of Queen Mary College, University of London. My Ph.D. programme was defined around a number of projects, which drew on, and extended, recent theoretical and methodological advances in network science. The projects that were concerned with weighted networks and longitudinal networks were outlined and critically discussed in my thesis (Structure and Evolution of Weighted Networks). The entire thesis, except Appendix C which is outdated, is available on the <a href="http://toreopsahl.com/publications/thesis/">Publication &gt; Thesis-page</a>.</p>
<p><strong>Acknowledgements</strong></p>
<p>The theme of this thesis is interdependence among elements. In fact, this thesis is not just a product of myself, but also of my interdependence with others. Without the support of a number of people, it would not have been possible to write. It is my pleasure to have the opportunity to express my gratitude to many of them here.</p>
<p>For my academic achievements, I would like to acknowledge the constant support from my supervisors. In particular, I thank Pietro Panzarasa for taking an active part of all the projects I have worked on. I have also had the pleasure to collaborate with people other than my supervisors. I worked with Vittoria Colizza and Jose J. Ramasco on the analysis and method presented in Chapter&nbsp;2, Kathleen M. Carley on an empirical analysis of the online social network used throughout this thesis, and Martha J. Prevezer on a project related to knowledge transfer in emerging countries. In addition to these direct collaborations, I would also like to thank Filip Agneessens, Sinan Aral, Steve Borgatti, Ronald Burt, Mauro Faccioni Filho, Thomas Friemel, John Skvoretz, and Vanina Torlo for encouragement and helpful advice. In particular, I would like to thank Tom A. B. Snijders and Klaus Nielsen for insightful reading of this thesis and many productive remarks and suggestions. I have also received feedback on my work at a number of conferences and workshops. I would like to express my gratitude to the participants at these.</p>
<p>On a social note, I would like to thank John, Claudius, and my family for their continuing support. Without them I would have lost focus. My peers and the administrative staff have also been a great source of support. In particular, I would like to extend my acknowledgements to Mariusz Jarmuzek, Geraldine Marks, Roland Miller, Jenny Murphy, Cathrine Seierstad, Lorna Soar, Steven Telford, and Eshref Trushin.</p>
<br />Posted in Articles Tagged: actors, affiliation networks, arcs, betweenness, bipartite networks, centrality, closeness, clustering coefficient, communication, community, complex networks, degree, directed networks, edges, embeddedness, evolution, global, graphs, gregariousness, hubs, Links, local, longitudinal networks, network, nodes, online, online communication, online social networks, popularity, reciprocation, reinforcement, richclub, shortest distance, shortest path, social network analysis, social networking site, strength of nodes, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/1143/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/1143/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=1143&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/15/thesis-structure-and-evolution-of-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>
	</item>
		<item>
		<title>Projecting two-mode networks onto weighted one-mode networks</title>
		<link>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/</link>
		<comments>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/#comments</comments>
		<pubDate>Fri, 01 May 2009 00:00:06 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[affiliation networks]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[bipartite networks]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[two-mode networks]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=782</guid>
		<description><![CDATA[This post highlights a number of methods for projecting both binary and weighted two-mode networks (also known as affiliation or bipartite networks) onto weighted one-mode networks. Although I would prefer to analyse two-mode networks in their original form, few methods exist for that purpose. These networks can be transformed into one-mode networks by projecting them [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=782&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/two-mode-networks/projection/"><img src="http://toreopsahl.files.wordpress.com/2009/03/fig1_twomode_s.png?w=455" alt="Projection of binary and weighted two-mode networks onto one-mode weighted networks"   class="alignright size-full wp-image-798" /></a>This post highlights a number of methods for projecting both binary and weighted two-mode networks (also known as affiliation or bipartite networks) onto weighted one-mode networks. Although I would prefer to analyse two-mode networks in their original form, few methods exist for that purpose. These networks can be transformed into one-mode networks by projecting them (i.e., selecting one set of nodes, and linking two nodes if they are connected to the same node of the other set). Traditionally, ties in the one-mode networks are without weights. By carefully considering multiple ways of projecting two-mode networks onto weighted one-mode networks, we can maintain some of the richness contained within the two-mode structure. This enables researchers to conduct a deeper analysis than if the two-mode structure was completely ignored. </p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/two-mode-networks/projection/">Projecting Two-mode Networks</a>.</div>
<br />Posted in Network thoughts Tagged: actors, affiliation networks, arcs, bipartite networks, complex networks, edges, global, graphs, Links, network, nodes, social network analysis, strength of nodes, strength of ties, ties, two-mode networks, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/782/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/782/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=782&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/05/01/projecting-two-mode-networks-onto-weighted-one-mode-networks/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/03/fig1_twomode_s.png" medium="image">
			<media:title type="html">Projection of binary and weighted two-mode networks onto one-mode weighted networks</media:title>
		</media:content>
	</item>
		<item>
		<title>Are triangles made up by strong ties?</title>
		<link>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/</link>
		<comments>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/#comments</comments>
		<pubDate>Fri, 17 Apr 2009 00:00:40 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=429</guid>
		<description><![CDATA[A key assumption of Granovetter&#8217;s (1973) Strength of Weak Ties theory is that strong ties are embedded by being part of triangles, whereas weak ties are not embedded by being created towards disconnected nodes. This assumption have been tested by calculating the traditional clustering coefficient on binary networks created with increasing cut-off parameters (i.e., creating [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=429&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/weighted-networks/clustering/"><img src="http://toreopsahl.files.wordpress.com/2009/02/strongtiesembedded_s.png?w=455" alt="Are strong ties embedded?" title="Are strong ties embedded?"   class="alignright size-full wp-image-768" /></a>A key assumption of Granovetter&#8217;s (1973) Strength of Weak Ties theory is that strong ties are embedded by being part of triangles, whereas weak ties are not embedded by being created towards disconnected nodes. This assumption have been tested by calculating the traditional clustering coefficient on binary networks created with increasing cut-off parameters (i.e., creating a series of binary networks from a weighted network where ties with a weight greater than a cut-off parameter is set to present and the rest removed). Contrarily to theories of strong ties and embeddedness, these methods generally showed that the clustering coefficient decreased as the cut-off parameter increased. However, the binary networks were not comparable with each other as they had a different number of ties. Another way of testing this assumption is to take the ratio between the weighted global clustering coefficient and the traditional coefficient measured on networks where all ties are considered present. Thus, the number of ties is maintained. This post highlights this feature and empirically tests it on a number of publically available weighted network datasets.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/clustering/">Clustering</a>.</div>
<br />Posted in Network thoughts Tagged: arcs, clustering coefficient, complex networks, edges, embeddedness, graphs, Links, network, social network analysis, strength of ties, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/429/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=429&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/04/17/are-triangles-made-up-by-strong-ties/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/02/strongtiesembedded_s.png" medium="image">
			<media:title type="html">Are strong ties embedded?</media:title>
		</media:content>
	</item>
		<item>
		<title>Article: Clustering in Weighted Networks</title>
		<link>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 03 Apr 2009 00:00:38 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=662</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/"><img src="http://toreopsahl.wordpress.com/files/2009/01/triplet_small21.png" alt="Triplet" title="Triplet" width="86" height="100" class="alignright size-full wp-image-534" /></a>A paper called "Clustering in Weighted Networks" that I have co-authored will be published in Social Networks. Although many social network measures exist for binary networks and many theories differentiate between strong and weak ties, few measures have been generalised so that they can be applied to weighted networks and retain the information encoded in the weights of ties. One of these measures is the global clustering coefficient, which measures embeddedness or, more specifically, the likelihood of a triplet being closed by a tie so that it forms a triangle. This article proposes a generalisation of this key network measure to weighted networks.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=662&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A paper called &#8220;Clustering in Weighted Networks&#8221; that I have co-authored with <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa</a> will be published in <a href="http://www.sciencedirect.com/science/journal/03788733" target="_blank">Social Networks</a>. Unfortunately, the copyright agreement prevents me from uploading a pdf of the published paper to this blog. However, if you have access to Social Networks, you can <a href="http://dx.doi.org/10.1016/j.socnet.2009.02.002" target="_blank">download the paper directly</a>. Otherwise, a <a href="http://toreopsahl.files.wordpress.com/2009/03/clustering.pdf" target="_blank">preprint with the exact same text</a> is available.</p>
<p><strong>Abstract</strong></p>
<p>In recent years, researchers have investigated a growing number of weighted networks where ties are differentiated according to their strength or capacity. Yet, most network measures do not take weights into consideration, and thus do not fully capture the richness of the information contained in the data. In this paper, we focus on a measure originally defined for unweighted networks: the global clustering coefficient. We propose a generalization of this coefficient that retains the information encoded in the weights of ties. We then undertake a comparative assessment by applying the standard and generalized coefficients to a number of network datasets.</p>
<p><strong>Motivation</strong></p>
<p><img src="http://toreopsahl.files.wordpress.com/2008/12/fig1.png?w=455" alt="Sample network" title="Sample network"   class="alignright size-full wp-image-160" />In this sample network the binary clustering coefficient is 0.33 as a third of the triplets are closed by being part of a triangle. By looking at the weights, it is possible to see that the strongest ties are in part of the closed triplets. This is not reflected in the binary clustering coefficient.</p>
<p>By applying the proposed generalisation of the coefficient using the <a href="http://en.wikipedia.org/wiki/Arithmetic_mean" target="_blank">arithmetic mean</a> method for defining triplet value, the clustering coefficient increases to 0.42. This increase of this coefficient from the binary coefficient is a reflection of the fact that the strongest ties are part of the closed triplets.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>The <a href="http://toreopsahl.com/tnet/weighted-networks/clustering/">clustering_w</a> function in <a href="http://toreopsahl.com/tnet/software/">tnet</a> allows you to test the generalised clustering coefficient on your own dataset.</p>
<p>For example, to test the <a href="http://toreopsahl.com/tnet/weighted-networks/clustering/">clustering_w</a> function on the sample network above, you can run the following code in R:</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network
net &lt;- cbind(
i=c(1,1,2,2,2,2,3,3,4,5,5,6),
j=c(2,3,1,3,4,5,1,2,2,2,6,5),
w=c(4,2,4,4,1,2,2,4,1,2,1,1))

# Run function
clustering_w(net, measure=c(&quot;am&quot;, &quot;gm&quot;, &quot;ma&quot;, &quot;mi&quot;))

# The output is:
#       am        gm        ma        mi
#0.4166667 0.4361302 0.3750000 0.5000000
</pre>
<p>To test in on Freeman&#8217;s third EIES network from <a href="http://toreopsahl.com/datasets/">the datasets page</a>, you can do the following:</p>
<pre class="brush: plain; title: ; notranslate">
# Load tnet
library(tnet)

# Load network
data(Freemans.EIES)

# Run function
clustering_w(Freemans.EIES.net.3.n32, measure=c(&quot;am&quot;, &quot;gm&quot;, &quot;ma&quot;, &quot;mi&quot;))

# The output is:
#0.7378310 0.7331536 0.7410959 0.7249982</pre>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Panzarasa, P., 2009. Clustering in weighted networks. Social Networks 31 (2), 155-163</div>
<br />Posted in Articles Tagged: arcs, clustering coefficient, complex networks, directed networks, edges, embeddedness, global, graphs, Links, network, nodes, reinforcement, social network analysis, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/662/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/662/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=662&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/04/03/article-clustering-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2008/12/fig1.png" medium="image">
			<media:title type="html">Sample network</media:title>
		</media:content>
	</item>
		<item>
		<title>The importance of allowing ties to decay</title>
		<link>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/</link>
		<comments>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/#comments</comments>
		<pubDate>Fri, 20 Mar 2009 00:00:45 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[longitudinal networks]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[online communication]]></category>
		<category><![CDATA[online social networks]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=436</guid>
		<description><![CDATA[Recently, a number of network dataset have been constructed from archival data (e.g., email logs) with the aim to study human interaction. This has allowed researchers to study large-scale social networks. If the archival data does not included information about the severing or weakening of ties, non-relevant interaction among people, which occurred far in the [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=436&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/longitudinal-networks/smoothing-window/"><img src="http://toreopsahl.files.wordpress.com/2009/02/fig1_evolving_s.gif?w=455" alt="Evolving network" title="Evolving network"   class="alignright size-full wp-image-764" /></a>Recently, a number of network dataset have been constructed from archival data (e.g., email logs) with the aim to study human interaction. This has allowed researchers to study large-scale social networks. If the archival data does not included information about the severing or weakening of ties, non-relevant interaction among people, which occurred far in the past, might be deemed relevant. This post highlights this issue and suggests imposing a lifespan on interactions to record only relevant ties with the current strength.</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/longitudinal-networks/smoothing-window/">Smoothing Window</a>.</div>
<br />Posted in Network thoughts Tagged: arcs, complex networks, edges, evolution, graphs, Links, longitudinal networks, network, online communication, online social networks, reinforcement, social network analysis, social networking site, strength of ties, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/436/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/436/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=436&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/03/20/the-importance-of-allowing-ties-to-decay/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/02/fig1_evolving_s.gif" medium="image">
			<media:title type="html">Evolving network</media:title>
		</media:content>
	</item>
		<item>
		<title>Article: Patterns and Dynamics of Users’ Behaviour and Interaction: Network Analysis of an Online Community</title>
		<link>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/</link>
		<comments>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/#comments</comments>
		<pubDate>Fri, 06 Mar 2009 00:00:19 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[gender differences]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[gregariousness]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[popularity]]></category>
		<category><![CDATA[reachability]]></category>
		<category><![CDATA[reciprocation]]></category>
		<category><![CDATA[reinforcement]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social networking site]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=659</guid>
		<description><![CDATA[<a href="http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/"><img src="http://toreopsahl.wordpress.com/files/2009/11/af-page20-fixed_150.png" alt="Online Social Network" title="Online Social Network" width="150" height="151" class="alignright size-full wp-image-2059" /></a>A paper called "Patterns and Dynamics of Users' Behaviour and Interaction: Network Analysis of an Online Community" that I have co-authored will be published in the Journal of the American Society for Information Science and Technology (JASIST). In this paper, we studied the evolution of a variety of properties in an online community, including how users create, reciprocate, and deepen relationships with one another, variations in users’ gregariousness and popularity, reachability and typical distances among users, and the degree of local redundancy in the community. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=659&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A paper called Patterns and Dynamics of Users&#8217; Behaviour and Interaction: Network Analysis of an Online Community that I have co-authored with <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa</a> and <a href="http://www.casos.cs.cmu.edu/bios/carley/carley.html" target="_blank">Kathleen M. Carley</a> will be published in the May issue of the <a href="http://www.asis.org/jasist.html" target="_blank">Journal of the American Society for Information Science and Technology (JASIST)</a>. Unfortunately, the copyright agreement prevents me from uploading the paper to this blog. However, if you have access to JASIST, you can <a href="http://dx.doi.org/10.1002/asi.21015" target="_blank">download the paper directly</a>. Otherwise, if you write me an <a href="http://toreopsahl.com/about/">email</a>, I can send it to you.</p>
<p><strong>Abstract</strong></p>
<p>This research draws on longitudinal network data from an online community to examine patterns of users’ behavior and social interaction and infer the processes underpinning dynamics of system use. The online community represents a prototypical example of a complex evolving social network in which connections between users are established over time by online messages. We study the evolution of a variety of properties since the inception of the system, including how users create, reciprocate, and deepen relationships with one another, variations in users’ gregariousness and popularity, reachability and typical distances among users, and the degree of local redundancy in the system. Results indicate that the system is a &#8220;small world&#8221;  characterized by the emergence, in its early stages, of a hub-dominated structure with highly heterogeneous users’ behavior. We investigate whether hubs are responsible for holding the system together and facilitating information flow, examine first-mover advantages underpinning users’ ability to rise to system prominence, and uncover gender differences in users’ gregariousness, popularity, and local redundancy. We discuss the implications of the results for research on system use and evolving social networks, and for a host of applications, including information diffusion, communities of practice, and the security and robustness of information systems. </p>
<div class="knobcite">If you use any of the information in this post, please cite: Panzarasa, P., Opsahl, T., Carley, K.M., 2009. Patterns and dynamics of users’ behavior and interaction: Network analysis of an online community. Journal of the American Society for Information Science and Technology 60 (5), 911-932</div>
<br />Posted in Articles Tagged: clustering coefficient, communication, community, complex networks, embeddedness, evolution, gender differences, graphs, gregariousness, hubs, network, online, popularity, reachability, reciprocation, reinforcement, shortest distance, social network analysis, social networking site, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/659/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/659/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=659&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/03/06/article-patterns-and-dynamics-of-users-behaviour-and-interaction-network-analysis-of-an-online-community/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>
	</item>
		<item>
		<title>Betweenness in weighted networks</title>
		<link>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 20 Feb 2009 00:00:13 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[betweenness]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[shortest path]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=464</guid>
		<description><![CDATA[This post highlights a generalisation of Freeman&#8217;s (1978) betweenness measure to weighted networks implicitly introduced by Brandes (2001) when he developed an algorithm for calculating betweenness faster. Betweenness is a measure of the extent to which a node funnels transactions among all the other nodes in the network. By funnelling the transactions, a node can [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=464&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/weighted-networks/node-centrality/"><img src="http://toreopsahl.files.wordpress.com/2009/01/fig1_betweenness_s3.gif?w=455" alt="Betweenness example" title="Betweenness example"   class="alignright size-full wp-image-623" /></a>This post highlights a generalisation of Freeman&#8217;s (1978) betweenness measure to weighted networks implicitly introduced by Brandes (2001) when he developed an algorithm for calculating betweenness faster. Betweenness is a measure of the extent to which a node funnels transactions among all the other nodes in the network. By funnelling the transactions, a node can broker. This could be by taking a cut (e.g. Ukraine controls most gas pipelines from Russia to Europe) or distorting the information being transmitted to its advantage. </p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/node-centrality/">Node Centrality in Weighted Networks</a>.</div>
<br />Posted in Network thoughts Tagged: betweenness, centrality, closeness, complex networks, graphs, local, network, shortest distance, shortest path, social network analysis, strength of ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/464/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/464/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=464&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/02/20/betweenness-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/01/fig1_betweenness_s3.gif" medium="image">
			<media:title type="html">Betweenness example</media:title>
		</media:content>
	</item>
		<item>
		<title>Operationalisation of tie strength in social networks</title>
		<link>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/</link>
		<comments>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/#comments</comments>
		<pubDate>Fri, 06 Feb 2009 00:00:47 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=427</guid>
		<description><![CDATA[The method used to operationalise ties&#8217; strength into weights affects the outcomes of weighted networks measures. Simply assigning 1, 2, and 3 to three different levels of tie strength might not be appropriate as this scale might misrepresent the actually difference among the three levels (using an ordinal scale). In this post, I highlight issues [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=427&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/weighted-networks/defining-one-mode-networks/"><img src="http://toreopsahl.files.wordpress.com/2009/01/triangle_strength_weight1_s.png?w=455" alt="Ties&#039; strength and weight" title="Ties&#039; strength and weight"   class="alignright size-full wp-image-536" /></a>The method used to operationalise ties&#8217; strength into weights affects the outcomes of weighted networks measures. Simply assigning 1, 2, and 3 to three different levels of tie strength might not be appropriate as this scale might misrepresent the actually difference among the three levels (using an ordinal scale). In this post, I highlight issues with collecting weighted social network data from surveys. </p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/defining-one-mode-networks/">Defining Weighted Networks</a>.</div>
<br />Posted in Network thoughts Tagged: arcs, complex networks, edges, graphs, Links, network, social network analysis, strength of nodes, ties, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/427/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=427&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/02/06/operationalisation-of-tie-strength/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/01/triangle_strength_weight1_s.png" medium="image">
			<media:title type="html">Ties&#039; strength and weight</media:title>
		</media:content>
	</item>
		<item>
		<title>Weighted local clustering coefficient</title>
		<link>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/</link>
		<comments>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/#comments</comments>
		<pubDate>Fri, 23 Jan 2009 00:00:22 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[clustering coefficient]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.wordpress.com/?p=312</guid>
		<description><![CDATA[The generalisation of the local clustering coefficient to weighted networks by Barrat et al. (2004) considers the value of a triplet to be the average of the weights attached to the two ties that make up the triplet. In this post, I suggest three additional methods for defining the triplet value. &#160; &#160; The content [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=312&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/weighted-networks/clustering/"><img src="http://toreopsahl.files.wordpress.com/2009/01/triplet_small21.png?w=455" alt="Triplet" title="Triplet"   class="alignright size-full wp-image-534" /></a>The generalisation of the local clustering coefficient to weighted networks by Barrat et al. (2004) considers the value of a triplet to be the average of the weights attached to the two ties that make up the triplet. In this post, I suggest three additional methods for defining the triplet value.<br />
&nbsp;<br />
&nbsp;</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/clustering/">Clustering in Weighted Networks</a>.</div>
<br />Posted in Network thoughts Tagged: clustering coefficient, complex networks, embeddedness, graphs, local, network, social network analysis, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/312/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=312&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/01/23/weighted-local-clustering-coefficient/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/01/triplet_small21.png" medium="image">
			<media:title type="html">Triplet</media:title>
		</media:content>
	</item>
		<item>
		<title>Average shortest distance in weighted networks</title>
		<link>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/</link>
		<comments>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/#comments</comments>
		<pubDate>Fri, 09 Jan 2009 00:00:23 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[centrality]]></category>
		<category><![CDATA[closeness]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[shortest distance]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.wordpress.com/?p=174</guid>
		<description><![CDATA[The average distance that separate nodes in a network became a famous measure following Milgram&#8217;s six-degrees of separation experiment in 1967 that found that people in the US were on average 6-steps from each other. This post proposes a generalisation of this measure to weighted networks by building on work by Dijkstra (1959) and Newman [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=174&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://toreopsahl.com/tnet/weighted-networks/shortest-paths/"><img src="http://toreopsahl.files.wordpress.com/2009/01/distance.png?w=455" alt="Distance between nodes" title="Distance between nodes"   class="alignright size-full wp-image-530" /></a>The average distance that separate nodes in a network became a famous measure following Milgram&#8217;s six-degrees of separation experiment in 1967 that found that people in the US were on average 6-steps from each other. This post proposes a generalisation of this measure to weighted networks by building on work by Dijkstra (1959) and Newman (2001).</p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/shortest-paths/">Shortest Paths in Weighted Networks</a>.</div>
<br />Posted in Network thoughts Tagged: centrality, closeness, complex networks, global, graphs, network, shortest distance, social network analysis, valued networks, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/174/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=174&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2009/01/09/average-shortest-distance-in-weighted-networks/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>

		<media:content url="http://toreopsahl.files.wordpress.com/2009/01/distance.png" medium="image">
			<media:title type="html">Distance between nodes</media:title>
		</media:content>
	</item>
		<item>
		<title>Local weighted rich-club measure</title>
		<link>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/</link>
		<comments>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/#comments</comments>
		<pubDate>Fri, 26 Dec 2008 00:31:20 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Network thoughts]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[weighted networks]]></category>
		<category><![CDATA[weighted-richclub]]></category>

		<guid isPermaLink="false">http://toreopsahl.wordpress.com/?p=5</guid>
		<description><![CDATA[This post proposes a local (node-level) version of the Weighted Rich-club Effect (PRL 101, 168702). By incorporating this measure into a regression analysis, the impact of targeting efforts towards prominent nodes on performance can be studied. The content of this post has been integrated in the tnet manual, see The Weighted Rich-club Effect. Posted in [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=5&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This post proposes a local (node-level) version of the <a href="http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/">Weighted Rich-club Effect (PRL 101, 168702)</a>. By incorporating this measure into a regression analysis, the impact of targeting efforts towards prominent nodes on performance can be studied. </p>
<div class="knobcitenobg">The content of this post has been integrated in the <em>tnet</em> manual, see <a href="http://toreopsahl.com/tnet/weighted-networks/weighted-rich-club-effect/">The Weighted Rich-club Effect</a>.</div>
<br />Posted in Network thoughts Tagged: complex networks, graphs, local, network, richclub, social network analysis, valued networks, weighted networks, weighted-richclub <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/5/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=5&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2008/12/26/local-weighted-rich-club-measure/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>
	</item>
		<item>
		<title>Article: Prominence and control: The weighted rich-club effect</title>
		<link>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/</link>
		<comments>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/#comments</comments>
		<pubDate>Fri, 12 Dec 2008 00:00:08 +0000</pubDate>
		<dc:creator>Tore Opsahl</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[actors]]></category>
		<category><![CDATA[arcs]]></category>
		<category><![CDATA[complex networks]]></category>
		<category><![CDATA[directed networks]]></category>
		<category><![CDATA[edges]]></category>
		<category><![CDATA[embeddedness]]></category>
		<category><![CDATA[global]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[hubs]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[richclub]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strength of nodes]]></category>
		<category><![CDATA[strength of ties]]></category>
		<category><![CDATA[ties]]></category>
		<category><![CDATA[undirected networks]]></category>
		<category><![CDATA[valued networks]]></category>
		<category><![CDATA[vertices]]></category>
		<category><![CDATA[weighted networks]]></category>

		<guid isPermaLink="false">http://toreopsahl.com/?p=779</guid>
		<description><![CDATA[A paper called Prominence and control: The weighted rich-club effect that I have co-authored was published in Physical Review Letters (PRL). In this paper, we proposed a new general framework for studying the tendency of prominent nodes to direct their strongest ties toward each other.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=779&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A paper called Prominence and control: The weighted rich-club effect that I have co-authored with <a href="http://vcolizza.googlepages.com/" target="_blank">Vittoria Colizza</a>, <a href="http://www.busman.qmul.ac.uk/staff/staff.php?p.panzarasa@qmul.ac.uk" target="_blank">Pietro Panzarasa</a>, and <a href="http://isiosf.isi.it/~jramasco/" target="_blank">José Javier Ramasco</a> was published in the October 17 issue of <a href="http://prl.aps.org/" target="_blank">Physical Review Letters (PRL)</a>. A pdf version of <a href="http://vcolizza.googlepages.com/PhysRevLett_101_168702.pdf" target="_blank">the paper is available</a>.</p>
<p><strong>Abstract</strong></p>
<p>Complex systems are often characterized by large-scale hierarchical organizations.  Whether the prominent elements, at the top of the hierarchy, share and control resources or avoid one another lies at the heart of the global organization and  functioning. Inspired by network perspectives, we propose a new general framework for studying the tendency of prominent elements to form clubs with exclusive control over the majority of a system&#8217;s resources. We explore associations between prominence and control in the fields of transportation, scientific collaboration, and online communication. </p>
<p><em>UPDATE</em>: I have suggested an additional null model in <a href="http://toreopsahl.com/tnet/two-mode-networks/weighted-rich-club-effect/">Weighted Rich-club Effect: A more appropriate null model for scientific collaboration networks</a>.</p>
<p><strong>Want to test it with your data?</strong></p>
<p>See the Weighted Rich-club Effect in the <a href="http://toreopsahl.com/tnet/weighted-networks/weighted-rich-club-effect/">tnet manual</a>.</p>
<div class="knobcite">If you use any of the information in this post, please cite: Opsahl, T., Opsahl, Colizza, V., Panzarasa, P., Ramasco, J.J., 2008. Prominence and control: The weighted rich-club effect. Physical Review Letters 101 (168702)</div>
<br />Posted in Articles Tagged: actors, arcs, complex networks, directed networks, edges, embeddedness, global, graphs, hubs, Links, network, nodes, richclub, social network analysis, strength of nodes, strength of ties, ties, undirected networks, valued networks, vertices, weighted networks <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/toreopsahl.wordpress.com/779/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/toreopsahl.wordpress.com/779/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=toreopsahl.com&#038;blog=5878280&#038;post=779&#038;subd=toreopsahl&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://toreopsahl.com/2008/12/12/article-prominence-and-control-the-weighted-rich-club-effect/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/52afd8a25dc5ae6cf390031b61953552?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">Tore</media:title>
		</media:content>
	</item>
	</channel>
</rss>
