Section 3.2
The Random Network Model

Network science aims to build models that reproduce the properties of real networks. Most networks we encounter do not have the comforting regularity of a crystal lattice or the predictable radial architecture of a spider web. Rather, at first inspection they look as if they were spun randomly (Image 2.4). Random network theory embraces this apparent randomness by constructing and characterizing networks that are truly random.

From a modeling perspective a network is a relatively simple object, consisting of only nodes and links. The real challenge, however, is to decide where to place the links between the nodes so that we reproduce the complexity of a real system. In this respect the philosophy behind a random network is simple: We assume that this goal is best achieved by placing the links randomly between the nodes. That takes us to the definition of a random network (BOX 3.1):

A random network consists of N nodes where each node pair is connected with probability p.

Box 3.1
Defining Random Networks

There are two definitions of a random network:

G(N, L) Model: N labeled nodes are connected with L randomly placed links. Erdős and Rényi used this definition in their string of papers on random networks [2-9]
G(N, p) Model: Each pair of N labeled nodes is connected with probability p, a model introduced by Gilbert [10].

Hence, the G(N, p) model fixes the probability p that two nodes are connected and the G(N, L)) model fixes the total number of links L. While in the G(N, L) model the average degree of a node is simply ‹k› = 2L/N, other network characteristics are easier to calculate in the G(N, p) model. Throughout this book we will explore the G(N, p) model, not only for the ease that it allows us to calculate key network characteristics, but also because in real networks the number of links rarely stays fixed.

To construct a random network we follow these steps:

Start with N isolated nodes.
Select a node pair and generate a random number between 0 and 1. If the number exceeds p, connect the selected node pair with a link, otherwise leave them disconnected.
Repeat step (2) for each of the N(N-1)/2 node pairs.

The network obtained after this procedure is called a random graph or a random network. Two mathematicians, Pál Erdős and Alfréd Rényi, have played an important role in understanding the properties of these networks. In their honor a random network is called the Erdős-Rényi network (BOX 3.2).

Box 3.2
Random Networks: a Brief History

Image 3.2
(a) Pál Erdős (1913-1996)

Hungarian mathematician known for both his exceptional scientific output and eccentricity. Indeed, Erdős published more papers than any other mathematician in the history of mathematics. He co-authored papers with over five hundred mathematicians, inspiring the concept of Erdős number. His legendary personality and profound professional impact has inspired two biographies [12, 13] and a documentary [14] (Video 3.1).

(b) Alfréd Rényi (1921-1970)

Hungarian mathematician with fundamental contributions to combinatorics, graph theory, and number theory. His impact goes beyond mathematics: The Rényi entropy is widely used in chaos theory and the random network theory he co-developed is at the heart of network science. He is remembered through the hotbed of Hungarian mathematics, the Alfréd Rényi Institute of Mathematics in Budapest.

Video 3.1
N is a Number: A Portrait of Paul Erdős
The 1993 biographical documentary of Pál Erdős, directed by George Paul Csicsery, offers a glimpse into Erdős' life and scientific impact [14].

Anatol Rapoport (1911-2007), a Russian immigrant to the United States, was the first to study random networks. Rapoport’s interests turned to mathematics after realizing that a successful career as a concert pianist would require a wealthy patron. He focused on mathematical biology at a time when mathematicians and biologists hardly spoke to each other. In a paper written with Ray Solomonoff in 1951 [11], Rapoport demonstrated that if we increase the average degree of a network, we observe an abrupt transition from disconnected nodes to a graph with a giant component.

The study of random networks reached prominence thanks to the fundamental work of Pál Erdős and Alfréd Rényi (Image 3.2). In a sequence of eight papers published between 1959 and 1968 [2-9], they merged probability theory and combinatorics with graph theory, establishing random graph theory, a new branch of mathematics [2].

The random network model was independently introduced by Edgar Nelson Gilbert (1923-2013) [10] the same year Erdős and Rényi published their first paper on the subject. Yet, the impact of Erdős and Rényi’s work is so overwhelming that they are rightly considered the founders of random graph theory.

“A mathematician is a device for turning coffee into theorems”

Alfréd Rényi (a quote often attributed to Erdős)

Section 3.3
Number of Links

Each random network generated with the same parameters N, p looks slightly different (Image 3.3). Not only the detailed wiring diagram changes between realizations, but so does the number of links L. It is useful, therefore, to determine how many links we expect for a particular realization of a random network with fixed Nand p.

The probability that a random network has exactly L links is the product of three terms:

The probability that L of the attempts to connect the N(N-1)/2 pairs of nodes have resulted in a link, which is p^L.
The probability that the remaining N(N-1)/2 - L attempts have not resulted in a link, which is (1-p)^N(N-1)/2-L.
A combinational factor, $\left( {\begin{array}{*{20}c} {\frac{{N(N - 1)}}{2}} \\ L \\ \end{array}} \right) \hspace{20 mm} (3 . 0)$ counting the number of different ways we can place L links among N(N-1)/2 node pairs.

We can therefore write the probability that a particular realization of a random network has exactly L links as

$p_L = \left( {\begin{array}{*{20}c} {\frac{{N(N - 1)}}{2}} \\ L \\ \end{array}} \right)p^L (1 - p)^{\frac{{N(N - 1)}}{2} - L} \hspace{20 mm} (3 . 1)$

As (3.1) is a binomial distribution (BOX 3.3), the expected number of links in a random graph is

\ $\left\langle L \right\rangle = \sum\limits_{L = 0}^{\frac{{N(N - 1)}}{2}} {Lp_L } = p\frac{{N(N - 1)}}{2} \hspace{20 mm} (3 . 2)$

Hence ‹L› is the product of the probability p that two nodes are connected and the number of pairs we attempt to connect, which is L_max = N(N - 1)/2 (CHAPTER 2).

Using (3.2) we obtain the average degree of a random network

$\left\langle k \right\rangle = \frac{{2\left\langle L \right\rangle }}{N} = p(N - 1) \hspace{20 mm} (3 . 3)$

Hence ‹k› is the product of the probability p that two nodes are connected and (N-1), which is the maximum number of links a node can have in a network of size N.

In summary the number of links in a random network varies between realizations. Its expected value is determined by N and p. If we increase p a random network becomes denser: The average number of links increase linearly from ‹L› = 0 to L_max and the average degree of a node increases from ‹k› = 0 to ‹k› = N-1.

Random Networks are Truly Random. — Image 3.3
Random Networks are Truly Random
Top Row
Three realizations of a random network generated with the same parameters p=1/6 and N=12. Despite the identical parameters, the networks not only look different, but they have a different number of links as well (L=10, 10, 8).

Bottom Row
Three realizations of a random network with p=0.03 and N=100. Several nodes have degree k=0, shown as isolated nodes at the bottom.

Box 3.3
Binomial Distribution: Mean and Variance

If we toss a fair coin N times, tails and heads occur with the same probability p = 1/2. The binomial distribution provides the probability p_x that we obtain exactly x heads in a sequence of N throws. In general, the binomial distribution describes the number of successes in N independent experiments with two possible outcomes, in which the probability of one outcome is p, and of the other is 1-p.

The binomial distribution has the form

$p_x = \left( {\begin{array}{*{20}c} N \\ x \\ \end{array}} \right)p^x (1 - p)^{N - x}$

The mean of the distribution (first moment) is

$\left\langle x \right\rangle = \sum\limits_{x = 0}^N {xp_x } = Np \hspace{20 mm} (3 . 4)$

Its second moment is

$\left\langle {x^2 } \right\rangle = \sum\limits_{x = 0}^N {x^2 p_x } = p(1 - p)N + p^2 N^2 \hspace{20 mm} (3 . 5)$

providing its standard deviation as

$\sigma _x = \left( {\left\langle {x^2 } \right\rangle - \left\langle x \right\rangle ^2 } \right)^{\frac{1}{2}} = \left[ {p(1 - p)N} \right]^{\frac{1}{2}} \hspace{20 mm} (3 . 6)$

Equations (3.4) - (3.6) are used repeatedly as we characterize random networks.

Section 3.4
Degree Distribution

In a given realization of a random network some nodes gain numerous links, while others acquire only a few or no links (Image 3.3). These differences are captured by the degree distribution, p_k, which is the probability that a randomly chosen node has degree k. In this section we derive p_k for a random network and discuss its properties.

Binomial vs. Poisson Degree Distribution. — Image 3.4
Binomial vs. Poisson Degree Distribution
The exact form of the degree distribution of a random network is the binomial distribution(left half). For N ›› ‹k› the binomial is well approximated by a Poisson distribution (right half). As both formulas describe the same distribution,they have the identical properties, but they are expressed in terms of different parameters: The binomial distribution depends on p and N, while the Poisson distribution has only one parameter, ‹k›. It is this simplicity that makes the Poisson form preferred in calculations.

Binomial Distribution

In a random network the probability that node i has exactly k links is the product of three terms [15]:

The probability that k of its links are present, or p^k.
The probability that the remaining (N-1-k) links are missing, or (1-p)^N-1-k
The number of ways we can select k links from N- 1 potential links a node can have, or

$\left( {\begin{array}{*{20}c} {N - 1} \\ k \\ \end{array}} \right)$

Consequently the degree distribution of a random network follows the binomial distribution

$p_k = \left( {\begin{array}{*{20}c} {N - 1} \\ k \\ \end{array}} \right)p^k (1 - p)^{N - 1 - k} \hspace{20 mm} (3 . 7)$

The shape of this distribution depends on the system size N and the probability p (Image 3.4). The binomial distribution (BOX 3.3) allows us to calculate the network’s average degree ‹k›, recovering (3.3), as well as its second moment ‹k²› and variance σ_k (Image 3.4).

Poisson Distribution

Most real networks are sparse, meaning that for them ‹k› ‹‹ N (Table 2.1). In this limit the degree distribution (3.7) is well approximated by the Poisson distribution (ADVANCED TOPICS 3.A)

$p_k = e^{ - \left\langle k \right\rangle } \frac{{\left\langle k \right\rangle ^k }}{{k!}} \hspace{20 mm} (3 . 8)$

which is often called, together with (3.7), the degree distribution of a random network.

The binomial and the Poisson distribution describe the same quantity, hence they have similar properties (Image 3.4):

Both distributions have a peak around ‹k›. If we increase p the network becomes denser, increasing ‹k› and moving the peak to the right.
The width of the distribution (dispersion) is also controlled by p or ‹k›. The denser the network, the wider is the distribution, hence the larger are the differences in the degrees.

When we use the Poisson form (3.8), we need to keep in mind that:

The exact result for the degree distribution is the binomial form (3.7), thus (3.8) represents only an approximation to (3.7) valid in the ‹k› ‹‹ N limit. As most networks of practical importance are sparse, this condition is typically satisfied.
The advantage of the Poisson form is that key network characteristics, like <‹k›, ‹k²› and σ_k , have a much simpler form (Image 3.4), depending on a single parameter, ‹k›.
The Poisson distribution in (3.8) does not explicitly depend on the number of nodes N. Therefore, (3.8) predicts that the degree distribution of networks of different sizes but the same average degree ‹k› are indistinguishable from each other (Image 3.5).

In summary, while the Poisson distribution is only an approximation to the degree distribution of a random network, thanks to its analytical simplicity, it is the preferred form for p_k. Hence throughout this book, unless noted otherwise, we will refer to the Poisson form (3.8) as the degree distribution of a random network. Its key feature is that its properties are independent of the network size and depend on a single parameter, the average degree ‹k›.

Degree Distribution is Independent of the Network Size. — Image 3.5
Degree Distribution is Independent of the Network Size

The degree distribution of a random network with ‹k› = 50 and N = 10², 10³, 10⁴.

Small Networks: Binomial
For a small network (N = 10²) the degree distribution deviates significantly from the Poisson form (3.8), as the condition for the Poisson approximation, N»‹k›, is not satisfied. Hence for small networks one needs to use the exact binomial form (3.7) (green line).

Large Networks: Poisson
For larger networks (N = 10³, 10⁴) the degree distribution becomes indistinguishable from the Poisson prediction (3.8), shown as a continuous grey line. Therefore for large N the degree distribution is independent of the network size. In the figure we averaged over 1,000 independently generated random networks to decrease the noise.

Section 3.5
Real Networks are Not Poisson

As the degree of a node in a random network can vary between 0 and N-1, we must ask, how big are the differences between the node degrees in a particular realization of a random network? That is, can high degree nodes coexist with small degree nodes? We address these questions by estimating the size of the largest and the smallest node in a random network.

Let us assume that the world’s social network is described by the random network model. This random society may not be as far fetched as it first sounds: There is significant randomness in whom we meet and whom we choose to become acquainted with.

Sociologists estimate that a typical person knows about 1,000 individuals on a first name basis, prompting us to assume that ‹k› ≈ 1,000. Using the results obtained so far about random networks, we arrive to a number of intriguing conclusions about a random society of N ≈ 7 x 10⁹ of individuals (ADVANCED TOPICS 3.B):

The most connected individual (the largest degree node) in a random society is expected to have k_max = 1,185 acquaintances.
The degree of the least connected individual is k_min = 816, not that different from k_max or ‹k›.
The dispersion of a random network is σ_k = ‹k›^1/2 , which for ‹k› = 1,000 is σ_k = 31.62. This means that the number of friends a typical individual has is in the ‹k› ± σ_k range, or between 968 and 1,032, a rather narrow window.

Taken together, in a random society all individuals are expected to have a comparable number of friends. Hence if people are randomly connected to each other, we lack outliers: There are no highly popular individuals, and no one is left behind, having only a few friends. This suprising conclusion is a consequence of an important property of random networks: in a large random network the degree of most nodes is in the narrow vicinity of ‹k›

This prediction blatantly conflicts with reality. Indeed, there is extensive evidence of individuals who have considerably more than 1,185 acquaintances. For example, US president Franklin Delano Roosevelt’s appointment book has about 22,000 names, individuals he met personally [16, 17]. Similarly, a study of the social network behind Facebook has documented numerous individuals with 5,000 Facebook friends, the maximum allowed by the social networking platform [18]. To understand the origin of these discrepancies we must compare the degree distribution of real and random networks.

Degree Distribution of Real Networks. — Image 3.6
Degree Distribution of Real Networks
The degree distribution of the (a) Internet, (b) science collaboration network, and (c) protein interaction network (Table 2.1). The green line corresponds to the Poisson prediction, obtained by measuring ‹k› for the real network and then plotting (3.8). The significant deviation between the data and the Poisson fit indicates that the random network model underestimates the size and the frequency of the high degree nodes, as well as the number of low degree nodes. Instead the random network model predicts a larger number of nodes in the vicinity of ‹k› than seen in real networks.

Section 3.6
The Evolution of a Random Network

The cocktail party we encountered at the beginning of this chapter captures a dynamical process: Starting with N isolated nodes, the links are added gradually through random encounters between the guests. This corresponds to a gradual increase of p, with striking consequences on the network topology (Video 3.2). To quantify this process, we first inspect how the size of the largest connected cluster within the network, N_G, varies with ‹k›. Two extreme cases are easy to understand:

For p = 0 we have ‹k› = 0, hence all nodes are isolated. Therefore the largest component has size N_G = 1 and N_G/N→0 for large N.
For p = 1 we have ‹k›= N-1, hence the network is a complete graph and all nodes belong to a single component. Therefore N_G = N and N_G/N = 1.

Video 3.2
Evolution of a Random Network
A video showing the change in the structure of a random network with increasing p. It vividly illustrates the absence of a giant component for small p and its sudden emergence once p reaches a critical value.

One would expect that the largest component grows gradually from N_G = 1 to N_G = N if ‹k› increases from 0 to N-1. Yet, as Image 3.7a indicates, this is not the case: N_G/N remains zero for small ‹k›, indicating the lack of a large cluster. Once ‹k› exceeds a critical value, N_G/N increases, signaling the rapid emergence of a large cluster that we call the giant component. Erdős and Rényi in their classical 1959 paper predicted that the condition for the emergence of the giant component is [2]

$\left\langle k \right\rangle = 1 \hspace{20 mm} (3 . 10)$

In other words, we have a giant component if and only if each node has on average more than one link (ADVANCED TOPICS 3.C).

The fact that we need at least one link per node to observe a giant component is not unexpected. Indeed, for a giant component to exist, each of its nodes must be linked to at least one other node. It is somewhat counterintuitive, however, that one link is sufficient for its emergence.

We can express (3.10) in terms of p using (3.3), obtaining

$p_c = \frac{1}{{N - 1}} \approx \frac{1}{N} \hspace{20 mm} (3 . 11)$

Therefore the larger a network, the smaller p is sufficient for the giant component.

The emergence of the giant component is only one of the transitions characterizing a random network as we change ‹k›. We can distinguish four topologically distinct regimes (Image 3.7a), each with its unique characteristics:

Subcritical Regime: 0 ‹ ‹k› ‹ 1 (p ‹ 1/N , Image 3.7b).

For ‹k› = 0 the network consists of N isolated nodes. Increasing ‹k› means that we are adding N‹k› = pN(N-1)/2 links to the network. Yet, given that ‹k› ‹ 1, we have only a small number of links in this regime, hence we mainly observe tiny clusters (Image 3.7b).

We can designate at any moment the largest cluster to be the giant component. Yet in this regime the relative size of the largest cluster, N_G/N, remains zero. The reason is that for ‹k› ‹ 1 the largest cluster is a tree with size N_G ~ lnN, hence its size increases much slower than the size of the network. Therefore N_G/N ≃ lnN/N→0 in the N→∞ limit.

In summary, in the subcritical regime the network consists of numerous tiny components, whose size follows the exponential distribution (3.35). Hence these components have comparable sizes, lacking a clear winner that we could designate as a giant component.

Critical Point: ‹k› = 1 (p = 1/N , Image 3.7c).

The critical point separates the regime where there is not yet a giant component (‹k› ‹ 1) from the regime where there is one (‹k› › 1). At this point the relative size of the largest component is still zero (Image 3.7c). Indeed, the size of the largest component is N_G ~ N^2/3. Consequently N_G grows much slower than the network’s size, so its relative size decreases as N_G/N ~ N ^-1/3 in the N→∞ limit.

Note, however, that in absolute terms there is a significant jump in the size of the largest component at ‹k› = 1. For example, for a random network with N = 7 ×10⁹ nodes, comparable to the globe’s social network, for ‹k› ‹ 1 the largest cluster is of the order of N_G ≃ lnN = ln (7 ×109) ≃ 22.7. In contrast at ‹k› = 1 we expect N_G ~ N^2/3 = (7 ×10⁹)^2/3 ≃ 3 ×10⁶, a jump of about five orders of magnitude. Yet, both in the subcritical regime and at the critical point the largest component contains only a vanishing fraction of the total number of nodes in the network.

In summary, at the critical point most nodes are located in numerous small components, whose size distribution follows (3.36). The power law form indicates that components of rather different sizes coexist. These numerous small components are mainly trees, while the giant component may contain loops. Note that many properties of the network at the critical point resemble the properties of a physical system undergoing a phase transition (ADVANCED TOPICS 3.F).

Evolution of a Random Network. — Image 3.7
Evolution of a Random Network

The relative size of the giant component in function of the average degree ‹k› in the Erdős-Rényi model. The figure illustrates the phase tranisition at ‹k› = 1, responsible for the emergence of a giant component with nonzero *N_G*

A sample network and its properties in the four regimes that characterize a random network.

Supercritical Regime: ‹k› › 1 (p › 1/N , Image 3.7d).

This regime has the most relevance to real systems, as for the first time we have a giant component that looks like a network. In the vicinity of the critical point the size of the giant component varies as

$\frac{{N_G }}{N} \sim \left\langle k \right\rangle - 1 \hspace{20 mm} (3 . 12)$

$N_G \sim (p - p_c )N \hspace{20 mm} (3 . 13)$

where p_c is given by (3.11). In other words, the giant component contains a finite fraction of the nodes. The further we move from the critical point, a larger fraction of nodes will belong to it. Note that (3.12) is valid only in the vicinity of ‹k› = 1. For large ‹k› the dependence between N_G and ‹k› is nonlinear (Image 3.7a).

In summary in the supercritical regime numerous isolated components coexist with the giant component, their size distribution following (3.35). These small components are trees, while the giant component contains loops and cycles. The supercritical regime lasts until all nodes are absorbed by the giant component.

Connected Regime: ‹k› › lnN (p › lnN/N , Image 3.7e).

For sufficiently large p the giant component absorbs all nodes and components, hence N_G≃ N. In the absence of isolated nodes the network becomes connected. The average degree at which this happens depends on N as (ADVANCED TOPIC 3.E)

$\left\langle k \right\rangle = \ln N \hspace{20 mm} (3 . 14)$

Note that when we enter the connected regime the network is still relatively sparse, as lnN / N → 0 for large N. The network turns into a complete graph only at ‹k› = N - 1.

In summary, the random network model predicts that the emergence of a network is not a smooth, gradual process: The isolated nodes and tiny components observed for small ‹k› collapse into a giant component through a phase transition (ADVANCED TOPICS 3.F). As we vary ‹k› we encounter four topologically distinct regimes (Image 3.7).

The discussion offered above follows an empirical perspective, fruitful if we wish to compare a random network to real systems. A different perspective, with its own rich behavior, is offered by the mathematical literature (BOX 3.5).

Box 3.5
Network Evolution in Graph Theory

In the random graph literature it is often assumed that the connection probability p(N) scales as N^z, where z is a tunable parameter between -∞ and 0 [15]. In this language Erdős and Rényi discovered that as we vary z, key properties of random graphs appear quite suddenly

A graph has a given property Q if the probability of having Q approaches 1 as N → ∞. That is, for a given z either almost every graph has the property Q or almost no graph has it. For example, for z less than -3/2 almost all graphs contain only isolated nodes and pairs of nodes connected by a link. Once z exceeds -3/2, most networks will contain paths connecting three or more nodes (Image 3.8).

Image 3.8
Evolution of a Random Graph
The threshold probabilities at which different subgraphs appear in a random graph, as defined by the exponent z in the p(N) ~ N^z relationship. For z ‹ -3/2 the graph consists of isolated nodes and edges. When z passes -3/2 trees of order 3 appear, while at z = -4/3 trees of order 4 appear. At z = 1 trees of all orders are present, together with cycles of all orders. Complete subgraphs of order 4 appear at z =-2/3, and as z increases further, complete subgraphs of larger and larger order emerge. After [19].

Section 3.7
Real Networks are Supercritical

Two predictions of random network theory are of direct importance for real networks:

Once the average degree exceeds ‹k› = 1, a giant component should emerge that contains a finite fraction of all nodes. Hence only for ‹k› › 1 the nodes organize themselves into a recognizable network.
For ‹k› › lnN all components are absorbed by the giant component, resulting in a single connected network.

Do real networks satisfy the criteria for the existence of a giant component, i.e. ‹k› › 1? And will this giant component contain all nodes for ‹k› › lnN, or will we continue to see some disconnected nodes and components? To answer these questions we compare the structure of a real network for a given ‹k› with the theoretical predictions discussed above.

The measurements indicate that real networks extravagantly exceed the ‹k› = 1 threshold. Indeed, sociologists estimate that an average person has around 1,000 acquaintances; a typical neuron is the human brain has about 7,000 synapses; in our cells each molecule takes part in several chemical reactions.

This conclusion is supported by Table 3.1, that lists the average degree of several undirected networks, in each case finding ‹k› › 1. Hence the average degree of real networks is well beyond the ‹k› = 1 threshold, implying that they all have a giant component. The same is true for the reference networks listed in Table 3.1.

Network	N	L	‹K›	lnN
Internet	192,244	609,066	6.34	12.17
Power Grid	4,941	6,594	2.67	8.51
Science Collaboration	23,133	94,437	8.08	10.05
Actor Network	702,388	29,397,908	83.71	13.46
Protein Interactions	2,018	2,930	2.90	7.61

Table 3.1
Are Real Networks Connected?
The number of nodes N and links L for the undirected networks of our reference network list of Table 3.1, shown together with ‹k› and lnN. A giant component is expected for ‹k› › 1 and all nodes should join the giant component for ‹k› › lnN. While for all networks ‹k› › 1, for most ‹k› is under the lnN threshold (see also Image 3.9).

Let us now turn to the second prediction, inspecting if we have single component (i.e. if ‹k› › lnN), or if the network is fragmented into multiple components (i.e. if ‹k› ‹ lnN). For social networks the transition between the supercritical and the fully connected regime should be at ‹k› › ln(7×10⁹) ≈ 22.7. That is, if the average individual has more than two dozens acquaintances, then a random society must have a single component, leaving no individual disconnected. With ‹k› ≈ 1,000 this condition is clearly satisfied. Yet, according to Table 3.1 many real networks do not obey the fully connected criteria. Consequently, according to random network theory these networks should be fragmented into several disconnected components. This is a disconcerting prediction for the Internet, indicating that some routers should be disconnected from the giant component, being unable to communicate with other routers. It is equally problematic for the power grid, indicating that some consumers should not get power. These predictions are clearly at odds with reality.

In summary, we find that most real networks are in the supercritical regime (Image 3.9). Therefore these networks are expected to have a giant component, which is in agreement with the observations. Yet, this giant component should coexist with many disconnected components, a prediction that fails for several real networks. Note that these predictions should be valid only if real networks are accurately described by the Erdős-Rényi model, i.e. if real networks are random. In the coming chapters, as we learn more about the structure of real networks, we will understand why real networks can stay connected despite failing the ‹k› › lnNcriteria.

Most Real Networks are Supercritical. — Image 3.9
Most Real Networks are Supercritical
The four regimes predicted by random network theory, marking with a cross the location (‹k›) of the undirected networks listed in Table 3.1. The diagram indicates that most networks are in the supercritical regime, hence they are expected to be broken into numerous isolated components. Only the actor network is in the connected regime, meaning that all nodes are part of a single giant component. Note that while the boundary between the subcritical and the supercritical regime is always at ‹k› = 1, the boundary between the supercritical and the connected regime is at lnN, which varies from system to system.

Section 3.8
Small Worlds

The small world phenomenon, also known as six degrees of separation, has long fascinated the general public. It states that if you choose any two individuals anywhere on Earth, you will find a path of at most six acquaintances between them (Image 3.10). The fact that individuals who live in the same city are only a few handshakes from each other is by no means surprising. The small world concept states, however, that even individuals who are on the opposite side of the globe can be connected to us via a few acquaintances.

Six Deegree of Separation. — Image 3.10
Six Deegree of Separation
According to six degrees of separation two individuals, anywhere in the world, can be connected through a chain of six or fewer acquaintances. This means that while Sarah does not know Peter, she knows Ralph, who knows Jane and who in turn knows Peter. Hence Sarah is three handshakes, or three degrees from Peter. In the language of network science six degrees, also called the small world property, means that the distance between any two nodes in a network is unexpectedly small.

In the language of network science the small world phenomenon implies that the distance between two randomly chosen nodes in a network is short. This statement raises two questions: What does short (or small) mean, i.e. short compared to what? How do we explain the existence of these short distances?

Both questions are answered by a simple calculation. Consider a random network with average degree ‹k›. A node in this network has on average:

‹k› nodes at distance one (d=1).
‹k›² nodes at distance two (d=2).
‹k›³ nodes at distance three (d =3).
...
‹k›^d nodes at distance d.

For example, if ‹k› ≈ 1,000, which is the estimated number of acquaintences an individual has, we expect 10⁶ individuals at distance two and about a billion, i.e. almost the whole earth’s population, at distance three from us.

To be precise, the expected number of nodes up to distance d from our starting node is

$N(d) \approx 1 + \left\langle k \right\rangle + \left\langle k \right\rangle ^2 + ... + \left\langle k \right\rangle ^d = \frac{{\left\langle k \right\rangle ^{d + 1} - 1}}{{\left\langle k \right\rangle - 1}} \hspace{20 mm} (3 . 15)$

N(d) must not exceed the total number of nodes, N, in the network. Therefore the distances cannot take up arbitrary values. We can identify the maximum distance, d_max, or the network’s diameter by setting

$N(d_{\max } ) \approx N \hspace{20 mm} (3 . 16)$

Assuming that ‹k› » 1, we can neglect the (-1) term in the nominator and the denominator of (3.15), obtaining

$\left\langle k \right\rangle ^{d_{\max } } \approx N \hspace{20 mm} (3 . 17)$

Therefore the diameter of a random network follows

$d_{\max } \approx \frac{{\ln N}}{{\ln \left\langle k \right\rangle }} \hspace{20 mm} (3 . 18)$

which represents the mathematical formulation of the small world phenomenon. The key, however is its interpretation:

As derived, (3.18) predicts the scaling of the network diameter, d_max, with the size of the system, N. Yet, for most networks (3.18) offers a better approximation to the average distance between two randomly chosen nodes, ‹d›, than to d_max (Table 3.2). This is because d_max is often dominated by a few extreme paths, while ‹d› is averaged over all node pairs, a process that supresses the fluctuations. Hence typically the small world property is defined by $\left\langle d \right\rangle \approx \frac{{\ln N}}{{\ln \left\langle k \right\rangle }} \hspace{20 mm} (3 . 19)$ describing the dependence of the average distance in a network on N and ‹k›.
In general lnN « N, hence the dependence of ‹d› on lnN implies that the distances in a random network are orders of magnitude smaller than the size of the network. Consequently by small in the "small world phenomenon" we mean that the average path length or the diameter depends logarithmically on the system size. Hence, “small” means that ‹d› is proportional to lnN, rather than N or some power of N (Image 3.11).
The 1/ln ‹k› term implies that the denser the network, the smaller is the distance between the nodes.
In real networks there are systematic corrections to (3.19), rooted in the fact that the number of nodes at distance d › ‹d› drops rapidly (ADVANCED TOPICS 3.F).

Image 3.11
Why are Small Worlds Surprising?

Much of our intuition about distance is based on our experience with regular lattices, which do not display the small world property:

**1D:** For a one-dimensional lattice (a line of length N) the diameter and the average path length scale linearly with N: *d_max*~‹d› ~N.

**2D:** For a square lattice *d_max*~‹d› ~ N^1/2.

**3D:** For a cubic lattice *d_max*~‹d› ~ N^1/3.

**4D:** In general, for a d-dimensional lattice *d_max*~‹d› ~ N^1/d.

These polynomial dependences predict a much faster increase with N than (3.19), indicating that in lattices the path lengths are significantly longer than in a random network. For example, if the social network would form a square lattice (2D), where each individual knows only its neighbors, the average distance between two individuals would be roughly (7 ×10⁹)^1/2 = 83,666. Even if we correct for the fact that a person has about 1,000 acquaintances, not four, the average separation will be orders of magnitude larger than predicted by (3.19).

The figure shows the predicted N-dependence of ‹d› for regular and random networks on a linear scale.

The same as in (a), but shown on a log-log scale.

Let us illustrate the implications of (3.19) for social networks. Using N ≈ 7 ×10⁹ and ‹k› ≈ 10³, we obtain

$\left\langle d \right\rangle \approx \frac{{\ln 7 \times 10^9 }}{{\ln (10^3 )}} = 3.28 \hspace{20 mm} (3 . 20)$

Therefore, all individuals on Earth should be within three to four handshakes of each other [20]. The estimate (3.20) is probably closer to the real value than the frequently quoted six degrees (BOX 3.7).

Much of what we know about the small world property in random networks, including the result (3.19), is in a little known paper by Manfred Kochen and Ithiel de Sola Pool [20], in which they mathematically formulated the problem and discussed in depth its sociological implications. This paper inspired the well known Milgram experiment (BOX 3.6), which in turn inspired the six-degrees of separation phrase.

While discovered in the context of social systems, the small world property applies beyond social networks (BOX 3.6). To demonstrate this in Table 3.2 we compare the prediction of (3.19) with the average path length ‹d› for several real networks, finding that despite the diversity of these systems and the significant differences between them in terms of N and ‹k›, (3.19) offers a good approximation to the empirically observed ‹d›.

Box 3.6
19 Degrees of Separation

How many clicks do we need to reach a randomly chosen document on the Web? The difficulty in addressing this question is rooted in the fact that we lack a complete map of the WWW—we only have access to small samples of the full map. We can start, however, by measuring the WWW’s average path length in samples of increasing sizes, a procedure called finite size scaling. The measurements indicate that the average path length of the WWW increases with the size of the network as [21]

$\left\langle d \right\rangle \approx 0.35 + 0.89\ln N$

In 1999 the WWW was estimated to have about 800 million documents [22], in which case the above equation predicts ‹d›≈18.69. In other words in 1999 two randomly chosen documents were on average 19 clicks from each other, a result that became known as 19 degrees of separation. Subsequent measurements on a sample of 200 million documents found ‹d›≈16 [23], in good agreement with the ‹d›≈17 prediction. Currently the WWW is estimated to have about trillion nodes (N~10¹²), in which case the formula predicts ‹d›≈25. Hence ‹d› is not fixed but as the network grows, so does the distance between two documents.

The average path length of 25 is much larger than the proverbial six degrees (BOX 3.7). The difference is easy to understand: The WWW has smaller average degree and larger size than the social network. According to (3.19) both of these differences increase the Web’s diameter.

In summary the small world property has not only ignited the public’s imagination (BOX 3.8), but plays an important role in network science as well. The small world phenomena can be reasonably well understood in the context of the random network model: It is rooted in the fact that the number of nodes at distance d from a node increases exponentially with d. In the coming chapters we will see that in real networks we encounter systematic deviations from (3.19), forcing us to replace it with more accurate predictions. Yet the intuition offered by the random network model on the origin of the small world phenomenon remains valid.

Network	N	L	‹k›	‹d›	d_max	lnN/ln‹k›
Internet	192,244	609,066	6.34	6.98	26	6.58
WWW	325,729	1,497,134	4.60	11.27	93	8.31
Power Grid	4,941	6,594	2.67	18.99	46	8.66
Mobile-Phone Calls	36,595	91,826	2.51	11.72	39	11.42
Email	57,194	103,731	1.81	5.88	18	18.4
Science Collaboration	23,133	93,437	8.08	5.35	15	4.81
Actor Network	702,388	29,397,908	83.71	3.91	14	3.04
Citation Network	449,673	4,707,958	10.43	11.21	42	5.55
E. Coli Metabolism	1,039	5,802	5.58	2.98	8	4.04
Protein Interactions	2,018	2,930	2.90	5.61	14	7.14

Table 3.2
Six Degrees of Separation
The average distance ‹d› and the maximum distance d_max for the ten reference networks. The last column provides ‹d› predicted by (3.19), indicating that it offers a reasonable approximation to the measured ‹d›. Yet, the agreement is not perfect - we will see in the next chapter that for many real networks (3.19) needs to be adjusted. For directed networks the average degree and the path lengths are measured along the direction of the links.

Box 3.7
Six Degrees: Experimental Confirmation

The first empirical study of the small world phenomena took place in 1967, when Stanley Milgram, building on the work of Pool and Kochen [20], designed an experiment to measure the distances in social networks [24, 25]. Milgram chose a stock broker in Boston and a divinity student in Sharon, Massachusetts as targets. He then randomly selected residents of Wichita and Omaha, sending them a letter containing a short summary of the study’s purpose, a photograph, the name, address and information about the target person. They were asked to forward the letter to a friend, relative or acquantance who is most likely to know the target person.

Within a few days the first letter arrived, passing through only two links. Eventually 64 of the 296 letters made it back, some, however, requiring close to a dozen intermediates [25]. These completed chains allowed Milgram to determine the number of individuals required to get the letter to the target (Image 3.12a). He found that the median number of intermediates was 5.2, a relatively small number that was remarkably close to Frigyes Karinthy’s 1929 insight (BOX 3.8).

Image 3.12
Six Degrees? From Milgram to Facebook

In Milgram's experiment 64 of the 296 letters made it to the recipient. The figure shows the length distribution of the completed chains, indicating that some letters required only one intermediary, while others required as many as ten. The mean of the distribution was 5.2, indicating that on average six ‘handshakes’ were required to get a letter to its recipient. The playwright John Guare renamed this ‘six degrees of separation’ two decades later. After [25].
The distance distribution, p_d , for all pairs of Facebook users worldwide and within the US only.Using Facebook’s N and L (3.19) predicts the average degree to be approximately 3.90, not far from the reported four degrees. After [18].

Milgram lacked an accurate map of the full acquaintance network, hence his experiment could not detect the true distance between his study’s participants. Today Facebook has the most extensive social network map ever assembled. Using Facebook’s social graph of May 2011, consisting of 721 million active users and 68 billion symmetric friendship links, researchers found an average distance 4.74 between the users (Image 3.12b). Therefore, the study detected only ‘four degrees of separation’ [18], closer to the prediction of (3.20) than to Milgram’s six degrees [24, 25].

“I asked a person of intelligence how many steps he thought it would take, and he said that it would require 100 intermediate persons, or more, to move from Nebraska to Sharon.”

Stanley Milgram, 1969

Section 3.9
Clustering Coefficient

The degree of a node contains no information about the relationship between a node's neighbors. Do they all know each other, or are they perhaps isolated from each other? The answer is provided by the local clustering coefficient C_i, that measures the density of links in node i’s immediate neighborhood: C_i = 0 means that there are no links between i’s neighbors; C_i = 1 implies that each of the i’s neighbors link to each other (SECTION 2.10).

To calculate C_i for a node in a random network we need to estimate the expected number of links L_i between the node’s k_i neighbors. In a random network the probability that two of i’s neighbors link to each other is p. As there are k_i(k_i - 1)/2 possible links between the k_i neighbors of node i, the expected value of L_i is

$\left\langle {L_i } \right\rangle = p\frac{{k_i (k_i - 1)}}{2} \hspace{20 mm} (3 . 20)$

Thus the local clustering coefficient of a random network is

$C_i = \frac{{2\left\langle {L_i } \right\rangle }}{{k_i (k_i - 1)}} = p = \frac{{\left\langle k \right\rangle }}{N} \hspace{20 mm} (3 . 21)$

Equation (3.21) makes two predictions:

For fixed ‹k›, the larger the network, the smaller is a node’s clustering coefficient. Consequently a node's local clustering coefficient C_i is expected to decrease as 1/N. Note that the network's average clustering coefficient, ‹C› also follows (3.21).
The local clustering coefficient of a node is independent of the node’s degree.

To test the validity of (3.21) we plot ‹C›/‹k› in function of N for several undirected networks (Image 3.13a). We find that ‹C›/‹k› does not decrease as N^-1, but it is largely independent of N, in violation of the prediction (3.21) and point (1) above. In Image 3.13b-d we also show the dependency of C on the node’s degree k_i for three real networks, finding that C(k) systematically decreases with the degree, again in violation of (3.21) and point (2).

In summary, we find that the random network model does not capture the clustering of real networks. Instead real networks have a much higher clustering coefficient than expected for a random network of similar N and L. An extension of the random network model proposed by Watts and Strogatz [29] addresses the coexistence of high ‹C› and the small world property (BOX 3.9). It fails to explain, however, why high-degree nodes have a smaller clustering coefficient than low-degree nodes. Models explaining the shape of C(k) are discussed in Chapter 9.

Clustering in Real Networks. — Image 3.13
Clustering in Real Networks

Comparing the average clustering coefficient of real networks with the prediction (3.21) for random networks. The circles and their colors correspond to the networks of Table 3.2. Directed networks were made undirected to calculate ‹C› and ‹k›. The green line corresponds to (3.21), predicting that for random networks the average clustering coefficient decreases as N^-1. In contrast, for real networks ‹C› appears to be independent of N.

The dependence of the local clustering coefficient, *C(k)*, on the node’s degree for (b) the Internet, (c) science collaboration network and (d) protein interaction network. *C(k)* is measured by averaging the local clustering coefficient of all nodes with the same degree k. The green horizontal line corresponds to ‹C›.

Box 3.9
Watts-Strogatz Model

Duncan Watts and Steven Strogatz proposed an extension of the random network model (Image 3.14) motivated by two observations [29]:

Small World Property
In real networks the average distance between two nodes depends logarithmically on N (3.18), rather than following a polynomial expected for regular lattices (Image 3.11).
High Clustering
The average clustering coefficient of real networks is much higher than expected for a random network of similar N and L (Image 3.13a).

The Watts-Strogatz model (also called the small-world model) interpolates between a regular lattice, which has high clustering but lacks the small-world phenomenon, and a random network, which has low clustering, but displays the small-world property (Image 3.14a-c). Numerical simulations indicate that for a range of rewiring parameters the model's average path length is low but the clustering coefficient is high, hence reproducing the coexistence of high clustering and small-world phenomena (Image 3.14d).

Being an extension of the random network model, the Watts- Strogatz model predicts a Poisson-like bounded degree distribution. Consequently high degree nodes, like those seen in Image 3.6, are absent from it. Furthermore it predicts a k-independent C(k), being unable to recover the k-dependence observed in Image 3.13b-d. As we show in the next chapters, understanding the coexistence of the small world property with high clustering must start from the network's correct degree distribution.

Image 3.14
The Watts-Strogatz Model

We start from a ring of nodes, each node being connected to their immediate and next neighbors. Hence initially each node has ‹C› = 3/4 (p = 0).
With probability p each link is rewired to a randomly chosen node. For small p the network maintains high clustering but the random long-range links can drastically decrease the distances between the nodes.
For p = 1 all links have been rewired, so the network turns into a random network.
The dependence of the average path length d(p) and clustering coefficient ‹C(p)› on the rewiring parameter p. Note that d(p) and ‹C(p)› have been normalized by d(0) and ‹C(0)› obtained for a regular lattice (i.e. for p=0 in (a)). The rapid drop in d(p) signals the onset of the small-world phenomenon. During this drop, ‹C(p)› remains high. Hence in the range 0.001‹p‹0.1 short path lengths and high clustering coexist in the network. All graphs have N=1000 and ‹k›=10. After [29].

Section 3.10
Summary: Real Networks are Not Random

Since its introduction in 1959 the random network model has dominated mathematical approaches to complex networks. The model suggests that the random-looking networks observed in complex systems should be described as purely random. With that it equated complexity with randomness. We must therefore ask:

Do we really believe that real networks are random?

The answer is clearly no. As the interactions between our proteins are governed by the strict laws of biochemistry, for the cell to function its chemical architecture cannot be random. Similarly, in a random society an American student would be as likely to have among his friends Chinese factory workers than one of her classmates.

In reality we suspect the existence of a deep order behind most complex systems. That order must be reflected in the structure of the network that describes their architecture, resulting in systematic deviations from a pure random configuration.

The degree to which random networks describe, or fail to describe, real systems, must not be decided by epistemological arguments, but by a systematic quantitative comparison. We can do this, taking advantage of the fact that random network theory makes a number of quantitative predictions:

Degree Distribution
A random network has a binomial distribution, well approximated by a Poisson distribution in the k « N limit. Yet, as shown in Image 3.5, the Poisson distribution fails to capture the degree distribution of real networks. In real systems we have more highly connected nodes than the random network model could account for.

Connectedness
Random network theory predicts that for ‹k› › 1 we should observe a giant component, a condition satisfied by all networks we examined. Most networks, however, do not satisfy the ‹k› › lnN condition, implying that they should be broken into isolated clusters (Table 3.1). Some networks are indeed fragmented, most are not.

Average Path Length
Random network theory predicts that the average path length follows (3.19), a prediction that offers a reasonable approximation for the observed path lengths. Hence the random network model can account for the emergence of small world phenomena.

Clustering Coefficient
In a random network the local clustering coefficient is independent of the node’s degree and ‹C› depends on the system size as 1/N. In contrast, measurements indicate that for real networks C(k) decreases with the node degrees and is largely independent of the system size (Image 3.13).

Taken together, it appears that the small world phenomena is the only property reasonably explained by the random network model. All other network characteristics, from the degree distribution to the clustering coefficient, are significantly different in real networks. The extension of the Erdős-Rényi model proposed by Watts and Strogatz successfully predicts the coexistence of high C and low ‹d›, but fails to explain the degree distribution and C(k). In fact, the more we learn about real networks, the more we will arrive at the startling conclusion that we do not know of any real network that is accurately described by the random network model.

This conclusion begs a legitimate question: If real networks are not random, why did we devote a full chapter to the random network model? The answer is simple: The model serves as an important reference as we proceed to explore the properties of real networks. Each time we observe some network property we will have to ask if it could have emerged by chance. For this we turn to the random network model as a guide: If the property is present in the model, it means that randomness can account for it. If the property is absent in random networks, it may represent some signature of order, requiring a deeper explanation. So, the random network model may be the wrong model for most real systems, but it remains quite relevant for network science (BOX 3.10).

Box 3.10
Random Networks and Network Science

The lack of agreement between random and real networks raises an important question: How could a theory survive so long given its poor agreement with reality? The answer is simple: Random network theory was never meant to serve as a model of real systems.

Erdős and Rényi write in their first paper [2] that random networks “may be interesting not only from a purely mathematical point of view. In fact, the evolution of graphs may be considered as a rather simplified model of the evolution of certain communication nets (railways, road or electric network systems, etc.) of a country or some unit.” Yet, in the string of eight papers authored by them on the subject [2-9], this is the only mention of the potential practical value of their approach. The subsequent development of random graphs was driven by the problem's inherent mathematical challenges, rather than its applications.

It is tempting to follow Thomas Kuhn and view network science as a paradigm change from random graphs to a theory of real networks [30]. In reality, there was no network paradigm before the end of 1990s. This period is characterized by a lack of systematic attempts to compare the properties of real networks with graph theoretical models. The work of Erdős and Rényi has gained prominence outside mathematics only after the emergence of network science (Image 3.15).

Network theory does not lessen the contributions of Erdős and Rényi, but celebrates the unintended impact of their work. When we discuss the disrepacies between random and real networks, we do so mainly for pedagogical reasons: to offer a proper foundation on which we can understand the properties of real systems.

Image 3.15
Network Science and Random Networks

While today we perceive the Erdős-Rényi model as the cornerstone of network theory, the model was hardly known outside a small subfield of mathematics. This is illustrated by the yearly citations of the first two papers by Erdős and Rényi, published in 1959 and 1960 [2,3]. For four decades after their publication the papers gathered less than 10 citations each year. The number of citations exploded after the first papers on scale-free networks [21, 31, 32] have turned Erdős and Rényi’s work into the reference model of network theory.

Section 3.11
Homework

Erdős-Rényi Networks
Consider an Erdős-Rényi network with N = 3,000 nodes, connected to each other with probability p = 10^–3.
1. What is the expected number of links, 〈L〉?
2. In which regime is the network?
3. Calculate the probability p_c so that the network is at the critical point
4. Given the linking probability p = 10^–3, calculate the number of nodes N^cr so that the network has only one component.
5. For the network in (d), calculate the average degree 〈k^cr〉 and the average distance between two randomly chosen nodes 〈d〉.
6. Calculate the degree distribution p_k of this network (approximate with a Poisson degree distribution).
Generating Erdős-Rényi Networks
Relying on the G(N, p) model, generate with a computer three networks with N = 500 nodes and average degree (a) 〈k〉 = 0.8, (b) 〈k〉 = 1 and (c) 〈k〉 = 8. Visualize these networks.
Circle Network
Consider a network with N nodes placed on a circle, so that each node connects to m neighbors on either side (consequently each node has degree 2m). Image 3.14(a) shows an example of such a network with m = 2 and N = 20. Calculate the average clustering coefficient 〈C〉 of this network and the average shortest path 〈d〉. For simplicity assume that N and m are chosen such that (n-1)/2m is an integer. What happens to 〈C〉 if N≫1? And what happens to 〈d〉?
Cayley Tree
A Cayley tree is a symmetric tree, constructed starting from a central node of degree k. Each node at distance d from the central node has degree k, until we reach the nodes at distance P that have degree one and are called leaves (see Image 3.16 for a Cayley tree with k = 3 and P = 5.)
1. Calculate the number of nodes reachable in t steps from the central node.
2. Calculate the degree distribution of the network.
3. Calculate the diameter d_max.
4. Find an expression for the diameter d_max in terms of the total number of nodes N.
5. Does the network display the small-world property?
Snobbish Network
Consider a network of N red and N blue nodes. The probability that there is a link between nodes of identical color is p and the probability that there is a link between nodes of different color is q. A network is snobbish if p › q, capturing a tendency to connect to nodes of the same color. For q = 0 the network has at least two components, containing nodes with the same color.
1. Calculate the average degree of the "blue" subnetwork made of only blue nodes, and the average degree in the full network.
2. Determine the minimal p and q required to have, with high probability, just one component.
3. Show that for large N even very snobbish networks (p≫q) display the small-world property.
Snobbish Social Networks
Consider the following variant of the model discussed above: We have a network of 2N nodes, consisting of an equal number of red and blue nodes, while an f fraction of the 2N nodes are purple. Blue and red nodes do not connect to each other (q = 0), while they connect with probability p to nodes of the same color. Purple nodes connect with the same probability p to both red and blue nodes
1. We call the red and blue communities interactive if a typical red node is just two steps away from a blue node and vice versa. Evaluate the fraction of purple nodes required for the communities to be interactive.
2. Comment on the size of the purple community if the average degree of the blue (or red) nodes is 〈k〉≫1.
3. What are the implications of this model for the structure of social (and other) networks?

Cayley Tree. — Image 3.16
Cayley Tree
A Cayley Tree With k = 3 and P = 5.

Section 3.13
Advanced Topic 3.B
Maximum and Minimum Degrees

To determine the expected degree of the largest node in a random network, called the network’s upper natural cutoff, we define the degree k_max such that in a network of N nodes we have at most one node with degree higher than k_max . Mathematically this means that the area behind the Poisson distribution p_k for k ≥ k_max should be approximately one (Image 3.17). Since the area is given by 1-P(k_max), where P(k) is the cumulative degree distribution of p_k, the network’s largest node satisfies:

$N\left[ {1 - P\left( {k_{\max } } \right)} \right] \approx 1 \hspace{20 mm} (3 . 26)$

We write ≈ instead of =, because k_max is an integer, so in general the exact equation does not have a solution. For a Poisson distribution

$1 - P(k_{\max } ) = 1 - e^{ - \left\langle k \right\rangle } \sum\limits_{k = 0}^{k_{\max } } {\frac{{\left\langle k \right\rangle ^k }}{{k!}}} = e^{ - \left\langle k \right\rangle } \sum\limits_{k = k_{\max } + 1}^\infty {\frac{{\left\langle k \right\rangle ^k }}{{k!}}} \approx e^{ - \left\langle k \right\rangle } \frac{{\left\langle k \right\rangle ^{k_{\max } + 1} }}{{(k_{\max } + 1)!}} \hspace{20 mm} (3 . 27)$

where in the last term we approximate the sum with its largest term.

For N = 10⁹ and 〈k〉 = 1,000, roughly the size and the average degree of the globe’s social network, (3.26) and (3.27) predict k_max = 1,185, indicating that a random network lacks extremely popular individuals, or hubs.

We can use a similar argument to calculate the expected degree of the smallest node, k_min. By requiring that there should be at most one node with degree smaller than k_min we can write

$NP(k_{\min } - 1) \simeq 1 \hspace{20 mm} (3 . 28)$

For the Erdős-Rényi network we have

$P(k_{\min } - 1) = e^{ - \left\langle k \right\rangle } \sum\limits_{k = 0}^{k_{\min } - 1} {\frac{{\left\langle k \right\rangle ^k }}{{k!}}} \hspace{20 mm} (3 . 29)$

Solving (3.28) with N = 10⁹ and 〈k〉 = 1,000 we obtain k_min = 816.

Minimum and Maximum Degree. — Image 3.17
Minimum and Maximum Degree
The estimated maximum degree of a network, k_max, is chosen so that there is at most one node whose degree is higher than k_max. This is often called the *natural upper cutoff* of a degree distribution. To calculate it, we need to set k_max such that the area under the degree distribution p_k for k > k_max equals 1/N, hence the total number of nodes expected in this region is exactly one. We follow a similar argument to determine the expected smallest degree, k_min.

Section 3.14
Advanced Topic 3.C
Giant Component

In this section we introduce the argument, proposed independently by Solomonoff and Rapoport [11], and by Erdős and Rényi [2], for the emergence of giant component at 〈k〉= 1 [33].

Let us denote with u = 1 - N_G/N the fraction of nodes that are not in the giant component (GC), whose size we take to be N_G. If node i is part of the GC, it must link to another node j, which must also be part of the GC. Hence if i is not part of the GC, that could happen for two reasons:

There is no link between i and j (probability for this is 1- p).
There is a link between i and j, but j is not part of the GC (probability for this is pu).

Therefore the total probability that i is not part of the GC via node j is 1 - p + pu. The probability that i is not linked to the GC via any other node is therefore (1 - p + pu)^{N - 1}, as there are N - 1 nodes that could serve as potential links to the GC for node i. As u is the fraction of nodes that do not belong to the GC, for any p and N the solution of the equation

$u = (1 - p + pu)^{N - 1} \hspace{20 mm} (3 . 30)$

provides the size of the giant component via N_G = N(1 - u). Using p = 〈k〉/(N - 1) and taking the logarithm of both sides, for 〈k〉 « N we obtain

$\ln u = (N - 1)\ln \left[ {1 - \frac{{\left\langle k \right\rangle }}{{N - 1}}(1 - u)} \right] \approx (N - 1)\left[ { - \frac{{\left\langle k \right\rangle }}{{N - 1}}(1 - u)} \right] = - \left\langle k \right\rangle (1 - u) \hspace{20 mm} (3 . 31)$

where we used the series expansion for ln(1+x).

Taking an exponential of both sides leads to u = exp[- 〈k〉(1 - u)]. If we denote with S the fraction of nodes in the giant component, S = N_G / N, then S = 1 - u and (3.31) results in

$S = 1 - e^{ - \left\langle k \right\rangle S} \hspace{20 mm} (3 . 32)$

This equation provides the size of the giant component S in function of 〈k〉 (Image 3.18). While (3.32) looks simple, it does not have a closed solution. We can solve it graphically by plotting the right hand side of (3.32) as a function of S for various values of 〈k〉. To have a nonzero solution, the obtained curve must intersect with the dotted diagonal, representing the left hand side of (3.32). For small 〈k〉 the two curves intersect each other only at S = 0, indicating that for small 〈k〉 the size of the giant component is zero. Only when 〈k〉 exceeds a threshold value, does a non-zero solution emerge.

To determine the value of 〈k〉 at which we start having a nonzero solution we take a derivative of (3.32), as the phase transition point is when the r.h.s. of (3.32) has the same derivative as the l.h.s. of (3.32), i.e. when

$\frac{d}{{dS}}\left( {1 - e^{ - \left\langle k \right\rangle S} } \right) = 1$ $\hspace{80 mm} (3 . 33)$ $\left\langle k \right\rangle e^{ - \left\langle k \right\rangle S} = 1$

Setting S = 0, we obtain that the phase transition point is at 〈k〉 = 1 (see also ADVANCED TOPICS 3.F).

Graphical Solution. — Image 3.18
Graphical Solution

The three purple curves correspond to y = 1-*exp*[ -*‹k›* S ] for *‹k›=0.5, 1, 1.5*. The green dashed diagonal corresponds y = S, and the intersection of the dashed and purple curves provides the solution to (3.32). For *‹k›=0.5* there is only one intersection at S = 0, indicating the absence of a giant component. The *‹k›=1.5* curve has a solution at S = 0.583 (green vertical line). The *‹k›=1* curve is precisely at the critical point, representing the separation between the regime where a nonzero solution for S exists and the regime where there is only the solution at S = 0.

The size of the giant component in function of *‹k›* as predicted by (3.32). After [33].

Section 3.15
Advanced Topic 3.D
Component Sizes

In Image 3.7 we explored the size of the giant component, leaving an important question open: How many components do we expect for a given ‹k›? What is their size distribution? The aim of this section is to discuss these topics.

Component Size Distribution
For a random network the probability that a randomly chosen node belongs to a component of size s (which is different from the giant component G) is [33]

$p_s \sim \frac{{\left( {s\left\langle k \right\rangle } \right)^{s - 1} }}{{s!}}e^{ - \left\langle k \right\rangle s} \hspace{20 mm} (3 . 34)$

Replacing ‹k›^s-1 with exp[(s-1) ln‹k›] and using the Stirling-formula

$s! = \sqrt {2\pi s} \left( {\frac{s}{e}} \right)^s$

for large s we obtain

$p_s \sim s^{ - 3/2} e^{ - (\left\langle k \right\rangle - 1)s + (s - 1)\ln \left\langle k \right\rangle } \hspace{20 mm} (3 . 35)$

Therefore the component size distribution has two contributions: a slowly decreasing power law term s^-3/2 and a rapidly decreasing exponential term e^{-(‹k›-1)s+(s-1)ln‹k›}. Given that the exponential term dominates for large s, (3.35) predicts that large components are prohibited. At the critical point, ‹k› = 1, all terms in the exponential cancel, hence p_s follows the power law

$p_s \sim s^{ - 3/2} \hspace{20 mm} (3 . 36)$

As a power law decreases relatively slowly, at the critical point we expect to observe clusters of widely different sizes, a property consistent with the behavior of a system during a phase transition (ADVANCED TOPICS 3.F). These predictions are supported by the numerical simulations shown in Image 3.19.

Component Size Distribution. — Image 3.19
Component Size Distribution
Component size distribution p_s in a random network, excluding the giant component.

p_s for different *‹k›* values and N, indicating that p_s converges for large N to the prediction (3.34).

p_s for N = 10⁴, shown for different *‹k›*. While for *‹k›* ‹ 1 and *‹k›* › 1 the p_s distribution has an exponential form, right at the critical point *‹k›* = 1 the distribution follows the power law (3.36). The continuous green lines correspond to (3.35). The first numerical study of the component size distribution in random networks was carried out in 1998 [34], preceding the exploding interest in complex networks.

Average Component Size
The calculations also indicate that the average component size (once again, excluding the giant component) follows [33]

$\left\langle s \right\rangle = \frac{1}{{1 - \left\langle k \right\rangle + \left\langle k \right\rangle N_G /N}} \hspace{20 mm} (3 . 37)$

For ‹k› ‹ 1 we lack a giant component (N_G = 0), hence (3.37) becomes

$\left\langle s \right\rangle = \frac{1}{{1 - \left\langle k \right\rangle }} \hspace{20 mm} (3 . 38)$

which diverges when the average degree approaches the critical point ‹k› = 1. Therefore as we approach the critical point, the size of the clusters increases, signaling the emergence of the giant component at ‹k› = 1. Numerical simulations support these predictions for large N (Image 3.20).

To determine the average component size for ‹k› › 1 using (3.37), we need to first calculate the size of the giant component. This can be done in a self-consistent manner, obtaining that the average cluster size decreases for ‹k› › 1, as most clusters are gradually absorbed by the giant component.

Note that (3.37) predicts the size of the component to which a randomly chosen node belongs. This is a biased measure, as the chance of belonging to a larger cluster is higher than the chance of belonging to a smaller one. The bias is linear in the cluster size s. If we correct for this bias, we obtain the average size of the small components that we would get if we were to inspect each cluster one by one and then measure their average size [33]

$\left\langle {s'} \right\rangle = \frac{2}{{2 - \left\langle k \right\rangle + \left\langle k \right\rangle N_G /N}} \hspace{20 mm} (3 . 39)$

Image 3.20 offers numerical support for (3.39).

Average Component Size. — Image 3.20
Average Component Size

The average size *‹s›* of a component to which a randomly chosen node belongs to as predicted by (3.39) (purple). The green curve shows the overall average size *‹s'›* of a component as predicted by (3.37). (After [33]).

The average cluster size in a random network. We choose a node and determined the size of the cluster it belongs to. This measure is biased, as each component of size s will be counted s times. The larger N becomes, the more closely the numerical data follows the prediction (3.37). As predicted, *‹s›* diverges at the *‹k›*=1 critical point, supporting the existence of a phase transition (ADVANCED TOPICS 3.F).

The average cluster size in a random network, where we corrected for the bias in (b) by selecting each component only once.The larger N becomes, the more closely the numerical data follows the prediction (3.39).

Section 3.17
Advanced Topic 3.F
Phase Transitions

The emergence of the giant component at ‹k›=1 in the random network model is reminiscent of a phase transition, a much studied phenomenon in physics and chemistry [35]. Consider two examples:

Water-Ice Transition (Image 3.21a): At high temperatures the H₂O molecules engage in a diffusive motion, forming small groups and then breaking apart to group up with other water molecules. If cooled, at 0˚C the molecules suddenly stop this diffusive dance, forming an ordered rigid ice crystal.
Magnetism (Image 3.21b): In ferromagnetic metals like iron at high temperatures the spins point in randomly chosen directions. Under some critical temperature T_c all atoms orient their spins in the same direction and the metal turns into a magnet.

The freezing of a liquid and the emergence of magnetization are examples of phase transitions, representing transitions from disorder to order. Indeed, relative to the perfect order of the crystalline ice, liquid water is rather disordered. Similarly, the randomly oriented spins in a ferromagnet take up the highly ordered common orientation under T_c.

Many properties of a system undergoing a phase transition are universal. This means that the same quantitative patterns are observed in a wide range of systems, from magma freezing into rock to a ceramic material turning into a superconductor. Furthermore, near the phase transition point, called the critical point, many quantities of interest follow power-laws.

The phenomena observed near the critical point ‹k› = 1 in a random network in many ways is similar to a phase transition:

The similarity between Image 3.7a and the magnetization diagram of Image 3.21b is not accidental: they both show a transition from disorder to order. In random networks this corresponds to the emergence of a giant component when ‹k› exceeds ‹k› = 1.
As we approach the freezing point, ice crystals of widely different sizes are observed, and so are domains of atoms with spins pointing in the same direction. The size distribution of the ice crystals or magnetic domains follows a power law. Similarly, while for ‹k› ‹ 1 and ‹k› › 1 the cluster sizes follow an exponential distribution, right at the phase transition point p_s follows the power law (3.36), indicating the coexistence of components of widely different sizes.
At the critical point the average size of the ice crystals or of the magnetic domains diverges, assuring that the whole system turns into a single frozen ice crystal or that all spins point in the same direction. Similarly in a random network the average cluster size ‹s› diverges as we approach ‹k› = 1 (Image 3.20).

Phase Transitions. — Image 3.21
Phase Transitions

**Water-Ice Phase Transition**
The hydrogen bonds that hold the water molecules together (dotted lines) are weak, constantly breaking up and re-forming, maintaining partially ordered local structures (left panel). The temperature-pressure phase diagram indicates (center panel) that by lowering the temperature, the water undergoes a phase transition, moving from a liquid (purple) to a frozen solid (green) phase. In the solid phase each water molecule binds rigidly to four other molecules, forming an ice lattice (right panel). After http://www.lbl.gov/Science-Articles/ Archive/sabl/2005/February/ water-solid.html.

**Magnetic Phase Transition**
In ferromagnetic materials the magnetic moments of the individual atoms (spins) can point in two different directions. At high temperatures they choose randomly their direction (right panel). In this *disordered state* the system’s total magnetization (m = ΔM/N, where ΔM is the number of up spins minus the number of down spins) is zero. The phase diagram (middle panel) indicates that by lowering the temperature T, the system undergoes a phase transition at T= T_c, when a nonzero magnetization emerges. Lowering T further allows m to converge to one. In this *ordered phase* all spins point in the same direction (left panel).

Section 3.14
Bibliography

[1] A.-L. Barabási. Linked: The new science of networks. Plume Books, 2003.

[2] P. Erdős and A. Rényi. On random graphs, I. Publicationes Mathematicae (Debrecen), 6:290-297, 1959.

[3] P. Erdős and A. Rényi. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci., 5:17-61, 1960.

[4] P. Erdős and A. Rényi. On the evolution of random graphs. Bull. Inst. Internat. Statist., 38:343-347, 1961.

[5] P. Erdős and A. Rényi. On the Strength of Connectedness of a Random Graph, Acta Math. Acad. Sci. Hungary, 12: 261–267, 1961.

[6] P. Erdős and A. Rényi. Asymmetric graphs. Acta Mathematica Acad. Sci. Hungarica, 14:295-315, 1963.

[7] P. Erdős and A. Rényi. On random matrices. Publ. Math. Inst. Hung. Acad. Sci., 8:455-461, 1966.

[8] P. Erdős and A. Rényi. On the existence of a factor of degree one of a connected random graph. Acta Math. Acad. Sci. Hungary, 17:359-368, 1966.

[9] P. Erdős and A. Rényi. On random matrices II. Studia Sci. Math. Hungary, 13:459-464, 1968.

[10] E. N. Gilbert. Random graphs. The Annals of Mathematical Statistics, 30:1141-1144, 1959.

[11] R. Solomonoff and A. Rapoport. Connectivity of random nets. Bulletin of Mathematical Biology, 13:107-117, 1951.

[12] P. Hoffman. The Man Who Loved Only Numbers: The Story of Paul Erdős and the Search for Mathematical Truth. Hyperion Books, 1998.

[13] B. Schechter. My Brain is Open: The Mathematical Journeys of Paul Erdős. Simon and Schuster, 1998.

[14] G. P. Csicsery. N is a Number: A Portait of Paul Erdős, 1993

[15] B. Bollobás. Random Graphs. Cambridge University Press, 2001.

[16] L. C. Freeman and C. R. Thompson. Estimating Acquaintanceship. Volume, pg. 147-158, in The Small World, Edited by Manfred Kochen (Ablex, Norwood, NJ), 1989.

[17] H. Rosenthal. Acquaintances and contacts of Franklin Roosevelt. Unpublished thesis. Massachusetts Institute of Technology, 1960.

[18] L. Backstrom, P. Boldi, M. Rosa, J. Ugander, and S. Vigna. Four degrees of separation. In ACM Web Science 2012: Conference Proceedings, pages 45−54. ACM Press, 2012.

[19] R. Albert and A.-L. Barabási. Statistical mechanics of complex networks. Reviews of Modern Physics, 74:47-97, 2002.

[20] I. de Sola Pool and M. Kochen. Contacts and Influence. Social Networks, 1: 5-51, 1978.

[21] H. Jeong, R. Albert and A. L. Barabási. Internet: Diameter of the world-wide web. Nature, 401:130-131, 1999.

[22] S. Lawrence and C.L. Giles. Accessibility of information on the Web Nature, 400:107, 1999.

[23] A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener. Graph structure in the web. Computer Networks, 33:309–320, 2000.

[24] S. Milgram. The Small World Problem. Psychology Today, 2: 60-67, 1967.

[25] J. Travers and S. Milgram. An Experimental Study of the Small World Problem. Sociometry, 32:425-443, 1969.

[26] K. Frigyes, “Láncszemek,” in Minden másképpen van (Budapest: Atheneum Irodai es Nyomdai R.-T. Kiadása, 1929), 85–90. English translation is available in [27].

[27] M. Newman, A.-L. Barabási, and D. J. Watts. The Structure and Dynamics of Networks. Princeton University Press, 2006.

[28] J. Guare. Six degrees of separation. Dramatist Play Service, 1992.

[29] D. J. Watts and S. H. Strogatz. Collective dynamics of ‘small-world’ networks. Nature, 393: 409–10, 1998.

[30] T. S. Kuhn. The Structure of Scientific Revolutions. University of Chicago Press, 1962.

[31] A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286:509-512, 1999.

[32] A.-L. Barabási, R. Albert, and H. Jeong. Meanfield theory for scalefree random networks. Physica A, 272:173-187, 1999.

[33] M. Newman. Networks: An Introduction. Oxford University Press, 2010.

[34] K. Christensen, R. Donangelo, B. Koiller, and K. Sneppen. Evolution of Random Networks. Physical Review Letters, 81:2380-2383, 1998.

[35] H. E. Stanley. Introduction to Phase Transitions and Critical Phenomena. Oxford University Press, 1987.

[36] D. Fernholz and V. Ramachandran. The diameter of sparse random graphs. Random Structures and Algorithms, 31:482-516, 2007.

Section 3.1
Introduction

Section 3.2
The Random Network Model

Section 3.3
Number of Links

Section 3.4
Degree Distribution

Binomial Distribution

Poisson Distribution

Section 3.5
Real Networks are Not Poisson

Section 3.6
The Evolution of a Random Network

Section 3.7
Real Networks are Supercritical

Section 3.8
Small Worlds

Section 3.9
Clustering Coefficient

Section 3.10
Summary: Real Networks are Not Random

Section 3.11
Homework

Section 3.12
Advanced Topic 3.A
Deriving the Poisson Distribution

Section 3.13
Advanced Topic 3.B
Maximum and Minimum Degrees

Section 3.14
Advanced Topic 3.C
Giant Component

Section 3.15
Advanced Topic 3.D
Component Sizes

Section 3.16
Advanced Topic 3.E
Fully Connected Regime

Section 3.17
Advanced Topic 3.F
Phase Transitions

Section 3.18
Advanced Topic 3.G
Small World Corrections

Section 3.14
Bibliography

Section 3.1 Introduction

Section 3.2 The Random Network Model

Section 3.3 Number of Links

Section 3.4 Degree Distribution

Binomial Distribution

Poisson Distribution

Section 3.5 Real Networks are Not Poisson

Section 3.6 The Evolution of a Random Network

Section 3.7 Real Networks are Supercritical

Section 3.8 Small Worlds

Section 3.9 Clustering Coefficient

Section 3.10 Summary: Real Networks are Not Random

Section 3.11 Homework

Section 3.12 Advanced Topic 3.ADeriving the Poisson Distribution

Section 3.13 Advanced Topic 3.BMaximum and Minimum Degrees

Section 3.14 Advanced Topic 3.CGiant Component

Section 3.15 Advanced Topic 3.DComponent Sizes

Section 3.16 Advanced Topic 3.EFully Connected Regime

Section 3.17 Advanced Topic 3.FPhase Transitions

Section 3.18 Advanced Topic 3.GSmall World Corrections

Section 3.14 Bibliography

Section 3.1
Introduction

Section 3.2
The Random Network Model

Section 3.3
Number of Links

Section 3.4
Degree Distribution

Section 3.5
Real Networks are Not Poisson

Section 3.6
The Evolution of a Random Network

Section 3.7
Real Networks are Supercritical

Section 3.8
Small Worlds

Section 3.9
Clustering Coefficient

Section 3.10
Summary: Real Networks are Not Random

Section 3.11
Homework

Section 3.12
Advanced Topic 3.A
Deriving the Poisson Distribution

Section 3.13
Advanced Topic 3.B
Maximum and Minimum Degrees

Section 3.14
Advanced Topic 3.C
Giant Component

Section 3.15
Advanced Topic 3.D
Component Sizes

Section 3.16
Advanced Topic 3.E
Fully Connected Regime

Section 3.17
Advanced Topic 3.F
Phase Transitions

Section 3.18
Advanced Topic 3.G
Small World Corrections

Section 3.14
Bibliography