The spread of rumors is an important form of communication in society. There are two approaches to investigating the rumor spreading process: microscopic models and the macroscopic models. The macroscopic models propose a macro view about this process and are mainly based on the widely-used Daley-Kendall and Maki-Thompson models. Particularly, rumor spread can be viewed as a stochastic process in social networks. By contrast, the microscopic models are more interested on micro-level interactions between individuals.
In the last few years, there has been a growing interest in rumor propagation in online social networks problems where different approaches have been proposed.
The first category is mainly based on the epidemic models. Pioneering research on rumor propagation using these models started during the 1960s.[1]
A standard model of rumor spreading was introduced by Daley and Kendall.[1] Assume there are N people in total and those people in the network are categorized into three groups: ignorants, spreaders and stiflers, which are denoted as S, I, and R respectively hereinafter (in correspondance with the SIR model):
The rumor is propagated through the population by pair-wise contacts between spreaders and others in the population. Any spreader involved in a pair-wise meeting attempts to “infect” the other individual with the rumor. In the case this other individual is an ignorant, he or she becomes a spreader. In the other two cases, either one or both of those involved in the meeting learn that the rumor is known and decided not to tell the rumor anymore, thereby turning into stiflers.
One variant is the Maki-Thompson model.[2] In this model, rumor is spread by directed contacts of the spreaders with others in the population. Furthermore, when a spreader contacts another spreader only the initiating spreader becomes a stifler. Therefore, three types of interactions can happen with certain rates.
Of course we always have conservation of individuals:
The change in each class in a small time interval is:
Since we know S {\displaystyle S} , I {\displaystyle I} and R {\displaystyle R} sum up to N {\displaystyle N} , we can reduce one equation from the above, which leads to a set of differential equations using relative variable x = I / N {\displaystyle x=I/N} and y = S / N {\displaystyle y=S/N} as follows
which we can write
Compared with the ordinary SIR model, we see that the only difference to the ordinary SIR model is that we have a factor α + β {\displaystyle \alpha +\beta } in the first equation instead of just α {\displaystyle \alpha } . We immediately see that the ignorants can only decrease since x , y ≥ 0 {\displaystyle x,y\geq 0} and d y d t ≤ 0 {\displaystyle {dy \over dt}\leq 0} . Also, if
which means
the rumor model exhibits an “epidemic” even for arbitrarily small rate parameters.
We model the process introduced above on a network in discrete time, that is, we can model it as a DTMC. Say we have a network with N nodes, then we can define X i ( t ) {\displaystyle X_{i}(t)} to be the state of node i at time t. Then X ( t ) {\displaystyle X(t)} is a stochastic process on S = { S , I , R } N {\displaystyle S=\{S,I,R\}^{N}} . At a single moment, some node i and node j interact with each other, and then one of them will change its state. Thus we define the function f {\displaystyle f} so that for x {\displaystyle x} in S {\displaystyle S} , f ( x , i , j ) {\displaystyle f(x,i,j)} is when the state of network is x {\displaystyle x} , node i and node j interact with each other, and one of them will change its state. The transition matrix depends on the number of ties of node i and node j, as well as the state of node i and node j. For any y = f ( x , i , j ) {\displaystyle y=f(x,i,j)} , we try to find P ( x , y ) {\displaystyle P(x,y)} . If node i is in state I and node j is in state S, then P ( x , y ) = α A j i / k i {\displaystyle P(x,y)=\alpha A_{ji}/k_{i}} ; if node i is in state I and node j is in state I, then P ( x , y ) = β A j i / k i {\displaystyle P(x,y)=\beta A_{ji}/k_{i}} ; if node i is in state I and node j is in state R, then P ( x , y ) = β A j i / k i {\displaystyle P(x,y)=\beta A_{ji}/k_{i}} . For all other y {\displaystyle y} , P ( x , y ) = 0 {\displaystyle P(x,y)=0} .
The procedure on a network is as follows:[3]
p j = A j i k i {\displaystyle p_{j}={A_{ji} \over k_{i}}}
We would expect that this process spreads the rumor throughout a considerable fraction of the network. Note however that if we have a strong local clustering around a node, what can happen is that many nodes become spreaders and have neighbors who are spreaders. Then, every time we pick one of those, they will recover and can extinguish the rumor spread. On the other hand, if we have a network that is small world, that is, a network in which the shortest path between two randomly chosen nodes is much smaller than that one would expect, we can expect the rumor spread far away.
Also we can compute the final number of people who once spread the news, this is given by r ∞ = 1 − e − ( α + β β ) r ∞ {\displaystyle r_{\infty }=1-e^{-({\alpha +\beta \over \beta })r_{\infty }}} In networks the process that does not have a threshold in a well mixed population, exhibits a clear cut phase-transition in small worlds. The following graph illustrates the asymptotic value of r ∞ {\displaystyle r_{\infty }} as a function of the rewiring probability p {\displaystyle p} .
The microscopic approaches are more focused on interactions between individuals: "who influenced whom."
Models include the independent cascade model, linear threshold model,[4] energy model,[5] HISBmodel,[6] and Galam's Model.[7]
The Independent Cascade (IC) model is a probabilistic framework for modeling the spread of influence through a social network. In this model, the diffusion process unfolds in discrete time steps. When a node becomes active (i.e., influenced), it has a single chance to activate each of its inactive neighbors. The activation attempt succeeds with a certain probability, specific to the edge connecting the nodes. If the attempt fails, the neighbor cannot be activated by that node in future steps. This process continues until no more activations are possible.
Mathematically, the probability that an active node u activates an inactive neighbor v is denoted as p u v {\displaystyle p_{uv}} . The activation process can be described as follows:
This model captures scenarios where individuals independently decide to adopt a behavior or product after being influenced by their peers. It is particularly useful for modeling viral marketing and information dissemination in networks.
[8]
The Linear Threshold (LT) model is another approach to modeling influence spread in social networks. In this model, each node is assigned a threshold value, representing the fraction of its neighbors that need to be active before it becomes active itself. Each edge from a neighbor to a node has an associated weight, indicating the strength of influence. The sum of incoming edge weights to a node is typically constrained to be at most 1.
Mathematically, for a node v {\displaystyle v} , let N ( v ) {\displaystyle N(v)} be the set of its neighbors, and w u v {\displaystyle w_{uv}} be the weight of the edge from neighbor u {\displaystyle u} to v {\displaystyle v} , such that ∑ u ∈ N ( v ) w u v ≤ 1 {\displaystyle \sum _{u\in N(v)}w_{uv}\leq 1} . Each node v {\displaystyle v} is assigned a threshold θ v {\displaystyle \theta _{v}} , drawn uniformly at random from the interval [0,1].
The activation process proceeds as follows:
∑ u ∈ A t − 1 ∩ N ( v ) w u v ≥ θ v {\displaystyle \sum _{u\in A_{t-1}\cap N(v)}w_{uv}\geq \theta _{v}}
The LT model reflects situations where individuals adopt behaviors based on the collective influence of their peers, such as adopting a new technology or participating in social movements.
In recent years, the rampant spread of misinformation and rumors on online social networks (OSNs) has raised significant concerns within the scientific and technological communities. OSNs are powerful tools for disseminating information, but they can also accelerate the propagation of false or misleading content, which may have serious societal consequences. Consequently, the study of rumor dynamics and the development of effective mitigation strategies have become crucial areas of research.
Several models have been proposed to analyze and control rumor spread in OSNs. Among them, the HISB model (Human Individual and Social Behavior model) stands out as a comprehensive approach that incorporates both individual cognitive factors and social influence mechanisms. This model was first introduced in [9], where the authors proposed a rumor diffusion framework based on human behaviors in online social environments. It was later refined and applied in a more practical context for rumor mitigation in [10].
Subsequent enhancements were made in [11], where the model was adapted for breaking news scenarios, addressing the challenges of real-time rumor control. Most recently, a hybrid approach incorporating the HISB model was proposed by [12].
These successive contributions illustrate the robustness and adaptability of the HISB framework in tackling the evolving nature of misinformation in OSNs. This article provides a comprehensive overview of rumor propagation models, with a focus on those incorporating behavioral dynamics, and discusses state-of-the-art strategies for minimizing rumor influence.
This article adopts the HISBmodel for simulating rumor diffusion. Below, we provide an overview of its key components.
The HISBmodel represents the behavior of individuals facing a rumor through three main factors:
An individual's attraction to a rumor over time is modeled after damped harmonic motion:
where A _ u ( t ) {\displaystyle A\_u(t)} is the attraction of user u {\displaystyle u} to the rumor at time t {\displaystyle t} , and A _ i n t u {\displaystyle A\_{int}^{u}} is the initial attraction.
The model operates on a dynamic multilayer graph G _ 1 _ t , … , G _ n _ t {\displaystyle {G\_1}\_t,\ldots ,{G\_n}\_t} , where each layer represents a different OSN or mode of interaction over time t ∈ 0 , 1 , … , T {\displaystyle t\in {0,1,\ldots ,T}} . Each user can be in one of three states:
At each time step t − 1 {\displaystyle t-1} , spreaders may transmit the rumor according to one of two models:
The probability of rumor transmission from node u {\displaystyle u} to v {\displaystyle v} on layer i {\displaystyle i} at time t {\displaystyle t} is given by:
With the components defined as:
where P {\displaystyle P} is a tunable propagation probability and ( d _ i n ) _ i ( u ) {\displaystyle (d\_{in})\_i(u)} is the in-degree of u {\displaystyle u} in layer i {\displaystyle i} .
The global popularity of the rumor is measured as:
The rumor ceases to propagate when R ( t ) ≈ 0 {\displaystyle R(t)\approx 0} .
Individuals' opinions evolve using a discrete-time Markov chain with states:
Each node v {\displaystyle v} has a transition matrix:
P v = [ 1 − ρ ρ 2 ρ 2 0 1 − ρ 3 1 − ρ 3 1 − ρ 3 ρ 1 − ρ 3 1 − ρ 3 1 − ρ 3 ρ 0 1 − ρ 2 1 − ρ 2 ρ ] {\displaystyle P_{v}={\begin{bmatrix}1-\rho &{\frac {\rho }{2}}&{\frac {\rho }{2}}&0\\{\frac {1-\rho }{3}}&{\frac {1-\rho }{3}}&{\frac {1-\rho }{3}}&\rho \\{\frac {1-\rho }{3}}&{\frac {1-\rho }{3}}&{\frac {1-\rho }{3}}&\rho \\0&{\frac {1-\rho }{2}}&{\frac {1-\rho }{2}}&\rho \\\end{bmatrix}}}
The decision factor ρ {\displaystyle \rho } is given by:
where n _ v − , n _ v {\displaystyle n\_v^{-},n\_v} are the numbers of negative and total received rumors, and J _ v {\displaystyle J\_v} reflects personal judgment.
The steady-state probability of denying a rumor is:
Denoting π _ v − = π _ D ( ρ _ v ) {\displaystyle \pi \_v^{-}=\pi \_D(\rho \_v)} and π _ v + = 1 − π _ v − {\displaystyle \pi \_v^{+}=1-\pi \_v^{-}} , the model can predict long-term belief adoption.
This enhanced HISBmodel framework thus enables nuanced, multilayer-aware modeling of how rumors spread and opinions evolve across interconnected OSNs.