4.3. EXPLAINABLE FAKE NEWS DETECTION 69

4.3 EXPLAINABLE FAKE NEWS DETECTION

In recent years, computational detection of fake news has been producing some promising early

results. However, there is a critical missing piece of the study, the explainability of such detection,

i.e., why a particular piece of news is detected as fake. Here, we introduce two representative

approaches based on web articles and user comments.

4.3.1 A WEB EVIDENCE-AWARE APPROACH

Web evidence is important to provide additional auxiliary information to predict the credibility

of online misinformation and fake news. Existing methods for fake news detection focus on ex-

ploring eﬀective features from diﬀerent sources such as the news content or social media signals

to improve fake news detection performance. However, these approaches also do not oﬀer any

explanation of their verdicts. In the real world, external evidence or counter-evidence from the

Web can serve as a base to mine user-unrepeatable explanations.

Given a set of n news claims A D fa

;    ; a

g with their corresponding sources P D

;    ; p

g, and each news claim a

is reported by a set of L articles W

D fe

i;1

;    ; e

i;L

where i 2 Œ1; N , from sources WP

D fep

i;1

;    ; ep

i;L

g. Each tuple .a

; p

; e

; ep

/ forms a

training instance. e goal is to predict the label for each news claim as fake or real, with user-

comprehensible explanations for the prediction results (see Figure 4.9).

False

True

√

𝖷

Textual

Claim

DeClarE*

Evidence

World Wide

Web

Figure 4.9: e illustration of fake news detection with evidence-aware explanations.

e framework DeClarE[110] (see Figure 4.10), debunking claims with interpretable ev-

idence, mainly consists of the following components: (1) learning Claims and Article Represen-

tations; (2) claim Speciﬁc Attention; and (3) claim Credibility Prediction.

70 4. CHALLENGING PROBLEMS OF FAKE NEWS DETECTION

Claim Word

Embeddings

Claim Source

Embedding

Attention

Weights

Inner

Product

Concatenate

Softmax

Dense

Layer

Dense

Layer

Dense

Layer

Softmax/

Linear

Credibility

Score

Bidirectional

LSTM

Article Word

Embeddings

Article Source

Embedding

Concatenate

Features

avg

avgx

Figure 4.10: e framework of fake news detection with evidence-aware explanations. Based

on [110].

Learning Claims and Article Representations

e claim a

of length l is represented as Œw

; w

;    ; w

 where w

is the word embedding

vectors for the l-th word. e source of the claim and articles are represented by embed-

ding vectors with the same dimensions. A report article from the Web e

i;j

is represented by

Œw

i;j;1

; w

i;j;2

;    ; w

i;j;k

, where w

i;j;k

is the word embedding vector of the k-th word in the

article. To obtain the representation of an article e

, we can use the bidirectional LSTM net-

works, as we introduced in Section 2.1.3, to learn the neural textual representations. Speciﬁcally,

given an input word embedding of token w

, an LSTM cell performs nonlinear transformations

to generate a hidden state h

for timestamp k, and the last hidden state can be regard as the

representation. Since bidirectional LSTM is adopted, so the ﬁnal out representation is the con-

catenation of the output of forward LSTM and backward LSTM, i.e., h D Œ

!

h ;



h .

Claim-Speciﬁc Attention

To consider the relevance of an article with respect to the claim, we can use attention mechanism

to help the model focus on salient words in the article. By adding attention mechanism, it also

helps make the model transparent and interpretable. First, the overall representation of the input

claim is generated by taking the average of the word embeddings of all words:

Na D

(4.20)

4.3. EXPLAINABLE FAKE NEWS DETECTION 71

and then the overall representation of this claim a is concatenated with each article as follows:

D e

˚ Na (4.21)

the claim-speciﬁc representation of each article is transformed through a fully connected layer:

D f .W

C b

/; (4.22)

where W

and b

are the corresponding weight matrix and bias terms, and f is the activation

function. Following this step, a softmax activation can be used to calculate an attention score ˛

for each word in the article capturing the relevance to the claim context:

exp.a

: (4.23)

Now that we have the article representation < h

>, and their relevance to the claim given

by < ˛

>, we can combine then to further predict the news claims’ credibility. e weighted

average of the hidden state representations for all articles can be calculated as follows:

r D

 h

: (4.24)

At last, the article representation r is combined with the claim source embedding (ep) and article

source embedding (p) simultaneously through a fully connected layer,

l D relu.W

.r ˚ ep ˚ p/ C b

/: (4.25)

Claim Credibility Prediction

e credibility score for each article y is predicted by taking the aforementioned representation

into a softmax layer:

e D softmax.l/: (4.26)

erefore, once we have the per-article credibility scores, we can take the average of these scores

to generate the overall credibility score for the news claim:

y D

: (4.27)

4.3.2 A SOCIAL CONTEXT-AWARE APPROACH

One way is to derive explanation from the perspectives of news contents and user comments

(see Figure 4.11) [132]. First, news contents may contain information that is veriﬁably false. For

example, journalists manually check the claims in news articles on fact-checking websites such

72 4. CHALLENGING PROBLEMS OF FAKE NEWS DETECTION

as PolitiFact,

which is usually labor intensive and time consuming. Researchers also attempt

to use external sources to fact-check the claims in news articles to decide and explain whether

a news piece is fake or not [29], which may not be able to check newly emerging events (that

has not been fact-checked). Second, user comments have rich information from the crowd on

social media, including opinions, stances, and sentiment, that are useful to detect fake news.

For example, researchers propose to use social features to select important comments to predict

fake news pieces [48]. Moreover, news contents and user comments inherently are related each

other and can provide important cues to explain why a given news article is fake or not. For

example, in Figure 4.11, we can see users discuss diﬀerent aspects of the news in comments such

as “St. Nicholas was white? Really??Lol,” which directly responds to the claims in the

news content “The Holy Book always said Santa Claus was white.”

Figure 4.11: A piece of fake news on PolitiFact, and the user comments on social media. Some

explainable comments are directly related to the sentences in news contents.

Let a be a news article, consisting of L sentences fs

iD1

. Each sentence s

;    ; w

g contains M

words. Let C D fc

; c

; :::; c

g be a set of T comments related to

the news a, where each comment c

D fw

;    ; w

g contains Q

words. Similar to previous

research [59, 136], we treat fake news detection problem as the binary classiﬁcation problem,

i.e., each news article can be true (y D 1) or fake (y D 0). At the same time, we aim to learn

a rank list RS from all sentences in fs

iD1

, and a rank list RC from all comments in fc

j D1

according to the degree of explainability, where RS

(RC

) denotes the k

most explainable sen-

tence (comment). e explainability of sentences in news contents represent the degree of how

check-worthy they are, while the explainability of comments denote the degree of how much

users believe if news is fake or real, closely related to the major claims in news.

https://www.politifact.com/

4.3. EXPLAINABLE FAKE NEWS DETECTION 73

We present the details of the framework for explainability fake news detection, named

as dEFEND (neural Explainable FakE News Detection). It consists of four major components

(see Figure 4.12): (1) a news content encoder (including word encoder and sentence encoder)

component; (2) a user comment encoder component; (3) a sentence-comment co-attention com-

ponent; and (4) a fake news prediction component.



S C



ŷ: Softmax

[ŝ,ĉ]: Concate

ĉ: Product

: Softmax

: Sum

: Product

ŝ: Product

: Softmax

: Sum

L: Product





M







M



















M







M







M







······





Q







Q







Q



······

Sentence-Comment

Co-attention

Fake News

Prediction

Sentence Encoder

Comments Encoder

Word Encoder

Figure 4.12: e framework dEFEND consists of four components: (1) a news content (in-

cluding word-level and sentence-level) encoder; (2) a user comment encoder; (3) a sentence-

comment co-attention component; and (4) a fake news prediction component. Based on [132].

First, the news content encoder component describes the modeling from the news lin-

guistic features to latent feature space through a hierarchical word- and sentence-level encod-

74 4. CHALLENGING PROBLEMS OF FAKE NEWS DETECTION

ing; next, the user comment encoder component illustrates the comment latent feature extrac-

tion through word-level attention networks; then, the sentence-comment co-attention compo-

nent models the mutual inﬂuences between the news sentences and user comments for learning

feature representations, and the explainability degree of sentences and comments are learned

through the attention weights within co-attention learning; ﬁnally, the fake news prediction

component shows the process of concatenating news content and user comment features for

fake news classiﬁcation.

News Contents Encoding As fake news pieces are intentionally created to spread inaccurate

information rather than to report objective claims, they often have opinionated and sensational

language styles, which have the potential to help detect fake news. In addition, a news document

contains linguistic cues with diﬀerent levels such as word-level and sentence-level, which provide

diﬀerent degrees of importance for the explainability of why the news is fake. For example,

in a fake news claim “Pence: Michelle Obama is the most vulgar first lady we've

ever had,” the word “vulgar” contributes more signals to decide whether the news claim is

fake rather than other words in the sentence.

Recently, researchers ﬁnd that hierarchical attention neural networks [177] are very prac-

tical and useful to learn document representations [24] with highlighting important words or

sentences for classiﬁcation. It adopts a hierarchical neural network to model word-level and

sentence-level representations through self-attention mechanisms. Inspired by [24], we learn

the news content representations through a hierarchical structure. Speciﬁcally, we ﬁrst learn the

sentence vectors by using the word encoder with attention and then learn the sentence repre-

sentations through sentence encoder component.

Word Encoder We learn the sentence representation via a RNN based word encoder. Al-

though in theory, RNN is able to capture long-term dependency, in practice, the old mem-

ory will fade away as the sequence becomes longer. To making it easier for RNNs to capture

long-term dependencies, GRU [27] are designed in a manner to have more persistent memory.

Similar to [177], we adopt GRU to encode the word sequence.

To further capture the contextual information of annotations, we use bidirectional

GRU [8] to model word sequences from both directions of words. e bidirectional GRU con-

tains the forward GRU

!

f which reads sentence s

from word w

to w

and a backward GRU



f which reads sentence s

from word w

to w

!

!

GRU.w

/; t 2 f1; : : : ; M





GRU.w

/; t 2 fM

; : : : ; 1g:

(4.28)

We then obtain an annotation of word w

by concatenating the forward hidden state

!

and

backward hidden state



, i.e., h

D Œ

!

;



, which contains the information of the whole sen-

tence centered around w

4.3. EXPLAINABLE FAKE NEWS DETECTION 75

Note that not all words contribute equally to the representation of the sentence meaning.

erefore, we introduce an attention mechanism to learn the weights to measure the importance

of each word, and the sentence vector v

2 R

2d 1

is computed as follows:

tD1

; (4.29)

where ˛

measures the importance of t

word for the sentence s

, and ˛

is calculated as follows:

D tanh.W

C b

exp.o

kD1

exp.o

;

(4.30)

where o

is a hidden representation of h

obtained by feeding the hidden state h

to a fully

embedding layer, and o

is the weight parameter that represents the world-level context vector.

Sentence Encoder Similar to word encoder, we utilize RNNs with GRU units to encode

each sentence in news articles. rough the sentence encoder, we can capture the context infor-

mation in the sentence-level to learn the sentence representations h

from the learned sentence

vector v

. Speciﬁcally, we can use the bidirectional GRU to encode the sentences as follows:

!

!

GRU.v

/; i 2 f1; : : : ; Lg





GRU.v

/; i 2 fL; : : : ; 1g:

(4.31)

We then obtain an annotation of sentence s

2 R

2d 1

by concatenating the forward hidden state

!

and backward hidden state



, i.e., s

D Œ

!

;



, which captures the context from neighbor

sentences around sentence s

User Comments Encoding People express their emotions or opinions toward fake news

through social media posts such as comments, such as skeptical opinions, sensational reactions,

etc. ese textual information has been shown to be related to the content of original news

pieces. us, comments may contain useful semantic information that has the potential to help

fake news detection. Next, we demonstrate how to encode the comments to learn the latent rep-

resentations. e comments extracted from social media are usually short text, so we use RNNs

to encode the word sequence in comments directly to learn the latent representations of com-

ments. Similar to the word encoder, we adopt bidirectional GRU to model the word sequences

in comments. Speciﬁcally, given a comment c

with words w

; t 2 f1;    ; Q

g, we ﬁrst map

each word w

into the word vector w

2 R

with an embedding matrix. en, we can obtain

the feed forward hidden states

!

and backward hidden states



as follows:

!

!

GRU.w

/; t 2 f1; : : : ; Q





GRU.w

/; t 2 fQ

; : : : ; 1g:

(4.32)

76 4. CHALLENGING PROBLEMS OF FAKE NEWS DETECTION

We further obtain the annotation of word w

by concatenating

!

and



, i.e., h

D Œ

!

;



.

We also introduce the attention mechanism to learn the weights to measure the importance of

each word, and the comment vector c

2 R

is computed as follows:

tD1

; (4.33)

where ˇ

measures the importance of t

word for the comment c

, and ˇ

is calculated as

follows,

D tanh.W

C b

exp.o

kD1

exp.o

;

(4.34)

where o

is a hidden representation of h

obtained by feeding the hidden state h

to a fully

embedding layer, and u

is the weight parameter.

Sentence-Comment Co-attention We observe that not all sentences in news contents

are fake, and in fact, many sentences are true but only for supporting wrong claim sen-

tences [40]. us, news sentences may not be equally important in determining and explain-

ing whether a piece of news is fake or not. For example, the sentence “Michelle Obama is so

vulgar she's not only being vocal..” is strongly related to the major fake claim “Pence:

Michelle Obama Is The Most Vulgar First Lady We've Ever Had,” while “The First

Lady denounced the Republican presidential nominee” is a sentence that expresses

some fact and is less helpful in detecting and explaining whether the news is fake.

Similarly, user comments may contain relevant information about the important aspects

that explain why a piece of news is fake, while they may also be less informative and noisy. For ex-

ample, a comment “Where did Pence say this? I saw him on CBS this morning and

he didn't say these things..” is more explainable and useful to detect the fake news, than

other comments such as “Pence is absolutely right.”

us, we aim to select some news sentences and user comments that can explain why a

piece of news is fake. As they provide a good explanation, they should also be helpful in detecting

fake news. is suggests us to design attention mechanisms to give high weights of representa-

tions of news sentences and comments that are beneﬁcial to fake news detection. Speciﬁcally,

we use sentence-comment co-attention because it can capture the semantic aﬃnity of sentences

and comments and further help learn the attention weights of sentences and comments simul-

taneously.

We can construct the feature matrix of news sentences S D Œs

I    ; s

 2 R

2d L

and the

feature map of user comments C D fc

;    ; c

g 2 R

2d T

, the co-attention attends to the sen-

tences and comments simultaneously. Similar to [83, 172], we ﬁrst compute the aﬃnity matrix

F 2 R

T L

as follows:

F D tanh.C

S/; (4.35)

4.3. EXPLAINABLE FAKE NEWS DETECTION 77

where W

2 R

2d 2d

is a weight matrix to be learned through the networks. Following the op-

timization strategy in [83], we can consider the aﬃnity matrix as a feature and learn to predict

sentence and comment attention maps as follows:

D tanh

S C .W

C/F

D tanh



C C .W

S/F



;

(4.36)

where W

; W

2 R

k2d

are the weight parameters. e attention weights of sentences and com-

ments are calculated as follows:

D softmax





D softmax





;

(4.37)

where a

2 R

1N

and a

2 R

1T

are the attention probabilities of each sentence s

and comment

, respectively. w

; w

2 R

1k

are the weight parameters. e aﬃnity matrix F transforms

user comment attention space to news sentence attention space, and vice versa for F

. Based on

the above attention weights, the comment and sentence attention vectors are calculated as the

weighted sum of the comment features and sentence features, i.e.,

Os D

iD1

; Oc D

j D1

; (4.38)

where Os 2 R

12d

and Oc 2 R

12d

are the learned features for news sentences and user comments

through co-attention.

Explainable Detection of Fake News We have introduced how we can encode news con-

tents by modeling the hierarchical structure from word level and sentence level, how we encode

comments by word-level attention networks, and the component to model co-attention to learn

sentences and comments representations. We further integrate these components together and

predict fake news with the following objective:

Oy D softmax.ŒOs; OcW

C b

/; (4.39)

where Oy D ŒOy

; Oy

 is the predicted probability vector with Oy

and Oy

indicate the predicted prob-

ability of label being 0 (real news) and 1 (fake news) respectively. y 2 f0; 1g denotes the ground

truth label of news. ŒOs; Oc means the concatenation of learned features for news sentences and

user comments. b

2 R

12

is the bias term. us, for each news piece, the goal is to minimize

the cross-entropy loss function as follows:

L./ D y log.Oy

/  .1  y/ log.1  Oy

(4.40)

where  denotes the parameters of the network.