Fake News Detection

Social Media is a place where a massive data is generated in seconds and it’s the first place for any news to land. Having said that, there is huge chance to spread Fake news and it’s one the biggest challenging issue in social media. Following are some of the high level concepts in Fake new detection.

Characterization and Detection are the two aspects of fake news detection.

What is Characterization ?

This phase talks about fake news and its characteristics. Classified into the following two categories Traditional and Social Media.

Traditional Media ( Psychology and Social Foundation )

Psychology Foundation : How each individual user is responsible
Naıve Realism	Consumers believe their perceptions are correct
Confirmation Bias	Consumers prefer to receive information that confirms to their views

Social Media ( Malicious Accounts and Echo Chambers )

Malicious Accounts	Bots ,Cyborg users, Trolls.
Echo Chambers	Groups of like minded people spreading the news

What is Detection ?

This Phase talks about the fake news detection methodologies. Detection Algorithms are highly classified into two categories Content and Social context based.

**Content Based**
Knowledge Based	Used External Sources for checking the credibility of the news.
Style Based	Relies on the same source but manipulates basing on the writing style.

**Social Context Based**
Stance Based	Uses user viewpoint
Propagation Based	Checks for the relevance in social media posts.

To train the model We can get data from different sources . However checking veracity of the news is challenging . Therefore following are some of the publicly available data sets

BuzzFeedNews	This dataset comprises a complete sample of news published in Facebook from 9 news agencies over a week close to the 2016 U.S. election from September 19 to 23 and September 26 and 27. Every post and the linked article were fact-checked
LIAR	This dataset is collected from fact-checking website PolitiFact through its API
BS Detector	This dataset is collected from a browser extension called BS detector developed for checking news veracity
CREDBANK	This is a large scale crowdsourced dataset of approximately 60 million tweets that cover 96 days starting from October 2015

Fake News Related Areas

Rumor Classification	Aims to classify piece of information as rumor or not and its has different phases as Rumor detection, rumor tracking, stance classification, and veracity classification
Truth Discovery	Detecting truth from multiple conflicting sources.
Clickbait Detection	Detecting eye catching teaser in online Media
Spammer and Bot Detection	Detecting malicious users , Spreading ads, Viruses and Phishing etc.

Future Research

Fake news detection is one of the emerging research areas . Its broadly outlined into four different areas like Data-oriented, Feature-oriented, Model-oriented, and Application-oriented.

Data-oriented	The main focus here is on data such as benchmark data collection, Fake news validation etc
Feature-oriented	Focus here is for detecting fake news from multiple data sources, such as news content and social context.
Model-oriented	Focuses more on practical and effective models for fake news detection, including supervised, semi-supervised and unsupervised models
Application-oriented	This research goes beyond fake news detection, such as fake new diffusion and intervention

Reference:

Click to access 19-1-Article2.pdf

My Experiences with Microservices / Cloud Services / Machine Learning / CI CD

Fake News Detection

Leave a comment Cancel reply

Share this:

Leave a comment Cancel reply