0% found this document useful (0 votes)

50 views6 pages

Fake News Paper2

Uploaded by

jayaenchantedjewellery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views6 pages

Fake News Paper2

Uploaded by

jayaenchantedjewellery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Fake News Classification using Machine Learning

ABSTRACT

In today’s modern world, "fake news" has been a major concern, spreading like wildfire through
many platforms. This phenomenon not only undermines the credibility of information but also
misleads society. Nowadays, social media is the greatest means by which fake news spreads all over
the place. This can cause many problems such as defamation of people and spreading news in favour
of specific individuals. Fake news often targets the most prominent, powerful, and influential people
in society, aiming to tarnish their reputation. The escalating impact of fake news knows no bounds.
Fake news is often biased, favouring a single person or a section of people in society for their
personal benefits. To mitigate these challenges and promote transparency, there is a need to reduce
the spread of fake news. Introducing a "Fake News Classifier using ML" offers a promising solution to
combat this issue. By using machine learning algorithms, this classifier can effectively identify
misleading information as fake news, thereby contributing to awareness in society and reducing
losses.

INTRODUCTION

Fake news primarily consists of mis leading information spread across the society, creating turmoil. In
this era, Information is all over and the number of people accessing the information is increasing
substantially. There should be awareness among users regarding what type of information they are
consuming - “is it real? or fake?”. Moreover, most of the social media platforms allow users to share
their views through stories, statuses, posts, directly affecting the spread of news , which may often
considered fake.

One very famous Social media platform, What’s App serves as a means for
consistently sharing fake news among its users through What’s App groups, Statuses, personal
messages. If this sharing or spreading of fake news reaches a significant number, there is a risk of
people believing it, leading to disorder.

One such recent example is the rumour of the ban on 10 rupees coin in India. There was widespread
news that 10 rupees coins in India were banned, thanks to social media, which facilitated the rapid
spread of this misinformation. Nobody was accepting 10 rupees coins, causing concern among
people in India about what to do with them. However, the government did not announce any such
ban on 10 rupees coin, it was simply a baseless rumour. After confirmation from the Reserve Bank of
India (RBI), people calmed down, and acceptance of the 10 rupees coins resumed.

LITERATURE SURVEY:

The first study proposed a machine learning approach for Fake News Detection on Social Media using
K-Nearest Neighbor Classifier.In a study by Smith et al. (2018), titled "Fake News Detection on Social
Media using K Nearest Neighbors Algorithm," the authors introduced a method leveraging KNN for
identifying misinformation on social media platforms. They assembled a comprehensive dataset
comprising news articles and incorporated diverse features, such as linguistic patterns and user
engagement metrics, to enhance classification accuracy. Through extensive experimentation, they
demonstrated the effectiveness of KNN in achieving a classification accuracy surpassing the 79%
mark. This research highlights the promising potential of KNN as a robust tool for combating fake
news proliferation in online environments.

The second study proposed a Deep Learning approach for Fake News Detection using AA-HGNN .The
landscape of fake news detection has seen significant advancements with the emergence of
Attention-Aware Heterogeneous Graph Neural Networks (AA-HGNN). These models leverage the
power of graph representations and attention mechanisms to effectively discern misinformation
from legitimate information sources. Prior research has extensively explored the utility of Graph
Neural Networks (GNNs) in modeling complex relationships within news articles and their associated
entities. The incorporation of attention mechanisms further enhances the capability of AA-HGNNs by
allowing them to focus on salient features and connections crucial for identifying fake news. Studies
have demonstrated the effectiveness of AA-HGNNs in various tasks, including social network analysis
and recommendation systems, highlighting their potential to combat the proliferation of
misinformation in online platforms. The integration of AA-HGNNs represents a promising avenue for
fake news detection, offering a sophisticated framework to tackle the multifaceted challenges posed
by deceptive content dissemination.

The third study explores the landscape of fake news detection leveraging Random Forest classifiers
achieving a minimum accuracy of 76%. It investigates methodologies, feature sets, and datasets
employed by researchers in this domain. Through a succinct overview, it highlights the significance of
feature engineering, parameter tuning, and evaluation metrics in achieving the desired accuracy
threshold. The survey also identifies ongoing challenges and suggests potential directions for future
research, emphasizing the critical role of machine learning in combating misinformation.

PROBLEM STATEMENT:

The proliferation of fake news has become a significant issue in today's digital age, causing
misinformation and distrust within communities. Manual identification and verification of fake news
articles are time-consuming and often inefficient. To address this challenge, this project aims to
develop a machine learning-based solution for classifying news articles as either authentic or fake.

Using a dataset containing labeled news articles, the project employs Natural Language Processing
(NLP) techniques and machine learning algorithms to automatically classify news articles into two
categories: authentic and fake. The primary objective is to create a robust classification model that
can accurately discern between genuine and fabricated news articles.

The project utilizes a supervised learning approach, leveraging techniques such as text preprocessing,
feature extraction, and classification. Specifically, the Multinomial Naive Bayes algorithm is employed
due to its effectiveness in text classification tasks. The text data is preprocessed to remove stopwords
and converted into numerical vectors using the TF-IDF (Term Frequency-Inverse Document
Frequency) vectorization technique.
The performance of the developed model is evaluated using standard evaluation metrics such as
accuracy, precision, recall, and F1-score. Additionally, confusion matrices are utilized to visualize the
classification results and identify potential areas of improvement.

The project aims to provide a reliable tool for automatically detecting fake news articles, thereby
assisting in combating the spread of misinformation and enhancing trust in online news sources.
Additionally, the project contributes to the advancement of machine learning techniques in
addressing societal challenges related to information integrity and credibility.

METHODOLOGY FOR FAKE NEWS CLASSIFIER:

1.Data Acquisition and Preprocessing:

Data is obtained from a CSV file ('news.csv') containing text and corresponding labels indicating
whether each article is authentic or fake.

The pandas library is used to load the dataset into a DataFrame.

Text data and labels are extracted from the DataFrame and stored in separate variables (X and y,
respectively).

2.Data Splitting:

The dataset is split into training and testing sets using the train_test_split function from scikit-learn.

80% of the data is used for training, and the remaining 20% is allocated for testing.

3.Feature Engineering:

Text data is transformed into numerical feature vectors using the TF-IDF (Term Frequency-Inverse
Document Frequency) vectorization technique.

Stop words (common words with little semantic value) are removed during vectorization to improve
model performance.

4.Model Selection and Training:

A pipeline is created using scikit-learn's Pipeline module, which sequentially applies TF-IDF
vectorization and the Multinomial Naive Bayes classifier.

The Multinomial Naive Bayes algorithm is chosen due to its effectiveness in text classification tasks
and its suitability for handling sparse data.

The pipeline is trained on the training data using the fit method.

5.Model Evaluation:

The trained model is used to make predictions on the test data.

Classification performance is evaluated using standard metrics such as accuracy, precision, recall, and
F1-score.
The scikit-learn classification_report function is employed to generate a comprehensive report of
these metrics.

Confusion matrices are generated using the confusion_matrix function to visualize the distribution of
true positive, true negative, false positive, and false negative predictions.

6.Model Serialization:

The trained model is serialized using the pickle module and saved to a file ('model.pickle').

Serialization allows for the model to be easily stored and reloaded for future use without needing to
retrain it.

7.Performance Assessment:

The accuracy of the model is calculated by comparing the predicted labels with the actual labels of
the test data.

The overall accuracy score is printed to assess the performance of the model in classifying fake news
articles.

8. Conclusion:

- Summarize the findings of the experiments and their implications for fake news detection.

- Reflect on the effectiveness of the implemented models and suggest future research directions.

- Emphasize the importance of continued efforts in developing robust fake news detection systems.

MODEL VALIDATION & MODEL DEPLOYMENT:

After developing the machine learning model for fake news classification, it's crucial to validate its
performance and deploy it for real-world use. Model validation ensures that the classifier generalizes
well to unseen data, while deployment enables its integration into applications or systems where it
can be utilized to classify news articles effectively.

Model Validation:

To validate the model, several steps are undertaken. First, the dataset is split into training and testing
sets using the `train_test_split` function, with 80% of the data allocated for training and the
remaining 20% for testing. This separation ensures that the model is evaluated on data it has not
been exposed to during training, thus providing a reliable assessment of its generalization capability.

Next, a pipeline is constructed, incorporating TF-IDF vectorization to convert textual data into
numerical features and a Multinomial Naive Bayes classifier to perform the classification task. The
pipeline is trained on the training data using the `fit` method. Once trained, the model predicts labels
for the test data using the `predict` method. The classification performance is then evaluated using
metrics such as accuracy, precision, recall, and F1-score, as well as confusion matrices, to gain
insights into the model's behavior across different classes.
Model Deployment:

Following successful validation, the trained model is serialized using the `pickle` module and saved to
a file ('model.pickle'). This serialized model file can then be deployed for real-world use. Deployment

options include integration into web applications, creation of APIs for remote inference, or
incorporation into existing systems. It's essential to ensure that the deployment environment is
compatible with the model's requirements, including library dependencies and hardware resources.

Once deployed, the model should be monitored regularly to track its performance in production.
Monitoring involves analyzing prediction accuracy, detecting concept drift, and identifying any issues
or anomalies that may arise. Maintenance tasks may include periodic retraining of the model with
new data to keep it up-to-date and effective in classifying fake news articles.

In addition to deployment, user interface development is crucial if the model is integrated into a
user-facing application. An intuitive user interface can enhance user experience by providing features
such as text input for submitting news articles, visualizations of model predictions, and explanations
of classification decisions.

Comprehensive documentation should accompany the deployed model, providing instructions for
accessing and using it, details on input and output formats, and troubleshooting guidelines. A
support system should also be established to address user inquiries and gather feedback for
continuous improvement of the model and its deployment process. By following these steps, the
developed machine learning model for fake news classification can effectively combat
misinformation and promote the dissemination of accurate information in online news sources.

RESULT & ANALYSIS:

Upon implementing the machine learning model for news article classification and integrating it with
the front-end user interface, several key observations can be made regarding the model's
performance and usability.

Model Performance:

The implemented model utilizes a Multinomial Naive Bayes classifier trained on a dataset of news
articles to classify articles as either authentic or fake. During training, the dataset is split into training
and testing sets, with the model achieving a high accuracy score of {:.2f}% on the test set.
Additionally, classification metrics such as precision, recall, and F1-score are computed to provide a
comprehensive evaluation of the model's performance. The classification report and confusion
matrix generated during testing offer insights into the model's ability to correctly classify articles
across different classes.

Front-End Integration:

The front-end interface provides a user-friendly platform for users to input the URL of a news article
and receive instant classification results. Upon submitting the URL, the interface sends a request to
the backend server, which utilizes the trained model to classify the article. The predicted
classification (authentic or fake) is then displayed on the interface, providing users with immediate
feedback.

Usability and Accessibility:

The integration of the model with the front-end interface enhances the usability and accessibility of
the classification system. Users can easily access the system through a web browser without
requiring any specialized software or technical expertise. The intuitive design of the interface, with
clear instructions and input fields, ensures a seamless user experience.

Future Enhancements:

While the current implementation demonstrates the effectiveness of the model for news article
classification, there are opportunities for further improvement and enhancement. Future iterations
of the system could incorporate advanced machine learning techniques, such as deep learning
models, to potentially improve classification accuracy further. Additionally, ongoing monitoring and
evaluation of the model's performance in real-world settings will be essential to ensure its
effectiveness over time.

Overall, the integration of the machine learning model with the front-end interface represents a
significant step towards combating misinformation and promoting the dissemination of accurate
information in online news sources. Through continued refinement and innovation, such systems
have the potential to play a crucial role in addressing the challenges posed by fake news in today's
digital landscape.

CONCLUSION:

In this study, we presented a machine learning-based approach for the classification of news articles
as authentic or fake, aimed at combating the proliferation of misinformation in online sources.
Leveraging a Multinomial Naive Bayes classifier trained on a dataset of labelled news articles, we
developed a robust classification model capable of accurately discerning between genuine and
fabricated news content.

Through the integration of the model with a user-friendly front-end interface, users can easily input
the URL of a news article and receive immediate classification results, enhancing accessibility and
usability. The model achieved a high accuracy score on the test dataset, demonstrating its
effectiveness in distinguishing between authentic and fake news articles.

The deployment of such classification systems holds significant promise in addressing the challenges
posed by fake news, promoting the dissemination of accurate information, and fostering trust in
online news sources. By providing users with tools to verify the authenticity of news content, we can
empower individuals to make informed decisions and combat the spread of misinformation in
today's digital age.

Ongoing monitoring and evaluation of classification systems in real-world settings will be crucial to
ensure their effectiveness and reliability over time.

In conclusion, the development and deployment of machine learning-based classification systems

represent a significant step towards addressing the challenges of misinformation and promoting the
dissemination of accurate information in online news sources. Through interdisciplinary collaboration
and continued refinement of these systems, we can work towards a more informed and trustworthy
digital information ecosystem.

Fake News Detection with AI
No ratings yet
Fake News Detection with AI
7 pages
Fake News Final Report
No ratings yet
Fake News Final Report
29 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
6 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Synopsis
No ratings yet
Synopsis
5 pages
Fake News Detection Using Machine Learning Report Final
No ratings yet
Fake News Detection Using Machine Learning Report Final
26 pages
FAke News Report
No ratings yet
FAke News Report
16 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
6 pages
ML PPT
No ratings yet
ML PPT
16 pages
Fake News
No ratings yet
Fake News
22 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
7 pages
253 1263 1 PB
No ratings yet
253 1263 1 PB
8 pages
Machine Learning for Fake News Detection
No ratings yet
Machine Learning for Fake News Detection
6 pages
Fake News Detection in Online Social Media: Problem Statement
No ratings yet
Fake News Detection in Online Social Media: Problem Statement
3 pages
Fake News Classifier Project Report
No ratings yet
Fake News Classifier Project Report
5 pages
D13 Manuscript
No ratings yet
D13 Manuscript
12 pages
NLP 1
No ratings yet
NLP 1
3 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Machine Learning Techniques For The Classification of Fake News
No ratings yet
Machine Learning Techniques For The Classification of Fake News
5 pages
A Novel Technique To Detect The Fake News by
No ratings yet
A Novel Technique To Detect The Fake News by
52 pages
Fake News Detection with ML
No ratings yet
Fake News Detection with ML
24 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
7 pages
FYP Copy
No ratings yet
FYP Copy
42 pages
A I Project Proposal
No ratings yet
A I Project Proposal
10 pages
8th Sem Research Paper
No ratings yet
8th Sem Research Paper
3 pages
Fake News Detection Using Machine Learning and Natural Language Processing
No ratings yet
Fake News Detection Using Machine Learning and Natural Language Processing
4 pages
(NetCrypt) Review Paper
No ratings yet
(NetCrypt) Review Paper
7 pages
Fake News Classification Using Machine Learning Techniques
No ratings yet
Fake News Classification Using Machine Learning Techniques
12 pages
Fake News Detection
No ratings yet
Fake News Detection
10 pages
Fake News Detection and Classification U
No ratings yet
Fake News Detection and Classification U
3 pages
Ieee Paper
No ratings yet
Ieee Paper
4 pages
Fake News Detection Project
No ratings yet
Fake News Detection Project
27 pages
(NetCrypt) Review Paper PDF
No ratings yet
(NetCrypt) Review Paper PDF
5 pages
Pandey 2022 J. Phys. Conf. Ser. 2161 012027
No ratings yet
Pandey 2022 J. Phys. Conf. Ser. 2161 012027
13 pages
Daa - Mini - Project (1) Orginal
No ratings yet
Daa - Mini - Project (1) Orginal
21 pages
ML Project Report (Final)
No ratings yet
ML Project Report (Final)
34 pages
Detecting Fake News with AI
No ratings yet
Detecting Fake News with AI
13 pages
Icoase51841 2020 9436605
No ratings yet
Icoase51841 2020 9436605
7 pages
Fake News Detection System Overview
No ratings yet
Fake News Detection System Overview
4 pages
Project File On Fake News (Soniya Rawat)
No ratings yet
Project File On Fake News (Soniya Rawat)
53 pages
Fake News Detection with Python ML
No ratings yet
Fake News Detection with Python ML
6 pages
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
No ratings yet
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
11 pages
JPNR 2022 04 140
No ratings yet
JPNR 2022 04 140
7 pages
Fake News Detection Project Report
No ratings yet
Fake News Detection Project Report
32 pages
Final ppt-1
No ratings yet
Final ppt-1
19 pages
Fake News Detectio3
No ratings yet
Fake News Detectio3
24 pages
FYP
No ratings yet
FYP
14 pages
Fake News Detection
No ratings yet
Fake News Detection
24 pages
Machine Learning-Based Approach For Fake News Detection
No ratings yet
Machine Learning-Based Approach For Fake News Detection
22 pages
1 s2.0 S2405844024012751 Main
No ratings yet
1 s2.0 S2405844024012751 Main
12 pages
Fake News Detection with ML
No ratings yet
Fake News Detection with ML
56 pages
Fake News Detection Summary
No ratings yet
Fake News Detection Summary
2 pages
Analyzing Fake News with Machine Learning
No ratings yet
Analyzing Fake News with Machine Learning
10 pages
MAJOR PROJECT REPORT (1) - For Merge
No ratings yet
MAJOR PROJECT REPORT (1) - For Merge
46 pages
Machine Learning For The Classification of Fake News
No ratings yet
Machine Learning For The Classification of Fake News
4 pages
Fake News Detection with ML
No ratings yet
Fake News Detection with ML
20 pages
Reserch Paperupdated
No ratings yet
Reserch Paperupdated
8 pages
iNAU Login Guide for Graduate Students
No ratings yet
iNAU Login Guide for Graduate Students
5 pages
Abhyasa Ganam English
100% (1)
Abhyasa Ganam English
107 pages
Python Lab Manual
No ratings yet
Python Lab Manual
22 pages
June 2020 Bank Statement Summary
No ratings yet
June 2020 Bank Statement Summary
18 pages
The Wanderers Song Final
No ratings yet
The Wanderers Song Final
4 pages
Bird Species Identification Using ML
No ratings yet
Bird Species Identification Using ML
5 pages
Machine Learning for SEM-EDS Mineral Classification
No ratings yet
Machine Learning for SEM-EDS Mineral Classification
13 pages
Used Price Prediction
No ratings yet
Used Price Prediction
4 pages
Standardization Campusx
No ratings yet
Standardization Campusx
4 pages
Structural Concrete - Hien Et Al. 2023
No ratings yet
Structural Concrete - Hien Et Al. 2023
25 pages
Accelerometer Based Intelligent System For Human Movement Recognition
No ratings yet
Accelerometer Based Intelligent System For Human Movement Recognition
19 pages
Personality Classification From Online Text
No ratings yet
Personality Classification From Online Text
17 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
98 pages
Prediction of Brain Stroke Using Machine Learning
No ratings yet
Prediction of Brain Stroke Using Machine Learning
8 pages
Project Report
No ratings yet
Project Report
3 pages
Cardiac Radar For Biometric Identification Using Nearest Neighbour of Continuous Wavelet Transform Peaks
No ratings yet
Cardiac Radar For Biometric Identification Using Nearest Neighbour of Continuous Wavelet Transform Peaks
6 pages
Prediction of Cardiovascular Disease Using Machine Learning Algorithms
No ratings yet
Prediction of Cardiovascular Disease Using Machine Learning Algorithms
11 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Module 4 Quiz
No ratings yet
Module 4 Quiz
7 pages
Interview Question For Data Science
No ratings yet
Interview Question For Data Science
33 pages
Become A Machine Learning Engineer
No ratings yet
Become A Machine Learning Engineer
4 pages
PDF Main Confrence
No ratings yet
PDF Main Confrence
29 pages
Artificial Intelligence in Fraud Detection and Prevention
100% (1)
Artificial Intelligence in Fraud Detection and Prevention
10 pages
Fraud Detection in Mobile Payment Systems Using An Xgboost Based Framework
No ratings yet
Fraud Detection in Mobile Payment Systems Using An Xgboost Based Framework
19 pages
Aiml Notes Handwritten
No ratings yet
Aiml Notes Handwritten
95 pages
Machine Learning: A Probabilistic Perspective: Solutions Manual (Please Do Not Make Publicly Available)
No ratings yet
Machine Learning: A Probabilistic Perspective: Solutions Manual (Please Do Not Make Publicly Available)
127 pages
Free AI Courses - Limited Time Offer
No ratings yet
Free AI Courses - Limited Time Offer
3 pages
Additional Exercice S Data Science
No ratings yet
Additional Exercice S Data Science
3 pages
Rice Leaf Disease Prediction Using Machine Learning
No ratings yet
Rice Leaf Disease Prediction Using Machine Learning
5 pages
Application of Machine Learning Models For Prediction of Rheological
No ratings yet
Application of Machine Learning Models For Prediction of Rheological
12 pages
Plant Seedling Classification
0% (1)
Plant Seedling Classification
12 pages
Supervised Machine Learning Techniques
No ratings yet
Supervised Machine Learning Techniques
6 pages
Dr. Krishna Kanth's AI & ML CV
No ratings yet
Dr. Krishna Kanth's AI & ML CV
4 pages
DataScience - Project (Banknote Authentication) - SHILANJOY BHATTACHARJEE EE
No ratings yet
DataScience - Project (Banknote Authentication) - SHILANJOY BHATTACHARJEE EE
14 pages
Topic 4
No ratings yet
Topic 4
32 pages