GitHub - DeterminedSage/Spam-Mail-Prediction

PreRequist :- make sure you have uploaded the "mail_data.csv" in the google collab from the repository link to run this file

This project demonstrates how to build a spam detection system using Logistic Regression in Python. It involves the following steps:

Data Collection and Preprocessing:

Load email data from a CSV file. Handle missing values. Perform label encoding to represent spam as 0 and ham as 1. Feature Extraction:

Utilize TF-IDF vectorization to convert text data into numerical features. Remove stop words and convert text to lowercase for better feature representation. Model Training:

Train a Logistic Regression model on the labeled data. Model Evaluation:

Evaluate the model's performance on both training and testing datasets using metrics such as accuracy.

Accuracy on training data = 96.70%

Accuracy on test data = 96.59%

Prediction:

Build a system to predict whether new emails are spam or ham based on the trained model. This project showcases a basic implementation of a spam detection system using machine learning.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
Spam_Mail_Prediction.ipynb		Spam_Mail_Prediction.ipynb
mail_data.csv		mail_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

DeterminedSage/Spam-Mail-Prediction

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages