devxyasir
/

Email_Spam_Detection

+---
+license: mit
+language:
+- en
+tags:
+- email
+- spam
+- spamdetection
+---
+ # 📩 Spam Detection Neural Network (PyTorch)
+[![Python](https://img.shields.io/badge/python-3.10-blue.svg)](https://www.python.org/)
+[![PyTorch](https://img.shields.io/badge/pytorch-2.1-red.svg)](https://pytorch.org/)
+[![License](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE)
+A **simple, real-world spam detection neural network** built from scratch in **PyTorch**.
+This model classifies SMS / short text messages as **Spam** or **Ham (Not Spam)**.
+The project is **small, easy to understand, and perfect for learning**.
+You can fork it, fine-tune it, and use it as a **starting point for your own projects**.
+---
+## 🧠 Model Overview
+- **Framework:** PyTorch
+- **Architecture:** Fully Connected Neural Network (MLP)
+- **Input:** Bag-of-Words text vectors
+- **Output:** Binary classification (Spam / Ham)
+- **Training:** From scratch, small dataset (~5,500 messages)
+> ⚠️ Note: The dataset is intentionally small to keep things simple.
+> You are encouraged to **fork the repo, add more data, and fine-tune the model**.
+---
+## 📂 Repository Structure
+```
+.
+├── spam_nn.pth        # Trained PyTorch model weights
+├── vectorizer.pkl     # CountVectorizer for text preprocessing
+├── model.py           # Neural network architecture
+├── config.json        # Model configuration
+├── inference.py       # Inference / prediction script
+├── README.md          # Documentation
+````
+---
+## 🚀 Usage
+### Load Model
+```python
+import torch
+from model import SpamNN
+import pickle
+# Load model architecture + weights
+model = SpamNN()
+model.load_state_dict(torch.load("spam_nn.pth"))
+model.eval()
+# Load vectorizer
+with open("vectorizer.pkl", "rb") as f:
+    vectorizer = pickle.load(f)
+````
+### Predict Messages
+```python
+def predict(text):
+    vec = vectorizer.transform([text]).toarray()
+    vec = torch.tensor(vec, dtype=torch.float32)
+    with torch.no_grad():
+        output = model(vec)
+    return "Spam" if output.item() > 0.35 else "Ham"
+# Example
+print(predict("Congratulations! You won $1000. Click now!"))
+```
+---
+## 🔧 Training & Fine-Tuning
+The model can be **improved and fine-tuned** by:
+* Adding more data (larger SMS datasets)
+* Increasing n-grams (`ngram_range=(1,2)`)
+* Adjusting class weights in `BCEWithLogitsLoss`
+* Training with more epochs
+* Using embeddings or LSTM for contextual understanding
+💡 **Fork this repo and experiment freely**. Make it your own!
+---
+## 🌟 Support the Project
+If this project is helpful:
+⭐ **Give this repository a star**
+🍴 **Fork it and improve it**
+📢 **Share it with others learning PyTorch**
+> Following and starring helps me keep releasing open-source projects!
+---
+## 📌 Source Code & Updates
+For the **full source code, training scripts, and future updates**,
+please visit the **GitHub repository** linked to this project.
+---
+## 📜 License
+This project is **open-source** and intended for **educational purposes**.
+MIT License applies.
+---
+## 🤗 Hugging Face Friendly
+You can also **upload this model to Hugging Face Model Hub**.
+Include `spam_nn.pth`, `vectorizer.pkl`, `config.json`, and `inference.py` to make it **ready for inference online**.