Data Cleaning project

Logo

A comprehensive data cleaning project using the SmartPrix Laptop dataset, focusing on handling missing values, duplicates, and outliers.

View the Project on GitHub themispap/laptop-data-cleaning

Laptop Data Cleaning Project

Overview

This project focuses on cleaning the “SmartPrix Laptop Unclean Dataset” obtained from Kaggle. The dataset includes various attributes of laptops such as price, specifications, and user ratings. The goal of this project is to demonstrate data cleaning techniques including handling missing values, duplicates, and outliers.

Dataset

The dataset used in this project contains the following columns:

Project Highlights

Tech Stack

Setup Instructions

1. Clone the Repository

git clone https://github.com/yourusername/laptop-data-cleaning.git
cd laptop-data-cleaning

2. Create and Activate Virtual Environment

3. Install Required Libraries

Install all required libraries using the requirements.txt file:

pip install -r requirements.txt

4. Open Jupyter Notebook

Run the project in Jupyter Notebook:

jupyter notebook

Usage

Future Improvements

License

This project is licensed under the MIT License. See the LICENSE file for details.