Welcome to my personal collection of hands-on data science projects! This repository showcases my journey exploring and mastering various data science concepts, tools, and techniques.
๐ Stay tuned as I continue to expand this repository with more exciting projects!
Note
Have a look at this repo about my Machine Learning Subject wich contains even more projects written in R.
- ๐ Notebook
- ๐ ๏ธ Technologies: Pandas, PyTorch.
- ๐งช Summary: A classic classification problem using the Iris dataset to practice data manipulation, visualization, and building simple neural networks.
- ๐ Notebook
- ๐ ๏ธ Technologies: Scikit-learn, TensorFlow.
- ๐งช Summary: Predicting the likelihood of diabetes using machine learning models, focusing on data preprocessing and model evaluation.
- ๐ Notebook
- ๐ ๏ธ Technologies: Autokeras, Scikit-learn.
- ๐งช Summary: Automated approach to classify breast cancer cases. The project leverages AutoKeras to find optimal deep learning models with minimal manual tuning.
- ๐ Notebook
- ๐ ๏ธ Technologies: Pyspark, Pandas.
- ๐งช Summary: The script processes the Wine dataset using Apache Spark, performing data cleaning, exploration, and applying custom pandas UDFs for additional transformations.
- ๐ Notebook
- ๐ ๏ธ Technologies: Dask, Scikit-learn.
- ๐งช Summary: This time, we will use an alternative to Pandas so that parallel computing is considered when manipulating dataframes thanks to the library Dask.
- ๐ Notebook
- ๐ ๏ธ Technologies: NLTK, Scikit-learn.
- ๐งช Summary: The notebook analyzes movie reviews using the NLTK library, focusing on text preprocessing, feature extraction, and sentiment classification with a Naive Bayes model.
- ๐ป Repository
- ๐ ๏ธ Technologies: Pandas, Matplotlib, FPDF, Openpyxl, Streamlit.
- ๐งช Summary: A self-made tool for generating PDF reports from data files locally.
- ๐ Carprice report and Titanic report
- ๐ ๏ธ Technologies: Power BI.
- ๐งช Summary: My first two dashboards ever made with Power BI allowed me to learn the basics of visualizing and manipulating data.
https://github.com/MarkosHB/My-Data-Science-Learning-Projects
Leave a Reply