My-Data-Science-Learning-Projects

๐Ÿš€ My Data Science Learning Projects.

Welcome to my personal collection of hands-on data science projects! This repository showcases my journey exploring and mastering various data science concepts, tools, and techniques.
๐ŸŒŸ Stay tuned as I continue to expand this repository with more exciting projects!

Note

Have a look at this repo about my Machine Learning Subject wich contains even more projects written in R.

๐ŸŒธ Iris Classification.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: Pandas, PyTorch.
  • ๐Ÿงช Summary: A classic classification problem using the Iris dataset to practice data manipulation, visualization, and building simple neural networks.

๐Ÿฉบ Diabetes Prediction.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: Scikit-learn, TensorFlow.
  • ๐Ÿงช Summary: Predicting the likelihood of diabetes using machine learning models, focusing on data preprocessing and model evaluation.

๐Ÿง‘โ€โš•๏ธ Breast Cancer.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: Autokeras, Scikit-learn.
  • ๐Ÿงช Summary: Automated approach to classify breast cancer cases. The project leverages AutoKeras to find optimal deep learning models with minimal manual tuning.

๐Ÿท Wine.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: Pyspark, Pandas.
  • ๐Ÿงช Summary: The script processes the Wine dataset using Apache Spark, performing data cleaning, exploration, and applying custom pandas UDFs for additional transformations.

๐Ÿ  California Housing.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: Dask, Scikit-learn.
  • ๐Ÿงช Summary: This time, we will use an alternative to Pandas so that parallel computing is considered when manipulating dataframes thanks to the library Dask.

๐Ÿ’ญ Movie Reviews.

  • ๐Ÿ“„ Notebook
  • ๐Ÿ› ๏ธ Technologies: NLTK, Scikit-learn.
  • ๐Ÿงช Summary: The notebook analyzes movie reviews using the NLTK library, focusing on text preprocessing, feature extraction, and sentiment classification with a Naive Bayes model.

๐Ÿ“Š ManageYourData.

  • ๐Ÿ’ป Repository
  • ๐Ÿ› ๏ธ Technologies: Pandas, Matplotlib, FPDF, Openpyxl, Streamlit.
  • ๐Ÿงช Summary: A self-made tool for generating PDF reports from data files locally.

๐ŸŽ›๏ธ Data analysis.

  • ๐Ÿ“‹ Carprice report and Titanic report
  • ๐Ÿ› ๏ธ Technologies: Power BI.
  • ๐Ÿงช Summary: My first two dashboards ever made with Power BI allowed me to learn the basics of visualizing and manipulating data.

Visit original content creator repository
https://github.com/MarkosHB/My-Data-Science-Learning-Projects

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *