Название: Data Science Crash Course: Thyroid Disease Classification and Prediction Using Machine Learning and Deep Learning with Python GUI, Second Edition
Автор: Vivian Siahaan, Rismon Sianipar
Издательство: Balige Publishing
Год: July 2023
Страниц: 585
Язык: английский
Формат: epub (true)
Размер: 19.7 MB
Thyroid disease is a prevalent condition that affects the thyroid gland, leading to various health issues. In this session of the Data Science Crash Course, we will explore the classification and prediction of thyroid disease using Machine Learning and Deep Learning techniques, all implemented with the power of Python and a user-friendly GUI built with PyQt. We will start by conducting data exploration on a comprehensive dataset containing relevant features and thyroid disease labels. Through analysis and pattern recognition, we will gain insights into the underlying factors contributing to thyroid disease. Next, we will delve into the Machine Learning phase, where we will implement popular algorithms including Support Vector, Logistic Regression, K-Nearest Neighbors (KNN), Decision Tree, Random Forest, Gradient Boosting, Light Gradient Boosting, Naive Bayes, Adaboost, Extreme Gradient Boosting, and Multi-Layer Perceptron. These models will be trained using different preprocessing techniques, including raw data, normalization, and standardization, to evaluate their performance and accuracy. We train each model on the training dataset and evaluate its performance using appropriate metrics such as accuracy, precision, recall, and F1-score. This helps us assess how well the models can predict stroke based on the given features. To optimize the models' performance, we perform hyperparameter tuning using techniques like grid search or randomized search. This involves systematically exploring different combinations of hyperparameters to find the best configuration for each model. After training and tuning the models, we save them to disk using joblib. This allows us to reuse the trained models for future predictions without having to train them again. Moving beyond traditional Machine Learning, we will build an artificial neural network (ANN) using TensorFlow.