Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
The data were collected from the Iraqi society, as they data were acquired from the laboratory of Medical City Hospital and (the Specializes Center for Endocrinology and Diabetes-Al-Kindy Teaching Hospital).
Please add this citation if you use this dataset for any further analysis.
Rashid, Ahlam. Diabetes Dataset. PlumX Metrics, 2020.
Link: https://plu.mx/plum/a?mendeley_data_id=wj9rwkp9c2&theme=plum-bigben-theme
- https://www.kaggle.com/datasets/aravindpcoder/diabetes-dataset
- https://www.kaggle.com/datasets/simaanjali/diabetes-classification-dataset/data
- Used techniques: preprocessing, SMOTE, clustering with K-Means and Hierarchical, classification with KNNs and Decision Tree
- Re-edit the path if you use our code for importing or loading the dataset.
- The report is for reference only, please do not edit or use for other purposes.
All the files are done by me and @phamcongthuan, if you reuse the code please add the citation