Brain Tumour Evaluation

Brain Tumour Evaluation

AI is being used in various fields all over the world. One of the biggest potential beneficiaries of this is the medical field. We can use machine learning models to accurately and quickly identify and then classify different brain tumours from MRI images. I developed a CNN to classify brain tumours into four categories: glioma, meningioma, pituitary, and notumor. For this post, I am not going to talk about the actual design of the model but more just the evaluation of my model, highlighting its strengths, weaknesses, and the implications of using AI in the medical sector.

Dataset Break Down

The dataset is made up of 7023 images, these images are labelled as either pituitary, glioma, meningioma, and notumor.

The dataset is also broken into train and test datasets. The training dataset contains 5712 images, ~81% of the total data and the testing dataset contains 1311 images, ~19% of the total data.

Understanding the Evaluation Metrics

Firstly I am going to explain the different metrics I have used to evaluate my model's performance.

These metrics were calculated for each class, giving us a detailed view of how the model performs across different tumour types. When evaluating our model on the test dataset, we achieved the following results:

Per-Class Performance Metrics

Overall Accuracy: 0.7595 (75.97% of predictions were correct)

Analysis of the Metrics

Confusion Matrix

The above 4x4 confusion matrix, visualised as a heatmap, shows how often the model confuses one class for another. Just like our other metrics showed us my model struggles the most with meningioma tumours. We can also see from this that the model believes that most meningiomas actually are notumors. This can help me understand where the model is going wrong, giving guidance for future improvements.

Implications and Future Directions

My model demonstrates promising capabilities, particularly in identifying non-tumour cases and pituitary tumours. However, several areas require attention.

Moving Towards Clinical Integration

While my model shows potential, it is not yet ready for clinical use. Models like this should be used as a decision support tool to assist radiologists, not replace them. Before deploying a model like this, I would need to:

Conclusion

My brain tumour classification model is a step forward in leveraging AI for medical diagnostics, with strong performance in identifying non-tumour cases and pituitary tumours. However, challenges with glioma recall and meningioma metrics highlight areas for improvement.