This repository implements the enhanced DenseNet121 architecture integrated by dilated convolution and Squeeze-and-Excitation (SE) networks to improve the diagnostic accuracy in brain tumor classification through MRI images.
![Screen Shot 2023-12-10 at 10 39 50 PM](https://private-user-images.githubusercontent.com/89234579/289419552-a6bbfe56-f307-4e04-a74a-3b12ee064747.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNDY4ODksIm5iZiI6MTczOTI0NjU4OSwicGF0aCI6Ii84OTIzNDU3OS8yODk0MTk1NTItYTZiYmZlNTYtZjMwNy00ZTA0LWE3NGEtM2IxMmVlMDY0NzQ3LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDA0MDMwOVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTI0Mjk0ZGZmNmQzMmZhYzAzOTQ1YzE1N2ZlMDE4Y2Q5YTAzZjkxZTU1YzAxYWNkODMxNDVlOGI4MjBlM2YyZTkmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.QXVMyBCCgdAFNni-NcKpTcIReFoHFrKmVIhmrRcZT7Y)
We trained and evaluated our model using a comprehensive Kaggle brain tumor dataset comprising 7023 images, classified into four categories, including healthy brain. The dataset was augmented and preprocessed for optimal model training.
The dataset can be found here
Our model advances upon the traditional DenseNet-121 architecture, integrating dilated convolution in place of some standard convolutional layers and augmenting with an SE mechanism. These innovations enhance the model’s representation learning capabilities.
![Screen Shot 2023-12-10 at 10 44 07 PM](https://private-user-images.githubusercontent.com/89234579/289419671-1e657d3b-dd96-4e39-a2a3-f97662170fa9.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNDY4ODksIm5iZiI6MTczOTI0NjU4OSwicGF0aCI6Ii84OTIzNDU3OS8yODk0MTk2NzEtMWU2NTdkM2ItZGQ5Ni00ZTM5LWEyYTMtZjk3NjYyMTcwZmE5LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDA0MDMwOVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTZjNDc4Njg2ZWQ0ZDE5YjBiMzVkZTY3Y2YwMjY3NDY4NTRlZDVjODk1YjhlODE2MTlkM2FjMTZjNTA3ODFjNGYmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.zZrRwoizUI6Br8nDbWupCNUTH-_c2iA0_CmOW2YP3co)
We used the AdamW optimizer with a custom Label Smoothing cross-entropy loss function and employed a Cosine Annealing learning rate scheduler. The model was trained over 50 epochs with a batch size of 256.
Our evaluation used a 10-crop method, involving resizing each image to 256 × 256 pixels and producing ten distinct crops per image. The final test report averages the results over these crops.
The model demonstrated superior learning ability, outperforming pre-trained models: ResNet50, VGG16, ViT_16, DenseNet121, and Efficient_V2 in later training epochs and in testing.
![Screen Shot 2023-12-10 at 10 49 14 PM](https://private-user-images.githubusercontent.com/89234579/289420263-ae58fc9d-2886-4d62-8bb7-f7d1f63250a9.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNDY4ODksIm5iZiI6MTczOTI0NjU4OSwicGF0aCI6Ii84OTIzNDU3OS8yODk0MjAyNjMtYWU1OGZjOWQtMjg4Ni00ZDYyLThiYjctZjdkMWY2MzI1MGE5LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDA0MDMwOVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTA3NWUwMTdkOTUyNjc2NWRjZjRmYjEwNTYzYzhkNDBlNWJmMTY3YThmZjkzYzFmZjQ2ZDIwZmE1NWYzY2JjMWImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.MOkmcxgzqJgt1b6HERduz7lvuBfx2XeLhKPS5EIpaHM)
Future research will focus on the implementation of advanced image augmentation techniques, integration of multi-scale network architecture, and adaptive dilation convolution rates.
This work was collaboratively conducted by Yuannong Mao and Edward Jiwook Kim from University of Waterloo.