Dermatologist AI

I made this project based on the project idea in udacity's deep learning nanodegree, however I heavily expanded on the problem.

LICENSE

Objective

The objective of this model is to determine if a supplied image of a skin aliment is either melanoma, nevus, or seborrheic keratosis based on nothing but the image itself.

Libraries Used

I used Keras with the Tensorflow back end to train this model. I also use image processing tools from Keras for processing the data. And of course numpy. Make sure to install the required libraries in requirements.txt!

Performance

Right now, by fine tuning the weights of Google's InceptionV3 network with my own classifier fully connected layers, and training on 2500 images, I am able to get about 77 percent accuracy. With this, I also augmented my data by rotation images by varying degrees using Keras' ImageDataGenerator. With more fine tuning I was able to increase accuracy by 17 percent to get 77 percent from 60 percent! I will continue to hopefully hit 90 percent accuracy. This new model can be downloaded here.

I believe trying the Inception ResNet V2 model and fine tuning it might give better results, however with the model being much more complex than InceptionV3 I have not tried training it. A good idea I have that I will try is just fine tune the top few "blocks" of the network instead of the entire network. This could yield better results without huge computation times.

Running this model

Download data set (my own):

All three data sets, with more data than udacity and scrubbed of bad data

Download data sets (provided by udacity):

Unzip each data set into their respective directory in the data folder. Leave them in the three separate folders (melanoma, nevus, seborrheic keratosis).
(Optional) Clean data and add more images (if you don't use my uploaded dataset).

More images for each category can be found at the ISIC Archive.
Most of the images found on that website don't appear in the initially download data, with the exception of seborrheic keratosis, since there are few images in the archive for it. So just make sure you are not copying the same data between sets. Also, for better accuracy try to include pictures without other objects in it, like drawn circles or arrows, or blue/yellow markers next to the lesion for size.
Some of the data initially provided has those drawn circles and markers in them, so for better results you can look through and delete them. Took me only a few minutes with file previews on.

Run the model.
1. Open preprocess_data.py and change proccesses_num to the number of cores you are willing to give to process the data. This can take some time on one core, so I left it at 4. The higher the number the faster processing will go.
2. Change image size if you are going to be changing the transferred network, otherwise leave it the same.
3. (Optional) If you want to see TensorBoard output like loss, and an image of the network (gradient histograms coming soon), run tensorboard --logdir=tensorboard_logs
4. Run cnn_network_skin_cancer.py and enter 'y' to pre-proccess the data.
5. Let the model train! It will give information about trainable and non-trainable parameters, and at the end evaluate the model on the test data. The model will go through two training phases: one to train the classifier fully connected layers on the top of the network, then freeze those weights and fine tune the rest of the model.
6. Once the model is done training, it will be saved under saved_models.
Test images.

You can test individual images by running test_model.py
It will return the name of the determined disease.
This requires a saved model, which one is saved by running the steps above completely. I do provide my own trained model for testing if you want. Download the model here.
Models are saved under the saved_models directory.
Usage is:

python test_model.py /path/to/image/file /path/to/saved/model

Todo

Right now, I am trying to get TensorBoard callback to work when training, so model training can be visualized with TensorBoard. However, I am getting a ValueError because the returned gradients from the model are None. Doesn't seem to be an issue with freezing layers, I really cannot find anything on this issue. Still working on it.

I hope you can get better results than I! You can speed up training if you increase batch size and epochs to have smaller batch sizes per epoch. Good luck!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dermatologist AI

Objective

Libraries Used

Performance

Running this model

Todo

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
saved_models		saved_models
saved_weights		saved_weights
tensorboard_logs		tensorboard_logs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnn_network_skin_cancer.py		cnn_network_skin_cancer.py
preprocess_data.py		preprocess_data.py
requirements.txt		requirements.txt
test_model.py		test_model.py

License

brandons209/dermatologist-CNN

Folders and files

Latest commit

History

Repository files navigation

Dermatologist AI

Objective

Libraries Used

Performance

Running this model

Todo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages