diff options
author | Navan Chauhan <navanchauhan@gmail.com> | 2021-05-26 23:58:29 +0530 |
---|---|---|
committer | Navan Chauhan <navanchauhan@gmail.com> | 2021-05-26 23:58:29 +0530 |
commit | bfd3a825c2d73bd842769cdfaf11ad0319a3bd6e (patch) | |
tree | 7b2c052bdf539f433ed3ab6bd133b6d46c7ff7e5 /Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md | |
parent | 2cb28c0dd749611e6edd4688955769bda3381453 (diff) |
added code and content
Diffstat (limited to 'Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md')
-rw-r--r-- | Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md | 268 |
1 files changed, 268 insertions, 0 deletions
diff --git a/Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md b/Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md new file mode 100644 index 0000000..5045a57 --- /dev/null +++ b/Content/posts/2020-01-16-Image-Classifier-Using-Turicreate.md @@ -0,0 +1,268 @@ +--- +date: 2020-01-16 10:36 +description: Tutorial on creating a custom Image Classifier using Turicreate and a dataset from Kaggle +tags: Tutorial, Colab, Turicreate +--- + +# Creating a Custom Image Classifier using Turicreate to detect Smoke and Fire + +*For setting up Kaggle with Google Colab, please refer to <a href="/posts/2020-01-15-Setting-up-Kaggle-to-use-with-Colab/"> my previous post</a>* + + +## Dataset + +### Mounting Google Drive + +```python +import os +from google.colab import drive +drive.mount('/content/drive') +``` + +### Downloading Dataset from Kaggle + +```python +os.environ['KAGGLE_CONFIG_DIR'] = "/content/drive/My Drive/" +!kaggle datasets download ashutosh69/fire-and-smoke-dataset +!unzip "fire-and-smoke-dataset.zip" +``` + +## Pre-Processing + +```Termcap +!mkdir default smoke fire +``` + + +\ + +```Termcap +!ls data/data/img_data/train/default/*.jpg +``` + +\ + +```Termcap +img_1002.jpg img_20.jpg img_519.jpg img_604.jpg img_80.jpg +img_1003.jpg img_21.jpg img_51.jpg img_60.jpg img_8.jpg +img_1007.jpg img_22.jpg img_520.jpg img_61.jpg img_900.jpg +img_100.jpg img_23.jpg img_521.jpg 'img_62 (2).jpg' img_920.jpg +img_1014.jpg img_24.jpg 'img_52 (2).jpg' img_62.jpg img_921.jpg +img_1018.jpg img_29.jpg img_522.jpg 'img_63 (2).jpg' img_922.jpg +img_101.jpg img_3000.jpg img_523.jpg img_63.jpg img_923.jpg +img_1027.jpg img_335.jpg img_524.jpg img_66.jpg img_924.jpg +img_102.jpg img_336.jpg img_52.jpg img_67.jpg img_925.jpg +img_1042.jpg img_337.jpg img_530.jpg img_68.jpg img_926.jpg +img_1043.jpg img_338.jpg img_531.jpg img_700.jpg img_927.jpg +img_1046.jpg img_339.jpg 'img_53 (2).jpg' img_701.jpg img_928.jpg +img_1052.jpg img_340.jpg img_532.jpg img_702.jpg img_929.jpg +img_107.jpg img_341.jpg img_533.jpg img_703.jpg img_930.jpg +img_108.jpg img_3.jpg img_537.jpg img_704.jpg img_931.jpg +img_109.jpg img_400.jpg img_538.jpg img_705.jpg img_932.jpg +img_10.jpg img_471.jpg img_539.jpg img_706.jpg img_933.jpg +img_118.jpg img_472.jpg img_53.jpg img_707.jpg img_934.jpg +img_12.jpg img_473.jpg img_540.jpg img_708.jpg img_935.jpg +img_14.jpg img_488.jpg img_541.jpg img_709.jpg img_938.jpg +img_15.jpg img_489.jpg 'img_54 (2).jpg' img_70.jpg img_958.jpg +img_16.jpg img_490.jpg img_542.jpg img_710.jpg img_971.jpg +img_17.jpg img_491.jpg img_543.jpg 'img_71 (2).jpg' img_972.jpg +img_18.jpg img_492.jpg img_54.jpg img_71.jpg img_973.jpg +img_19.jpg img_493.jpg 'img_55 (2).jpg' img_72.jpg img_974.jpg +img_1.jpg img_494.jpg img_55.jpg img_73.jpg img_975.jpg +img_200.jpg img_495.jpg img_56.jpg img_74.jpg img_980.jpg +img_201.jpg img_496.jpg img_57.jpg img_75.jpg img_988.jpg +img_202.jpg img_497.jpg img_58.jpg img_76.jpg img_9.jpg +img_203.jpg img_4.jpg img_59.jpg img_77.jpg +img_204.jpg img_501.jpg img_601.jpg img_78.jpg +img_205.jpg img_502.jpg img_602.jpg img_79.jpg +img_206.jpg img_50.jpg img_603.jpg img_7.jpg +``` + + +The image files are not actually JPEG, thus we first need to save them in the correct format for Turicreate + +```python +from PIL import Image +import glob + + +folders = ["default","smoke","fire"] +for folder in folders: + n = 1 + for file in glob.glob("./data/data/img_data/train/" + folder + "/*.jpg"): + im = Image.open(file) + rgb_im = im.convert('RGB') + rgb_im.save((folder + "/" + str(n) + ".jpg"), quality=100) + n +=1 + for file in glob.glob("./data/data/img_data/train/" + folder + "/*.jpg"): + im = Image.open(file) + rgb_im = im.convert('RGB') + rgb_im.save((folder + "/" + str(n) + ".jpg"), quality=100) + n +=1 +``` + +\ + +```Termcap +!mkdir train +!mv default ./train +!mv smoke ./train +!mv fire ./train +``` + +## Making the Image Classifier + +### Making an SFrame + +```Termcap +!pip install turicreate +``` + +\ + +```python +import turicreate as tc +import os + +data = tc.image_analysis.load_images("./train", with_path=True) + +data["label"] = data["path"].apply(lambda path: os.path.basename(os.path.dirname(path))) + +print(data) + +data.save('fire-smoke.sframe') +``` + +\ + +```Termcap ++-------------------------+------------------------+ +| path | image | ++-------------------------+------------------------+ +| ./train/default/1.jpg | Height: 224 Width: 224 | +| ./train/default/10.jpg | Height: 224 Width: 224 | +| ./train/default/100.jpg | Height: 224 Width: 224 | +| ./train/default/101.jpg | Height: 224 Width: 224 | +| ./train/default/102.jpg | Height: 224 Width: 224 | +| ./train/default/103.jpg | Height: 224 Width: 224 | +| ./train/default/104.jpg | Height: 224 Width: 224 | +| ./train/default/105.jpg | Height: 224 Width: 224 | +| ./train/default/106.jpg | Height: 224 Width: 224 | +| ./train/default/107.jpg | Height: 224 Width: 224 | ++-------------------------+------------------------+ +[2028 rows x 2 columns] +Note: Only the head of the SFrame is printed. +You can use print_rows(num_rows=m, num_columns=n) to print more rows and columns. ++-------------------------+------------------------+---------+ +| path | image | label | ++-------------------------+------------------------+---------+ +| ./train/default/1.jpg | Height: 224 Width: 224 | default | +| ./train/default/10.jpg | Height: 224 Width: 224 | default | +| ./train/default/100.jpg | Height: 224 Width: 224 | default | +| ./train/default/101.jpg | Height: 224 Width: 224 | default | +| ./train/default/102.jpg | Height: 224 Width: 224 | default | +| ./train/default/103.jpg | Height: 224 Width: 224 | default | +| ./train/default/104.jpg | Height: 224 Width: 224 | default | +| ./train/default/105.jpg | Height: 224 Width: 224 | default | +| ./train/default/106.jpg | Height: 224 Width: 224 | default | +| ./train/default/107.jpg | Height: 224 Width: 224 | default | ++-------------------------+------------------------+---------+ +[2028 rows x 3 columns] +Note: Only the head of the SFrame is printed. +You can use print_rows(num_rows=m, num_columns=n) to print more rows and columns. +``` + + +### Making the Model + +```python + +import turicreate as tc + +# Load the data +data = tc.SFrame('fire-smoke.sframe') + +# Make a train-test split +train_data, test_data = data.random_split(0.8) + +# Create the model +model = tc.image_classifier.create(train_data, target='label') + +# Save predictions to an SArray +predictions = model.predict(test_data) + +# Evaluate the model and print the results +metrics = model.evaluate(test_data) +print(metrics['accuracy']) + +# Save the model for later use in Turi Create +model.save('fire-smoke.model') + +# Export for use in Core ML +model.export_coreml('fire-smoke.mlmodel') +``` + +\ + +```Termcap +Performing feature extraction on resized images... +Completed 64/1633 +Completed 128/1633 +Completed 192/1633 +Completed 256/1633 +Completed 320/1633 +Completed 384/1633 +Completed 448/1633 +Completed 512/1633 +Completed 576/1633 +Completed 640/1633 +Completed 704/1633 +Completed 768/1633 +Completed 832/1633 +Completed 896/1633 +Completed 960/1633 +Completed 1024/1633 +Completed 1088/1633 +Completed 1152/1633 +Completed 1216/1633 +Completed 1280/1633 +Completed 1344/1633 +Completed 1408/1633 +Completed 1472/1633 +Completed 1536/1633 +Completed 1600/1633 +Completed 1633/1633 +PROGRESS: Creating a validation set from 5 percent of training data. This may take a while. + You can set ``validation_set=None`` to disable validation tracking. + +Logistic regression: +-------------------------------------------------------- +Number of examples : 1551 +Number of classes : 3 +Number of feature columns : 1 +Number of unpacked features : 2048 +Number of coefficients : 4098 +Starting L-BFGS +-------------------------------------------------------- ++-----------+----------+-----------+--------------+-------------------+---------------------+ +| Iteration | Passes | Step size | Elapsed Time | Training Accuracy | Validation Accuracy | ++-----------+----------+-----------+--------------+-------------------+---------------------+ +| 0 | 6 | 0.018611 | 0.891830 | 0.553836 | 0.560976 | +| 1 | 10 | 0.390832 | 1.622383 | 0.744681 | 0.792683 | +| 2 | 11 | 0.488541 | 1.943987 | 0.733075 | 0.804878 | +| 3 | 14 | 2.442703 | 2.512545 | 0.727917 | 0.841463 | +| 4 | 15 | 2.442703 | 2.826964 | 0.861380 | 0.853659 | +| 9 | 28 | 2.340435 | 5.492035 | 0.941328 | 0.975610 | ++-----------+----------+-----------+--------------+-------------------+---------------------+ +Performing feature extraction on resized images... +Completed 64/395 +Completed 128/395 +Completed 192/395 +Completed 256/395 +Completed 320/395 +Completed 384/395 +Completed 395/395 +0.9316455696202531 +``` + +We just got an accuracy of 94% on Training Data and 97% on Validation Data! |