CatalystCode
diff --git a/Diff for: ‎README.md
+8-1 b/Diff for: ‎README.md
+8-1
diff --git a/Diff for: ‎config.ini
+6-1 b/Diff for: ‎config.ini
+6-1
diff --git a/Diff for: ‎functions/README.md
+1-1 b/Diff for: ‎functions/README.md
+1-1
diff --git a/Diff for: ‎functions/pipeline/labels/__init__.py
+1 b/Diff for: ‎functions/pipeline/labels/__init__.py
+1
diff --git a/Diff for: ‎functions/pipeline/onboardcontainer/__init__.py
+111-111 b/Diff for: ‎functions/pipeline/onboardcontainer/__init__.py
+111-111
diff --git a/Diff for: ‎images/VOTT_animal.PNG
2.06 MB b/Diff for: ‎images/VOTT_animal.PNG
2.06 MB
diff --git a/Diff for: ‎images/init_predict.PNG
60.3 KB b/Diff for: ‎images/init_predict.PNG
60.3 KB
diff --git a/Diff for: ‎init_pred_desription.md
+76 b/Diff for: ‎init_pred_desription.md
+76
@@ -1,6 +1,8 @@
 # Active learning + object detection
 Labeling images for object detection is commonly required task to get started with Computer Vision related project.
-Good news that you do not have to label all images  (draw bounding boxes) from scratch --- the goal of this project is to add (semi)automation to the process.
+Good news that you do not have to label all images  (draw bounding boxes) from scratch --- the goal of this project is to add (semi)automation to the process. 
+Please refer to this blog post that describes Active Learning and semi-automated flow: 
+  [Active Learning for Object Detection in Partnership with Conservation Metrics](https://www.microsoft.com/developerblog/2018/11/06/active-learning-for-object-detection/)
 We will use Transfer Learning and Active Learning as core Machine Learning  components of the pipeline.
  -- Transfer Learning: use powerful pre-trained on big dataset (COCO) model as a startining point for fine-tuning foe needed classes.
  -- Active Learning: human annotator labels small set of images (set1), trains Object Detection Model  (model1) on this set1 and then uses model1 to predict bounding boxes on images (thus pre-labeling those). Human annotator reviews mode1's predictions where the model was less confident -- and thus comes up with new set of images -- set2. Next phase will be to train more powerful model2 on bigger train set that includes set1 and set2 and use model2 prediction results as draft of labeled set3…
@@ -28,6 +30,11 @@ There is config.ini that needs to be updated with details like blob storage conn
 More details TBD.  
 Basically the idea is to kick off Active Learning cycle with model retaining as soon as human annotator revises new set of images.
 
+# Notes before we get started 
+- The steps below refer to updating config.ini. You can find detailed description of config [here](config_description.md) 
+- Got several thousands of images (or much more) and not sure if random sampling will be helpful to get rolling with labeling data? 
+Take a look at [Guide to "initialization" predictions](init_pred_desription.md).
+
 # How to run semi-automated pipeline
 The flow below assumes the following: 
 1) We use Tensorflow Object Detection API (Faster RCNN with Resnet 50 as default option)  to fine tune object detection. 
 
@@ -39,6 +39,8 @@ min_confidence=.5
 test_percentage=.2
 model_name=faster_rcnn_resnet50_coco_2018_01_28
 optional_pipeline_url=https://raw.githubusercontent.com/tensorflow/models/master/research/object_detection/samples/configs/faster_rcnn_resnet50_pets.config
+#Init Predictions
+init_model_name=faster_rcnn_resnet101_coco_2018_01_28
 # Config File Details
 old_label_path=PATH_TO_BE_CONFIGURED/pet_label_map.pbtxt
 old_train_path=PATH_TO_BE_CONFIGURED/pet_faces_train.record-?????-of-00010
@@ -65,4 +67,7 @@ tf_val_record=${tf_record_location%.*}_val.${tf_record_location##*.}
 tf_url=http://download.tensorflow.org/models/object_detection/${model_name}.tar.gz
 pipeline_file=${download_location}/${model_name}/pipeline.config
 fine_tune_checkpoint=${download_location}/${model_name}/model.ckpt
-tagging_output=${data_dir}/tagging.csv
+tagging_output=${data_dir}/tagging.csv
+init_pred_tf_url=http://download.tensorflow.org/models/object_detection/${init_model_name}.tar.gz
+init_model_graph=${download_location}/${init_model_name}/frozen_inference_graph.pb
+
@@ -208,4 +208,4 @@ Showing our function running:
 ```bash
 curl "https://jmsactlrnpipeline.azurewebsites.net/api/download?code=AARPr45D5K6AIEWv8bEaqWalSaddrUzd4aydOxmhSPauGUrsPvzw==&imageCount=1"
 ["https://csehackstorage.blob.core.windows.net/image-to-tag/1.jpg"]
-```
+```
@@ -117,4 +117,5 @@ def __create_ImageTag_list(image_id, tags_list):
     image_tags = []
     for tag in tags_list:
         image_tags.append(ImageTag(image_id, tag['x1'], tag['x2'], tag['y1'], tag['y2'], tag['classes']))
+
     return image_tags
@@ -1,111 +1,111 @@
-import os
-import logging
-import json
-import azure.functions as func
-from urlpath import URL
-from datetime import datetime, timedelta
-from ..shared.constants import ImageFileType
-from ..shared.storage_utils import get_filepath_from_url
-
-from azure.storage.blob import BlockBlobService, BlobPermissions
-from azure.storage.queue import QueueService, QueueMessageFormat
-
-DEFAULT_RETURN_HEADER = {
-    "content-type": "application/json"
-}
-
-
-def main(req: func.HttpRequest) -> func.HttpResponse:
-    logging.info('Python HTTP trigger function processed a request.')
-
-    user_name = req.params.get('userName')
-
-    if not user_name:
-        return func.HttpResponse(
-            status_code=401,
-            headers=DEFAULT_RETURN_HEADER,
-            body=json.dumps({"error": "invalid userName given or omitted"})
-        )
-
-    try:
-        req_body = req.get_json()
-        logging.debug(req.get_json())
-        storage_account = req_body["storageAccount"]
-        storage_account_key = req_body["storageAccountKey"]
-        storage_container = req_body["storageContainer"]
-    except ValueError:
-        return func.HttpResponse(
-            "ERROR: Unable to decode POST body",
-            status_code=400
-        )
-
-    if not storage_container or not storage_account or not storage_account_key:
-        return func.HttpResponse(
-            "ERROR: storage container/account/key/queue not specified.",
-            status_code=401
-        )
-
-    # Create blob service for storage account (retrieval source)
-    blob_service = BlockBlobService(
-        account_name=storage_account,
-        account_key=storage_account_key)
-
-    # Queue service for perm storage and queue
-    queue_service = QueueService(
-        account_name=os.getenv('STORAGE_ACCOUNT_NAME'),
-        account_key=os.getenv('STORAGE_ACCOUNT_KEY')
-    )
-
-    queue_service.encode_function = QueueMessageFormat.text_base64encode
-
-    try:
-        blob_list = []
-
-        for blob_object in blob_service.list_blobs(storage_container):
-            blob_url = URL(
-                blob_service.make_blob_url(
-                    storage_container,
-                    blob_object.name
-                )
-            )
-            # Check for supported image types here.
-            if ImageFileType.is_supported_filetype(blob_url.suffix):
-                logging.debug("INFO: Building sas token for blob " + blob_object.name)
-                # create sas signature
-                sas_signature = blob_service.generate_blob_shared_access_signature(
-                    storage_container,
-                    blob_object.name,
-                    BlobPermissions.READ,
-                    datetime.utcnow() + timedelta(hours=1)
-                )
-
-                logging.debug("INFO: have sas signature {}".format(sas_signature))
-
-                signed_url = blob_url.with_query(sas_signature)
-
-                blob_list.append(signed_url.as_uri())
-
-                logging.debug("INFO: Built signed url: {}".format(signed_url))
-
-                msg_body = {
-                    "imageUrl": signed_url.as_uri(),
-                    "fileName": str(blob_url.name),
-                    "fileExtension": str(blob_url.suffix),
-                    "directoryComponents": get_filepath_from_url(blob_url, storage_container),
-                    "userName": user_name
-                }
-
-                body_str = json.dumps(msg_body)
-                queue_service.put_message("onboardqueue", body_str)
-            else:
-                logging.info("Blob object not supported. Object URL={}".format(blob_url.as_uri))
-
-        return func.HttpResponse(
-            status_code=202,
-            headers=DEFAULT_RETURN_HEADER,
-            body=json.dumps(blob_list)
-        )
-    except Exception as e:
-        logging.error("ERROR: Could not build blob object list. Exception: " + str(e))
-        return func.HttpResponse("ERROR: Could not get list of blobs in storage_container={0}. Exception={1}".format(
-            storage_container, e), status_code=500)
+import os
+import logging
+import json
+import azure.functions as func
+from urlpath import URL
+from datetime import datetime, timedelta
+from ..shared.constants import ImageFileType
+from ..shared.storage_utils import get_filepath_from_url
+
+from azure.storage.blob import BlockBlobService, BlobPermissions
+from azure.storage.queue import QueueService, QueueMessageFormat
+
+DEFAULT_RETURN_HEADER = {
+    "content-type": "application/json"
+}
+
+
+def main(req: func.HttpRequest) -> func.HttpResponse:
+    logging.info('Python HTTP trigger function processed a request.')
+
+    user_name = req.params.get('userName')
+
+    if not user_name:
+        return func.HttpResponse(
+            status_code=401,
+            headers=DEFAULT_RETURN_HEADER,
+            body=json.dumps({"error": "invalid userName given or omitted"})
+        )
+
+    try:
+        req_body = req.get_json()
+        logging.debug(req.get_json())
+        storage_account = req_body["storageAccount"]
+        storage_account_key = req_body["storageAccountKey"]
+        storage_container = req_body["storageContainer"]
+    except ValueError:
+        return func.HttpResponse(
+            "ERROR: Unable to decode POST body",
+            status_code=400
+        )
+
+    if not storage_container or not storage_account or not storage_account_key:
+        return func.HttpResponse(
+            "ERROR: storage container/account/key/queue not specified.",
+            status_code=401
+        )
+
+    # Create blob service for storage account (retrieval source)
+    blob_service = BlockBlobService(
+        account_name=storage_account,
+        account_key=storage_account_key)
+
+    # Queue service for perm storage and queue
+    queue_service = QueueService(
+        account_name=os.getenv('STORAGE_ACCOUNT_NAME'),
+        account_key=os.getenv('STORAGE_ACCOUNT_KEY')
+    )
+
+    queue_service.encode_function = QueueMessageFormat.text_base64encode
+
+    try:
+        blob_list = []
+
+        for blob_object in blob_service.list_blobs(storage_container):
+            blob_url = URL(
+                blob_service.make_blob_url(
+                    storage_container,
+                    blob_object.name
+                )
+            )
+            # Check for supported image types here.
+            if ImageFileType.is_supported_filetype(blob_url.suffix):
+                logging.debug("INFO: Building sas token for blob " + blob_object.name)
+                # create sas signature
+                sas_signature = blob_service.generate_blob_shared_access_signature(
+                    storage_container,
+                    blob_object.name,
+                    BlobPermissions.READ,
+                    datetime.utcnow() + timedelta(hours=1)
+                )
+
+                logging.debug("INFO: have sas signature {}".format(sas_signature))
+
+                signed_url = blob_url.with_query(sas_signature)
+
+                blob_list.append(signed_url.as_uri())
+
+                logging.debug("INFO: Built signed url: {}".format(signed_url))
+
+                msg_body = {
+                    "imageUrl": signed_url.as_uri(),
+                    "fileName": str(blob_url.name),
+                    "fileExtension": str(blob_url.suffix),
+                    "directoryComponents": get_filepath_from_url(blob_url, storage_container),
+                    "userName": user_name
+                }
+
+                body_str = json.dumps(msg_body)
+                queue_service.put_message("onboardqueue", body_str)
+            else:
+                logging.info("Blob object not supported. Object URL={}".format(blob_url.as_uri))
+
+        return func.HttpResponse(
+            status_code=202,
+            headers=DEFAULT_RETURN_HEADER,
+            body=json.dumps(blob_list)
+        )
+    except Exception as e:
+        logging.error("ERROR: Could not build blob object list. Exception: " + str(e))
+        return func.HttpResponse("ERROR: Could not get list of blobs in storage_container={0}. Exception={1}".format(
+            storage_container, e), status_code=500)
@@ -0,0 +1,76 @@
+# Guide to "initialization" predictions
+Assuming you got an datatet containing  many thousands of images -- how do you get started with labeling first
+few hundreds images? 
+What about unblanced case when most of the pictures not have much going on?  
+If you just random sample pictures _blindly_ it make quite a few Active Learning cycles to set your model and 
+training set onto the right pass.
+
+## Let's get "metadata" about each image
+We could use pretrained model that can detect decently few dozens or more object class to get idea what kind
+of objects are on the images. The model might not provide super-accurate results however some of those might be
+useful for more target image sampling.  
+For example if you dataset has common scenes of nature or city life than using model trained on [COCO dataset](https://github.com/amikelive/coco-labels/blob/master/coco-labels-paper.txt)
+might give you an idea what images have objects that _resembles_ person, car, deer and so on. And depedning on your
+scenario you might focus you initial labeling efforts on images that have or don't have a particular class.  
+
+![Flow](images/init_predict.PNG)  
+
+## Settings in config.ini
+The following settings control what model is going to be used for "initialization" predictions.  
+  - init_model_name=faster_rcnn_resnet101_coco_2018_01_28  
+  Model name to be used for predictions. Current code assumes it's COCO based model.
+  - init_pred_tf_url=http://download.tensorflow.org/models/object_detection/${init_model_name}.tar.gz  
+ URL for downloading model from Tensorflow Object Detection model zoo.  
+  - init_model_graph=${download_location}/${init_model_name}/frozen_inference_graph.pb 
+  Location (on DSVM) of inference graph that's used for producing "initialization" predictions. 
+
+## Running "initialization" predictions flow
+Once config settings are set (and images are on blob storage) user needs to do the following:
+- SSH to DSVM and run script  that will actually produces predictions
+- provide desired mapping (and merging) or detected classes to the classes of interest (more details below)
+- dowload specified number of images to client machine and review the tags
+
+*Produce predictions*  
+SSH to DSVM, activate needed Tensorflow virtul environment if needed and run:  
+ `. ./active_learning_init_pred.sh ../config.ini`  
+ The output _init_totag*.csv_ contains all detecting objetcs bboxes iformation. It's probably worth spedining
+ the time analizing those results. 
+  
+ *Class mapping json*  
+ Please refer to _sample_init_classes_map.json_ for reference. 
+ First we define that we want to class  "1" to be shown as class "person" in VOTT when user will be doing labels review.
+ We also want to have 60% of images that will be pulled for review to have presence of class "person" in them:  
+ `{`  
+      `"initclass": "1", `       
+     `"map": "person",`  
+     `"balance": "0.6"`  
+  `}`
+  
+ Then we want to _merge_ several classes: "19" (horse) and "21" (cow) will be displayed in VOTT as "animal".  
+    `{`  
+      `"initclass": "19",`  
+      `"map": "animal",`  
+      `"balance": "0.2"`  
+    `},`  
+    `{`  
+      `"initclass": "21",`  
+      `"map": "animal",`  
+      `"balance": "0.2"`  
+    `}'  
+    
+  We specify that 20% of each _animal_ class (40% in total) is present in the dataset that user will be reviewing in VOTT.  
+   Also we specifically request not to include images where no known COCO classes were detected. Given that COCO-based
+    model may miss quite a bit of objects it's good practice still to review some of those.  
+  Model might be detecting classes that we will be cluttering image during review process. For example the dataset
+  may have busket images that is wrongly classified as a "vase".  In scenario when we are not interested in detecting
+  baskets nor vases we may want just to "drop" bboxes for the "vase" class (class 86 in COCO):  
+  ` "unmapclass":["64","86", "15"],`   
+  Finally for _everything else_  -- classes we are not sure what to do at this stage but still want to preserve bbox -- 
+  we will map then to a "default" class. We can set the name of "default" class in mapping json.
+  
+  *Review predictions in VOTT* 
+  On a client (tagger) machine run the usual script to download images. Only difference is that you'd be providing 
+  "class mapping json" as 3rd parameter:  
+  ` D:\repo\active-learning-detect\tag>python download_vott_json.py 200 ..\config.ini ..\sample_init_classes_map.json`  
+  
+  ![Flow](images/VOTT_animal.PNG)