Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Marco leonardi is 

Main feedback, summed up: 

...

Darknet YOLO (You Only Look Once)

Darknet YOLO is a veryo very common neural network, but the reason why it is so broadly used is because people adapt from others and this one is known as the one "everybody uses " (according to Leonardi).

In our small period of trying this out, we experienced that training this network with the raw images resulted in relatively bad performance. Lots of false positives and few actual detections. However our results improved drastically when the dataset was binarized with edge detection and dilation (hence a fairly heavy and slow preprocessing procedure in advance). According to Leonardi this makes sense as these networks are may way easier to train on binary images. However, he was not convinced that the results were optimal.

...

There are two main differences between these to two neural networks that in Leonardi's opinion would lead to way better performance using RCNN.

  1. Mask RCNN labels each pixel in the image. YOLO on the other side only tries to find a bounding box around the desired contours.  This has proven to give better performance in object detection.

  2. Mask RCNN is a significantly larger neural network. It is therefore slower in execution (and each training takes longer), but on the other hand there are more filters and the object detection becomes more accurate. The networks is capable of detecting more complex objects than Darknet YOLO.


Challenges with Mask RCNN

A larger and more complex neural network is also harder to understand and adapt to your desired usage. Hence understanding how the network functions and how to retrain the last layers may be challenging. Also, there are fewer people out there with experience and probably less help available in forums etc.


Computational cost vs. speed is always a challenge.


The value of bubble detection vs. the time necessary to implement this into the system must definitely be taken into account before deciding on this approach.