Skip to content

Image Object Detection ‐ Detecto

Jeffrey K Gillan edited this page Sep 18, 2024 · 39 revisions

Workshops sessions are recorded and posted to the UA Datalab Youtube Channel about 1 week after the live session.


Decto Documentation

Detecto is a python library built on top of deep learning framework Pytorch. It makes using Pytorch simpler by reducing the amount of tools and options.

By default, Detecto uses the convolutional neural network architecture Faster R-CNN ResNet-50 FPN. The architecture was pre-trained on the COCO (common objects in context) dataset which contains over 330,000 images, each annotated with 80 object classes (e.g., person, bicycle, car, motorcycle, airplane, bus, train, truck, boat, kite, knife, spoon, tv, book)

Object labels need to be in an xml file using the PASCAL VOC format. Try Label Studio tool to create labels.

NWPU VHR-10 dataset ( Cheng et al., 2014a ). 1 This is a challenging 10-class geospatial object detection dataset, which can be used for both single class and multi-class objects detection. These ten classes of objects are airplane, ship, storage tank, baseball diamond, tennis court, basketball court, ground track field, harbor, bridge, and vehicle. This dataset contains totally 800 VHR optical RSIs, from which 757 airplanes, 302 ships, 655 storage tanks, 390 baseball diamonds, 524 tennis courts, 159 basketball courts, 163 ground track fields, 224 harbors, 124 bridges, and 477 vehicles were manually annotated with axis-aligned bounding boxes used for ground truth. 715 color images were acquired from Google Earth with the spatial resolution ranging from 0.5-m to 2-m, and 85 pansharpened color infrared (CIR) images were acquired from Vaihingen data with a spatial resolution of 0.08-m.

https://huggingface.co/datasets/satellite-image-deep-learning/SODA-A

Run the Jupyter Notebook in Google Colab

One option to run the notebook is in Google Colab. This is probably the easiest option to get started. Google provides you with a virtual machine with optional (but limited) GPU resources. Open Colab Notebook

Clone this wiki locally