cv-detection

star 13.4k

Best practices for object detection tasks. Use when working on COCO, VOC, or detection architectures like YOLO and DETR.

aiming-lab

By aiming-lab schedule Updated 3/23/2026

play_arrow Run Skill in Manus View GitHub

name: cv-detection description: Best practices for object detection tasks. Use when working on COCO, VOC, or detection architectures like YOLO and DETR. metadata: category: domain trigger-keywords: "detection,object,bbox,yolo,coco,anchor,faster rcnn" applicable-stages: "9,10" priority: "5" version: "1.0" author: researchclaw references: "Ren et al., Faster R-CNN, NeurIPS 2015; Carion et al., End-to-End Object Detection with Transformers, ECCV 2020"

Object Detection Best Practice

Architecture families:

One-stage: YOLO (v5/v8), SSD, RetinaNet, FCOS
Two-stage: Faster R-CNN, Cascade R-CNN
Transformer: DETR, DINO, RT-DETR

Training recipe:

Use pre-trained backbone (ImageNet)
Multi-scale training and testing
IoU threshold: 0.5 for mAP50, 0.5:0.95 for mAP
Use FPN for multi-scale feature extraction
Focal loss for class imbalance in one-stage detectors

Standard benchmarks:

COCO val2017: ~37 mAP (Faster R-CNN R50), ~51 mAP (DINO Swin-L)
Pascal VOC: ~80 mAP50 (Faster R-CNN)

Install via CLI

npx skills add https://github.com/aiming-lab/AutoResearchClaw --skill cv-detection

Repository Details

star Stars 13,443

call_split Forks 1,577

navigation Branch main

article Path SKILL.md

Occupations

Computer And Information Research Scientists 151221

More from Creator

aiming-lab

aiming-lab Explore all skills →