YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this technical report, we strive to push its limits to the next level, stepping forward with an unwavering mindset for industry application. Considering the diverse requirements for speed and accuracy in the real environment, we extensively examine the up-to-date object detection advancements either from industry or academia. Specifically, we heavily assimilate ideas from recent network design, training strategies, testing techniques, quantization, and optimization methods. On top of this, we integrate our thoughts and practice to build a suite of deployment-ready networks at various scales to accommodate diversified use cases. With the generous permission of YOLO authors, we name it YOLOv6. We also express our warm welcome to users and contributors for further enhancement. For a glimpse of performance, our YOLOv6-N hits 35.9% AP on the COCO dataset at a throughput of 1234 FPS on an NVIDIA Tesla T4 GPU. YOLOv6-S strikes 43.5% AP at 495 FPS, outperforming other mainstream detectors at the same scale~(YOLOv5-S, YOLOX-S, and PPYOLOE-S). Our quantized version of YOLOv6-S even brings a new state-of-the-art 43.3% AP at 869 FPS. Furthermore, YOLOv6-M/L also achieves better accuracy performance (i.e., 49.5%/52.3%) than other detectors with a similar inference speed. We carefully conducted experiments to validate the effectiveness of each component. Our code is made available at https://github.com/meituan/YOLOv6.

Abstract

technical report

Related collections

Author and article information

Journal

Publisher: arXiv

Publication date (Electronic): 2022

Publication date Submitted: 07 September 2022

Publication date Updated: 08 September 2022

Publication date Available: September 2022

Article

DOI: 10.48550/ARXIV.2209.02976

SO-VID: a8ddc4d9-4c19-4d35-ba98-4dfd3f6f1424

License:

arXiv.org perpetual, non-exclusive license

History

Keywords: Computer Vision and Pattern Recognition (cs.CV),FOS: Computer and information sciences

Data availability:

Keywords: Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

Read this article at

Abstract

Abstract

Related collections

Nanopublications (single, attributable and machine-readable assertions in scientific literature)

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 25

Cited by 26