New: Ultralytics YOLO-Human #12702

Laughing-q · 2024-05-15T08:25:36Z

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Introducing new YOLOHuman model for human attribute detection! 🚀

📊 Key Changes

Added YOLOHuman class as part of the model imports.
Introduced a new YAML configuration for the YOLOv8 human detection model.
Implemented additional augmentations to include human attribute data handling.
Established a new dataset class, HumanDataset, for loading and processing human-related datasets.
Included Human object in results to encapsulate detected human attributes.
Enriched model __init__.py to include YOLOHuman.
Formulated HumanPredictor, HumanTrainer, and HumanValidator under the new YOLO human module for prediction, training, and validation.

🎯 Purpose & Impact

Enhances Model Catalog: Expands Ultralytics' model offerings to include human-specific attribute detection.
Improves Dataset Handling: Offers streamlined process for datasets involving human features.
Facilitates Human-centric Applications: Paves the way for more sophisticated applications such as demographic analysis, security enhancements, and personalized customer experiences.

This reverts commit bd5587c.

glenn-jocher · 2024-05-30T21:09:26Z

@Laughing-q @ambitious-octopus val test is also failing because our save_txt functionality is not adapted correctly (should output txt files in the same format as the labels).

yolo val human model=weights/yolov8n-human.pt data=human8.yaml imgsz=32 save_txt

Save JSON is working correct:

yolo val human model=weights/yolov8n-human.pt data=human8.yaml imgsz=32 save_json

ambitious-octopus · 2024-05-31T08:01:14Z

@glenn-jocher Updated https://github.com/ultralytics/assets/releases/download/v8.2.0/yolov8n-human.pt with a freshly trained model.

Laughing-q · 2024-05-31T08:11:40Z

@glenn-jocher @ambitious-octopus Guys I removed YOLOHuman class since it's not needed while we treat human as a new task of YOLO.
Also I've fixed the save_one_txt issue for human task. There's actually another update made by me, is that I figured we can directly use the save_txt method in Results instead of recreating something similar/redundant for val mode of each task.

ultralytics/ultralytics/engine/results.py

Lines 332 to 366 in bac868c

    
               def save_txt(self, txt_file, save_conf=False): 
        
                   """ 
        
                   Save predictions into txt file. 
        
                   Args: 
        
                       txt_file (str): txt file path. 
        
                       save_conf (bool): save confidence score or not. 
        
                   """ 
        
                   is_obb = self.obb is not None 
        
                   boxes = self.obb if is_obb else self.boxes 
        
                   masks = self.masks 
        
                   probs = self.probs 
        
                   kpts = self.keypoints 
        
                   texts = [] 
        
                   if probs is not None: 
        
                       # Classify 
        
                       [texts.append(f"{probs.data[j]:.2f} {self.names[j]}") for j in probs.top5] 
        
                   elif boxes: 
        
                       # Detect/segment/pose 
        
                       for j, d in enumerate(boxes): 
        
                           c, conf, id = int(d.cls), float(d.conf), None if d.id is None else int(d.id.item()) 
        
                           line = (c, *(d.xyxyxyxyn.view(-1) if is_obb else d.xywhn.view(-1))) 
        
                           if masks: 
        
                               seg = masks[j].xyn[0].copy().reshape(-1)  # reversed mask.xyn, (n,2) to (n*2) 
        
                               line = (c, *seg) 
        
                           if kpts is not None: 
        
                               kpt = torch.cat((kpts[j].xyn, kpts[j].conf[..., None]), 2) if kpts[j].has_visible else kpts[j].xyn 
        
                               line += (*kpt.reshape(-1).tolist(),) 
        
                           line += (conf,) * save_conf + (() if id is None else (id,)) 
        
                           texts.append(("%g " * len(line)).rstrip() % line) 
        
                   if texts: 
        
                       Path(txt_file).parent.mkdir(parents=True, exist_ok=True)  # make directory 
        
                       with open(txt_file, "a") as f: 
        
                           f.writelines(text + "\n" for text in texts)

ultralytics/ultralytics/models/yolo/detect/val.py

Lines 256 to 263 in 7593b4a

    
           def save_one_txt(self, predn, save_conf, shape, file): 
        
               """Save YOLO detections to a txt file in normalized coordinates in a specific format.""" 
        
               gn = torch.tensor(shape)[[1, 0, 1, 0]]  # normalization gain whwh 
        
               for *xyxy, conf, cls in predn.tolist(): 
        
                   xywh = (ops.xyxy2xywh(torch.tensor(xyxy).view(1, 4)) / gn).view(-1).tolist()  # normalized xywh 
        
                   line = (cls, *xywh, conf) if save_conf else (cls, *xywh)  # label format 
        
                   with open(file, "a") as f: 
        
                       f.write(("%g " * len(line)).rstrip() % line + "\n")

So I updated our detect/obb/human tasks with Results.save_txt. Other tasks i.e segment/pose actually have the save_one_txt part commented so I left it there for now and let's develop these two in another PR later.

ultralytics/ultralytics/models/yolo/segment/val.py

Lines 157 to 158 in 7593b4a

    
           # if self.args.save_txt: 
        
           #    save_one_txt(predn, save_conf, shape, file=save_dir / 'labels' / f'{path.stem}.txt')

ambitious-octopus · 2024-05-31T08:14:41Z

@ambitious-octopus test_data_utils() is failing because we need to upload a HUB-format dataset to https://github.com/ultralytics/hub/tree/main/example_datasets to join the datasets for other tasks.

@glenn-jocher Uploaded HUB-format datataset PR.

Laughing-q · 2024-05-31T08:22:41Z

@glenn-jocher Meanwhile I noticed that we have a lot duplicated code in Validator.update_metrics across different tasks.

ultralytics/ultralytics/models/yolo/segment/val.py

Line 100 in 7593b4a

def update_metrics(self, preds, batch):

Looking into this PR: #12645, the author had to update each val.py to add the feature because we have multiple modified versions of this method. And that's why @ambitious-octopus encountered the target_img key missing issue after the PR merged(because I had to create another modified version for human task).
I actually tried to refactor this part a bit in the OBB PR so we don't have a modified version of update_metrics lying in obb/val.py. But there's still a lot duplication with other tasks.
I think I'll revisit this part of code and try to eliminate the duplicated part as much as possible later some day in another PR.

Laughing-q · 2024-05-31T08:27:03Z

@ambitious-octopus the CI tests are failing, seems to related to new updated yolov8n-human.pt

And that's because the model was trained withtask=detect but now since we're updating the logic here to treat human as a new task so the tests are failing with training from the new yolov8n-human.pt

Laughing-q · 2024-05-31T08:30:01Z

@ambitious-octopus let's reset the weight back, and later today I'll launch new training on our server with task=human to get all sizes of model. :)

Laughing-q · 2024-05-31T09:07:37Z

@glenn-jocher @ambitious-octopus ok I've re-uploaded the weight and now everything works properly in tests except the hub dataset, which I guess it'll be good when the PR that @ambitious-octopus opened merged. :)

And now there's several gpus freed on our server, I'll launch several training right now.

ambitious-octopus · 2024-05-31T10:22:07Z

docs image

Burhan-Q

I did not serve the docs locally, just quickly reviewed the raw markdown on GitHub. A few notes and suggestions, but overall looks excellent!

Burhan-Q · 2024-05-31T14:50:11Z

docs/en/reference/models/yolo/human/predict.md

@@ -0,0 +1,16 @@
+---
+description: TODO ADD DESCRIPTION


Fill in TODO sections for predict.md and train.md

Burhan-Q · 2024-05-31T14:51:57Z

docs/en/tasks/human.md

+Human detection and attributes estimation is a task that involves identifying humans in an image or video stream and estimating their attributes, such as age, gender, weight, height, and ethnicity.
+The output of the detector is a set of bounding boxes that enclose the humans in the image, along with class labels, confidence scores, and estimated attributes for each person. This task is useful for applications in surveillance, retail analytics, and human-computer interaction.
+
+## [Models](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/cfg/models/v8)


Do we want the header to be a link like this? I haven't looked at how it renders, but I don't think this is normally something we do.

Burhan-Q · 2024-05-31T14:52:36Z

docs/en/tasks/human.md

+
+## [Models](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/cfg/models/v8)
+
+YOLOv8 pretrained Human models are shown here. Detect, Segment and Pose models are pretrained on the [COCO](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/coco.yaml) dataset, while Classify models are pretrained on the [ImageNet](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/ImageNet.yaml) dataset.


Instead of referencing the YAML files, probably better to reference the docs pages for the relevant datasets.

Burhan-Q · 2024-05-31T14:53:45Z

docs/en/tasks/human.md

+
+YOLOv8 pretrained Human models are shown here. Detect, Segment and Pose models are pretrained on the [COCO](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/coco.yaml) dataset, while Classify models are pretrained on the [ImageNet](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/ImageNet.yaml) dataset.
+
+[Models](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/cfg/models) download automatically from the latest Ultralytics [release](https://github.com/ultralytics/assets/releases) on first use.


Not sure if Models here needs to be a link.

Burhan-Q · 2024-05-31T14:57:35Z

docs/en/tasks/human.md

+
+!!! note
+
+    It is important to note that these models have been trained on a specially curated, artificially annotated version of the COCO dataset. This custom dataset was meticulously crafted to enhance the models' performance on specific tasks by incorporating additional annotations and adjustments beyond those available in the public COCO dataset. Due to proprietary reasons, this enhanced version of the dataset is not publicly available. The artificial annotations were designed to provide more comprehensive and nuanced data, enabling the models to achieve higher accuracy and robustness in their predictions. The proprietary nature of this dataset ensures that the models possess a competitive edge, offering advanced capabilities and superior performance in their respective applications.


A few notes:

I think that "artificially annotated" should be bolded and underlined artificially annotated for emphasis

I don't think a reason ("Due to proprietary reasons") for why the dataset isn't available should be given, just that we've decided to not make it publicly available.

I think this section should be removed "ensures that the models possess a competitive edge, offering advanced capabilities and superior performance" as it's fairly baseless and seems (to me) like it's asking for trouble.

Burhan-Q · 2024-05-31T15:14:50Z

docs/en/datasets/human/index.md

+
+- Weight (Kg): The weight of the person is annotated in kilograms. This numeric value is essential for applications requiring precise biometric data.
+
+- Height (Cm): The height of the person is annotated in centimeters. Accurate height measurements are crucial for many analytical and identification purposes.


- Height (Cm) + Height (cm)

Burhan-Q · 2024-05-31T15:15:50Z

docs/en/models/yolo-human.md

+- **Real-time Human Attribute Estimation**: Leverages the computational speed of CNNs to provide fast and accurate human attribute estimation in real-time.
+- **Efficiency and Performance**: Optimized for reduced computational and resource requirements without sacrificing performance, enabling deployment in real-time applications.
+- **Comprehensive Attribute Estimation**: Capable of estimating multiple human attributes such as age, gender, ethnicity, weight, and height, providing detailed demographic analysis.
+- **Detection of Flocks of Humans**: Enhanced to detect both individual humans and groups of humans, expanding its applicability in various scenarios.


Flocks of Humans

Burhan-Q · 2024-05-31T15:17:16Z

docs/en/models/yolo-human.md

+
+!!! note
+
+    It is important to note that these models have been trained on a specially curated, artificially annotated version of the COCO dataset. This custom dataset was meticulously crafted to enhance the models' performance on specific tasks by incorporating additional annotations and adjustments beyond those available in the public COCO dataset. Due to proprietary reasons, this enhanced version of the dataset is not publicly available. The artificial annotations were designed to provide more comprehensive and nuanced data, enabling the models to achieve higher accuracy and robustness in their predictions. The proprietary nature of this dataset ensures that the models possess a competitive edge, offering advanced capabilities and superior performance in their respective applications.


See comments in docs/en/tasks/human.md on this text.

Burhan-Q · 2024-05-31T15:18:33Z

docs/en/models/yolo-human.md

+
+### Val Usage
+
+Validate trained YOLOv8n-human model accuracy on the COCO8-human dataset. No argument need to passed as the `model` retains it's training `data` and arguments as model attributes.


I would include data="coco8-human" even tho it will work without. Trust me, it will avoid confusion later.

Burhan-Q · 2024-05-31T15:20:02Z

docs/en/tasks/human.md

+
+## Val
+
+Validate trained YOLOv8n-human model accuracy on the COCO8-human dataset. No argument need to passed as the `model` retains it's training `data` and arguments as model attributes.


Same comment from earlier

Laughing-q and others added 30 commits May 14, 2024 10:37

add HumanDataset

ded08c9

update augmentation pipeline

b318d8c

add HumanDetect

3556627

update HumanDetect

7d4dd77

improve DFL robustness

c1a3445

add yolov8-human.yaml

62dc8ca

add Human loss

f3b31c8

add human trainer/validator/predictor

6ac588c

update

1eab940

update model

73ce12d

update build_dataset

7f15953

add YOLOHuman

962ce31

clean useless arg

488efc4

fix

bc444b1

update loss

855fdd2

add preprocess in val

b00860b

fix nms

affc0dc

fix training prompt

2873f35

update YOLOHuman

f264ace

add Human in Results

2a51b44

add visualization

7b4c5d8

update human plot color

d335815

add metrics

73562ca

Merge branch 'main' into human

18d953f

update

f12f57c

add plot_results for human

9c56c8d

simplify accracy

c89acd8

update

d319bef

add plot_attributes

44934c8

Auto-format by https://ultralytics.com/actions

ad04c8c

Burhan-Q and others added 2 commits May 30, 2024 21:03

Revert "habana_frameworks imports for training"

a2a4676

This reverts commit bd5587c.

Merge branch 'main' into human

7849ac7

glenn-jocher force-pushed the main branch from a2a4676 to cbcb494 Compare May 30, 2024 21:17

Laughing-q and others added 9 commits May 31, 2024 15:13

remove YOLOHuman

f85d4d3

update save_txt for human

fab1146

changed human8 to coco8-human

b6a5aea

updated download link for coco8-human

084b835

update save_one_txt by using Results

96c552c

update

f3557dd

Auto-format by https://ultralytics.com/actions

13f3f90

fix

3377dc1

Auto-update Ultralytics Docs by https://ultralytics.com/actions

def81fc

update letter

000df7b

ambitious-octopus added 2 commits May 31, 2024 11:45

updated human task docs page

773df49

added new human task to task-index page

e1a397a

ambitious-octopus added 4 commits May 31, 2024 12:24

added coco8-human dataset docs

b037c11

added YOLO-human model page

6db963f

added dataset notice

7658a49

Merge branch 'main' into human

8f7459d

Burhan-Q reviewed May 31, 2024

View reviewed changes

Merge branch 'main' into human

600320c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New: Ultralytics YOLO-Human #12702

New: Ultralytics YOLO-Human #12702

Laughing-q commented May 15, 2024 •

edited by github-actions bot

glenn-jocher commented May 30, 2024

ambitious-octopus commented May 31, 2024

Laughing-q commented May 31, 2024

ambitious-octopus commented May 31, 2024 •

edited

Laughing-q commented May 31, 2024

Laughing-q commented May 31, 2024 •

edited

Laughing-q commented May 31, 2024

Laughing-q commented May 31, 2024

ambitious-octopus commented May 31, 2024

Burhan-Q left a comment

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024

Burhan-Q May 31, 2024


		## [Models](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/cfg/models/v8)

		YOLOv8 pretrained Human models are shown here. Detect, Segment and Pose models are pretrained on the [COCO](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/coco.yaml) dataset, while Classify models are pretrained on the [ImageNet](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/ImageNet.yaml) dataset.


		YOLOv8 pretrained Human models are shown here. Detect, Segment and Pose models are pretrained on the [COCO](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/coco.yaml) dataset, while Classify models are pretrained on the [ImageNet](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/cfg/datasets/ImageNet.yaml) dataset.

		[Models](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/cfg/models) download automatically from the latest Ultralytics [release](https://github.com/ultralytics/assets/releases) on first use.


		!!! note

		It is important to note that these models have been trained on a specially curated, artificially annotated version of the COCO dataset. This custom dataset was meticulously crafted to enhance the models' performance on specific tasks by incorporating additional annotations and adjustments beyond those available in the public COCO dataset. Due to proprietary reasons, this enhanced version of the dataset is not publicly available. The artificial annotations were designed to provide more comprehensive and nuanced data, enabling the models to achieve higher accuracy and robustness in their predictions. The proprietary nature of this dataset ensures that the models possess a competitive edge, offering advanced capabilities and superior performance in their respective applications.


		- Weight (Kg): The weight of the person is annotated in kilograms. This numeric value is essential for applications requiring precise biometric data.

		- Height (Cm): The height of the person is annotated in centimeters. Accurate height measurements are crucial for many analytical and identification purposes.


		### Val Usage

		Validate trained YOLOv8n-human model accuracy on the COCO8-human dataset. No argument need to passed as the `model` retains it's training `data` and arguments as model attributes.


		## Val

		Validate trained YOLOv8n-human model accuracy on the COCO8-human dataset. No argument need to passed as the `model` retains it's training `data` and arguments as model attributes.

New: Ultralytics YOLO-Human #12702

Are you sure you want to change the base?

New: Ultralytics YOLO-Human #12702

Conversation

Laughing-q commented May 15, 2024 • edited by github-actions bot

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented May 30, 2024

ambitious-octopus commented May 31, 2024

Laughing-q commented May 31, 2024

ambitious-octopus commented May 31, 2024 • edited

Laughing-q commented May 31, 2024

Laughing-q commented May 31, 2024 • edited

Laughing-q commented May 31, 2024

Laughing-q commented May 31, 2024

ambitious-octopus commented May 31, 2024

Burhan-Q left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Laughing-q commented May 15, 2024 •

edited by github-actions bot

ambitious-octopus commented May 31, 2024 •

edited

Laughing-q commented May 31, 2024 •

edited