Adapt segmentation trainer to work with ViT #1403

nilsleh · 2023-06-06T13:20:11Z

No description provided.

adamjstewart · 2023-06-09T17:59:47Z

I assume this will also be needed in PixelwiseRegressionTask?

nilsleh · 2023-06-09T18:44:51Z

This is how I have hacked it so far.

nilsleh · 2023-06-09T18:48:40Z

torchgeo/trainers/segmentation.py

@@ -93,7 +104,7 @@ def config_task(self) -> None:
                    _, state_dict = utils.extract_backbone(weights)
                else:
                    state_dict = get_weight(weights).get_state_dict(progress=True)
-                self.model.encoder.load_state_dict(state_dict)
+                self.model.encoder.load_state_dict(state_dict, strict=False)


This is done because for the tu-vit model self.model.encoder has a head.weight and bias.weight which is not something that a pretrained model has.

Additionally, the state dict of self.model.encoder is named model.{module} whereas resnet backbones for Unet are just {model}.

nilsleh · 2023-06-09T18:50:34Z

torchgeo/trainers/utils.py

@@ -37,6 +37,9 @@ def extract_backbone(path: str) -> tuple[str, "OrderedDict[str, Tensor]"]:
        state_dict = OrderedDict(
            {k.replace("model.", ""): v for k, v in state_dict.items()}
        )
+    elif "vits16" in checkpoint["hyper_parameters"]["backbone"]:


the state dict of self.model.encoder is named model.{module}. This naming is created in the conversion script.

This seems very hyper-specific but I don't know enough to offer an alternative

adamjstewart · 2023-06-09T21:46:19Z

experiments/ssl4eo/convert_checkpoints.py

Needs shebang, copyright, executable

adamjstewart · 2023-06-09T21:49:03Z

torchgeo/trainers/segmentation.py

@@ -36,19 +36,30 @@ def config_task(self) -> None:
        """Configures the task based on kwargs parameters passed to the constructor."""
        weights = self.hyperparams["weights"]

+        if self.hyperparams["backbone"].startswith("tu-vit"):
+            encoder_depth = 4
+            decoder_channels = (256, 128, 64, 32)


Suggested change

decoder_channels = (256, 128, 64, 32)

decoder_channels: tuple[int, ...] = (256, 128, 64, 32)

This should placate mypy

adamjstewart · 2023-06-09T21:49:47Z

Don't know why tests are failing but we need to fix that

adapt segmentation

5985f47

nilsleh requested a review from isaaccorley June 6, 2023 13:20

github-actions bot added the trainers PyTorch Lightning trainers label Jun 6, 2023

misaligned channels

ecd197f

adamjstewart added this to In progress in SSL4EO-L via automation Jun 9, 2023

adamjstewart added this to the 0.4.2 milestone Jun 9, 2023

adapt segmentation

127cfe6

github-actions bot added the scripts Training and evaluation scripts label Jun 9, 2023

nilsleh commented Jun 9, 2023

View reviewed changes

adamjstewart reviewed Jun 9, 2023

View reviewed changes

experiments/ssl4eo/convert_checkpoints.py

Copy link

Collaborator

adamjstewart Jun 9, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Needs shebang, copyright, executable

adamjstewart reviewed Jun 9, 2023

View reviewed changes

adamjstewart changed the title ~~Adapt segmentation trainer to work with VIT~~ Adapt segmentation trainer to work with ViT Jun 9, 2023

adamjstewart mentioned this pull request Jun 9, 2023

Load Segmentation Trainer Weights #1379

Open

adamjstewart removed this from the 0.4.2 milestone Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapt segmentation trainer to work with ViT #1403

Adapt segmentation trainer to work with ViT #1403

nilsleh commented Jun 6, 2023

adamjstewart commented Jun 9, 2023

nilsleh commented Jun 9, 2023

nilsleh Jun 9, 2023 •

edited

nilsleh Jun 9, 2023

nilsleh Jun 9, 2023

adamjstewart Jun 9, 2023

adamjstewart Jun 9, 2023

adamjstewart Jun 9, 2023

adamjstewart commented Jun 9, 2023

	decoder_channels = (256, 128, 64, 32)
	decoder_channels: tuple[int, ...] = (256, 128, 64, 32)

Adapt segmentation trainer to work with ViT #1403

Are you sure you want to change the base?

Adapt segmentation trainer to work with ViT #1403

Conversation

nilsleh commented Jun 6, 2023

adamjstewart commented Jun 9, 2023

nilsleh commented Jun 9, 2023

nilsleh Jun 9, 2023 • edited

Choose a reason for hiding this comment

nilsleh Jun 9, 2023

Choose a reason for hiding this comment

nilsleh Jun 9, 2023

Choose a reason for hiding this comment

adamjstewart Jun 9, 2023

Choose a reason for hiding this comment

adamjstewart Jun 9, 2023

Choose a reason for hiding this comment

adamjstewart Jun 9, 2023

Choose a reason for hiding this comment

adamjstewart commented Jun 9, 2023

nilsleh Jun 9, 2023 •

edited