[FEATURE] Support FasterViT #1842

seefun · 2023-06-12T06:23:51Z

”FasterViT: Fast Vision Transformers with Hierarchical Attention“

The code is written based on timm and provides pretrained weights on ImageNet1k. But there are many layers customized in the code which are different from the implementation of timm. So I'm not sure if we need to make significant adjustments to these code.

It looks interesting, but it doesn't seem like the paper has been released.

rwightman · 2023-06-13T17:52:49Z

yeah, noticed this one, it is timm oriented but as always, baked in square image size assumptions and put the downsample at the end of the blocks so needs a decent amount of attention to fix and remap :(

I really truly don't understand the obsession with putting downsample at the end of vit/hybrid blocks :(

Other thing is, I've never found gcvit (same authors) to be particularly easy to train or fine-tune (including reproducing the original results) compared to vit, swin, convnext (which I've successfully managed to reproduce and improve on originals). I wonder how this compares.... given the complexity of the model code, I found the throughput #s surprising as more code usually == more activations and slower speeds.

tp-nan · 2023-08-11T09:18:25Z

Hi, guys, is there any update on this issue? The throughout is really high.

youssefadr · 2023-09-03T14:16:47Z

Hi, I can take this one. I'll begin by moving the downsamples as mentioned here

seefun added the enhancement New feature or request label Jun 12, 2023

rwightman mentioned this issue Aug 21, 2023

[FEATURE] Support EfficientViT #1815

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Support FasterViT #1842

[FEATURE] Support FasterViT #1842

seefun commented Jun 12, 2023

rwightman commented Jun 13, 2023

tp-nan commented Aug 11, 2023 •

edited

youssefadr commented Sep 3, 2023

[FEATURE] Support FasterViT #1842

[FEATURE] Support FasterViT #1842

Comments

seefun commented Jun 12, 2023

rwightman commented Jun 13, 2023

tp-nan commented Aug 11, 2023 • edited

youssefadr commented Sep 3, 2023

tp-nan commented Aug 11, 2023 •

edited