Add lw-detr models by srishtiii28 · Pull Request #332 · mlverse/torchvision

srishtiii28 · 2026-06-14T12:00:01Z

Closes #328

Adds four variants of the LW-DETR from the Atten4Vis implementation:

model_lw_detr_tiny - ViT-Ti (6 layers, embed_dim=192), 100 queries
model_lw_detr_small - ViT-Ti (10 layers, embed_dim=192), 300 queries
model_lw_detr_medium - ViT-S (10 layers, embed_dim=384), 300 queries
model_lw_detr_large - ViT-S (10 layers, embed_dim=384), 2-scale projector, 300 queries

The architecture is a ViT encoder with interleaved window or global attention, a C2f projector from the YOLOv8 and a 3-layer DETR decoder which has deformable cross-attention. Two-stage query selection and Group DETR which contains 13 groups are used during training but only the primary group is used at inference. The pretrained COCO weights are fetched via download_and_cache() . And all the four checkpoints load with zero missing or unexpected keys.

Since torchvisionlib #25 is still open, deformable cross-attention uses a pure PyTorch nnf_grid_sample fallback and no CUDA dependency. The CUDA operation can be swapped in once that issue is resolved.

The input images should be ImageNet-normalized tensors of shape (B, 3, H, W), square and divisible by 64. 640×640 would be recommended . Output would be a list of detections per image with boxes i.e. xyxy pixels), labels, and scores.

…ature/lw-detr

Add lw-detr models

d87c3ff

srishtiii28 marked this pull request as draft June 14, 2026 12:00

Clean up lw-detr: remove unused imports and params

84fe947

srishtiii28 marked this pull request as ready for review June 14, 2026 17:56

srishtiii28 marked this pull request as draft June 19, 2026 07:08

srishtiii28 added 2 commits June 19, 2026 12:58

Add lw-detr docs, NEWS entry, and tests

c4d0790

Merge branch 'main' of https://github.com/mlverse/torchvision into fe…

e52a1b0

…ature/lw-detr

srishtiii28 force-pushed the feature/lw-detr branch from d0703f0 to e52a1b0 Compare June 19, 2026 07:32

srishtiii28 marked this pull request as ready for review June 19, 2026 07:33

srishtiii28 marked this pull request as draft June 21, 2026 21:13

srishtiii28 closed this Jun 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add lw-detr models#332

Add lw-detr models#332
srishtiii28 wants to merge 4 commits into
mlverse:mainfrom
srishtiii28:feature/lw-detr

srishtiii28 commented Jun 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

srishtiii28 commented Jun 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

srishtiii28 commented Jun 14, 2026 •

edited

Loading