Add LW-DETR object detection models by srishtiii28 · Pull Request #334 · mlverse/torchvision

srishtiii28 · 2026-06-24T17:58:21Z

Closes #328

Implements LW-DETR (https://arxiv.org/abs/2406.03459), a real-time detection transformer

Adds model_lw_detr_tiny(), model_lw_detr_small(), model_lw_detr_medium(), and model_lw_detr_large() with COCO-pretrained weights. The architecture comprises a ViT backbone, a C2f multi-scale projector, and a two-stage deformable-attention decoder. All four variants load the official weights with exact key/shape match and reproduce reference detections; pixel_mask is supported for letterboxed (non-square) inputs.

cregouby

praise again an impressive contribution
todo see inline

cregouby · 2026-06-28T05:45:08Z

+  },
+  forward = function(x) {
+    patches <- self$embeddings(x)
+    B <- patches$size(1L); H <- patches$size(2L); W <- patches$size(3L); C <- patches$size(4L)


todo style We prefer not to use multiple instruction in one line. I.e. avoid the ";"
suggestion you can use the zeallot multi affectation %<-% (as soon as patches is always 4-D)

Suggested change

B <- patches$size(1L); H <- patches$size(2L); W <- patches$size(3L); C <- patches$size(4L)

c(B,H,W,C) %<-% patches$size

cregouby · 2026-06-28T09:45:49Z

+    for (i in seq_along(feats)) {
+      f  <- feats[[i]]
+      h_i <- as.integer(f$size(3L)); w_i <- as.integer(f$size(4L))
+      shapes[[i]]   <- c(h_i, w_i)
+      lvl_start     <- c(lvl_start, cur); cur <- cur + h_i * w_i


todo performance lvl_start <- c(lvl_start, cur) reallocates the vector at each iteration which is very costly in performance.

Suggested change

for (i in seq_along(feats)) {

f <- feats[[i]]

h_i <- as.integer(f$size(3L)); w_i <- as.integer(f$size(4L))

shapes[[i]] <- c(h_i, w_i)

lvl_start <- c(lvl_start, cur); cur <- cur + h_i * w_i

lvl_start <- integer(length(feats))

for (i in seq_along(feats)) {

f <- feats[[i]]

h_i <- as.integer(f$size(3L))

w_i <- as.integer(f$size(4L))

shapes[[i]] <- c(h_i, w_i)

lvl_start[i] <- cur

cur <- cur + h_i * w_i

cregouby · 2026-06-28T10:04:30Z

praise this is a massive addition ( actually the largest in the code base), thanks and congratulation !
todo style multiple affectation can be turned nicer using zeallot %<-% (see exemple in line L202)
todo style we do not use multiple command per line with ";" air formatter will fix it in a single pass via $ air format R/models-lw_detr.R
suggestion performance list() preallocation should be allocated with the right size to prevent reallocation. We can manage this as a specific issue (as it is wider than your contribution).

cregouby · 2026-06-28T10:15:20Z

+    shortcut <- x
+
+    if (!self$window) {
+      bw <- x$size(1L); N <- x$size(2L); C <- x$size(3L)


suggestion You may use %<-% instead

cregouby · 2026-06-28T10:19:19Z

+  },
+  forward = function(x, x_value = NULL) {
+    if (is.null(x_value)) x_value <- x
+    B <- x$size(1L); N <- x$size(2L); C <- x$size(3L)


suggestionYou may use %<-%

cregouby · 2026-06-28T10:35:00Z

+    self$depth <- depth
+  },
+  forward = function(x, out_flags) {
+    out <- list()


todo perfromance please allocate the out list to its target size (to prevent the most impactfull reallocation time)

cregouby · 2026-06-28T10:42:01Z

+#' norm_std  <- c(0.229, 0.224, 0.225)
+#'
+#' # Letterbox a non-square image to 640x640 and build the matching pixel mask
+#' img <- magick_loader("path/to/image.jpg") |> transform_to_tensor()


todo Exemple must be executable without thinking by the end user. So please choose a working image to load
suggestion you may use an image by url of your choice (or the demo one in the lw-detr repo : "https://github.com/Atten4Vis/LW-DETR/blob/main/demo/000000496954.jpg?raw=true")

cregouby · 2026-06-28T13:15:23Z

+#' pred <- torch::with_no_grad(
+#'   model(canvas$unsqueeze(1), pixel_mask = mask$unsqueeze(1))
+#' )$detections[[1]]
+#' labels <- coco_classes(as.integer(pred$labels))


improvement missing Could we end the exemple with the code for a visual result through `draw_bounding_box(..) |> .. |> tensor_image_browse()

cregouby · 2026-06-28T13:25:38Z

+  skip_if(Sys.getenv("TEST_LARGE_MODELS", unset = 0) != 1,
+          "Skipping test: set TEST_LARGE_MODELS=1 to enable tests requiring large downloads.")


improvement We usually apply skip_if(TEST_LARGE_MODEL) to model larger than 100MB. So I would remove for the tiny model

cregouby · 2026-06-28T13:30:10Z

praise I like a lot the test for correctness at the end of the tiny pretrained model. A good practice to generalize !
todo missing Can we have the "tests for pretrained model_lw_detr_tiny" duplicated to cover "tests for pretrained model_lw_detr_small" and "tests for pretrained model_lw_detr_medium" and "tests for pretrained model_lw_detr_large" ? (all with a skip_if TEST_LARGE_MODEL != 1)

srishtiii28 added 2 commits June 24, 2026 23:23

Add LW-DETR object detection models

5e173a9

Updated NEWS.md

3d11f48

srishtiii28 marked this pull request as draft June 24, 2026 17:59

srishtiii28 added 2 commits June 24, 2026 23:34

Add models-lw_detr.R to Collate

592c49b

Cross-link model_lw_detr in detection model docs

6de639d

srishtiii28 marked this pull request as ready for review June 24, 2026 18:08

Add pixel_mask test for model_lw_detr

2a32933

cregouby reviewed Jun 26, 2026

View reviewed changes

Comment thread NEWS.md

Updated NEWS.md to fix misplacement of model

c333e16

cregouby requested changes Jun 28, 2026

View reviewed changes

cregouby and others added 2 commits June 28, 2026 15:49

usethis::use_air()

9ad1337

Address review feedback for lw_detr model

11cd2af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LW-DETR object detection models#334

Add LW-DETR object detection models#334
srishtiii28 wants to merge 8 commits into
mlverse:mainfrom
srishtiii28:feature/lw-detr-clean

srishtiii28 commented Jun 24, 2026

Uh oh!

Uh oh!

cregouby left a comment

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026 •

edited

Loading

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

cregouby Jun 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	B <- patches$size(1L); H <- patches$size(2L); W <- patches$size(3L); C <- patches$size(4L)
	c(B,H,W,C) %<-% patches$size

		skip_if(Sys.getenv("TEST_LARGE_MODELS", unset = 0) != 1,
		"Skipping test: set TEST_LARGE_MODELS=1 to enable tests requiring large downloads.")

Uh oh!

Conversation

srishtiii28 commented Jun 24, 2026

Uh oh!

Uh oh!

cregouby left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cregouby Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cregouby Jun 28, 2026 •

edited

Loading