Feat: Add SAHI (Slicing Aided Hyper Inference) Transform by DerrickUnleashed · Pull Request #329 · mlverse/torchvision

DerrickUnleashed · 2026-06-10T16:49:12Z

This PR Adds:

The implementation of transform_sahi_crop()
The implementation of target_transform_sahi_crop()
Adds test suites for the same
Adds export command to fix this error

✖ transforms-tensor.R:480: S3 method
  `get_image_size.torch_tensor` needs
  @export or @exportS3Method tag.

Updates nms.Rd Documentation to Roxygen2
Fixes Implement SAHI: (Slicing Aided Hyper Inference) as both transform_ and target_transform_ #324

DerrickUnleashed · 2026-06-11T20:03:10Z

@cregouby, I'm still a bit unclear about the usage of target_transform_sahi_crop() However, the transform_sahi_crop() seems to be working as expected

cregouby

praise I think that the cropping values logic is there.
todo design all transform_ functions shall return a object of the same type as the input (except transform to tensor) as they will be piped. Your only freedom is to change the shape of the tensor, and move the crops into a sequential batch, as transform_five_crop() do.
ex for x$shape [1, 3, 10, 10], transform_sahi_crop(x, c(4,4) , c(.2, .2)) shall output a tensor of shape [25, 3, 3, 3] (see

torchvision/tests/testthat/test-transforms-tensor.R

Lines 161 to 174 in ac32fa5

    
           test_that("five_crop", { 
        
             x <- torch_randn(3, 10, 12) 
        
             o <- transform_five_crop(x, c(3, 3)) 
        
             expect_length(o, 5) 
        
             expect_tensor_shape(o[[1]], c(3,3,3)) 
        
             ob <- transform_five_crop(x$unsqueeze(1), c(3, 3)) 
        
             expect_length(ob, 5) 
        
             expect_tensor_shape(ob[[1]], c(1,3,3,3)) 
        
           })

)
suggestion you should rely on the existing crop function to crop the tensor, saving you a lot of unit tests.
todo software design this transform shall be an S3 transform so shall have an entry in transform-generics.R for the dispatch, one in transforms-defaults.R, one in transforms-magick.R, one in transforms-tensor.R, as you never know what transform goes before, what transform is piped after.

todo code relocation Please keep (the badly named, my fault) transform-segmentation.R file for target_transforms
suggestion for clarity, you may rename the transform-segmentation.R into target-transform-segmentation.R (and the test file accordingly)

thought as SAHI intimately rely on input image size and coco bbox (so both x and y at the same time), and as there is no way to pass information from transform_sahi_crop to target_transform_sahi_crop, I think maybe a prepare_sahi_crop(dataset, size, overlap_size_ratio, ... ) function gathering all the needed context into a specific class object sahi_preparation and having both functions using it a the only input parameter transform_sahi_crop(x, sahi_preparation) and target_transform_sahi_crop(x, sahi_preparation) could be a nice API to SAHI.

DerrickUnleashed · 2026-06-13T17:56:37Z

todo software design this transform shall be an S3 transform so shall have an entry in transform-generics.R for the dispatch, one in transforms-defaults.R, one in transforms-magick.R, one in transforms-tensor.R, as you never know what transform goes before, what transform is piped after.

I didn't understand properly, Do I need to move the code of transform_sahi and target_transform_sahi to transform-generics ?

cregouby · 2026-06-15T05:23:18Z

No. Transform_sahi_crop must manage magick images, tensors and nothing else. The way to do it is S3 dispatch which required an entry in each of the mentioned R files. Le 13 juin 2026 19:56:59 GMT+02:00, Derrick Richard ***@***.***> a écrit :

…

DerrickUnleashed left a comment (mlverse/torchvision#329) > **todo** **software design** this transform shall be an S3 transform so shall have an entry in transform-generics.R for the dispatch, one in transforms-defaults.R, one in transforms-magick.R, one in transforms-tensor.R, as you never know what transform goes before, what transform is piped after. I didn't understand properly, Do I need to move the code of transform_sahi and target_transform_sahi to transform-generics ? -- Reply to this email directly or view it on GitHub: #329 (comment) You are receiving this because you were mentioned. Message ID: ***@***.***>

DerrickUnleashed · 2026-06-23T16:52:47Z

# From a dataset (SAHI as part of the transform pipeline)
ds <- coco_detection_dataset(train = FALSE, year = "2017", download = TRUE)
sp_ds <- prepare_sahi_split(ds, size = c(200, 200), overlap_size_ratio = c(0.2, 0.2))

ds <- coco_detection_dataset(train = FALSE, year = "2017",
  transform = . %>% transform_to_tensor() %>%
    transform_sahi_crop(sp_ds))

item <- ds[1]
grid <- vision_make_grid(item$x, scale = TRUE, num_rows = 3)
tensor_image_browse(grid)

img_url <- "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/small-vehicles1.jpeg"
img <- base_loader(img_url) %>% transform_to_tensor()

sp <- prepare_sahi_split(img, size = c(512, 512))

crops <- transform_sahi_crop(img, sp)

# Synthetic target with a box straddling the first two crops
y <- list(
  boxes = torch_tensor(matrix(c(400, 100, 600, 300), nrow = 1, byrow = TRUE),
                       dtype = torch_float()),
  labels = "car"
)

targets <- target_transform_sahi_crop(y, sp, min_area_ratio = 0.1)

targets[[1]]$boxes  # Box clipped and translated to first crop coordinates

targets[[2]]$boxes  # Box in second crop (the portion that spilled over)

# Visualize the crops with bounding boxes overlaid
preview <- lapply(1:dim(crops)[1], function(i) {
  item <- list(x = crops[i, ..], y = targets[[i]])
  class(item) <- "image_with_bounding_box"
  draw_bounding_boxes(item, colors = "red")
})
grid <- vision_make_grid(torch_stack(preview), scale = FALSE, num_rows = 3)
tensor_image_browse(grid)

DerrickUnleashed · 2026-06-23T17:03:25Z

thought the question now is how many input S3 format shall prepare_sahi_split() cover ? I'd start with the usual magick image then tensor image (single and batch of) and also cover the dataset..

it supports magick and tensor images as input now

cregouby

praise very significant improvement toward the merge
todo see inline

cregouby

.

DerrickUnleashed · 2026-06-28T08:38:03Z

## Not run: 
# Full SAHI pipeline: prepare split, crop image, adjust targets
img_url <- "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/small-vehicles1.jpeg"
img <- base_loader(img_url) %>% transform_to_tensor()

sp <- prepare_sahi_split(img, size = c(512, 512), overlap_size_ratio = c(0.2, 0.2))

crops <- transform_sahi_crop(img, sp)
crops$shape

# Synthetic target with a box straddling the first two crops
y <- list(
  boxes = torch_tensor(matrix(c(400, 100, 600, 300), nrow = 1, byrow = TRUE),
                       dtype = torch_float()),
  labels = "car"
)
targets <- target_transform_sahi_crop(y, sp, min_area_ratio = 0.1)

targets[[1]]$boxes  # Box clipped and translated to first crop
targets[[2]]$boxes  # Box in second crop (the portion that spilled over)

# Visualize crops with bounding boxes overlaid
preview <- lapply(1:dim(crops)[1], function(i) {
  item <- list(x = crops[i, ..], y = targets[[i]])
  class(item) <- "image_with_bounding_box"
  draw_bounding_boxes(item, colors = "red")
})
grid <- vision_make_grid(torch_stack(preview), scale = FALSE, num_rows = 3)
tensor_image_browse(grid)

## End(Not run)

…l three functions

cregouby

praise thanks for your modifications.
todo a small effort again for completness.

…-frame magick)

Co-authored-by: cregouby <cregouby@users.noreply.github.com>

…op-count test

…hed/torchvision into feat/sahiTransform

… expect_tensor_shape and expect_tensor_dtype

DerrickUnleashed and others added 19 commits June 10, 2026 21:38

Chore: Update Documentation

ac2a0a8

Chore: Update Description & Namespace

9f1b9b0

Feat: Add SAHI transform

835cb2d

Chore: Add tests for SAHI

3b2555d

Fix: missing exporrt error

6f08cf5

Chore: Update the Documentation similar to other transforms

1a9be76

Merge branch 'main' into feat/sahiTransform

543fb9a

Chore: Update nms.Rd to Roxygen2

00dccd1

Init: Transform and Target Transform code

efbf0e7

Revert Documentation

0e407d6

Working versions of transforms

6be235f

Remove torch:: imports

f0a5c81

Fix lint

4374933

Chore: Add Documentation

d002a3d

Chore: Update NEWS.md

4afd862

Update doc for transform

720a06c

Update doc for transform

accf698

Chore Add tests for transforms

e01b8a1

Chore: Update docs for target_tranform

7ef61dd

DerrickUnleashed marked this pull request as ready for review June 11, 2026 20:02

DerrickUnleashed added 2 commits June 12, 2026 01:46

Add tests for target_transforms

732c11c

Using distince expect_ fucns for testing

d0633fa

cregouby requested changes Jun 13, 2026

View reviewed changes

Comment thread .vscode/settings.json Outdated

Comment thread R/transforms-tensor.R

Comment thread R/transforms-segmentation.R Outdated

Comment thread R/transforms-segmentation.R Outdated

DerrickUnleashed added 3 commits June 13, 2026 21:10

Remove unnecessary files

0c7f428

Remove family transforms

286b029

Add realistic example

5ba16b6

DerrickUnleashed force-pushed the feat/sahiTransform branch from a56d682 to 5ba16b6 Compare June 13, 2026 17:34

Add tests

8358f25

Update target transform docs

e0fd6ba

DerrickUnleashed requested a review from cregouby June 23, 2026 16:53

cregouby requested changes Jun 24, 2026

View reviewed changes

cregouby requested changes Jun 26, 2026

View reviewed changes

DerrickUnleashed added 4 commits June 28, 2026 12:50

Change family to combining transforms

355ec98

Removing redundant check

0c4c701

Move shape_y outside lapply to avoid redefinition per crop window

cf20fb4

Remove from SAHI crop targets to avoid length inconsistency

91ccdab

DerrickUnleashed added 3 commits June 28, 2026 14:18

Merge SAHI docs into single Rd file with combined example covering al…

32b7e60

…l three functions

Adding explicit message on how to prepare the sahi_split if fails

d23f11d

Update NEWS.md

306a02b

DerrickUnleashed requested a review from cregouby June 28, 2026 09:23

cregouby reviewed Jun 28, 2026

View reviewed changes

Comment thread tests/testthat/test-transforms-array.R Outdated

cregouby reviewed Jun 28, 2026

View reviewed changes

Comment thread tests/testthat/test-transforms-array.R Outdated

cregouby reviewed Jun 28, 2026

View reviewed changes

Comment thread tests/testthat/test-transforms-tensor.R

cregouby reviewed Jun 28, 2026

View reviewed changes

Comment thread tests/testthat/test-transforms-tensor.R Outdated

cregouby requested changes Jun 28, 2026

View reviewed changes

Comment thread tests/testthat/test-target-transforms-segmentation.R

DerrickUnleashed and others added 8 commits June 28, 2026 21:23

Updating tests

9111c5a

Add batch input tests for SAHI transforms (4D tensor, 4D array, multi…

9c550c8

…-frame magick)

Update tests/testthat/test-transforms-tensor.R

f8007cb

Co-authored-by: cregouby <cregouby@users.noreply.github.com>

Add batch input handling to target_transform_sahi_crop and overlap cr…

a27a4e5

…op-count test

Merge branch 'feat/sahiTransform' of https://github.com/DerrickUnleas…

d1f329e

…hed/torchvision into feat/sahiTransform

Remove convert-then-split test case

265a59c

Remove redundant array SAHI tests, strengthen batched array test with…

f16b75a

… expect_tensor_shape and expect_tensor_dtype

Fix iscrowd field name (COCO uses iscrowd not is_crowd)

5ecc45e

DerrickUnleashed requested a review from cregouby June 28, 2026 17:35

	test_that("five_crop", {

	x <- torch_randn(3, 10, 12)
	o <- transform_five_crop(x, c(3, 3))

	expect_length(o, 5)
	expect_tensor_shape(o[[1]], c(3,3,3))

	ob <- transform_five_crop(x$unsqueeze(1), c(3, 3))

	expect_length(ob, 5)
	expect_tensor_shape(ob[[1]], c(1,3,3,3))

	})

Uh oh!

Conversation

DerrickUnleashed commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DerrickUnleashed commented Jun 11, 2026

Uh oh!

cregouby left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DerrickUnleashed commented Jun 13, 2026

Uh oh!

cregouby commented Jun 15, 2026 via email

Uh oh!

DerrickUnleashed commented Jun 23, 2026

Uh oh!

DerrickUnleashed commented Jun 23, 2026

Uh oh!

cregouby left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cregouby left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DerrickUnleashed commented Jun 28, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cregouby left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DerrickUnleashed commented Jun 10, 2026 •

edited

Loading

cregouby left a comment •

edited

Loading

cregouby left a comment •

edited

Loading