Question 1

How is this different from the background remover?

Accepted Answer

The background remover picks the whole foreground for you (one model, binary output). The subject cutout lets YOU choose what to extract by clicking on it. If your image has three people and you only want one, the background remover keeps all three; the cutout tool keeps just the person you click. Different shape, different use cases.

Question 2

Does my image get uploaded anywhere?

Accepted Answer

No. The image stays in your browser the entire time. The Segment Anything model runs locally on your device using WebGPU (or WebAssembly on devices without WebGPU). There's no server in the loop, no upload, no logs.

Question 3

How do I tell it what I want?

Accepted Answer

Click anywhere on the object you want to extract. SAM produces a mask in milliseconds. If the mask covers too much (it grabs a person AND the chair they're sitting on), right-click on the part you don't want (or switch to Exclude mode and tap) to refine. Add as many positive and negative points as you need until the mask matches your intent. Each click also produces three candidate masks (e.g. 'just the person', 'person and chair', 'whole foreground'), shown as numbered chips with a confidence score — switch to a different candidate if the auto-picked one isn't what you meant.

Question 4

What's the small lag the first time I drop an image?

Accepted Answer

The model has to encode your image once before clicks become instant. The encoder is the expensive part of Segment Anything; it produces an embedding the decoder then uses for every click. On WebGPU this takes about 1 to 3 seconds; on WebAssembly (mostly iOS) it can take 5 to 15 seconds. After that, every click is sub-50ms because the decoder is tiny.

Question 5

Why is there a 1024-pixel cap on input size?

Accepted Answer

SAM's image encoder reshapes inputs to 1024×1024 internally regardless of source size, so anything larger is wasted work plus extra memory. Capping at 1024 long edge before encoding keeps memory low (important on iPhones) without losing any mask quality. Output mask resolution matches the source you provide, up to that cap.

Question 6

What do the four output modes do?

Accepted Answer

Subject keeps only what's inside the mask, with everything else transparent (the most common case). Sticker wraps your cutout in a thick white border with a soft drop shadow, like an iMessage or Telegram sticker, and downloads as a transparent PNG you can drop into chat or layer in any editor. Background keeps everything outside the mask transparent in the masked area (useful when you want to delete one object from a photo). Mask exports the binary alpha mask as a black-and-white PNG (useful for image editors like Photoshop or Affinity that want a mask layer).

Question 7

Can I make stickers from my cutouts?

Accepted Answer

Yes. Switch the output to Sticker after you've selected your subject. A thick white border traces the cutout and a soft drop shadow sits behind it, the same look used by iMessage, Telegram, and WhatsApp stickers. The thickness slider runs from 1 to 20 pixels, so you can go from a thin outline to a chunky sticker frame. The preview updates as you drag, and the result is a transparent PNG you can drop into a chat, paste onto a poster, or layer over any background.

Question 8

Can I rotate or flip the cutout before downloading?

Accepted Answer

Yes. Use the rotate-left, rotate-right, flip-horizontal, and flip-vertical buttons under 'Rotate and flip' on the controls panel. Transforms apply to the final PNG, not to your source image. You can stack them.

Question 9

What does 'Crop to subject' do?

Accepted Answer

Trims transparent margins to the bounding box of the mask. Only meaningful for the Subject output mode. If you click a small object in a large image, this option gives you a tight crop instead of a big mostly-transparent PNG.

Question 10

Which model is this using?

Accepted Answer

SlimSAM-77, a distilled variant of Meta's Segment Anything Model. Apache 2.0 licensed, about 22 MB once downloaded, and engineered to run in browsers via WebGPU and WebAssembly. We picked it for the size/quality balance: it's small enough to ship to phones, and the canonical Hugging Face transformers.js example uses it for the same workflow.

Question 11

Does this work on iPhone?

Accepted Answer

Yes. SlimSAM is small enough to run cleanly on iOS Safari via WebAssembly. The encoder takes longer than on desktop with WebGPU, but the model fits well within iOS's per-tab memory budget, so the tab won't be killed mid-encode like with heavier vision models.

Question 12

Can I use the result commercially?

Accepted Answer

Yes. The output is yours, just like with any image editor. SlimSAM is Apache 2.0 (permissive open source); SAM-derived weights inherit that license. We claim no rights over images you process here.

Question 13

What's the right workflow for compositing the cutout into a new background?

Accepted Answer

Download the Subject PNG, then drop it into Figma, Canva, Photoshop, or any layer-based editor on top of your new background. The transparent edges blend cleanly with whatever's behind. For e-commerce product shots, downloading with 'Crop to subject' on gives you a tight crop ready to drop into a product grid.

Subject Cutout

Click any object. Cut it out. Download as a transparent PNG.

How to use it

When this beats a background remover

How clicks become a mask

Private by design, free forever

Frequently asked questions

Related tools