ECCV 2024 Day3 - SAM2: Meta Technical Presentation

2024. 11. 18. 18:48ใ†ArtificialIntelligence/ECCV2024

 

 

 

ECCV 2024. 10. 01. Tuesday 
SAM2: Meta Technical Presentation  

 

SAM2: Segment Anything in Images and Videos (6:00 ~ 6:30) 

์ •๋ง ๋งŽ์€ ์‚ฌ๋žŒ๋“ค์ด ๋ชจ์˜€๋‹ค..! ๋ฏธ๋ฆฌ ๊ฐ”๋Š”๋ฐ๋„, ์•‰์„ ์ž๋ฆฌ๊ฐ€ ์—†์—ˆ์Šต๋‹ˆ๋‹ค . .

 

 

 

์—ฌ๊ถŒ์„ ๋‹ด๋ณด๋กœ ๋ฐ›์„ ์ˆ˜ ์žˆ๋Š” ํ—ค๋“œ์…‹ ๐ŸŽง

 

 

 

 

๐Ÿ“Œ SAM2: Segment Anything in Images and Videos 

SAM2 ๋ฐœํ‘œ

 

 

 

 

๊ธฐ์กด SAM1 ๋ชจ๋ธ๊ณผ์˜ ์ฐจ์ด์ ์€ ๋น„๋””์˜ค ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ

 

 

 

 

๋น„๋””์˜ค segmentation - ์ž์œจ ์ฃผํ–‰, ๋“œ๋ก  ๋น„๋””์˜ค ํŠธ๋ž™ํ‚น, ์˜์ƒ ํŽธ์ง‘

 

 

 

 

VOS - Video Object Segmentation์˜ ์–ด๋ ค์šด ์ 

* ์œ ์‚ฌํ•˜๊ฒŒ ์ƒ๊ธด ๊ฐ์ฒด๋“ค์„ ์ œ๋Œ€๋กœ ๋ถ„๋ฆฌํ•˜๊ธฐ ์–ด๋ ต๋‹ค๋Š” ์ ์ด ์–ธ๊ธ‰๋˜์—ˆ๋‹ค

+ ์—ฌ๋Ÿฌ ์‚ฌ๋ก€๋“ค๊ณผ ํ•จ๊ป˜ ํ•œ๊ณ„์ ์„ ์งš์–ด๋ณด์•„์„œ, ์žฌ๋ฏธ์žˆ์—ˆ๋‹ค :) 

 

 

 

 

๊ธฐ์กด ๋ฐฉ์‹ - Semi-Supervised VOS 

์ดˆ๊ธฐ ํ”„๋ ˆ์ž„์—์„œ ์žกํžŒ mask๋ฅผ ์ˆ˜์ •ํ•˜๊ณ  ์‹ถ๋‹ค๋ฉด?

 

 

 

 

Prompt ๊ฐ€๋Šฅํ•œ Segmentation

์—ฌ๊ธฐ์„œ ํ”„๋กฌํ”„ํŠธ๋Š” NLP ํ”„๋กฌํ”„ํŠธ๊ฐ€ ์•„๋‹ˆ๋ผ, ํ”ฝ์…€ ์ƒ์˜ ๊ฐ€์ด๋“œ๋ฅผ ์˜๋ฏธํ•œ๋‹ค. 

 

 

 

 

๋น„๋””์˜ค๋ฐ์ดํ„ฐ - ์ด๋ฏธ์ง€ frame๋“ค์˜ ์ง‘ํ•ฉ (time ์ถ•์— ๋”ฐ๋ผ)

memory attention ์ถ”๊ฐ€

 

 

 

๋น„๋””์˜ค ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์„ค๋ช…

 

 

 

 

๋ชจ๋ธ ํ•™์Šต ์‹œ annotation ๊ด€๋ จ ๋ฐฉ๋ฒ•๋ก  

 

 

 

 

 

 

๐Ÿ“Œ SA-V dataset 

Segment Anything Video (SA-V) Dataset

https://ai.meta.com/datasets/segment-anything-video/

 

SA-V | Meta AI Research

Overview SA-V consists of 51K diverse videos and 643K spatio-temporal segmentation masks (i.e., masklets). It is intended to be used for computer vision research for the purposes permitted under the CC by 4.0 license. The videos were collected via a contra

ai.meta.com

 

 

 

https://github.com/facebookresearch/sam2/blob/main/sav_dataset/README.md

 

sam2/sav_dataset/README.md at main ยท facebookresearch/sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th...

github.com

 

 

 

 

 

Result

 

 

 

 

 

 

SAM2 Demo

https://sam2.metademolab.com

 

SAM 2 Demo | By Meta FAIR

Track an object across any video and create fun effects interactively, with as little as a single click on one frame.

sam2.metademolab.com

 

 

 

์ž˜ tracking ๋˜์ง€ ์•Š๋Š” ์˜ˆ์‹œ๋“ค

 

 

 

 

 

 

 

 

๐Ÿ‘ฉโ€๐Ÿ’ป

๋๋‚˜๊ธฐ ์ง์ „์— ํ›„๋‹ค๋‹ฅ ํ—ค๋“œ์…‹ ๋ฐ˜๋‚ฉํ•˜๊ณ ,

๊ต์ˆ˜๋‹˜์ด ์ €๋… ์‚ฌ์ฃผ์…”์„œ, ์•ฝ์† ์žฅ์†Œ๋กœ ๊ฐ”์Šต๋‹ˆ๋‹ค. 

 

์•„์‰ฝ๋‹ค .. ๐Ÿฅน

 

 

 

 

 

KHU CV Night โœจ

๐Ÿ“ธ

 

 

 

 

 

'ArtificialIntelligence > ECCV2024' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

ECCV 2024 Day4 - Oral: Point Clouds  (1) 2024.11.18
ECCV 2024 Day4 - Oral: Dataset Condensation  (0) 2024.11.18
ECCV 2024 Day3 - Synthesia Keynote  (2) 2024.11.14
ECCV 2024 Day3 - Oral: Recognition  (0) 2024.11.13
ECCV 2024 DAY3 - Demo Session  (7) 2024.10.18