[Paper reading] ResNet

2023. 8. 16. 12:49ใ†ArtificialIntelligence/PaperReading

 

 

 

 

 

 

ResNet Intro

  • Short Cut connections
  • adding neither extra params nor computational complexity
  • 100 ~ 1000 layers / 3.57%

 

 

 

Conclusion

์žฅ์ 

  • ์•„์ด๋””์–ด๊ฐ€ ๋…์ฐฝ์ ์ด๋‹ค.
  • ๊ฐ„๋‹จํ•œ ์•„์ด๋””์–ด๋กœ ๋„คํŠธ์›Œํฌ์˜ ๊นŠ์ด๋ฅผ ๋งค์šฐ ๊นŠ๊ฒŒ ์Œ“์•˜๋‹ค.
    • optimization ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐ, ์„ฑ๋Šฅ ํ–ฅ์ƒ 

 

๋‹จ์  

  • ์™œ ์ด๋Ÿฌํ•œ skip connection์„ ์‚ฌ์šฉํ•˜๋ฉด ์„ฑ๋Šฅ์ด ๋” ์ž˜ ๋‚˜์˜ค๋Š” ์ง€
  • ๊ธฐ์กด์˜ gradient descent ์˜ ์ •์˜์™€ ๋ฐฉํ–ฅ์ด ๋‹ฌ๋ผ์ง€์ง€ ์•Š๋‚˜?

 

๊ฐœ์„ ํ•  ์ 

  • ๋‹ค๋ฅธ function๋“ค๋„ skip ํ•ด๋ณด๊ธฐ
    • ๊ตณ์ด - x, + x ๋˜์–ด์•ผ ํ•˜๋Š” ์ด์œ ?
    • + x, - x ์ˆœ์„œ๋กœ ์ง„ํ–‰๋˜๋ฉด ์–ด๋– ํ•œ ์ฐจ์ด๊ฐ€ ์žˆ์„์ง€
    • ์ž”์ฐจ๋ผ๋Š” ๊ฐœ๋…์ด ์–ด๋””์—์„œ ์ฒ˜์Œ ๋‚˜์™”์„๊นŒ(related work)
    • ๋‹ค๋ฅธ ๊ฐ„๊ฒฐํ•œ ์ˆ˜์‹๋„ ๋งŽ์„ ํ…๋ฐ, ์™œ ์ž”์ฐจ ๊ฐœ๋…์œผ๋กœ ์„ ์ •?

 

 

 

Question 

1. ์ธต์ด ๊นŠ์–ด์งˆ ๋•Œ, ๋ฐœ์ƒํ•˜๋Š” ์„ฑ๋Šฅ ์ €ํ•˜

  • degradation 
  • Overfitting ๋ฌธ์ œ๊ฐ€ ์•„๋‹Œ, optimize ๋ฌธ์ œ
  • ํ›„๋ฐ˜๋ถ€์— ์˜ˆ์‹œ ๋‚˜์˜ค๋Š”๋ฐ, ์ด ๋ถ€๋ถ„์ด ์ž˜ ์ดํ•ด๋˜์ง€ ์•Š์•˜๋‹ค.

 

2. 1000 Layer ์ดํ›„์˜ ์˜ค๋ฒ„ํ”ผํŒ… ๋ฐœ์ƒ

  • Overfitting ์™œ ํŠน์ • ์ธต ์ดํ›„ ๋ฐœ์ƒ?
    • data์— ๋น„ํ•ด ๋ชจ๋ธ ํฌ๊ธฐ๊ฐ€ ๋งค์šฐ ์ปค์ ธ์„œ?
    • VGGNet์˜ ํ•œ๊ณ„(19)๋Š” ์˜ค๋ฒ„ํ”ผํŒ…์ด ์•„๋‹Œ Optimize ๋ฌธ์ œ?

 

'ArtificialIntelligence > PaperReading' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[Paper reading] Transformers for image recognition, ViT  (0) 2023.08.28
[Paper reading] Attention is all you need, Transformer  (0) 2023.08.25
[Paper reading] DenseNet  (0) 2023.08.22
[Paper reading] GoogleNet  (0) 2023.08.18
[Paper reading] VGGNet  (0) 2023.08.16