본문 바로가기

Computer Vision4

[논문 리뷰] You Only Look Once: Unified, Real-Time Object Detection Paper DetailsTitle: You Only Look Once: Unified, Real-Time Object DetectionAuthors: Joseph Redmon, Santosh Divvala, Ross Girshick, Ali FarhadiConference: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)Year of Publication: 2016Link: https://arxiv.org/abs/1506.02640Key Focus:YOLO reframes object detection as a single regression problem, predicting bounding boxes and class probabi.. 2025. 2. 5.
[논문 리뷰] InstructPix2Pix: Learning to Follow Image Editing Instructions Paper DetailsTitle: InstructPix2Pix: Learning to Follow Image Editing InstructionsAuthors: Tim Brooks, Aleksander Holynski, Alexei A. EfrosConference: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023Year of Publication: 2023Link: https://arxiv.org/abs/2211.09800 / https://www.timothybrooks.com/instruct-pix2pixKey Focus: This paper introduces InstructPix2Pix, a condition.. 2025. 1. 27.
[논문 리뷰] U-Net: Convolution Networks for Biomedical Image Segmentation Paper DetailsTitle: U-Net: Convolutional Networks for Biomedical Image Segmentation Authors: Olaf Ronneberger, Philipp Fischer, Thomas Brox Conference: Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2015 Year of Publication: 2015 Link: https://arxiv.org/abs/1505.04597Key Focus: This paper presents U-Net, a convolutional neural network designed for biomedical image segmentati.. 2024. 11. 18.
[논문 리뷰] AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE Paper Details Title: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Authors: Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby Conference: International Conference on Learning Representations (ICLR 2021) Year of Publ.. 2024. 10. 14.