Skip to main content

Showing 1–21 of 21 results for author: Guler, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.04133  [pdf, ps, other

    cs.AI

    Enhancement of Approximation Spaces by the Use of Primals and Neighborhood

    Authors: A. Çaksu Güler

    Abstract: Rough set theory is one of the most widely used and significant approaches for handling incomplete information. It divides the universe in the beginning and uses equivalency relations to produce blocks. Numerous generalized rough set models have been put out and investigated in an effort to increase flexibility and extend the range of possible uses. We introduce four new generalized rough set mode… ▽ More

    Submitted 23 October, 2024; originally announced November 2024.

  2. arXiv:2407.14064  [pdf, other

    cs.CV

    Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability

    Authors: Özgür Acar Güler, Manuel Günther, André Anjos

    Abstract: Automatic classification of active tuberculosis from chest X-ray images has the potential to save lives, especially in low- and mid-income countries where skilled human experts can be scarce. Given the lack of available labeled data to train such systems and the unbalanced nature of publicly available datasets, we argue that the reliability of deep learning models is limited, even if they can be s… ▽ More

    Submitted 8 October, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Preprint of paper presented at EUVIP 2024

  3. arXiv:2406.10180  [pdf, other

    cs.CV

    MeshPose: Unifying DensePose and 3D Body Mesh reconstruction

    Authors: Eric-Tuan Lê, Antonis Kakolyris, Petros Koutras, Himmy Tam, Efstratios Skordos, George Papandreou, Rıza Alp Güler, Iasonas Kokkinos

    Abstract: DensePose provides a pixel-accurate association of images with 3D mesh coordinates, but does not provide a 3D mesh, while Human Mesh Reconstruction (HMR) systems have high 2D reprojection error, as measured by DensePose localization metrics. In this work we introduce MeshPose to jointly tackle DensePose and HMR. For this we first introduce new losses that allow us to use weak DensePose supervision… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

    MSC Class: 68 ACM Class: I.2.10

    Journal ref: CVPR 2024

  4. arXiv:2402.05235  [pdf, other

    cs.CV

    SPAD : Spatially Aware Multiview Diffusers

    Authors: Yash Kant, Ziyi Wu, Michael Vasilkovsky, Guocheng Qian, Jian Ren, Riza Alp Guler, Bernard Ghanem, Sergey Tulyakov, Igor Gilitschenski, Aliaksandr Siarohin

    Abstract: We present SPAD, a novel approach for creating consistent multi-view images from text prompts or single images. To enable multi-view generation, we repurpose a pretrained 2D diffusion model by extending its self-attention layers with cross-view interactions, and fine-tune it on a high quality subset of Objaverse. We find that a naive extension of the self-attention proposed in prior work (e.g. MVD… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Webpage: https://yashkant.github.io/spad

  5. arXiv:2310.16167  [pdf, other

    cs.CV

    iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis

    Authors: Yash Kant, Aliaksandr Siarohin, Michael Vasilkovsky, Riza Alp Guler, Jian Ren, Sergey Tulyakov, Igor Gilitschenski

    Abstract: We present a method for generating consistent novel views from a single source image. Our approach focuses on maximizing the reuse of visible pixels from the source image. To achieve this, we use a monocular depth estimator that transfers visible pixels from the source view to the target view. Starting from a pre-trained 2D inpainting diffusion model, we train our method on the large-scale Objaver… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to SIGGRAPH Asia, 2023 (Conference Papers)

  6. arXiv:2303.14479  [pdf, other

    eess.IV cs.CV

    Explainable Image Quality Assessment for Medical Imaging

    Authors: Caner Ozer, Arda Guler, Aysel Turkvatan Cansever, Ilkay Oksuz

    Abstract: Medical image quality assessment is an important aspect of image acquisition, as poor-quality images may lead to misdiagnosis. Manual labelling of image quality is a tedious task for population studies and can lead to misleading results. While much research has been done on automated analysis of image quality to address this issue, relatively little work has been done to explain the methodologies.… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    ACM Class: I.4.9

  7. arXiv:2302.09227  [pdf, other

    cs.CV cs.GR

    Invertible Neural Skinning

    Authors: Yash Kant, Aliaksandr Siarohin, Riza Alp Guler, Menglei Chai, Jian Ren, Sergey Tulyakov, Igor Gilitschenski

    Abstract: Building animatable and editable models of clothed humans from raw 3D scans and poses is a challenging problem. Existing reposing methods suffer from the limited expressiveness of Linear Blend Skinning (LBS), require costly mesh extraction to generate each new pose, and typically do not preserve surface correspondences across different poses. In this work, we introduce Invertible Neural Skinning (… ▽ More

    Submitted 4 March, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  8. arXiv:2208.06034  [pdf, other

    eess.IV cs.CV

    Shifted Windows Transformers for Medical Image Quality Assessment

    Authors: Caner Ozer, Arda Guler, Aysel Turkvatan Cansever, Deniz Alis, Ercan Karaarslan, Ilkay Oksuz

    Abstract: To maintain a standard in a medical imaging study, images should have necessary image quality for potential diagnostic use. Although CNN-based approaches are used to assess the image quality, their performance can still be improved in terms of accuracy. In this work, we approach this problem by using Swin Transformer, which improves the poor-quality image classification performance that causes the… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 10 pages, 3 figures, 4 tables. Accepted in 13th Machine Learning in Medical Imaging (MLMI 2022) workshop

  9. Context-self contrastive pretraining for crop type semantic segmentation

    Authors: Michail Tarasiou, Riza Alp Guler, Stefanos Zafeiriou

    Abstract: In this paper, we propose a fully supervised pre-training scheme based on contrastive learning particularly tailored to dense classification tasks. The proposed Context-Self Contrastive Loss (CSCL) learns an embedding space that makes semantic boundaries pop-up by use of a similarity metric between every location in a training sample and its local context. For crop type semantic segmentation from… ▽ More

    Submitted 5 February, 2024; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 15 pages, 17 figures

  10. arXiv:2012.02342  [pdf, other

    cs.LG cs.AI math.OC

    Divide and Learn: A Divide and Conquer Approach for Predict+Optimize

    Authors: Ali Ugur Guler, Emir Demirovic, Jeffrey Chan, James Bailey, Christopher Leckie, Peter J. Stuckey

    Abstract: The predict+optimize problem combines machine learning ofproblem coefficients with a combinatorial optimization prob-lem that uses the predicted coefficients. While this problemcan be solved in two separate stages, it is better to directlyminimize the optimization loss. However, this requires dif-ferentiating through a discrete, non-differentiable combina-torial function. Most existing approaches… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  11. arXiv:2004.01946  [pdf, other

    cs.CV

    Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

    Authors: Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos, Michael Bronstein, Stefanos Zafeiriou

    Abstract: We introduce a simple and effective network architecture for monocular 3D hand pose estimation consisting of an image encoder followed by a mesh convolutional decoder that is trained through a direct 3D hand mesh reconstruction loss. We train our network by gathering a large-scale dataset of hand action in YouTube videos and use it as a source of weak supervision. Our weakly-supervised mesh convol… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020). Additional resources: https://arielai.com/mesh_hands

  12. arXiv:1906.05706  [pdf, other

    cs.CV

    Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues

    Authors: Natalia Neverova, James Thewlis, Rıza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi

    Abstract: DensePose supersedes traditional landmark detectors by densely mapping image pixels to body surface coordinates. This power, however, comes at a greatly increased annotation time, as supervising the model requires to manually label hundreds of points per pose instance. In this work, we thus seek methods to significantly slim down the DensePose annotations, proposing more efficient data collection… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: CVPR 2019

  13. arXiv:1905.01326  [pdf, other

    cs.CV

    Single Image 3D Hand Reconstruction with Mesh Convolutions

    Authors: Dominik Kulon, Haoyang Wang, Riza Alp Güler, Michael Bronstein, Stefanos Zafeiriou

    Abstract: Monocular 3D reconstruction of deformable objects, such as human body parts, has been typically approached by predicting parameters of heavyweight linear models. In this paper, we demonstrate an alternative solution that is based on the idea of encoding images into a latent non-linear representation of meshes. The prior on 3D hand shapes is learned by training an autoencoder with intrinsic graph c… ▽ More

    Submitted 5 August, 2019; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Proceedings of the British Machine Vision Conference (BMVC 2019)

  14. arXiv:1904.11960  [pdf, other

    cs.CV

    Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model using Deep Non-Rigid Structure from Motion

    Authors: Mihir Sahasrabudhe, Zhixin Shu, Edward Bartrum, Riza Alp Guler, Dimitris Samaras, Iasonas Kokkinos

    Abstract: In this work we introduce Lifting Autoencoders, a generative 3D surface-based model of object categories. We bring together ideas from non-rigid structure from motion, image formation, and morphable models to learn a controllable, geometric model of 3D categories in an entirely unsupervised manner from an unstructured set of images. We exploit the 3D geometric nature of our model and use normal in… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 19 pages; 12 figures; code will be released; Project page: https://msahasrabudhe.github.io/projects/lae/

  15. arXiv:1809.01995  [pdf, other

    cs.CV

    Dense Pose Transfer

    Authors: Natalia Neverova, Riza Alp Guler, Iasonas Kokkinos

    Abstract: In this work we integrate ideas from surface-based modeling with neural synthesis: we propose a combination of surface-based pose estimation and deep generative models that allows us to perform accurate pose transfer, i.e. synthesize a new image of a person based on a single image of that person and the image of a pose donor. We use a dense pose estimation system that maps pixels from both images… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: ECCV 2018

  16. arXiv:1806.06503  [pdf, other

    cs.CV

    Deforming Autoencoders: Unsupervised Disentangling of Shape and Appearance

    Authors: Zhixin Shu, Mihir Sahasrabudhe, Alp Guler, Dimitris Samaras, Nikos Paragios, Iasonas Kokkinos

    Abstract: In this work we introduce Deforming Autoencoders, a generative model for images that disentangles shape from appearance in an unsupervised manner. As in the deformable template paradigm, shape is represented as a deformation between a canonical coordinate system (`template') and an observed image, while appearance is modeled in `canonical', template, coordinates, thus discarding variability due to… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 17 pages including references, plus 12 pages appendix. Video available at : https://youtu.be/Oi7pyxKkF1g Code will be made available soon

  17. arXiv:1803.02188  [pdf, other

    cs.CV

    DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

    Authors: Riza Alp Guler, Yuxiang Zhou, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

    Abstract: In this work we use deep learning to establish dense correspondences between a 3D object model and an image "in the wild". We introduce "DenseReg", a fully-convolutional neural network (F-CNN) that densely regresses at every foreground pixel a pair of U-V template coordinates in a single feedforward pass. To train DenseReg we construct a supervision signal by combining 3D deformable model fitting… ▽ More

    Submitted 11 March, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1612.01202

  18. arXiv:1802.00434  [pdf, other

    cs.CV

    DensePose: Dense Human Pose Estimation In The Wild

    Authors: Rıza Alp Güler, Natalia Neverova, Iasonas Kokkinos

    Abstract: In this work, we establish dense correspondences between RGB image and a surface-based representation of the human body, a task we refer to as dense human pose estimation. We first gather dense correspondences for 50K persons appearing in the COCO dataset by introducing an efficient annotation pipeline. We then use our dataset to train CNN-based systems that deliver dense correspondence 'in the wi… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  19. arXiv:1707.03984  [pdf, ps, other

    cs.NI

    Spatial Interference Detection for Mobile Visible Light Communication

    Authors: Ali Ugur Guler, Tristan Braud, Pan Hui

    Abstract: Taking advantage of the rolling shutter effect of CMOS cameras in smartphones is a common practice to increase the transfered data rate with visible light communication (VLC) without employing external equipment such as photodiodes. VLC can then be used as replacement of other marker based techniques for object identification for Augmented Reality and Ubiquitous computing applications. However, th… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  20. arXiv:1612.01202  [pdf, other

    cs.CV

    DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

    Authors: Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

    Abstract: In this paper we propose to learn a mapping from image pixels into a dense template grid through a fully convolutional network. We formulate this task as a regression problem and train our network by leveraging upon manually annotated facial landmarks "in-the-wild". We use such landmarks to establish a dense correspondence field between a three-dimensional object template and the input image, whic… ▽ More

    Submitted 19 June, 2017; v1 submitted 4 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  21. arXiv:1606.01419  [pdf

    cs.OH

    One-dimensional Cutting Stock Problem with Divisible Items

    Authors: Deniz Tanir, Onur Ugurlu, Asli Guler, Urfat Nuriyev

    Abstract: This paper considers the one-dimensional cutting stock problem with divisible items, which is a new problem in the cutting stock literature. The problem exists in steel industries. In the new problem, each item can be divided into smaller pieces, then they can be recombined again by welding. The objective is to minimize both the trim loss and the number of the welds. We present a mathematical mode… ▽ More

    Submitted 4 June, 2016; originally announced June 2016.

    Comments: 12 pages, 2 figures

    MSC Class: 90B99

    Journal ref: TWMS Journal of Applied and Engineering Mathematics, 9(3), (2019). 473-484