-
Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model
Authors:
Junghun Cha,
Ali Haider,
Seoyun Yang,
Hoeyeong Jin,
Subin Yang,
A. F. M. Shahab Uddin,
Jaehyoung Kim,
Soo Ye Kim,
Sung-Ho Bae
Abstract:
A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies…
▽ More
A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies has become an indispensable task for many products, it has not been systematically explored, and to the best of our knowledge, no public datasets are available. In this paper, we define this problem as Descanning and introduce a new high-quality and large-scale dataset named DESCAN-18K. It contains 18K pairs of original and scanned images collected in the wild containing multiple complex degradations. In order to eliminate such complex degradations, we propose a new image restoration model called DescanDiffusion consisting of a color encoder that corrects the global color degradation and a conditional denoising diffusion probabilistic model (DDPM) that removes local degradations. To further improve the generalization ability of DescanDiffusion, we also design a synthetic data generation scheme by reproducing prominent degradations in scanned images. We demonstrate that our DescanDiffusion outperforms other baselines including commercial restoration products, objectively and subjectively, via comprehensive experiments and analyses.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Real-World Single Image Super-Resolution Under Rainy Condition
Authors:
Mohammad Shahab Uddin
Abstract:
Image super-resolution is an important research area in computer vision that has a wide variety of applications including surveillance, medical imaging etc. Real-world signal image super-resolution has become very popular now-a-days due to its real-time application. There are still a lot of scopes to improve real-world single image super-resolution specially during challenging weather scenarios. I…
▽ More
Image super-resolution is an important research area in computer vision that has a wide variety of applications including surveillance, medical imaging etc. Real-world signal image super-resolution has become very popular now-a-days due to its real-time application. There are still a lot of scopes to improve real-world single image super-resolution specially during challenging weather scenarios. In this paper, we have proposed a new algorithm to perform real-world single image super-resolution during rainy condition. Our proposed method can mitigate the influence of rainy conditions during image super-resolution. Our experiment results show that our proposed algorithm can perform image super-resolution decreasing the negative effects of the rain.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Characterizing User Behaviors in Open-Source Software User Forums: An Empirical Study
Authors:
Jazlyn Hellman,
Jiahao Chen,
Md. Sami Uddin,
Jinghui Cheng,
Jin L. C. Guo
Abstract:
User forums of Open Source Software (OSS) enable end-users to collaboratively discuss problems concerning the OSS applications. Despite decades of research on OSS, we know very little about how end-users engage with OSS communities on these forums, in particular, the challenges that hinder their continuous and meaningful participation in the OSS community. Many previous works are developer-centric…
▽ More
User forums of Open Source Software (OSS) enable end-users to collaboratively discuss problems concerning the OSS applications. Despite decades of research on OSS, we know very little about how end-users engage with OSS communities on these forums, in particular, the challenges that hinder their continuous and meaningful participation in the OSS community. Many previous works are developer-centric and overlook the importance of end-user forums. As a result, end-users' expectations are seldom reflected in OSS development. To better understand user behaviors in OSS user forums, we carried out an empirical study analyzing about 1.3 million posts from user forums of four popular OSS applications: Zotero, Audacity, VLC, and RStudio. Through analyzing the contribution patterns of three common user types (end-users, developers, and organizers), we observed that end-users not only initiated most of the threads (above 96% of threads in three projects, 86% in the other), but also acted as the significant contributors for responding to other users' posts, even though they tended to lack confidence in their activities as indicated by psycho-linguistic analyses. Moreover, we found end-users more open, reflecting a more positive emotion in communication than organizers and developers in the forums. Our work contributes new knowledge about end-users' activities and behaviors in OSS user forums that the vital OSS stakeholders can leverage to improve end-user engagement in the OSS development process.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Software Process Improvement Based on Defect Prevention Using Capability and Testing Model Integration in Extreme Programming
Authors:
Md. Habibur Rahman,
Ziaur Rahman,
Md. Al - Mustanjid,
Muhammad Shahin Uddin,
Mehedy Hasan Rafsan Jany
Abstract:
Nowadays, Software Process Improvement popularly known as SPI has been able to receive an immense concern in the continuous process to purify software quality. Several Agile methodologies previously have worked with Extreme programming (XP). Before improving the process, defect prevention (DP) is inevitable. In addition, DP largely depends on defect detection either found earlier in the design and…
▽ More
Nowadays, Software Process Improvement popularly known as SPI has been able to receive an immense concern in the continuous process to purify software quality. Several Agile methodologies previously have worked with Extreme programming (XP). Before improving the process, defect prevention (DP) is inevitable. In addition, DP largely depends on defect detection either found earlier in the design and implementation stages or held in the testing phases. However, testing maturity model integration (TMMI) has a crucial aspect in DP as well as process improvement of the software. In particular, when software gets validated by being tested and fixed the defects up, it achieves the maximum capability maturity model integration (CMMI) aiming the process improvement. Here, the article has proposed an improved defect detection and prevention model to enhance the software process following the approach of XP. Besides, as a unique contribution, we have united the capability and testing model integration to ensure better SPI.
△ Less
Submitted 11 July, 2021;
originally announced July 2021.
-
SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization
Authors:
A. F. M. Shahab Uddin,
Mst. Sirazam Monira,
Wheemyung Shin,
TaeChoong Chung,
Sung-Ho Bae
Abstract:
Advanced data augmentation strategies have widely been studied to improve the generalization ability of deep learning models. Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization. However, such information removal is undesirable. On the other hand, recent strategies sugg…
▽ More
Advanced data augmentation strategies have widely been studied to improve the generalization ability of deep learning models. Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization. However, such information removal is undesirable. On the other hand, recent strategies suggest to randomly cut and mix patches and their labels among training images, to enjoy the advantages of regional dropout without having any pointless pixel in the augmented images. We argue that such random selection strategies of the patches may not necessarily represent sufficient information about the corresponding object and thereby mixing the labels according to that uninformative patch enables the model to learn unexpected feature representation. Therefore, we propose SaliencyMix that carefully selects a representative image patch with the help of a saliency map and mixes this indicative patch with the target image, thus leading the model to learn more appropriate feature representation. SaliencyMix achieves the best known top-1 error of 21.26% and 20.09% for ResNet-50 and ResNet-101 architectures on ImageNet classification, respectively, and also improves the model robustness against adversarial perturbations. Furthermore, models that are trained with SaliencyMix help to improve the object detection performance. Source code is available at https://github.com/SaliencyMix/SaliencyMix.
△ Less
Submitted 27 July, 2021; v1 submitted 2 June, 2020;
originally announced June 2020.
-
Dynamic Channel Allocation for QoS Provisioning in Visible Light Communication
Authors:
Mostafa Zaman Chowdhury,
Muhammad Shahin Uddin,
Yeong Min Jang
Abstract:
In visible light communication (VLC) diverse types of traffic are supported while the number of optical channels is limited. In this paper we propose a dynamic channel reservation scheme for higher priority calls that does not reduce the channel utilization. The number of reserved channels for each traffic class is calculated using real time observation of the call arrival rates of each traffic cl…
▽ More
In visible light communication (VLC) diverse types of traffic are supported while the number of optical channels is limited. In this paper we propose a dynamic channel reservation scheme for higher priority calls that does not reduce the channel utilization. The number of reserved channels for each traffic class is calculated using real time observation of the call arrival rates of each traffic classes. The numerical results show that the proposed scheme is able to reduce the call blocking probability of the higher priority user within a reasonable range without sacrificing channel utilization.
△ Less
Submitted 4 October, 2018;
originally announced October 2018.
-
Dynamic Channel Allocation for Class-Based QoS Provisioning and Call Admission in Visible Light Communication
Authors:
Mostafa Zaman Chowdhury,
Muhammad Shahin Uddin,
Yeong Min Jang
Abstract:
Provisioning of quality of service (QoS) is a key issue in visible light communication (VLC) system as well as in other wireless communication systems. Due to the fact that QoS requirements are not as strict for all traffic types, more calls of higher priority traffic classes can be accommodated by blocking some more calls of lower priority traffic classes. Diverse types of high data rate traffic…
▽ More
Provisioning of quality of service (QoS) is a key issue in visible light communication (VLC) system as well as in other wireless communication systems. Due to the fact that QoS requirements are not as strict for all traffic types, more calls of higher priority traffic classes can be accommodated by blocking some more calls of lower priority traffic classes. Diverse types of high data rate traffic are supported by existing wireless communication systems while the resource is limited. Hence, priority based resource allocation can ensure the service quality for the calls of important traffic class. The fixed guard channels to prioritize any class of calls always reduce the channel utilization. In this paper we propose a priority based dynamic channel reservation scheme for higher priority calls that does not reduce the channel utilization significantly. The number of reserved channels for each of the individual traffic classes is calculated using real-time observation of the call arrival rates of all the traffic classes. The features of the scheme allow reduction of the call blocking probability of higher priority calls along with the increase of the channel utilization. The proposed Markov Chain model is expected to be very much effective for the queuing analysis especially for the priority scheme of any number of traffic classes. The numerical results show that the proposed scheme is able to attain reasonable call blocking probability of higher priority calls without sacrificing channel utilization.
△ Less
Submitted 11 December, 2014;
originally announced December 2014.
-
Analysis Tool for UNL-Based Knowledge Representation
Authors:
Shamim Ripon,
Aoyan Barua,
Mohammad Salah Uddin
Abstract:
The fundamental issue in knowledge representation is to provide a precise definition of the knowledge that they possess in a manner that is independent of procedural considerations, context free and easy to manipulate, exchange and reason about. Knowledge must be accessible to everyone regardless of their native languages. Universal Networking Language (UNL) is a declarative formal language and a…
▽ More
The fundamental issue in knowledge representation is to provide a precise definition of the knowledge that they possess in a manner that is independent of procedural considerations, context free and easy to manipulate, exchange and reason about. Knowledge must be accessible to everyone regardless of their native languages. Universal Networking Language (UNL) is a declarative formal language and a generalized form of human language in a machine independent digital platform for defining, recapitulating, amending, storing and dissipating knowledge among people of different affiliations. UNL extracts semantic data from a native language for Interlingua machine translation. This paper presents the development of a graphical tool that incorporates UNL to provide a visual mean to represent the semantic data available in a native text. UNL represents the semantics of a sentence as a conceptual hyper-graph. We translate this information into XML format and create a graph from XML, representing the actual concepts available in the native language
△ Less
Submitted 4 May, 2014;
originally announced May 2014.
-
Web Service Composition - BPEL vs cCSP Process Algebra
Authors:
Shamim Ripon,
Mohammad Salah Uddin,
Aoyan Barua
Abstract:
Web services technology provides a platform on which we can develop distributed services. The interoperability among these services is achieved by various standard protocols. In recent years, several researches suggested that process algebras provide a satisfactory assistance to the whole process of web services development. Business transactions, on the other hand, involve the coordination and in…
▽ More
Web services technology provides a platform on which we can develop distributed services. The interoperability among these services is achieved by various standard protocols. In recent years, several researches suggested that process algebras provide a satisfactory assistance to the whole process of web services development. Business transactions, on the other hand, involve the coordination and interaction between multiple partners. With the emergence of web services, business transactions are conducted using these services. The coordination among the business processes is crucial, so is the handling of faults that can arise at any stage of a transaction. BPEL models the behavior of business process interaction by providing a XML based grammar to describe the control logic required to coordinate the web services participating in a process flow. However BPEL lacks a proper formal description where the composition of business processes cannot be formally verified. Process algebra, on the other hand, facilitates a formal foundation for rigorous verification of the composition. This paper presents a comparison of web service composition between BPEL and process algebra, cCSP.
△ Less
Submitted 23 February, 2014;
originally announced February 2014.
-
Extraction of Facial Feature Points Using Cumulative Histogram
Authors:
Sushil Kumar Paul,
Mohammad Shorif Uddin,
Saida Bouakaz
Abstract:
This paper proposes a novel adaptive algorithm to extract facial feature points automatically such as eyebrows corners, eyes corners, nostrils, nose tip, and mouth corners in frontal view faces, which is based on cumulative histogram approach by varying different threshold values. At first, the method adopts the Viola-Jones face detector to detect the location of face and also crops the face regio…
▽ More
This paper proposes a novel adaptive algorithm to extract facial feature points automatically such as eyebrows corners, eyes corners, nostrils, nose tip, and mouth corners in frontal view faces, which is based on cumulative histogram approach by varying different threshold values. At first, the method adopts the Viola-Jones face detector to detect the location of face and also crops the face region in an image. From the concept of the human face structure, the six relevant regions such as right eyebrow, left eyebrow, right eye, left eye, nose, and mouth areas are cropped in a face image. Then the histogram of each cropped relevant region is computed and its cumulative histogram value is employed by varying different threshold values to create a new filtering image in an adaptive way. The connected component of interested area for each relevant filtering image is indicated our respective feature region. A simple linear search algorithm for eyebrows, eyes and mouth filtering images and contour algorithm for nose filtering image are applied to extract our desired corner points automatically. The method was tested on a large BioID frontal face database in different illuminations, expressions and lighting conditions and the experimental results have achieved average success rates of 95.27%.
△ Less
Submitted 15 March, 2012;
originally announced March 2012.