-
Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks
Authors:
Mathew Mithra Noel,
Arindam Banerjee,
Yug Oswal,
Geraldine Bessie Amali D,
Venkataraman Muthiah-Nakarajan
Abstract:
All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that determines how incorrect outputs are penalized and can be tuned to improve performance. This paper shows that training speed and final accuracy of neural networks can s…
▽ More
All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that determines how incorrect outputs are penalized and can be tuned to improve performance. This paper shows that training speed and final accuracy of neural networks can significantly depend on the loss function used to train neural networks. In particular derivative values can be significantly different with different loss functions leading to significantly different performance after gradient descent based Backpropagation (BP) training. This paper explores the effect on performance of using new loss functions that are also convex but penalize errors differently compared to the popular Cross-entropy loss. Two new classification loss functions that significantly improve performance on a wide variety of benchmark tasks are proposed. A new loss function call smooth absolute error that outperforms the Squared error, Huber and Log-Cosh losses on datasets with significantly many outliers is proposed. This smooth absolute error loss function is infinitely differentiable and more closely approximates the absolute error loss compared to the Huber and Log-Cosh losses used for robust regression.
△ Less
Submitted 5 November, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
The 100-month Swift catalogue of supergiant fast X-ray transients II. SFXT diagnostics from outburst properties
Authors:
Romano P.,
Evans P. A.,
Bozzo E.,
Mangano V.,
Vercellone S.,
Guidorzi C.,
Ducci L.,
Kennea J. A.,
Barthelmy S. D.,
Palmer D. M.,
Krimm H. A.,
Cenko B.
Abstract:
Supergiant Fast X-ray Transients (SFXT) are High Mass X-ray Binaries displaying X-ray outbursts reaching peak luminosities of 10$^{38}$ erg/s and spend most of their life in more quiescent states with luminosities as low as 10$^{32}$-10$^{33}$ erg/s. The main goal of our comprehensive and uniform analysis of the SFXT Swift triggers is to provide tools to predict whether a transient which has no kn…
▽ More
Supergiant Fast X-ray Transients (SFXT) are High Mass X-ray Binaries displaying X-ray outbursts reaching peak luminosities of 10$^{38}$ erg/s and spend most of their life in more quiescent states with luminosities as low as 10$^{32}$-10$^{33}$ erg/s. The main goal of our comprehensive and uniform analysis of the SFXT Swift triggers is to provide tools to predict whether a transient which has no known X-ray counterpart may be an SFXT candidate. These tools can be exploited for the development of future missions exploring the variable X-ray sky through large FoV instruments. We examined all available data on outbursts of SFXTs that triggered the Swift/BAT collected between 2005-08-30 and 2014-12-31, in particular those for which broad-band data, including the Swift/XRT ones, are also available. We processed all BAT and XRT data uniformly with the Swift Burst Analyser to produce spectral evolution dependent flux light curves for each outburst. The BAT data allowed us to infer useful diagnostics to set SFXT triggers apart from the general GRB population, showing that SFXTs give rise uniquely to image triggers and are simultaneously very long, faint, and `soft' hard-X-ray transients. The BAT data alone can discriminate very well the SFXTs from other fast transients such as anomalous X-ray pulsars and soft gamma repeaters. However, to distinguish SFXTs from, for instance, accreting millisecond X-ray pulsars and jetted tidal disruption events, the XRT data collected around the time of the BAT triggers are decisive. The XRT observations of 35/52 SFXT BAT triggers show that in the soft X-ray energy band, SFXTs display a decay in flux from the peak of the outburst of at least 3 orders of magnitude within a day and rarely undergo large re-brightening episodes, favouring in most cases a rapid decay down to the quiescent level within 3-5 days (at most). [Abridged]
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
The MUSE second-generation VLT instrument
Authors:
Bacon R.,
Accardo M.,
Adjali L.,
Anwand H.,
Bauer S.,
Biswas I.,
Blaizot J.,
Boudon D.,
Brau-Nogue S.,
Brinchmann J.,
Caillier P.,
Capoani L.,
Carollo C. M.,
Contini T.,
Couderc P.,
Daguise E.,
Deiries S.,
Delabre B.,
Dreizler S.,
Dubois J. P.,
Dupieux M.,
Dupuy C.,
Emsellem E.,
Fechner T.,
Fleischmann A.
, et al. (43 additional authors not shown)
Abstract:
The Multi Unit Spectroscopic Explorer (MUSE) is a second-generation VLT panoramic integral-field spectrograph currently in manufacturing, assembly and integration phase. MUSE has a field of 1x1 arcmin2 sampled at 0.2x0.2 arcsec2 and is assisted by the VLT ground layer adaptive optics ESO facility using four laser guide stars. The instrument is a large assembly of 24 identical high performance inte…
▽ More
The Multi Unit Spectroscopic Explorer (MUSE) is a second-generation VLT panoramic integral-field spectrograph currently in manufacturing, assembly and integration phase. MUSE has a field of 1x1 arcmin2 sampled at 0.2x0.2 arcsec2 and is assisted by the VLT ground layer adaptive optics ESO facility using four laser guide stars. The instrument is a large assembly of 24 identical high performance integral field units, each one composed of an advanced image slicer, a spectrograph and a 4kx4k detector. In this paper we review the progress of the manufacturing and report the performance achieved with the first integral field unit.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Confirmation of the star J004229.87+410551.8 in M31 as a B[e] supergiant
Authors:
Sarkisyan A.,
Vinokurov A.,
Solovyeva Yu.,
Atapin K.,
Sholukhova O.,
Fabrika S.,
Bizyaev D
Abstract:
We study the luminous blue variable candidate J004229.87+410551.8 in the Andromeda Galaxy. Earlier, the star displayed a spectral anomaly: although a hot emission spectrum had been detected, it had strong CaII H and K absorption lines. Subsequently the star was assumed to be a hot hypergiant or a B[e] supergiant. For the purpose of clear star classification, we conducted spectroscopic and photomet…
▽ More
We study the luminous blue variable candidate J004229.87+410551.8 in the Andromeda Galaxy. Earlier, the star displayed a spectral anomaly: although a hot emission spectrum had been detected, it had strong CaII H and K absorption lines. Subsequently the star was assumed to be a hot hypergiant or a B[e] supergiant. For the purpose of clear star classification, we conducted spectroscopic and photometric analysis of the object and its surroundings with the 6-m telescope of SAO RAS, the 3.5-m ARC telescope of the Apache Point Observatory and the 2.5-m telescope of the Caucasus Mountain Observatory of the Sternberg Astronomical Institute. The spectrum of the star has the FeII, [FeII], [OI], and Balmer emission lines. Its spectral energy distribution shows an excess in the near-infrared range due to hot circumstellar dust. The indicated features and a high estimated value of star's luminosity ($\log (L/L_{\odot}) = 4.6\pm 0.2$) allow us to finally classify the object as a B[e] supergiant.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Space Photometry with BRITE-Constellation
Authors:
Weiss W. W,
Zwintz K.,
Kuschnig R.,
Handler G.,
Moffat A. F. J.,
Baade D.,
Bowman D. M.,
Granzer T.,
Kallinger T.,
Koudelka O. F.,
Lovekin C. C.,
Neiner C.,
Pablo H.,
Pigulski A.,
Popowicz A.,
Ramiaramanantsoa T.,
Rucinski S. M.,
Strassmeier K. G.,
Wade G. A
Abstract:
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes ins…
▽ More
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes inside stars (e.g., asteroseismology) and in their immediate environment. While the first astronomical observations from space focused on the spectral regions not accessible from the ground it soon became obvious around 1970 that avoiding the turbulent terrestrial atmosphere significantly improved the accuracy of photometry and satellites explicitly dedicated to high-quality photometry were launched. A perfect example is BRITE-Constellation, which is the result of a very successful cooperation between Austria, Canada and Poland. Research highlights for targets distributed nearly over the entire HRD are presented, but focus primarily on massive and hot stars.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
Exploiting timing capabilities of the CHEOPS mission with warm-Jupiter planets
Authors:
Borsato L,
Piotto G,
Gandolfi D,
Nascimbeni V,
Lacedelli G,
Marzari F,
Billot N,
Maxted P,
Sousa S G,
Cameron A C,
Bonfanti A,
Wilson T,
Serrano L,
Garai Z,
Alibert Y,
Alonso R,
Asquier J,
Bárczy T,
Bandy T,
Barrado D,
Barros S C,
Baumjohann W,
Beck M,
Beck T,
Benz W
, et al. (53 additional authors not shown)
Abstract:
We present 17 transit light curves of seven known warm-Jupiters observed with the CHaracterising ExOPlanet Satellite (CHEOPS). The light curves have been collected as part of the CHEOPS Guaranteed Time Observation (GTO) program that searches for transit-timing variation (TTV) of warm-Jupiters induced by a possible external perturber to shed light on the evolution path of such planetary systems. We…
▽ More
We present 17 transit light curves of seven known warm-Jupiters observed with the CHaracterising ExOPlanet Satellite (CHEOPS). The light curves have been collected as part of the CHEOPS Guaranteed Time Observation (GTO) program that searches for transit-timing variation (TTV) of warm-Jupiters induced by a possible external perturber to shed light on the evolution path of such planetary systems. We describe the CHEOPS observation process, from the planning to the data analysis. In this work we focused on the timing performance of CHEOPS, the impact of the sampling of the transit phases, and the improvement we can obtain combining multiple transits together. We reached the highest precision on the transit time of about 13-16 s for the brightest target (WASP-38, G = 9.2) in our sample. From the combined analysis of multiple transits of fainter targets with G >= 11 we obtained a timing precision of about 2 min. Additional observations with CHEOPS, covering a longer temporal baseline, will further improve the precision on the transit times and will allow us to detect possible TTV signals induced by an external perturber.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Control Intervention Strategies for Within-Host, Between-Host and their Efficacy in the Treatment, Spread of COVID-19 : A Multi Scale Modeling Approach
Authors:
Bhanu Prakash D,
D. K. K. Vamsi,
D. Bangaru Rajesh,
Carani B Sanjeevi
Abstract:
The COVID-19 pandemic has resulted in more than 14.5 million infections and 6,04,917 deaths in 212 countries over the last few months. Different drug intervention acting at multiple stages of pathogenesis of COVID-19 can substantially reduce the infection induced,thereby decreasing the mortality. Also population level control strategies can reduce the spread of the COVID-19 substantially. Motivate…
▽ More
The COVID-19 pandemic has resulted in more than 14.5 million infections and 6,04,917 deaths in 212 countries over the last few months. Different drug intervention acting at multiple stages of pathogenesis of COVID-19 can substantially reduce the infection induced,thereby decreasing the mortality. Also population level control strategies can reduce the spread of the COVID-19 substantially. Motivated by these observations, in this work we propose and study a multi scale model linking both within-host and between-host dynamics of COVID-19. Initially the natural history dealing with the disease dynamics is studied. Later, comparative effectiveness is performed to understand the efficacy of both the within-host and population level interventions. Findings of this study suggest that a combined strategy involving treatment with drugs such as Arbidol, remdesivir, Lopinavir/Ritonavir that inhibits viral replication and immunotherapies like monoclonal antibodies, along with environmental hygiene and generalized social distancing proved to be the best and optimal in reducing the basic reproduction number and environmental spread of the virus at the population level.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
KRNET: Image Denoising with Kernel Regulation Network
Authors:
Peng Liu,
Xiaoxiao Zhou,
Junyiyang Li,
El Basha Mohammad D,
Ruogu Fang
Abstract:
One popular strategy for image denoising is to design a generalized regularization term that is capable of exploring the implicit prior underlying data observation. Convolutional neural networks (CNN) have shown the powerful capability to learn image prior information through a stack of layers defined by a combination of kernels (filters) on the input. However, existing CNN-based methods mainly fo…
▽ More
One popular strategy for image denoising is to design a generalized regularization term that is capable of exploring the implicit prior underlying data observation. Convolutional neural networks (CNN) have shown the powerful capability to learn image prior information through a stack of layers defined by a combination of kernels (filters) on the input. However, existing CNN-based methods mainly focus on synthetic gray-scale images. These methods still exhibit low performance when tackling multi-channel color image denoising. In this paper, we optimize CNN regularization capability by developing a kernel regulation module. In particular, we propose a kernel regulation network-block, referred to as KR-block, by integrating the merits of both large and small kernels, that can effectively estimate features in solving image denoising. We build a deep CNN-based denoiser, referred to as KRNET, via concatenating multiple KR-blocks. We evaluate KRNET on additive white Gaussian noise (AWGN), multi-channel (MC) noise, and realistic noise, where KRNET obtains significant performance gains over state-of-the-art methods across a wide spectrum of noise levels.
△ Less
Submitted 19 October, 2019;
originally announced October 2019.
-
Image Restoration Using Deep Regulated Convolutional Networks
Authors:
Peng Liu,
Xiaoxiao Zhou,
Yangjunyi Li,
El Basha Mohammad D,
Ruogu Fang
Abstract:
While the depth of convolutional neural networks has attracted substantial attention in the deep learning research, the width of these networks has recently received greater interest. The width of networks, defined as the size of the receptive fields and the density of the channels, has demonstrated crucial importance in low-level vision tasks such as image denoising and restoration. However, the…
▽ More
While the depth of convolutional neural networks has attracted substantial attention in the deep learning research, the width of these networks has recently received greater interest. The width of networks, defined as the size of the receptive fields and the density of the channels, has demonstrated crucial importance in low-level vision tasks such as image denoising and restoration. However, the limited generalization ability, due to the increased width of networks, creates a bottleneck in designing wider networks. In this paper, we propose the Deep Regulated Convolutional Network (RC-Net), a deep network composed of regulated sub-network blocks cascaded by skip-connections, to overcome this bottleneck. Specifically, the Regulated Convolution block (RC-block), featured by a combination of large and small convolution filters, balances the effectiveness of prominent feature extraction and the generalization ability of the network. RC-Nets have several compelling advantages: they embrace diversified features through large-small filter combinations, alleviate the hazy boundary and blurred details in image denoising and super-resolution problems, and stabilize the learning process. Our proposed RC-Nets outperform state-of-the-art approaches with significant performance gains in various image restoration tasks while demonstrating promising generalization ability. The code is available at https://github.com/cswin/RC-Nets.
△ Less
Submitted 21 June, 2024; v1 submitted 19 October, 2019;
originally announced October 2019.
-
An iterative estimation for disturbances of semi-wavefronts to the delayed Fisher-KPP equation
Authors:
Rafael Benguria D.,
Abraham Solar
Abstract:
We give an iterative method to estimate the disturbance of semi-wavefronts of the equation: $\dot{u}(t,x) = u''(t,x) +u(t,x)(1-u(t-h,x)),$ $x \in \mathbb{R},\ t >0;$ where $h>0.$ As a consequence, we show the exponential stability, with an unbounded weight, of semi-wavefronts with speed $c>2\sqrt{2}$ and $h>0$. Under the same restriction of $c$ and $h$, the uniqueness of semi-wavefronts is obtaine…
▽ More
We give an iterative method to estimate the disturbance of semi-wavefronts of the equation: $\dot{u}(t,x) = u''(t,x) +u(t,x)(1-u(t-h,x)),$ $x \in \mathbb{R},\ t >0;$ where $h>0.$ As a consequence, we show the exponential stability, with an unbounded weight, of semi-wavefronts with speed $c>2\sqrt{2}$ and $h>0$. Under the same restriction of $c$ and $h$, the uniqueness of semi-wavefronts is obtained.
△ Less
Submitted 11 June, 2018;
originally announced June 2018.
-
EURONEAR - Recovery, Follow-up and Discovery of NEAs and MBAs using Large Field 1-2m Telescopes
Authors:
O. Vaduvescu,
M. Birlan,
A. Tudorica,
A. Sonka,
F. Pozo N.,
A. Barr D.,
D. J. Asher,
J. Licandro,
J. L. Ortiz,
E. Unda-Sanzana,
M. Popescu,
A. Nedelcu,
D. Dumitru,
R. Toma,
I. Comsa,
C. Vancea,
D. Vidican,
C. Opriseanu,
T. Badescu,
M. Badea,
M. Constantinescu
Abstract:
We report on the follow-up and recovery of 100 program NEAs, PHAs and VIs using the ESO/MPG 2.2m, Swope 1m and INT 2.5m telescopes equipped with large field cameras. The 127 fields observed during 11 nights covered 29 square degrees. Using these data, we present the incidental survey work which includes 558 known MBAs and 628 unknown moving objects mostly consistent with MBAs from which 58 objects…
▽ More
We report on the follow-up and recovery of 100 program NEAs, PHAs and VIs using the ESO/MPG 2.2m, Swope 1m and INT 2.5m telescopes equipped with large field cameras. The 127 fields observed during 11 nights covered 29 square degrees. Using these data, we present the incidental survey work which includes 558 known MBAs and 628 unknown moving objects mostly consistent with MBAs from which 58 objects became official discoveries. We planned the runs using six criteria and four servers which focus mostly on faint and poorly observed objects in need of confirmation, follow-up and recovery. We followed 62 faint NEAs within one month after discovery and we recovered 10 faint NEAs having big uncertainties at their second or later opposition. Using the INT we eliminated 4 PHA candidates and VIs. We observed in total 1,286 moving objects and we reported more than 10,000 positions. All data were reduced by the members of our network in a team effort, and reported promptly to the MPC. The positions of the program NEAs were published in 27 MPC and MPEC references and used to improve their orbits. The O-C residuals for known MBAs and program NEAs are smallest for the ESO/MPG and Swope and about four times larger for the INT whose field is more distorted. The incidental survey allowed us to study statistics of the MBA and NEA populations observable today with 1--2m facilities. We calculate preliminary orbits for all unknown objects, classifying them as official discoveries, later identifications and unknown outstanding objects. The orbital elements a, e, i calculated by FIND_ORB software for the official discoveries and later identified objects are very similar with the published elements which take into account longer observational arcs; thus preliminary orbits were used in statistics for the whole unknown dataset. (CONTINUED)
△ Less
Submitted 29 August, 2011;
originally announced August 2011.
-
The structure of graphite oxide: Investigation of its surface chemical groups
Authors:
D. W. Lee,
L. De Los Santos V.,
J. W. Seo,
L. Leon Felix,
A. Bustamante D.,
J. M. Cole,
C. ~H. ~W. ~Barnes
Abstract:
The structure of graphite oxide (GO) has been systematically studied using various tools such as SEM, TEM, XRD, Fourier transform infrared spectroscopy (FT-IR), X-ray photoemission spectroscopy (XPS), 13C solid state NMR, and O K-edge X-ray absorption near edge structure (XANES). The TEM data reveal that GO consists of amorphous and crystalline phases. The XPS data show that some carbon atoms have…
▽ More
The structure of graphite oxide (GO) has been systematically studied using various tools such as SEM, TEM, XRD, Fourier transform infrared spectroscopy (FT-IR), X-ray photoemission spectroscopy (XPS), 13C solid state NMR, and O K-edge X-ray absorption near edge structure (XANES). The TEM data reveal that GO consists of amorphous and crystalline phases. The XPS data show that some carbon atoms have sp3 orbitals and others have sp2 orbitals. The ratio of sp2 to sp3 bonded carbon atoms decreases as sample preparation times increase. The 13C solid-state NMR spectra of GO indicate the existence of -OH and -O- groups for which peaks appear at 60 and 70 ppm, respectively. FT-IR results corroborate these findings. The existence of ketone groups is also implied by FT-IR, which is verified by O K-edge XANES and 13C solid-state NMR. We propose a new model for GO based on the results; -O-, -OH, and -C=O groups are on the surface.
△ Less
Submitted 5 August, 2010;
originally announced August 2010.