Skip to main content

Showing 1–3 of 3 results for author: Oswal, Y

.
  1. arXiv:2405.04459  [pdf, other

    cs.AI cs.CV cs.LG cs.NE

    A Significantly Better Class of Activation Functions Than ReLU Like Activation Functions

    Authors: Mathew Mithra Noel, Yug Oswal

    Abstract: This paper introduces a significantly better class of activation functions than the almost universally used ReLU like and Sigmoidal class of activation functions. Two new activation functions referred to as the Cone and Parabolic-Cone that differ drastically from popular activation functions and significantly outperform these on the CIFAR-10 and Imagenette benchmmarks are proposed. The cone activa… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 14 pages

    MSC Class: 68T07

  2. arXiv:2310.02901  [pdf, other

    cs.NE cs.CV

    Efficient Vectorized Backpropagation Algorithms for Training Feedforward Networks Composed of Quadratic Neurons

    Authors: Mathew Mithra Noel, Venkataraman Muthiah-Nakarajan, Yug D Oswal

    Abstract: Higher order artificial neurons whose outputs are computed by applying an activation function to a higher order multinomial function of the inputs have been considered in the past, but did not gain acceptance due to the extra parameters and computational cost. However, higher order neurons have significantly greater learning capabilities since the decision boundaries of higher order neurons can be… ▽ More

    Submitted 21 April, 2025; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 8 pages

    MSC Class: 68T07 ACM Class: I.5.0

  3. arXiv:2303.09935  [pdf, other

    cs.NE

    Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

    Authors: Mathew Mithra Noel, Arindam Banerjee, Yug Oswal, Geraldine Bessie Amali D, Venkataraman Muthiah-Nakarajan

    Abstract: All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that determines how incorrect outputs are penalized and can be tuned to improve performance. This paper shows that training speed and final accuracy of neural networks can s… ▽ More

    Submitted 5 November, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    MSC Class: 68 ACM Class: I.2