Photometric Analysis for Predicting Star Formation Rates in Large Galaxies Using Machine Learning and Deep Learning Techniques
Authors:
Satvik Raghav,
Prasanth Ayitapu,
Sathwik Narkedimilli,
Sujith Makam,
Aswath Babu H
Abstract:
Star formation rates (SFRs) are a crucial observational tracer of galaxy formation and evolution. Spectroscopy, which is expensive, is traditionally used to estimate SFRs. This study tests the possibility of inferring SFRs of large samples of galaxies from only photometric data, using state-of-the-art machine learning and deep learning algorithms. The dataset adopted in this work is the one collec…
▽ More
Star formation rates (SFRs) are a crucial observational tracer of galaxy formation and evolution. Spectroscopy, which is expensive, is traditionally used to estimate SFRs. This study tests the possibility of inferring SFRs of large samples of galaxies from only photometric data, using state-of-the-art machine learning and deep learning algorithms. The dataset adopted in this work is the one collected by Delli Veneri et al. (2019): it includes photometric data of more than 27 million galaxies coming from the Sloan Digital Sky Survey Data Release 7 (SDSS-DR7). The algorithms we implemented and tested for comparing the performances include Linear Regression, Long Short-Term Memory (LSTM) networks, Support Vector Regression (SVR), Random Forest Regressor, Decision Tree Regressor, Gradient Boosting Regressor, and classical deep learning models.
Our results mention that the Linear Regression model predicted an impressive accuracy of 98.97 percent as measured by the Mean Absolute Error (MAE), demonstrating that machine-learning approaches can be effective when it comes to photometric SFR estimation. Besides, the paper also reported the results for other intelligent algorithms, which predicted the SFRs, providing a detailed comparison of the performance of different machine learning algorithms in the photometric SFR estimation. This study not only shows the estimated SFR from photometric data is promising but also opens a door toward the application of machine learning and deep learning in astrophysics.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
Predicting Stellar Metallicity: A Comparative Analysis of Regression Models for Solar Twin Stars
Authors:
Sathwik Narkedimilli,
Satvik Raghav,
Sujith Makam,
Prasanth Ayitapu,
Aswath Babu H
Abstract:
The research focuses on determining the metallicity ([Fe/H]) predicted in the solar twin stars by using various regression modeling techniques which are, Random Forest, Linear Regression, Decision Tree, Support Vector, and Gradient Boosting. The data set that is taken into account here includes Stellar parameters and chemical abundances derived from a high-accuracy abundance catalog of solar twins…
▽ More
The research focuses on determining the metallicity ([Fe/H]) predicted in the solar twin stars by using various regression modeling techniques which are, Random Forest, Linear Regression, Decision Tree, Support Vector, and Gradient Boosting. The data set that is taken into account here includes Stellar parameters and chemical abundances derived from a high-accuracy abundance catalog of solar twins from the GALAH survey. To overcome the missing values, intensive preprocessing techniques involving, imputation are done. Each model will subjected to training using different critical observables, which include, Mean Squared Error(MSE), Mean Absolute Error(MAE), Root Mean Squared Error(RMSE), and R-squared. Modeling is done by using, different feature sets like temperature: effective temperature(Teff), surface gravity: log g of 14-chemical-abundances namely, (([Na/Fe], [Mg/Fe], [Al/Fe], [Si/Fe], [Ca/Fe], [Sc/Fe], [Ti/Fe], [Cr/Fe], [Mn/Fe], [Ni/Fe], [Cu/Fe], [Zn/Fe], [Y/Fe], [Ba/Fe])).
The target variable considered is the metallicity ([Fe/H]). The findings indicate that the Random Forest model achieved the highest accuracy, with an MSE of 0.001628 and an R-squared value of 0.9266. The results highlight the efficacy of ensemble methods in handling complex datasets with high dimensionality. Additionally, this study underscores the importance of selecting appropriate regression models for astronomical data analysis, providing a foundation for future research in predicting stellar properties with machine learning techniques.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.