-
Some Results on Generalized Familywise Error Rate Controlling Procedures under Dependence
Authors:
Monitirtha Dey,
Subir Kumar Bhandari
Abstract:
The topic of multiple hypotheses testing now has a potpourri of novel theories and ubiquitous applications in diverse scientific fields. However, the universal utility of this field often hinders the possibility of having a generalized theory that accommodates every scenario. This tradeoff is better reflected through the lens of dependence, a central piece behind the theoretical and applied develo…
▽ More
The topic of multiple hypotheses testing now has a potpourri of novel theories and ubiquitous applications in diverse scientific fields. However, the universal utility of this field often hinders the possibility of having a generalized theory that accommodates every scenario. This tradeoff is better reflected through the lens of dependence, a central piece behind the theoretical and applied developments of multiple testing. Although omnipresent in many scientific avenues, the nature and extent of dependence vary substantially with the context and complexity of the particular scenario. Positive dependence is the norm in testing many treatments versus a single control or in spatial statistics. On the contrary, negative dependence arises naturally in tests based on split samples and in cyclical, ordered comparisons. In GWAS, the SNP markers are generally considered to be weakly dependent. Generalized familywise error rate (k-FWER) control has been one of the prominent frequentist approaches in simultaneous inference. However, the performances of k-FWER controlling procedures are yet unexplored under different dependencies. This paper revisits the classical testing problem of normal means in different correlated frameworks. We establish upper bounds on the generalized familywise error rates under each dependence, consequently giving rise to improved testing procedures. Towards this, we present improved probability inequalities, which are of independent theoretical interest
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Progressive Rock Music Classification
Authors:
Arpan Nagar,
Joseph Bensabat,
Jokent Gaza,
Moinak Dey
Abstract:
This study investigates the classification of progressive rock music, a genre characterized by complex compositions and diverse instrumentation, distinct from other musical styles. Addressing this Music Information Retrieval (MIR) task, we extracted comprehensive audio features, including spectrograms, Mel-Frequency Cepstral Coefficients (MFCCs), chromagrams, and beat positions from song snippets…
▽ More
This study investigates the classification of progressive rock music, a genre characterized by complex compositions and diverse instrumentation, distinct from other musical styles. Addressing this Music Information Retrieval (MIR) task, we extracted comprehensive audio features, including spectrograms, Mel-Frequency Cepstral Coefficients (MFCCs), chromagrams, and beat positions from song snippets using the Librosa library. A winner-take-all voting strategy was employed to aggregate snippet-level predictions into final song classifications. We conducted a comparative analysis of various machine learning techniques. Ensemble methods, encompassing Bagging (Random Forest, ExtraTrees, Bagging Classifier) and Boosting (XGBoost, Gradient Boosting), were explored, utilizing Principal Component Analysis (PCA) for dimensionality reduction to manage computational constraints with high-dimensional feature sets. Additionally, deep learning approaches were investigated, including the development of custom 1D Convolutional Neural Network (1D CNN) architectures (named "Zuck" and "Satya") featuring specific layer configurations, normalization, and activation functions. Furthermore, we fine-tuned a state-of-the-art Audio Spectrogram Transformer (AST) model, leveraging its attention-based mechanisms for audio classification. Performance evaluation on validation and test sets revealed varying effectiveness across models, with ensemble methods like Extra Trees achieving test accuracies up to 76.38%. This research provides insights into the application and relative performance of diverse machine learning paradigms for the nuanced task of progressive rock genre classification.
△ Less
Submitted 14 April, 2025;
originally announced April 2025.
-
MMTEB: Massive Multilingual Text Embedding Benchmark
Authors:
Kenneth Enevoldsen,
Isaac Chung,
Imene Kerboua,
Márton Kardos,
Ashwin Mathur,
David Stap,
Jay Gala,
Wissam Siblini,
Dominik Krzemiński,
Genta Indra Winata,
Saba Sturua,
Saiteja Utpala,
Mathieu Ciancone,
Marion Schaeffer,
Gabriel Sequeira,
Diganta Misra,
Shreeya Dhakal,
Jonathan Rystrøm,
Roman Solomatin,
Ömer Çağatan,
Akash Kundu,
Martin Bernstorff,
Shitao Xiao,
Akshita Sukhlecha,
Bhavish Pahwa
, et al. (61 additional authors not shown)
Abstract:
Text embeddings are typically evaluated on a limited set of tasks, which are constrained by language, domain, and task diversity. To address these limitations and provide a more comprehensive evaluation, we introduce the Massive Multilingual Text Embedding Benchmark (MMTEB) - a large-scale, community-driven expansion of MTEB, covering over 500 quality-controlled evaluation tasks across 250+ langua…
▽ More
Text embeddings are typically evaluated on a limited set of tasks, which are constrained by language, domain, and task diversity. To address these limitations and provide a more comprehensive evaluation, we introduce the Massive Multilingual Text Embedding Benchmark (MMTEB) - a large-scale, community-driven expansion of MTEB, covering over 500 quality-controlled evaluation tasks across 250+ languages. MMTEB includes a diverse set of challenging, novel tasks such as instruction following, long-document retrieval, and code retrieval, representing the largest multilingual collection of evaluation tasks for embedding models to date. Using this collection, we develop several highly multilingual benchmarks, which we use to evaluate a representative set of models. We find that while large language models (LLMs) with billions of parameters can achieve state-of-the-art performance on certain language subsets and task categories, the best-performing publicly available model is multilingual-e5-large-instruct with only 560 million parameters. To facilitate accessibility and reduce computational cost, we introduce a novel downsampling method based on inter-task correlation, ensuring a diverse selection while preserving relative model rankings. Furthermore, we optimize tasks such as retrieval by sampling hard negatives, creating smaller but effective splits. These optimizations allow us to introduce benchmarks that drastically reduce computational demands. For instance, our newly introduced zero-shot English benchmark maintains a ranking order similar to the full-scale version but at a fraction of the computational cost.
△ Less
Submitted 8 June, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs
Authors:
Hailin Chen,
Fangkai Jiao,
Mathieu Ravaut,
Nawshad Farruque,
Xuan Phi Nguyen,
Chengwei Qin,
Manan Dey,
Bosheng Ding,
Caiming Xiong,
Shafiq Joty,
Yingbo Zhou
Abstract:
The rapid advancement of large language models (LLMs) demands robust, unbiased, and scalable evaluation methods. However, human annotations are costly to scale, model-based evaluations are susceptible to stylistic biases, and target-answer-based benchmarks are vulnerable to data contamination and cheating. To address these limitations, we propose StructTest, a novel benchmark that evaluates LLMs o…
▽ More
The rapid advancement of large language models (LLMs) demands robust, unbiased, and scalable evaluation methods. However, human annotations are costly to scale, model-based evaluations are susceptible to stylistic biases, and target-answer-based benchmarks are vulnerable to data contamination and cheating. To address these limitations, we propose StructTest, a novel benchmark that evaluates LLMs on their ability to follow compositional instructions and generate structured outputs, providing an unbiased, cost-effective, and difficult-to-cheat evaluation framework. Assessments are conducted deterministically using a rule-based evaluator, which can be easily extended to new tasks and datasets. By testing structured outputs across diverse domains including Summarization, Code, HTML, and Math, and evaluating 17 popular LLMs, we demonstrate that StructTest remains challenging even for top-performing models like Deepseek-V3/R1 and GPT-4o, establishing it as a robust proxy for measuring reasoning capabilities. We believe StructTest offers a critical and complementary approach to achieving objective and comprehensive model evaluation.
△ Less
Submitted 19 March, 2025; v1 submitted 23 December, 2024;
originally announced December 2024.
-
Bridging the Data Provenance Gap Across Text, Speech and Video
Authors:
Shayne Longpre,
Nikhil Singh,
Manuel Cherep,
Kushagra Tiwary,
Joanna Materzynska,
William Brannon,
Robert Mahari,
Naana Obeng-Marnu,
Manan Dey,
Mohammed Hamdy,
Nayan Saxena,
Ahmad Mustafa Anis,
Emad A. Alghamdi,
Vu Minh Chien,
Da Yin,
Kun Qian,
Yizhi Li,
Minnie Liang,
An Dinh,
Shrestha Mohanty,
Deividas Mataciunas,
Tobin South,
Jianguo Zhang,
Ariel N. Lee,
Campbell S. Lund
, et al. (18 additional authors not shown)
Abstract:
Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text. In this work we conduct the largest and first-of-its-kind longitudinal audit across modalities--popular text, speech, and video datasets--from their detailed sourcing trends and use restrictions to thei…
▽ More
Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text. In this work we conduct the largest and first-of-its-kind longitudinal audit across modalities--popular text, speech, and video datasets--from their detailed sourcing trends and use restrictions to their geographical and linguistic representation. Our manual analysis covers nearly 4000 public datasets between 1990-2024, spanning 608 languages, 798 sources, 659 organizations, and 67 countries. We find that multimodal machine learning applications have overwhelmingly turned to web-crawled, synthetic, and social media platforms, such as YouTube, for their training sets, eclipsing all other sources since 2019. Secondly, tracing the chain of dataset derivations we find that while less than 33% of datasets are restrictively licensed, over 80% of the source content in widely-used text, speech, and video datasets, carry non-commercial restrictions. Finally, counter to the rising number of languages and geographies represented in public AI training datasets, our audit demonstrates measures of relative geographical and multilingual representation have failed to significantly improve their coverage since 2013. We believe the breadth of our audit enables us to empirically examine trends in data sourcing, restrictions, and Western-centricity at an ecosystem-level, and that visibility into these questions are essential to progress in responsible AI. As a contribution to ongoing improvements in dataset transparency and responsible use, we release our entire multimodal audit, allowing practitioners to trace data provenance across text, speech, and video.
△ Less
Submitted 18 February, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Asymptotics in Multiple Hypotheses Testing under Dependence: beyond Normality
Authors:
Monitirtha Dey
Abstract:
Correlated observations are ubiquitous phenomena in a plethora of scientific avenues. Tackling this dependence among test statistics has been one of the pertinent problems in simultaneous inference. However, very little literature exists that elucidates the effect of correlation on different testing procedures under general distributional assumptions. In this work, we address this gap in a unified…
▽ More
Correlated observations are ubiquitous phenomena in a plethora of scientific avenues. Tackling this dependence among test statistics has been one of the pertinent problems in simultaneous inference. However, very little literature exists that elucidates the effect of correlation on different testing procedures under general distributional assumptions. In this work, we address this gap in a unified way by considering the multiple testing problem under a general correlated framework. We establish an upper bound on the family-wise error rate(FWER) of Bonferroni's procedure for equicorrelated test statistics. Consequently, we find that for a quite general class of distributions, Bonferroni FWER asymptotically tends to zero when the number of hypotheses approaches infinity. We extend this result to general positively correlated elliptically contoured setups. We also present examples of distributions for which Bonferroni FWER has a strictly positive limit under equicorrelation. We extend the limiting zero results to the class of step-down procedures under quite general correlated setups. Specifically, the probability of rejecting at least one hypothesis approaches zero asymptotically for any step-down procedure. The results obtained in this work generalize existing results for correlated Normal test statistics and facilitate new insights into the performances of multiple testing procedures under dependence.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Consent in Crisis: The Rapid Decline of the AI Data Commons
Authors:
Shayne Longpre,
Robert Mahari,
Ariel Lee,
Campbell Lund,
Hamidah Oderinwale,
William Brannon,
Nayan Saxena,
Naana Obeng-Marnu,
Tobin South,
Cole Hunter,
Kevin Klyman,
Christopher Klamm,
Hailey Schoelkopf,
Nikhil Singh,
Manuel Cherep,
Ahmad Anis,
An Dinh,
Caroline Chitongo,
Da Yin,
Damien Sileo,
Deividas Mataciunas,
Diganta Misra,
Emad Alghamdi,
Enrico Shippole,
Jianguo Zhang
, et al. (24 additional authors not shown)
Abstract:
General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how co…
▽ More
General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how codified data use preferences are changing over time. We observe a proliferation of AI-specific clauses to limit use, acute differences in restrictions on AI developers, as well as general inconsistencies between websites' expressed intentions in their Terms of Service and their robots.txt. We diagnose these as symptoms of ineffective web protocols, not designed to cope with the widespread re-purposing of the internet for AI. Our longitudinal analyses show that in a single year (2023-2024) there has been a rapid crescendo of data restrictions from web sources, rendering ~5%+ of all tokens in C4, or 28%+ of the most actively maintained, critical sources in C4, fully restricted from use. For Terms of Service crawling restrictions, a full 45% of C4 is now restricted. If respected or enforced, these restrictions are rapidly biasing the diversity, freshness, and scaling laws for general-purpose AI systems. We hope to illustrate the emerging crises in data consent, for both developers and creators. The foreclosure of much of the open web will impact not only commercial AI, but also non-commercial AI and academic research.
△ Less
Submitted 24 July, 2024; v1 submitted 20 July, 2024;
originally announced July 2024.
-
Neutrino Mixing from a Fresh Perspective
Authors:
Pralay Chakraborty,
Manash Dey,
Biswajit Karmakar,
Subhankar Roy
Abstract:
We propose a neutrino mass matrix texture bearing a suitable correlation $m_{22}=-2\,m_{13}$ and study its phenomenological implications. In light of both normal and inverted hierarchies, the texture imposes specific bounds on some observational parameters. As a potential application, the prediction of effective Majorana neutrino mass $m_{ββ}$ is visualized for both hierarchies. To understand the…
▽ More
We propose a neutrino mass matrix texture bearing a suitable correlation $m_{22}=-2\,m_{13}$ and study its phenomenological implications. In light of both normal and inverted hierarchies, the texture imposes specific bounds on some observational parameters. As a potential application, the prediction of effective Majorana neutrino mass $m_{ββ}$ is visualized for both hierarchies. To understand the proposed texture from the first principle, we incorporate the type-I+II seesaw mechanism in association with $A_4 \times Z_{10} \times Z_2$ group.
△ Less
Submitted 16 September, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Revisiting the Dirac Nature of Neutrinos in the Light of $Δ(27)$ and Cyclic Symmetries
Authors:
Manash Dey,
Subhankar Roy
Abstract:
Amid the uncertainty regarding the fundamental nature of neutrinos, we adhere to the Dirac description, and construct a model in the framework of $Δ(27)$ symmetry. The model successfully accounts for the hierarchical patterns of both charged lepton and neutrino masses. The neutrino mass matrix exhibits four texture zeroes, and the associated mixing scheme aligns with the experimental data, notably…
▽ More
Amid the uncertainty regarding the fundamental nature of neutrinos, we adhere to the Dirac description, and construct a model in the framework of $Δ(27)$ symmetry. The model successfully accounts for the hierarchical patterns of both charged lepton and neutrino masses. The neutrino mass matrix exhibits four texture zeroes, and the associated mixing scheme aligns with the experimental data, notably controlled by a single parameter.
△ Less
Submitted 9 January, 2025; v1 submitted 19 March, 2024;
originally announced March 2024.
-
StarCoder 2 and The Stack v2: The Next Generation
Authors:
Anton Lozhkov,
Raymond Li,
Loubna Ben Allal,
Federico Cassano,
Joel Lamy-Poirier,
Nouamane Tazi,
Ao Tang,
Dmytro Pykhtar,
Jiawei Liu,
Yuxiang Wei,
Tianyang Liu,
Max Tian,
Denis Kocetkov,
Arthur Zucker,
Younes Belkada,
Zijian Wang,
Qian Liu,
Dmitry Abulkhanov,
Indraneil Paul,
Zhuang Li,
Wen-Ding Li,
Megan Risdal,
Jia Li,
Jian Zhu,
Terry Yue Zhuo
, et al. (41 additional authors not shown)
Abstract:
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data…
▽ More
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data sources, such as GitHub pull requests, Kaggle notebooks, and code documentation. This results in a training set that is 4x larger than the first StarCoder dataset. We train StarCoder2 models with 3B, 7B, and 15B parameters on 3.3 to 4.3 trillion tokens and thoroughly evaluate them on a comprehensive set of Code LLM benchmarks. We find that our small model, StarCoder2-3B, outperforms other Code LLMs of similar size on most benchmarks, and also outperforms StarCoderBase-15B. Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size. In addition, it matches or outperforms CodeLlama-34B, a model more than twice its size. Although DeepSeekCoder- 33B is the best-performing model at code completion for high-resource languages, we find that StarCoder2-15B outperforms it on math and code reasoning benchmarks, as well as several low-resource languages. We make the model weights available under an OpenRAIL license and ensure full transparency regarding the training data by releasing the SoftWare Heritage persistent IDentifiers (SWHIDs) of the source code data.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Asymptotically Optimal Sequential Multiple Testing Procedures for Correlated Normal
Authors:
Monitirtha Dey,
Subir Kumar Bhandari
Abstract:
Simultaneous statistical inference has been a cornerstone in the statistics methodology literature because of its fundamental theory and paramount applications. The mainstream multiple testing literature has traditionally considered two frameworks: the sample size is deterministic, and the test statistics corresponding to different tests are independent. However, in many modern scientific avenues,…
▽ More
Simultaneous statistical inference has been a cornerstone in the statistics methodology literature because of its fundamental theory and paramount applications. The mainstream multiple testing literature has traditionally considered two frameworks: the sample size is deterministic, and the test statistics corresponding to different tests are independent. However, in many modern scientific avenues, these assumptions are often violated. There is little study that explores the multiple testing problem in a sequential framework where the test statistics corresponding to the various streams are dependent. This work fills this gap in a unified way by considering the classical means-testing problem in an equicorrelated Gaussian and sequential framework. We focus on sequential test procedures that control the type I and type II familywise error probabilities at pre-specified levels. We establish that our proposed test procedures achieve the optimal expected sample sizes under every possible signal configuration asymptotically, as the two error probabilities vanish at arbitrary rates. Towards this, we elucidate that the ratio of the expected sample size of our proposed rule and that of the classical SPRT goes to one asymptotically, thus illustrating their connection. Generalizing this, we show that our proposed procedures, with appropriately adjusted critical values, are asymptotically optimal for controlling any multiple testing error metric lying between multiples of FWER in a certain sense. This class of metrics includes FDR/FNR, pFDR/pFNR, the per-comparison and per-family error rates, and the false positive rate.
△ Less
Submitted 20 March, 2025; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Unveiling Neutrino Mysteries with $Δ(27)$ Symmetry
Authors:
Manash Dey,
Subhankar Roy
Abstract:
An elegant model is proposed by extending the Standard Model using the $Δ(27)\times Z_3 \times Z_{10}$ symmetry within the framework of the Type-I + Type-II seesaw mechanism. This model is particularly noteworthy for its ability to restrict the atmospheric mixing angle, $θ_{23}$, to specific values, and provides an explanation for the observed hierarchy of charged lepton masses. The neutrino mass…
▽ More
An elegant model is proposed by extending the Standard Model using the $Δ(27)\times Z_3 \times Z_{10}$ symmetry within the framework of the Type-I + Type-II seesaw mechanism. This model is particularly noteworthy for its ability to restrict the atmospheric mixing angle, $θ_{23}$, to specific values, and provides an explanation for the observed hierarchy of charged lepton masses. The neutrino mass matrix texture defined by three real parameters, predicts the three neutrino mass eigenvalues and the two Majorana phases. Furthermore, the model is tested against the experimental results of neutrino-less double beta ($0νββ$) decay and charged lepton flavour violation (cLFV) experiments.
△ Less
Submitted 10 July, 2024; v1 submitted 26 September, 2023;
originally announced September 2023.
-
StarCoder: may the source be with you!
Authors:
Raymond Li,
Loubna Ben Allal,
Yangtian Zi,
Niklas Muennighoff,
Denis Kocetkov,
Chenghao Mou,
Marc Marone,
Christopher Akiki,
Jia Li,
Jenny Chim,
Qian Liu,
Evgenii Zheltonozhskii,
Terry Yue Zhuo,
Thomas Wang,
Olivier Dehaene,
Mishig Davaadorj,
Joel Lamy-Poirier,
João Monteiro,
Oleh Shliazhko,
Nicolas Gontier,
Nicholas Meade,
Armel Zebaze,
Ming-Ho Yee,
Logesh Kumar Umapathi,
Jian Zhu
, et al. (42 additional authors not shown)
Abstract:
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle…
▽ More
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large collection of permissively licensed GitHub repositories with inspection tools and an opt-out process. We fine-tuned StarCoderBase on 35B Python tokens, resulting in the creation of StarCoder. We perform the most comprehensive evaluation of Code LLMs to date and show that StarCoderBase outperforms every open Code LLM that supports multiple programming languages and matches or outperforms the OpenAI code-cushman-001 model. Furthermore, StarCoder outperforms every model that is fine-tuned on Python, can be prompted to achieve 40\% pass@1 on HumanEval, and still retains its performance on other programming languages. We take several important steps towards a safe open-access model release, including an improved PII redaction pipeline and a novel attribution tracing tool, and make the StarCoder models publicly available under a more commercially viable version of the Open Responsible AI Model license.
△ Less
Submitted 13 December, 2023; v1 submitted 9 May, 2023;
originally announced May 2023.
-
A Realistic Neutrino mixing scheme arising from $A_4$ symmetry
Authors:
Manash Dey,
Subhankar Roy
Abstract:
We propose a unique lepton mixing scheme and its association with an exact hierarchy-philic neutrino mass matrix texture in the light of Type-I+Type-II seesaw mechanism under the framework of $A_4 \times Z_{10}$ discrete flavour symmetry. The proposed model successfully predicts the normal ordering of neutrino masses and the two Majorana phases. Additionally, the analysis extends to the effective…
▽ More
We propose a unique lepton mixing scheme and its association with an exact hierarchy-philic neutrino mass matrix texture in the light of Type-I+Type-II seesaw mechanism under the framework of $A_4 \times Z_{10}$ discrete flavour symmetry. The proposed model successfully predicts the normal ordering of neutrino masses and the two Majorana phases. Additionally, the analysis extends to the effective Majorana neutrino mass, in the context of neutrinoless double beta\,($0νββ$)-decay.
△ Less
Submitted 23 July, 2024; v1 submitted 14 April, 2023;
originally announced April 2023.
-
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Authors:
Hugo Laurençon,
Lucile Saulnier,
Thomas Wang,
Christopher Akiki,
Albert Villanova del Moral,
Teven Le Scao,
Leandro Von Werra,
Chenghao Mou,
Eduardo González Ponferrada,
Huu Nguyen,
Jörg Frohberg,
Mario Šaško,
Quentin Lhoest,
Angelina McMillan-Major,
Gerard Dupont,
Stella Biderman,
Anna Rogers,
Loubna Ben allal,
Francesco De Toni,
Giada Pistilli,
Olivier Nguyen,
Somaieh Nikpoor,
Maraim Masoud,
Pierre Colombo,
Javier de la Rosa
, et al. (29 additional authors not shown)
Abstract:
As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of ethics, harm, and governance in the f…
▽ More
As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of ethics, harm, and governance in the foreground. This paper documents the data creation and curation efforts undertaken by BigScience to assemble the Responsible Open-science Open-collaboration Text Sources (ROOTS) corpus, a 1.6TB dataset spanning 59 languages that was used to train the 176-billion-parameter BigScience Large Open-science Open-access Multilingual (BLOOM) language model. We further release a large initial subset of the corpus and analyses thereof, and hope to empower large-scale monolingual and multilingual modeling projects with both the data and the processing tools, as well as stimulate research around this large multilingual corpus.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
SantaCoder: don't reach for the stars!
Authors:
Loubna Ben Allal,
Raymond Li,
Denis Kocetkov,
Chenghao Mou,
Christopher Akiki,
Carlos Munoz Ferrandis,
Niklas Muennighoff,
Mayank Mishra,
Alex Gu,
Manan Dey,
Logesh Kumar Umapathi,
Carolyn Jane Anderson,
Yangtian Zi,
Joel Lamy Poirier,
Hailey Schoelkopf,
Sergey Troshin,
Dmitry Abulkhanov,
Manuel Romero,
Michael Lappert,
Francesco De Toni,
Bernardo García del Río,
Qian Liu,
Shamik Bose,
Urvashi Bhattacharyya,
Terry Yue Zhuo
, et al. (16 additional authors not shown)
Abstract:
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat…
▽ More
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigating better preprocessing methods for the training data. We train 1.1B parameter models on the Java, JavaScript, and Python subsets of The Stack and evaluate them on the MultiPL-E text-to-code benchmark. We find that more aggressive filtering of near-duplicates can further boost performance and, surprisingly, that selecting files from repositories with 5+ GitHub stars deteriorates performance significantly. Our best model outperforms previous open-source multilingual code generation models (InCoder-6.7B and CodeGen-Multi-2.7B) in both left-to-right generation and infilling on the Java, JavaScript, and Python portions of MultiPL-E, despite being a substantially smaller model. All models are released under an OpenRAIL license at https://hf.co/bigcode.
△ Less
Submitted 24 February, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
On Asymptotic Behaviors of Stepwise Multiple Testing Procedures
Authors:
Monitirtha Dey
Abstract:
Stepwise multiple testing procedures have attracted several statisticians for decades and are also quite popular with statistics users because of their technical simplicity. The Bonferroni procedure has been one of the earliest and most prominent testing rules for controlling the familywise error rate (FWER). A recent article established that the FWER for the Bonferroni method asymptotically (i.e.…
▽ More
Stepwise multiple testing procedures have attracted several statisticians for decades and are also quite popular with statistics users because of their technical simplicity. The Bonferroni procedure has been one of the earliest and most prominent testing rules for controlling the familywise error rate (FWER). A recent article established that the FWER for the Bonferroni method asymptotically (i.e., when the number of hypotheses becomes arbitrarily large) approaches zero under any positively equicorrelated multivariate normal framework. However, similar results for the limiting behaviors of FWER of general stepwise procedures are nonexistent. The present work addresses this gap in a unified manner by studying the limiting behaviors of the FWER of several stepwise testing rules for correlated normal setups. Specifically, we show that the limiting FWER approaches zero for any step-down rule (e.g., Holm's method) provided the infimum of the correlations is strictly positive. We also establish similar limiting zero results on FWER of other popular multiple testing rules, e.g., Hochberg's and Hommel's procedures. We then extend these results to any configuration of true and false null hypotheses. It turns out that, within our chosen asymptotic framework, the Benjamini-Hochberg method can hold the FWER at a strictly positive level asymptotically under the equicorrelated normality. We finally discuss the limiting powers of various procedures.
△ Less
Submitted 18 February, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Authors:
BigScience Workshop,
:,
Teven Le Scao,
Angela Fan,
Christopher Akiki,
Ellie Pavlick,
Suzana Ilić,
Daniel Hesslow,
Roman Castagné,
Alexandra Sasha Luccioni,
François Yvon,
Matthias Gallé,
Jonathan Tow,
Alexander M. Rush,
Stella Biderman,
Albert Webson,
Pawan Sasanka Ammanamanchi,
Thomas Wang,
Benoît Sagot,
Niklas Muennighoff,
Albert Villanova del Moral,
Olatunji Ruwase,
Rachel Bawden,
Stas Bekman,
Angelina McMillan-Major
, et al. (369 additional authors not shown)
Abstract:
Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access…
▽ More
Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.
△ Less
Submitted 27 June, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
The $μ$-$τ$ mixed symmetry and neutrino mass matrix
Authors:
Manash Dey,
Pralay Chakraborty,
Subhankar Roy
Abstract:
We propose an elegant neutrino mass matrix texture entitled $μ$-$τ$ mixed symmetry highlighting two simple correlations among its elements and a detailed analysis is carried out to see its phenomenological implications. The proposed texture is motivated in the framework of Seesaw mechanism in association with $A_4$ symmetry
We propose an elegant neutrino mass matrix texture entitled $μ$-$τ$ mixed symmetry highlighting two simple correlations among its elements and a detailed analysis is carried out to see its phenomenological implications. The proposed texture is motivated in the framework of Seesaw mechanism in association with $A_4$ symmetry
△ Less
Submitted 16 April, 2024; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Radiative neutron capture reaction rates for r-process nucleosynthesis
Authors:
Vinay Singh,
Joydev Lahiri,
Malay Kanti Dey,
D. N. Basu
Abstract:
About half of the elements beyond iron are synthesized in stars by rapid-neutron capture process (r-process). The stellar environment provides very high neutron flux in a short time ($\sim$ seconds) which is conducive for the creation of progressively neutron-rich nuclei till the waiting point is reached after which no further neutron capture reactions proceed. At this point such extremely neutron…
▽ More
About half of the elements beyond iron are synthesized in stars by rapid-neutron capture process (r-process). The stellar environment provides very high neutron flux in a short time ($\sim$ seconds) which is conducive for the creation of progressively neutron-rich nuclei till the waiting point is reached after which no further neutron capture reactions proceed. At this point such extremely neutron-rich nuclei become stable via $β^-$ decay. A detailed understanding of the r-process remains illusive. In the present work, we explore the radiative neutron-capture (n,$γ$) cross sections and reaction rates around the r-process peak near mass number eighty. The inherent uncertainties remain large in some cases, particularly in case of neutron-rich nuclei. When the low-energy enhancement exists, it results in significant increase in the reaction rate for neutron-capture.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Demand Analysis with a Thin Price Sample
Authors:
Monitirtha Dey,
Arpan Kumar,
Diganta Mukherjee
Abstract:
For about 125 items of food, the Consumer Expenditure Survey (CES) schedule of the Indian National Sample Survey asks the interviewer to obtain both quantity and value of household consumption during the reference period from the respondent. This would appear to put a great burden on the respondent. But it is likely that the price usually paid is almost the same within each first stage unit (fsu).…
▽ More
For about 125 items of food, the Consumer Expenditure Survey (CES) schedule of the Indian National Sample Survey asks the interviewer to obtain both quantity and value of household consumption during the reference period from the respondent. This would appear to put a great burden on the respondent. But it is likely that the price usually paid is almost the same within each first stage unit (fsu). The present work proposes a new sampling scheme to estimate demand elasticities of essential food items. While the conventional sampling method used in practice (e.g. in NSS consumer expenditure survey) involves seeking price information from many households sampled from a fsu, the proposed procedure involves only one household chosen randomly from every fsu for price data collection and thus requires much less interview burden. Using unit records for vegetable items in the NSS's 2011-12 CES, our results show that in spite of requiring much less data, the new scheme captures the household food consumption behavior as precisely as before.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.
-
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts
Authors:
Shanya Sharma,
Manan Dey,
Koustuv Sinha
Abstract:
Neural Machine Translation systems built on top of Transformer-based architectures are routinely improving the state-of-the-art in translation quality according to word-overlap metrics. However, a growing number of studies also highlight the inherent gender bias that these models incorporate during training, which reflects poorly in their translations. In this work, we investigate whether these mo…
▽ More
Neural Machine Translation systems built on top of Transformer-based architectures are routinely improving the state-of-the-art in translation quality according to word-overlap metrics. However, a growing number of studies also highlight the inherent gender bias that these models incorporate during training, which reflects poorly in their translations. In this work, we investigate whether these models can be instructed to fix their bias during inference using targeted, guided instructions as contexts. By translating relevant contextual sentences during inference along with the input, we observe large improvements in reducing the gender bias in translations, across three popular test suites (WinoMT, BUG, SimpleGen). We further propose a novel metric to assess several large pre-trained models (OPUS-MT, M2M-100) on their sensitivity towards using contexts during translation to correct their biases. Our approach requires no fine-tuning and thus can be used easily in production systems to de-bias translations from stereotypical gender-occupation bias 1. We hope our method, along with our metric, can be used to build better, bias-free translation systems.
△ Less
Submitted 17 October, 2022; v1 submitted 22 May, 2022;
originally announced May 2022.
-
Constrained Neutrino Mass Matrix and Majorana Phases
Authors:
Pralay Chakraborty,
Manash Dey,
Subhankar Roy
Abstract:
We endeavor to constrain the neutrino mass matrix on the phenomenological ground and procure model-independent textures by emphasizing on the simple linear relationships among the mass matrix elements. These simple textures predict the two Majorana phases. In this regard, two types of parametrization of neutrino mass matrix: general and exponential are employed. We obtain fifty-three predictive ne…
▽ More
We endeavor to constrain the neutrino mass matrix on the phenomenological ground and procure model-independent textures by emphasizing on the simple linear relationships among the mass matrix elements. These simple textures predict the two Majorana phases. In this regard, two types of parametrization of neutrino mass matrix: general and exponential are employed. We obtain fifty-three predictive neutrino mass matrix textures, out of which twenty-eight are associated with the general parametrization, and the rest belong to the exponential one. Apart from Type-A/P textures, the rest deal with the prediction of a few other oscillation parameters as well. We try to realize the proposed textures in the light of $A_4$, $Δ\,(27)$ and $T_7$ symmetry groups.
△ Less
Submitted 7 December, 2023; v1 submitted 10 February, 2022;
originally announced February 2022.
-
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Authors:
Stephen H. Bach,
Victor Sanh,
Zheng-Xin Yong,
Albert Webson,
Colin Raffel,
Nihal V. Nayak,
Abheesht Sharma,
Taewoon Kim,
M Saiful Bari,
Thibault Fevry,
Zaid Alyafeai,
Manan Dey,
Andrea Santilli,
Zhiqing Sun,
Srulik Ben-David,
Canwen Xu,
Gunjan Chhablani,
Han Wang,
Jason Alan Fries,
Maged S. Al-shaibani,
Shanya Sharma,
Urmish Thakker,
Khalid Almubarak,
Xiangru Tang,
Dragomir Radev
, et al. (2 additional authors not shown)
Abstract:
PromptSource is a system for creating, sharing, and using natural language prompts. Prompts are functions that map an example from a dataset to a natural language input and target output. Using prompts to train and query language models is an emerging area in NLP that requires new tools that let users develop and refine these prompts collaboratively. PromptSource addresses the emergent challenges…
▽ More
PromptSource is a system for creating, sharing, and using natural language prompts. Prompts are functions that map an example from a dataset to a natural language input and target output. Using prompts to train and query language models is an emerging area in NLP that requires new tools that let users develop and refine these prompts collaboratively. PromptSource addresses the emergent challenges in this new setting with (1) a templating language for defining data-linked prompts, (2) an interface that lets users quickly iterate on prompt development by observing outputs of their prompts on many examples, and (3) a community-driven set of guidelines for contributing new prompts to a common pool. Over 2,000 prompts for roughly 170 datasets are already available in PromptSource. PromptSource is available at https://github.com/bigscience-workshop/promptsource.
△ Less
Submitted 29 March, 2022; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Authors:
Sabrina J. Mielke,
Zaid Alyafeai,
Elizabeth Salesky,
Colin Raffel,
Manan Dey,
Matthias Gallé,
Arun Raja,
Chenglei Si,
Wilson Y. Lee,
Benoît Sagot,
Samson Tan
Abstract:
What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete and atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in many areas, enabling small vocab…
▽ More
What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete and atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in many areas, enabling small vocabularies while still allowing for fast inference. Is the end of the road character-level model or byte-level processing? In this survey, we connect several lines of work from the pre-neural and neural era, by showing how hybrid approaches of words and characters as well as subword-based approaches based on learned segmentation have been proposed and evaluated. We conclude that there is and likely will never be a silver bullet singular solution for all applications and that thinking seriously about tokenization remains important for many applications.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Multitask Prompted Training Enables Zero-Shot Task Generalization
Authors:
Victor Sanh,
Albert Webson,
Colin Raffel,
Stephen H. Bach,
Lintang Sutawika,
Zaid Alyafeai,
Antoine Chaffin,
Arnaud Stiegler,
Teven Le Scao,
Arun Raja,
Manan Dey,
M Saiful Bari,
Canwen Xu,
Urmish Thakker,
Shanya Sharma Sharma,
Eliza Szczechla,
Taewoon Kim,
Gunjan Chhablani,
Nihal Nayak,
Debajyoti Datta,
Jonathan Chang,
Mike Tian-Jian Jiang,
Han Wang,
Matteo Manica,
Sheng Shen
, et al. (16 additional authors not shown)
Abstract:
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale,…
▽ More
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping any natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts with diverse wording. These prompted datasets allow for benchmarking the ability of a model to perform completely held-out tasks. We fine-tune a pretrained encoder-decoder model (Raffel et al., 2020; Lester et al., 2021) on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16x its size. Further, our approach attains strong performance on a subset of tasks from the BIG-bench benchmark, outperforming models up to 6x its size. All trained models are available at https://github.com/bigscience-workshop/t-zero and all prompts are available at https://github.com/bigscience-workshop/promptsource.
△ Less
Submitted 17 March, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
FWER Goes to Zero for Correlated Normal
Authors:
Monitirtha Dey,
Subir Kumar Bhandari
Abstract:
Familywise error rate (FWER) has been a cornerstone in simultaneous inference for decades, and the classical Bonferroni method has been one of the most prominent frequentist approaches for controlling FWER. The present article studies the limiting behavior of Bonferroni FWER in a multiple testing problem as the number of hypotheses grows to infinity. We establish that in the equicorrelated normal…
▽ More
Familywise error rate (FWER) has been a cornerstone in simultaneous inference for decades, and the classical Bonferroni method has been one of the most prominent frequentist approaches for controlling FWER. The present article studies the limiting behavior of Bonferroni FWER in a multiple testing problem as the number of hypotheses grows to infinity. We establish that in the equicorrelated normal setup with positive equicorrelation, Bonferroni FWER tends to zero asymptotically. We extend this result for generalized familywise error rates and to arbitrarily correlated setups.
△ Less
Submitted 6 December, 2021; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Behaviour of FWER in Normal Distributions
Authors:
Monitirtha Dey
Abstract:
Familywise error rate (FWER) has been a cornerstone in simultaneous inference for decades, and the classical Bonferroni method has been one of the most prominent frequentist approaches for controlling FWER. The present article studies the behavior of the FWER for Bonferroni procedure in a multiple testing problem. We establish upper bounds on FWER for Bonferroni method under the equicorrelated and…
▽ More
Familywise error rate (FWER) has been a cornerstone in simultaneous inference for decades, and the classical Bonferroni method has been one of the most prominent frequentist approaches for controlling FWER. The present article studies the behavior of the FWER for Bonferroni procedure in a multiple testing problem. We establish upper bounds on FWER for Bonferroni method under the equicorrelated and general normal setups in nonasymptotic case.
△ Less
Submitted 26 January, 2022; v1 submitted 30 June, 2021;
originally announced July 2021.
-
Evaluating Gender Bias in Natural Language Inference
Authors:
Shanya Sharma,
Manan Dey,
Koustuv Sinha
Abstract:
Gender-bias stereotypes have recently raised significant ethical concerns in natural language processing. However, progress in detection and evaluation of gender bias in natural language understanding through inference is limited and requires further investigation. In this work, we propose an evaluation methodology to measure these biases by constructing a challenge task that involves pairing gend…
▽ More
Gender-bias stereotypes have recently raised significant ethical concerns in natural language processing. However, progress in detection and evaluation of gender bias in natural language understanding through inference is limited and requires further investigation. In this work, we propose an evaluation methodology to measure these biases by constructing a challenge task that involves pairing gender-neutral premises against a gender-specific hypothesis. We use our challenge task to investigate state-of-the-art NLI models on the presence of gender stereotypes using occupations. Our findings suggest that three models (BERT, RoBERTa, BART) trained on MNLI and SNLI datasets are significantly prone to gender-induced prediction errors. We also find that debiasing techniques such as augmenting the training dataset to ensure a gender-balanced dataset can help reduce such bias in certain cases.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Assessing Viewer's Mental Health by Detecting Depression in YouTube Videos
Authors:
Shanya Sharma,
Manan Dey
Abstract:
Depression is one of the most prevalent mental health issues around the world, proving to be one of the leading causes of suicide and placing large economic burdens on families and society. In this paper, we develop and test the efficacy of machine learning techniques applied to the content of YouTube videos captured through their transcripts and determine if the videos are depressive or have a de…
▽ More
Depression is one of the most prevalent mental health issues around the world, proving to be one of the leading causes of suicide and placing large economic burdens on families and society. In this paper, we develop and test the efficacy of machine learning techniques applied to the content of YouTube videos captured through their transcripts and determine if the videos are depressive or have a depressing trigger. Our model can detect depressive videos with an accuracy of 83%. We also introduce a real-life evaluation technique to validate our classification based on the comments posted on a video by calculating the CES-D scores of the comments. This work conforms greatly with the UN Sustainable Goal of ensuring Good Health and Well Being with major conformity with section UN SDG 3.4.
△ Less
Submitted 29 July, 2020;
originally announced August 2020.
-
Correlation between two non-linear events in a complex dusty plasma system
Authors:
Rinku Mishra,
S. Adhikari,
Rupak Mukherjee,
M. Dey
Abstract:
A phenomenological model using fluid theory is developed to show that the decay of two seemingly independent nonlinear structures namely the dust void and dust soliton strongly depends on the plasma ionization parameter in an unmagnetized complex plasma system. Numerical solution of model equations has shown that the evolution of dust voids and their subsequent decay in a time frame is intimately…
▽ More
A phenomenological model using fluid theory is developed to show that the decay of two seemingly independent nonlinear structures namely the dust void and dust soliton strongly depends on the plasma ionization parameter in an unmagnetized complex plasma system. Numerical solution of model equations has shown that the evolution of dust voids and their subsequent decay in a time frame is intimately related with the ionization parameter. A similar result also holds good in case of dust solitons where stability of soliton is found to depend critically upon ionization parameter. Most importantly, it is observed that time of the collapse of a dust soliton precedes the onset time of a dust void decay and therefore soliton decay acts as a precursor for void decay to occur in a given dusty plasma system.
△ Less
Submitted 15 October, 2018; v1 submitted 7 August, 2018;
originally announced August 2018.
-
Challenges in QCD matter physics - The Compressed Baryonic Matter experiment at FAIR
Authors:
CBM Collaboration,
T. Ablyazimov,
A. Abuhoza,
R. P. Adak,
M. Adamczyk,
K. Agarwal,
M. M. Aggarwal,
Z. Ahammed,
F. Ahmad,
N. Ahmad,
S. Ahmad,
A. Akindinov,
P. Akishin,
E. Akishina,
T. Akishina,
V. Akishina,
A. Akram,
M. Al-Turany,
I. Alekseev,
E. Alexandrov,
I. Alexandrov,
S. Amar-Youcef,
M. Anđelić,
O. Andreeva,
C. Andrei
, et al. (563 additional authors not shown)
Abstract:
Substantial experimental and theoretical efforts worldwide are devoted to explore the phase diagram of strongly interacting matter. At LHC and top RHIC energies, QCD matter is studied at very high temperatures and nearly vanishing net-baryon densities. There is evidence that a Quark-Gluon-Plasma (QGP) was created at experiments at RHIC and LHC. The transition from the QGP back to the hadron gas is…
▽ More
Substantial experimental and theoretical efforts worldwide are devoted to explore the phase diagram of strongly interacting matter. At LHC and top RHIC energies, QCD matter is studied at very high temperatures and nearly vanishing net-baryon densities. There is evidence that a Quark-Gluon-Plasma (QGP) was created at experiments at RHIC and LHC. The transition from the QGP back to the hadron gas is found to be a smooth cross over. For larger net-baryon densities and lower temperatures, it is expected that the QCD phase diagram exhibits a rich structure, such as a first-order phase transition between hadronic and partonic matter which terminates in a critical point, or exotic phases like quarkyonic matter. The discovery of these landmarks would be a breakthrough in our understanding of the strong interaction and is therefore in the focus of various high-energy heavy-ion research programs. The Compressed Baryonic Matter (CBM) experiment at FAIR will play a unique role in the exploration of the QCD phase diagram in the region of high net-baryon densities, because it is designed to run at unprecedented interaction rates. High-rate operation is the key prerequisite for high-precision measurements of multi-differential observables and of rare diagnostic probes which are sensitive to the dense phase of the nuclear fireball. The goal of the CBM experiment at SIS100 (sqrt(s_NN) = 2.7 - 4.9 GeV) is to discover fundamental properties of QCD matter: the phase structure at large baryon-chemical potentials (mu_B > 500 MeV), effects of chiral symmetry, and the equation-of-state at high density as it is expected to occur in the core of neutron stars. In this article, we review the motivation for and the physics programme of CBM, including activities before the start of data taking in 2022, in the context of the worldwide efforts to explore high-density QCD matter.
△ Less
Submitted 29 March, 2017; v1 submitted 6 July, 2016;
originally announced July 2016.
-
A Strange Star Scenario for the Formation of Eccentric Millisecond Pulsar/Helium White Dwarf Binaries
Authors:
Long Jiang,
Xiang-Dong Li,
Jishnu Dey,
Mira Dey
Abstract:
According to the recycling scenario, millisecond pulsars (MSPs) have evolved from low-mass X-ray binaries (LMXBs). Their orbits are expected to be circular due to tidal interactions during the binary evolution, as observed in most of the binary MSPs. There are some peculiar systems that do not fit this picture. Three recent examples are PSRs J2234$+$06, J1946$+$3417 and J1950$+$2414, all of which…
▽ More
According to the recycling scenario, millisecond pulsars (MSPs) have evolved from low-mass X-ray binaries (LMXBs). Their orbits are expected to be circular due to tidal interactions during the binary evolution, as observed in most of the binary MSPs. There are some peculiar systems that do not fit this picture. Three recent examples are PSRs J2234$+$06, J1946$+$3417 and J1950$+$2414, all of which are MSPs in eccentric orbits but with mass functions compatible with expected He white dwarf companions. It has been suggested these MSPs may have formed from delayed accretion-induced collapse of massive white dwarfs, or the eccentricity may be induced by dynamical interaction between the binary and a circumbinary disk. Assuming that the core density of accreting neutron stars in LMXBs may reach the density of quark deconfinement, which can lead to phase transition from neutron stars to strange quark stars, we show that the resultant MSPs are likely to have an eccentric orbit, due to the sudden loss of the gravitational mass of the neutron star during the transition. The eccentricities can be reproduced with a reasonable estimate of the mass loss. This scenario might also account for the formation of the youngest known X-ray binary Cir X$-$1, which also possesses a low-field compact star in an eccentric orbit.
△ Less
Submitted 18 May, 2015;
originally announced May 2015.
-
Persistent charge and spin currents in a quantum ring using Green's function technique: Interplay between magnetic flux and spin-orbit interactions
Authors:
Santanu K. Maiti,
Moumita Dey,
S. N. Karmakar
Abstract:
We put forward a new approach based on Green's function formalism to evaluate precisely persistent charge and spin currents in an Aharonov-Bohm ring subjected to Rashba and Dresselhaus spin-orbit interactions. Unlike conventional methods our present scheme circumvents direct evaluation of eigenvalues and eigenstates of the system Hamiltonian to determine persistent currents which essentially reduc…
▽ More
We put forward a new approach based on Green's function formalism to evaluate precisely persistent charge and spin currents in an Aharonov-Bohm ring subjected to Rashba and Dresselhaus spin-orbit interactions. Unlike conventional methods our present scheme circumvents direct evaluation of eigenvalues and eigenstates of the system Hamiltonian to determine persistent currents which essentially reduces possible numerical errors, especially for larger rings. The interplay of Aharonov-Bohm flux and spin-orbit interactions in persistent charge and spin currents of quantum rings is analyzed in detail and our results lead to a possibility of estimating the strength of any one of the spin-orbit fields provided the other one is known. All these features are exactly invariant even in presence of impurities, and therefore, can be substantiated experimentally.
△ Less
Submitted 23 July, 2014; v1 submitted 1 January, 2014;
originally announced January 2014.
-
Spin-orbit interaction induced spin selective transmission through a multi-terminal mesoscopic ring
Authors:
Moumita Dey,
Santanu K. Maiti,
Sreekantha Sil,
S. N. Karmakar
Abstract:
Spin dependent transport in a multi-terminal mesoscopic ring is investigated in presence of Rashba and Dresselhaus spin-orbit interactions. Within a tight-binding framework we use a general spin density matrix formalism to evaluate all three components ($P_x$, $P_y$ and $P_z$) of the polarization vector associated with the charge current through the outgoing leads. It explores the dynamics of the…
▽ More
Spin dependent transport in a multi-terminal mesoscopic ring is investigated in presence of Rashba and Dresselhaus spin-orbit interactions. Within a tight-binding framework we use a general spin density matrix formalism to evaluate all three components ($P_x$, $P_y$ and $P_z$) of the polarization vector associated with the charge current through the outgoing leads. It explores the dynamics of the spin polarization vector of current propagating through the system subjected to the Rashba and/or the Dresselhaus spin-orbit couplings. The sensitivity of the polarization components on the electrode-ring interface geometry is discussed in detail. Our present analysis provides an understanding of the coupled spin and electron transport in mesoscopic bridge systems.
△ Less
Submitted 10 September, 2013; v1 submitted 17 July, 2013;
originally announced July 2013.
-
Strange star equation of state fits the refined mass measurement of 12 pulsars and predicts their radii
Authors:
Taparati Gangopadhyay,
Subharthi Ray,
Xiang-Dong Li,
Jishnu Dey,
Mira Dey
Abstract:
There are three categories of stars whose masses have been found accurately in recent times: (1) two for which Shapiro delay is used which is possible due to GR light bending as the partner is heavy : PSR J1614-2230 and PSR J1903+0327 (2) six eclipsing stars for which numerical Roche Lobe geometry is used and (3) 3 stars for which spectroscopic methods are used and in fact for these three the mass…
▽ More
There are three categories of stars whose masses have been found accurately in recent times: (1) two for which Shapiro delay is used which is possible due to GR light bending as the partner is heavy : PSR J1614-2230 and PSR J1903+0327 (2) six eclipsing stars for which numerical Roche Lobe geometry is used and (3) 3 stars for which spectroscopic methods are used and in fact for these three the mass and radii both are estimated. Motivated by large color (N_c) expansion using a modified Richardson potential, along with density dependent quark masses thereby allowing chiral symmetry restoration, we get compact strange stars fitting all the observed masses.
△ Less
Submitted 8 March, 2013;
originally announced March 2013.
-
Magnetic-non-magnetic superlattice chain with external electric field: Spin transport and the selective switching effect
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
Based on Green's function formalism, the existence of multiple mobility edges in a one-dimensional magnetic-non-magnetic superlattice geometry in presence of external electric field is predicted, and, it leads to the possibility of getting a metal-insulator transition at multiple values of Fermi energy. The role of electric field on electron localization is discussed for different arrangements of…
▽ More
Based on Green's function formalism, the existence of multiple mobility edges in a one-dimensional magnetic-non-magnetic superlattice geometry in presence of external electric field is predicted, and, it leads to the possibility of getting a metal-insulator transition at multiple values of Fermi energy. The role of electric field on electron localization is discussed for different arrangements of magnetic and non-magnetic atomic sites in the chain. We also analyze that the model quantum system can be used as a perfect spin filter for a wide range of energy.
△ Less
Submitted 5 August, 2013; v1 submitted 6 March, 2012;
originally announced March 2012.
-
Spin Hall effect in a Kagome lattice driven by Rashba spin-orbit interaction
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
Using four-terminal Landauer-Büttiker formalism and Green's function technique, in this present paper, we calculate numerically spin Hall conductance (SHC) and longitudinal conductance of a finite size kagome lattice with Rashba spin-orbit (SO) interaction both in presence and absence of external magnetic flux in clean limit. In the absence of magnetic flux, we observe that depending on the Fermi…
▽ More
Using four-terminal Landauer-Büttiker formalism and Green's function technique, in this present paper, we calculate numerically spin Hall conductance (SHC) and longitudinal conductance of a finite size kagome lattice with Rashba spin-orbit (SO) interaction both in presence and absence of external magnetic flux in clean limit. In the absence of magnetic flux, we observe that depending on the Fermi surface topology of the system SHC changes its sign at different values of Fermi energy, along with the band center. Unlike the infinite system (where SHC is a universal constant $\pm \frac{e}{8 π}$), here SHC depends on the external parameters like SO coupling strength, Fermi energy, etc. We show that in the presence of any arbitrary magnetic flux, periodicity of the system is lost and the features of SHC tends to get reduced because of elastic scattering. But again at some typical values of flux ($φ=1/2, 1/4, 3/4..., etc.) the system retains its periodicity depending on its size and the features of spin Hall effect (SHE) reappears. Our predicted results may be useful in providing a deeper insight into the experimental realization of SHE in such geometries.
△ Less
Submitted 19 June, 2012; v1 submitted 22 January, 2012;
originally announced January 2012.
-
Integer quantum Hall effect in a square lattice revisited
Authors:
Santanu K. Maiti,
Moumita Dey,
S. N. Karmakar
Abstract:
We investigate the phenomenon of integer quantum Hall effect in a square lattice, subjected to a perpendicular magnetic field, through Landauer-Büttiker formalism within the tight-binding framework. The oscillating nature of longitudinal resistance and near complete suppression of momentum relaxation processes are examined by studying the flow of charge current using Landauer-Keldysh prescription.…
▽ More
We investigate the phenomenon of integer quantum Hall effect in a square lattice, subjected to a perpendicular magnetic field, through Landauer-Büttiker formalism within the tight-binding framework. The oscillating nature of longitudinal resistance and near complete suppression of momentum relaxation processes are examined by studying the flow of charge current using Landauer-Keldysh prescription. Our analysis for the lattice model corroborates the finding obtained in the continuum model and provides a simple physical understanding.
△ Less
Submitted 5 March, 2012; v1 submitted 17 August, 2011;
originally announced August 2011.
-
Magnetic field induced metal-insulator transition in a Kagome Nanoribbon
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
In the present work we investigate two-terminal electron transport through a finite width kagome lattice nanoribbon in presence of a perpendicular magnetic field. We employ a simple tight-binding (T-B) Hamiltonian to describe the system and obtain the transmission properties by using Green's function technique within the framework of Landauer-Büttiker formalism. After presenting an analytical desc…
▽ More
In the present work we investigate two-terminal electron transport through a finite width kagome lattice nanoribbon in presence of a perpendicular magnetic field. We employ a simple tight-binding (T-B) Hamiltonian to describe the system and obtain the transmission properties by using Green's function technique within the framework of Landauer-Büttiker formalism. After presenting an analytical description of energy dispersion relation of a kagome nanoribbon in presence of the magnetic field, we investigate numerically the transmittance spectra together with the density of states and current-voltage characteristics. It is shown that for a specific value of the Fermi energy the kagome network can exhibit a magnetic field induced metal-insulator transition which is the central investigation of this communication. Our analysis may be inspiring in designing low-dimensional switching devices.
△ Less
Submitted 28 June, 2011; v1 submitted 27 June, 2011;
originally announced June 2011.
-
Magneto-transport in a mesoscopic ring with Rashba and Dresselhaus spin-orbit interactions
Authors:
Santanu K. Maiti,
Moumita Dey,
Shreekantha Sil,
Arunava Chakrabarti,
S. N. Karmakar
Abstract:
Electronic transport in a one-dimensional mesoscopic ring threaded by a magnetic flux is studied in presence of Rashba and Dresselhaus spin-orbit interactions. A completely analytical technique within a tight-binding formalism unveils the spin-split bands in presence of the spin-orbit interactions and leads to a method of determining the strength of the Dresselhaus interaction. In addition to this…
▽ More
Electronic transport in a one-dimensional mesoscopic ring threaded by a magnetic flux is studied in presence of Rashba and Dresselhaus spin-orbit interactions. A completely analytical technique within a tight-binding formalism unveils the spin-split bands in presence of the spin-orbit interactions and leads to a method of determining the strength of the Dresselhaus interaction. In addition to this, the persistent currents for ordered and disordered rings have been investigated numerically. It is observed that, the presence of the spin-orbit interaction, in general, leads to an enhanced amplitude of the persistent current. Numerical results corroborate the respective analytical findings.
△ Less
Submitted 23 July, 2011; v1 submitted 2 March, 2011;
originally announced March 2011.
-
KHz QPOs in LMXBs, relations between different frequencies and compactness of stars
Authors:
Taparati Gangopadhyay,
Xiang-Dong Li,
Subharthi Ray,
Mira Dey,
Jishnu Dey
Abstract:
We suggest that the mass of four compact stars SAX J1808.4$-$3658, KS 1731$-$260, SAX J1750.8$-$2900 and IGR J17191$-$2821 can be determined from the difference in the observed kiloHertz quasi periodic oscillations (kHz QPO-s) of these stars. The stellar radius is very close to the marginally stable orbit $R_{ms}$ as predicted by Einstein's general relativity. It may be noted that the first of the…
▽ More
We suggest that the mass of four compact stars SAX J1808.4$-$3658, KS 1731$-$260, SAX J1750.8$-$2900 and IGR J17191$-$2821 can be determined from the difference in the observed kiloHertz quasi periodic oscillations (kHz QPO-s) of these stars. The stellar radius is very close to the marginally stable orbit $R_{ms}$ as predicted by Einstein's general relativity. It may be noted that the first of these stars was suggested to be a strange star more than a decade back by Li \emph{et al.} (1999) from the unique millisecond X-ray pulsations with an accurate determination of its rotation period. It showed kHz QPO-s eight years back and so far it is the only set that has been observed. This is the first time we give an estimate of the mass of the star and of three other compact stars in Low-Mass X-ray Binaries using their observed kHz QPO-s.
△ Less
Submitted 10 February, 2011;
originally announced February 2011.
-
Effect of Dephasing on Electron Transport in a Molecular Wire: Green's Function Approach
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. karmakar
Abstract:
The effect of dephasing on electron transport through a benzene molecule is carefully examined using a phenomenological model introduced by Büttiker. Within a tight-binding framework all the calculations are performed based on the Green's function formalism. We investigate the influence of dephasing on transmission probability and current-voltage characteristics for three different configurations…
▽ More
The effect of dephasing on electron transport through a benzene molecule is carefully examined using a phenomenological model introduced by Büttiker. Within a tight-binding framework all the calculations are performed based on the Green's function formalism. We investigate the influence of dephasing on transmission probability and current-voltage characteristics for three different configurations ({\em ortho}, {\em meta} and {\em para}) of the molecular system depending on the locations of two contacting leads. The presence of dephasing provides a significant change in the spectral properties of the molecule and exhibits several interesting patterns that have so far remain unexplored.
△ Less
Submitted 5 March, 2011; v1 submitted 9 November, 2010;
originally announced November 2010.
-
Spin transport through a quantum network: Effects of Rashba spin orbit interaction and Aharonov-Bohm flux
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
We address spin dependent transport through an array of diamonds in the presence of Rashba spin-orbit (SO) interaction where each diamond plaquette is penetrated by an Aharonov-Bohm (AB) flux $φ$. The diamond chain is attached symmetrically to two semi-infinite one-dimensional non-magnetic metallic leads. We adopt a single particle tight-binding Hamiltonian to describe the system and study spin tr…
▽ More
We address spin dependent transport through an array of diamonds in the presence of Rashba spin-orbit (SO) interaction where each diamond plaquette is penetrated by an Aharonov-Bohm (AB) flux $φ$. The diamond chain is attached symmetrically to two semi-infinite one-dimensional non-magnetic metallic leads. We adopt a single particle tight-binding Hamiltonian to describe the system and study spin transport using Green's function formalism. After presenting an analytical method for the energy dispersion relation of an infinite diamond chain in the presence of Rashba SO interaction, we study numerically the conductance-energy characteristics together with the density of states of a finite sized diamond network. At the typical flux $φ=φ_0/2$, a delocalizing effect is observed in the presence of Rashba SO interaction, and, depending on the specific choices of SO interaction strength and AB flux the quantum network can be used as a spin filter. Our analysis may be inspiring in designing spintronic devices.
△ Less
Submitted 21 September, 2010; v1 submitted 6 July, 2010;
originally announced July 2010.
-
Topological effect on spin transport in a magnetic quantum wire: Green's function approach
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
We explore spin dependent transport through a magnetic quantum wire which is attached to two non-magnetic metallic electrodes. We adopt a simple tight-binding Hamiltonian to describe the model where the quantum wire is attached to two semi-infinite one-dimensional non-magnetic electrodes. Based on single particle Green's function formalism all the calculations are performed numerically which descr…
▽ More
We explore spin dependent transport through a magnetic quantum wire which is attached to two non-magnetic metallic electrodes. We adopt a simple tight-binding Hamiltonian to describe the model where the quantum wire is attached to two semi-infinite one-dimensional non-magnetic electrodes. Based on single particle Green's function formalism all the calculations are performed numerically which describe two-terminal conductance and current-voltage characteristics through the wire. Quite interestingly we see that, beyond a critical system size probability of spin flipping enhances significantly that can be used to design a spin flip device. Our numerical study may be helpful in fabricating mesoscopic or nano-scale spin devices.
△ Less
Submitted 1 April, 2010; v1 submitted 11 February, 2010;
originally announced February 2010.
-
Magnetic Quantum Wire as a Spin Filter: An Exact Study
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
We propose that a magnetic quantum wire composed of magnetic and non-magnetic atomic sites can be used as a spin filter for a wide range of applied bias voltage. We adopt a simple tight-binding Hamiltonian to describe the model where the quantum wire is attached to two semi-infinite one-dimensional non-magnetic electrodes. Based on single particle Green's function formalism all the calculations…
▽ More
We propose that a magnetic quantum wire composed of magnetic and non-magnetic atomic sites can be used as a spin filter for a wide range of applied bias voltage. We adopt a simple tight-binding Hamiltonian to describe the model where the quantum wire is attached to two semi-infinite one-dimensional non-magnetic electrodes. Based on single particle Green's function formalism all the calculations are performed numerically which describe two-terminal conductance and current through the wire. Our exact results may be helpful in fabricating mesoscopic or nano-scale spin filter.
△ Less
Submitted 29 December, 2009; v1 submitted 22 December, 2009;
originally announced December 2009.
-
Logical XOR gate response in a quantum interferometer: A spin dependent transport
Authors:
Moumita Dey,
Santanu K. Maiti,
S. N. Karmakar
Abstract:
We examine spin dependent transport in a quantum interferometer composed of magnetic atomic sites based on transfer matrix formalism. The interferometer, threaded by a magnetic flux $φ$, is symmetrically attached to two semi-infinite one-dimensional (1D) non-magnetic electrodes, namely, source and drain. A simple tight-binding model is used to describe the bridge system, and, here we address numer…
▽ More
We examine spin dependent transport in a quantum interferometer composed of magnetic atomic sites based on transfer matrix formalism. The interferometer, threaded by a magnetic flux $φ$, is symmetrically attached to two semi-infinite one-dimensional (1D) non-magnetic electrodes, namely, source and drain. A simple tight-binding model is used to describe the bridge system, and, here we address numerically the conductance-energy and current-voltage characteristics as functions of the interferometer-to-electrode coupling strength, magnetic flux and the orientation of local the magnetic moments associated with each atomic site. Quite interestingly it is observed that, for $φ=φ_0/2$ ($φ_0=ch/e$, the elementary flux-quantum) a logical XOR gate like response is observed, depending on the orientation of the local magnetic moments associated with the magnetic atoms in the upper and lower arms of the interferometer, and it can be changed by an externally applied gate magnetic field. This aspect may be utilized in designing a spin based electronic logic gate.
△ Less
Submitted 27 January, 2011; v1 submitted 12 November, 2009;
originally announced November 2009.
-
Selective spin transport through a quantum heterostructure: Transfer matrix method
Authors:
Moumita Dey,
Santanu K. Maiti
Abstract:
In the present work we propose that a one-dimensional quantum heterostructure composed of magnetic and non-magnetic atomic sites can be utilized as a spin filter for a wide range of applied bias voltage. A simple tight-binding framework is given to describe the conducting junction where the heterostructure is coupled to two semi-infinite one-dimensional non-magnetic electrodes. Based on transfer m…
▽ More
In the present work we propose that a one-dimensional quantum heterostructure composed of magnetic and non-magnetic atomic sites can be utilized as a spin filter for a wide range of applied bias voltage. A simple tight-binding framework is given to describe the conducting junction where the heterostructure is coupled to two semi-infinite one-dimensional non-magnetic electrodes. Based on transfer matrix method all the calculations are performed numerically which describe two-terminal spin dependent transmission probability along with junction current through the wire. Our detailed analysis may provide fundamental aspects of selective spin transport phenomena in one-dimensional heterostructures at nano-scale level.
△ Less
Submitted 18 May, 2016; v1 submitted 20 August, 2009;
originally announced August 2009.
-
The micro-glitch in PSR B1821-24 : A case for a strange pulsar?
Authors:
Raka Dona Ray Mandal,
Sushan Konar,
Mira Dey,
Jishnu Dey
Abstract:
The single glitch observed in PSR B1821-24, a millisecond pulsar in M28, is unusual on two counts. First, the magnitude of this glitch is at least an order of magnitude smaller ($Δν/ ν\sim 10^{-11}$) than the smallest glitch observed to date. Secondly, all other glitching pulsars have strong magnetic fields with $B \gsim 10^{11} G$ and are young, whereas PSR B1821-24 is an old recycled pulsar wi…
▽ More
The single glitch observed in PSR B1821-24, a millisecond pulsar in M28, is unusual on two counts. First, the magnitude of this glitch is at least an order of magnitude smaller ($Δν/ ν\sim 10^{-11}$) than the smallest glitch observed to date. Secondly, all other glitching pulsars have strong magnetic fields with $B \gsim 10^{11} G$ and are young, whereas PSR B1821-24 is an old recycled pulsar with a field strength of $2.25\times10^9 G$. We have suggested earlier that some of the recycled pulsars could actually be strange quark stars. In this work we argue that the crustal properties of such a {\em strange} pulsar are just right to give rise to a glitch of this magnitude, explaining the scarcity of larger glitches in millisecond pulsars.
△ Less
Submitted 26 June, 2009; v1 submitted 29 April, 2009;
originally announced April 2009.
-
Strange Stars : An interesting member of the compact object family
Authors:
Manjari Bagchi,
Subharthi Ray,
Jishnu Dey,
Mira Dey
Abstract:
We have studied strange star properties both at zero temperature and at finite temperatures and searched signatures of strange stars in gamma-ray, x-ray and radio astronomy. We have a set of Equations of State (EoS) for strange quark matter (SQM) and solving the TOV equations, we get the structure of strange stars. The maximum mass for a strange star decreases with the increase of temperature, b…
▽ More
We have studied strange star properties both at zero temperature and at finite temperatures and searched signatures of strange stars in gamma-ray, x-ray and radio astronomy. We have a set of Equations of State (EoS) for strange quark matter (SQM) and solving the TOV equations, we get the structure of strange stars. The maximum mass for a strange star decreases with the increase of temperature, because at high temperatures, the EoS become softer. One important aspect of strange star is that, surface tension depends on the size and structure of the star and is significantly larger than the conventional values. Moment of inertia is another important parameter for compact stars as by comparing theoretical values with observed estimate, it is possible to constrain the dense matter Equation of State. We hope that this approach will help us to decide whether the members of the double pulsar system PSR J0737-3039 are neutron stars or strange stars.
△ Less
Submitted 10 February, 2008;
originally announced February 2008.