-
Self-rationalization improves LLM as a fine-grained judge
Authors:
Prapti Trivedi,
Aditya Gulati,
Oliver Molenschot,
Meghana Arakkal Rajeev,
Rajkumar Ramamurthy,
Keith Stevens,
Tanveesh Singh Chaudhery,
Jahnavi Jambholkar,
James Zou,
Nazneen Rajani
Abstract:
LLM-as-a-judge models have been used for evaluating both human and AI generated content, specifically by providing scores and rationales. Rationales, in addition to increasing transparency, help models learn to calibrate its judgments. Enhancing a model's rationale can therefore improve its calibration abilities and ultimately the ability to score content. We introduce Self-Rationalization, an ite…
▽ More
LLM-as-a-judge models have been used for evaluating both human and AI generated content, specifically by providing scores and rationales. Rationales, in addition to increasing transparency, help models learn to calibrate its judgments. Enhancing a model's rationale can therefore improve its calibration abilities and ultimately the ability to score content. We introduce Self-Rationalization, an iterative process of improving the rationales for the judge models, which consequently improves the score for fine-grained customizable scoring criteria (i.e., likert-scale scoring with arbitrary evaluation criteria). Self-rationalization works by having the model generate multiple judgments with rationales for the same input, curating a preference pair dataset from its own judgements, and iteratively fine-tuning the judge via DPO. Intuitively, this approach allows the judge model to self-improve by learning from its own rationales, leading to better alignment and evaluation accuracy. After just two iterations -- while only relying on examples in the training set -- human evaluation shows that our judge model learns to produce higher quality rationales, with a win rate of $62\%$ on average compared to models just trained via SFT on rationale . This judge model also achieves high scoring accuracy on BigGen Bench and Reward Bench, outperforming even bigger sized models trained using SFT with rationale, self-consistency or best-of-$N$ sampling by $3\%$ to $9\%$.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
The SPARC Toroidal Field Model Coil Program
Authors:
Zachary Hartwig,
Rui Vieira,
Darby Dunn,
Theodore Golfinopoulos,
Brian LaBombard,
Christopher Lammi,
Phil Michael,
Susan Agabian,
David Arsenault,
Raheem Barnett,
Mike Barry,
Larry Bartoszek,
William Beck,
David Bellofatto,
Daniel Brunner,
William Burke,
Jason Burrows,
William Byford,
Charles Cauley,
Sarah Chamberlain,
David Chavarria,
JL Cheng,
James Chicarello,
Karen Cote,
Corinne Cotta
, et al. (75 additional authors not shown)
Abstract:
The SPARC Toroidal Field Model Coil (TFMC) Program was a three-year effort between 2018 and 2021 that developed novel Rare Earth Yttrium Barium Copper Oxide (REBCO) superconductor technologies and then successfully utilized these technologies to design, build, and test a first-in-class, high-field (~20 T), representative-scale (~3 m) superconducting toroidal field coil. With the principal objectiv…
▽ More
The SPARC Toroidal Field Model Coil (TFMC) Program was a three-year effort between 2018 and 2021 that developed novel Rare Earth Yttrium Barium Copper Oxide (REBCO) superconductor technologies and then successfully utilized these technologies to design, build, and test a first-in-class, high-field (~20 T), representative-scale (~3 m) superconducting toroidal field coil. With the principal objective of demonstrating mature, large-scale, REBCO magnets, the project was executed jointly by the MIT Plasma Science and Fusion Center (PSFC) and Commonwealth Fusion Systems (CFS). The TFMC achieved its programmatic goal of experimentally demonstrating a large-scale high-field REBCO magnet, achieving 20.1 T peak field-on-conductor with 40.5 kA of terminal current, 815 kN/m of Lorentz loading on the REBCO stacks, and almost 1 GPa of mechanical stress accommodated by the structural case. Fifteen internal demountable pancake-to-pancake joints operated in the 0.5 to 2.0 nOhm range at 20 K and in magnetic fields up to 12 T. The DC and AC electromagnetic performance of the magnet, predicted by new advances in high-fidelity computational models, was confirmed in two test campaigns while the massively parallel, single-pass, pressure-vessel style coolant scheme capable of large heat removal was validated. The REBCO current lead and feeder system was experimentally qualified up to 50 kA, and the crycooler based cryogenic system provided 600 W of cooling power at 20 K with mass flow rates up to 70 g/s at a maximum design pressure of 20 bar-a for the test campaigns. Finally, the feasibility of using passive, self-protection against a quench in a fusion-scale NI TF coil was experimentally assessed with an intentional open-circuit quench at 31.5 kA terminal current.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Authors:
Andreas Köpf,
Yannic Kilcher,
Dimitri von Rütte,
Sotiris Anagnostidis,
Zhi-Rui Tam,
Keith Stevens,
Abdullah Barhoum,
Nguyen Minh Duc,
Oliver Stanley,
Richárd Nagyfi,
Shahul ES,
Sameer Suri,
David Glushkov,
Arnav Dantuluri,
Andrew Maguire,
Christoph Schuhmann,
Huu Nguyen,
Alexander Mattick
Abstract:
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce…
▽ More
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like RLHF rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 complete and fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Models trained on OpenAssistant Conversations show consistent improvements on standard benchmarks over respective base models. We release our code and data under a fully permissive licence.
△ Less
Submitted 31 October, 2023; v1 submitted 14 April, 2023;
originally announced April 2023.
-
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Authors:
Samuel Cahyawijaya,
Holy Lovenia,
Alham Fikri Aji,
Genta Indra Winata,
Bryan Wilie,
Rahmad Mahendra,
Christian Wibisono,
Ade Romadhony,
Karissa Vincentio,
Fajri Koto,
Jennifer Santoso,
David Moeljadi,
Cahya Wirawan,
Frederikus Hudi,
Ivan Halim Parmonangan,
Ika Alfina,
Muhammad Satrio Wicaksono,
Ilham Firdausi Putra,
Samsul Rahmadani,
Yulianti Oenang,
Ali Akbar Septiandri,
James Jaya,
Kaustubh D. Dhole,
Arie Ardiyanti Suryani,
Rifki Afina Putri
, et al. (22 additional authors not shown)
Abstract:
We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources. Through this initiative, we have brought together 137 datasets and 118 standardized data loaders. The quality of the datasets has been assessed manually and automatically, and their value is demonstrated through multiple exp…
▽ More
We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources. Through this initiative, we have brought together 137 datasets and 118 standardized data loaders. The quality of the datasets has been assessed manually and automatically, and their value is demonstrated through multiple experiments. NusaCrowd's data collection enables the creation of the first zero-shot benchmarks for natural language understanding and generation in Indonesian and the local languages of Indonesia. Furthermore, NusaCrowd brings the creation of the first multilingual automatic speech recognition benchmark in Indonesian and the local languages of Indonesia. Our work strives to advance natural language processing (NLP) research for languages that are under-represented despite being widely spoken.
△ Less
Submitted 21 July, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
SKM-TEA: A Dataset for Accelerated MRI Reconstruction with Dense Image Labels for Quantitative Clinical Evaluation
Authors:
Arjun D Desai,
Andrew M Schmidt,
Elka B Rubin,
Christopher M Sandino,
Marianne S Black,
Valentina Mazzoli,
Kathryn J Stevens,
Robert Boutin,
Christopher Ré,
Garry E Gold,
Brian A Hargreaves,
Akshay S Chaudhari
Abstract:
Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have sh…
▽ More
Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have shown promise for reducing this burden, these techniques are primarily validated with imperfect image quality metrics, which are discordant with clinically-relevant measures that ultimately hamper clinical deployment and clinician trust. To mitigate this challenge, we present the Stanford Knee MRI with Multi-Task Evaluation (SKM-TEA) dataset, a collection of quantitative knee MRI (qMRI) scans that enables end-to-end, clinically-relevant evaluation of MRI reconstruction and analysis tools. This 1.6TB dataset consists of raw-data measurements of ~25,000 slices (155 patients) of anonymized patient MRI scans, the corresponding scanner-generated DICOM images, manual segmentations of four tissues, and bounding box annotations for sixteen clinically relevant pathologies. We provide a framework for using qMRI parameter maps, along with image reconstructions and dense image labels, for measuring the quality of qMRI biomarker estimates extracted from MRI reconstruction, segmentation, and detection techniques. Finally, we use this framework to benchmark state-of-the-art baselines on this dataset. We hope our SKM-TEA dataset and code can enable a broad spectrum of research for modular image reconstruction and image analysis in a clinically informed manner. Dataset access, code, and benchmarks are available at https://github.com/StanfordMIMI/skm-tea.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.
-
Hierarchical Document Encoder for Parallel Corpus Mining
Authors:
Mandy Guo,
Yinfei Yang,
Keith Stevens,
Daniel Cer,
Heming Ge,
Yun-Hsuan Sung,
Brian Strope,
Ray Kurzweil
Abstract:
We explore using multilingual document embeddings for nearest neighbor mining of parallel data. Three document-level representations are investigated: (i) document embeddings generated by simply averaging multilingual sentence embeddings; (ii) a neural bag-of-words (BoW) document encoding model; (iii) a hierarchical multilingual document encoder (HiDE) that builds on our sentence-level model. The…
▽ More
We explore using multilingual document embeddings for nearest neighbor mining of parallel data. Three document-level representations are investigated: (i) document embeddings generated by simply averaging multilingual sentence embeddings; (ii) a neural bag-of-words (BoW) document encoding model; (iii) a hierarchical multilingual document encoder (HiDE) that builds on our sentence-level model. The results show document embeddings derived from sentence-level averaging are surprisingly effective for clean datasets, but suggest models trained hierarchically at the document-level are more effective on noisy data. Analysis experiments demonstrate our hierarchical models are very robust to variations in the underlying sentence embedding quality. Using document embeddings trained with HiDE achieves state-of-the-art performance on United Nations (UN) parallel document mining, 94.9% P@1 for en-fr and 97.3% P@1 for en-es.
△ Less
Submitted 30 June, 2019; v1 submitted 19 June, 2019;
originally announced June 2019.
-
Donors and Deep Acceptors in $β$-Ga2O3
Authors:
Adam T. Neal,
Shin Mou,
Subrina Rafique,
Hongping Zhao,
Elaheh Ahmadi,
James S. Speck,
Kevin T. Stevens,
John D. Blevins,
Darren B. Thomson,
Neil Moser,
Kelson D. Chabak,
Gregg H. Jessen
Abstract:
We have studied the properties of Si, Ge shallow donors and Fe, Mg deep acceptors in $β$-Ga2O3 through temperature dependent van der Pauw and Hall effect measurements of samples grown by a variety of methods, including edge-defined film-fed (EFG), Czochralski (CZ), molecular beam epitaxy (MBE), and low pressure chemical vapor deposition (LPCVD). Through simultaneous, self-consistent fitting of the…
▽ More
We have studied the properties of Si, Ge shallow donors and Fe, Mg deep acceptors in $β$-Ga2O3 through temperature dependent van der Pauw and Hall effect measurements of samples grown by a variety of methods, including edge-defined film-fed (EFG), Czochralski (CZ), molecular beam epitaxy (MBE), and low pressure chemical vapor deposition (LPCVD). Through simultaneous, self-consistent fitting of the temperature dependent carrier density and mobility, we are able to accurately estimate the donor energy of Si and Ge to be 30 meV in $β$-Ga2O3. Additionally, we show that our measured Hall effect data are consistent with Si and Ge acting as typical shallow donors, rather than shallow DX centers. High temperature Hall effect measurement of Fe doped $β$-Ga2O3 indicates that the material remains weakly n-type even with the Fe doping, with an acceptor energy of 860 meV relative to the conduction band for the Fe deep acceptor. Van der Pauw measurements of Mg doped Ga2O3 indicate an activation energy of 1.1 eV, as determined from the temperature dependent conductivity.
△ Less
Submitted 4 September, 2018;
originally announced September 2018.
-
Effective Parallel Corpus Mining using Bilingual Sentence Embeddings
Authors:
Mandy Guo,
Qinlan Shen,
Yinfei Yang,
Heming Ge,
Daniel Cer,
Gustavo Hernandez Abrego,
Keith Stevens,
Noah Constant,
Yun-Hsuan Sung,
Brian Strope,
Ray Kurzweil
Abstract:
This paper presents an effective approach for parallel corpus mining using bilingual sentence embeddings. Our embedding models are trained to produce similar representations exclusively for bilingual sentence pairs that are translations of each other. This is achieved using a novel training method that introduces hard negatives consisting of sentences that are not translations but that have some d…
▽ More
This paper presents an effective approach for parallel corpus mining using bilingual sentence embeddings. Our embedding models are trained to produce similar representations exclusively for bilingual sentence pairs that are translations of each other. This is achieved using a novel training method that introduces hard negatives consisting of sentences that are not translations but that have some degree of semantic similarity. The quality of the resulting embeddings are evaluated on parallel corpus reconstruction and by assessing machine translation systems trained on gold vs. mined sentence pairs. We find that the sentence embeddings can be used to reconstruct the United Nations Parallel Corpus at the sentence level with a precision of 48.9% for en-fr and 54.9% for en-es. When adapted to document level matching, we achieve a parallel document matching accuracy that is comparable to the significantly more computationally intensive approach of [Jakob 2010]. Using reconstructed parallel data, we are able to train NMT models that perform nearly as well as models trained on the original data (within 1-2 BLEU).
△ Less
Submitted 2 August, 2018; v1 submitted 31 July, 2018;
originally announced July 2018.
-
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Authors:
Yonghui Wu,
Mike Schuster,
Zhifeng Chen,
Quoc V. Le,
Mohammad Norouzi,
Wolfgang Macherey,
Maxim Krikun,
Yuan Cao,
Qin Gao,
Klaus Macherey,
Jeff Klingner,
Apurva Shah,
Melvin Johnson,
Xiaobing Liu,
Łukasz Kaiser,
Stephan Gouws,
Yoshikiyo Kato,
Taku Kudo,
Hideto Kazawa,
Keith Stevens,
George Kurian,
Nishant Patil,
Wei Wang,
Cliff Young,
Jason Smith
, et al. (6 additional authors not shown)
Abstract:
Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NM…
▽ More
Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NMT's use in practical deployments and services, where both accuracy and speed are essential. In this work, we present GNMT, Google's Neural Machine Translation system, which attempts to address many of these issues. Our model consists of a deep LSTM network with 8 encoder and 8 decoder layers using attention and residual connections. To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder. To accelerate the final translation speed, we employ low-precision arithmetic during inference computations. To improve handling of rare words, we divide words into a limited set of common sub-word units ("wordpieces") for both input and output. This method provides a good balance between the flexibility of "character"-delimited models and the efficiency of "word"-delimited models, naturally handles translation of rare words, and ultimately improves the overall accuracy of the system. Our beam search technique employs a length-normalization procedure and uses a coverage penalty, which encourages generation of an output sentence that is most likely to cover all the words in the source sentence. On the WMT'14 English-to-French and English-to-German benchmarks, GNMT achieves competitive results to state-of-the-art. Using a human side-by-side evaluation on a set of isolated simple sentences, it reduces translation errors by an average of 60% compared to Google's phrase-based production system.
△ Less
Submitted 8 October, 2016; v1 submitted 26 September, 2016;
originally announced September 2016.
-
Exploring Practitioner Perspectives of Sourcing Risks: Towards the Development of an Integrated Risk and Control Framework
Authors:
Deborah Bunker,
Catherine Hardy,
Abdul Babar,
Ken Stevens
Abstract:
Outsourcing of information and communication technologies (ICT) and related services is an established and growing industry. Recent trends, such as the move toward multi-sourcing have increased the complexity and risk of these outsourcing arrangements. There is a critical research need to identify the risks faced by both the organisations that outsource ICT and the vendors that provide it in this…
▽ More
Outsourcing of information and communication technologies (ICT) and related services is an established and growing industry. Recent trends, such as the move toward multi-sourcing have increased the complexity and risk of these outsourcing arrangements. There is a critical research need to identify the risks faced by both the organisations that outsource ICT and the vendors that provide it in this changing landscape. To address growing concerns regarding the best way to deal with risk and control in this environment, our research focuses on establishing a Sourcing Risk and Control Framework to assist organisations identify these risks and develop effective mitigation strategies. In this paper we report on the first stage of our research that sought to document how sourcing risk is represented and considered in practice. To date, limited empirical research has been conducted in an Australian context. Using a series of workshops involving client and vendor representatives, we identified a broad range of risks and developed a cohesive categorisation scheme that incorporates functional and multi-stakeholder perspectives.
△ Less
Submitted 8 June, 2016;
originally announced June 2016.
-
Ball-grid array architecture for microfabricated ion traps
Authors:
Nicholas D. Guise,
Spencer D. Fallek,
Kelly E. Stevens,
K. R. Brown,
Curtis Volin,
Alexa W. Harter,
Jason M. Amini,
Robert E. Higashi,
Son Thai Lu,
Helen M. Chanhvongsak,
Thi A. Nguyen,
Matthew S. Marcus,
Thomas R. Ohnstein,
Daniel W. Youngner
Abstract:
State-of-the-art microfabricated ion traps for quantum information research are approaching nearly one hundred control electrodes. We report here on the development and testing of a new architecture for microfabricated ion traps, built around ball-grid array (BGA) connections, that is suitable for increasingly complex trap designs. In the BGA trap, through-substrate vias bring electrical signals f…
▽ More
State-of-the-art microfabricated ion traps for quantum information research are approaching nearly one hundred control electrodes. We report here on the development and testing of a new architecture for microfabricated ion traps, built around ball-grid array (BGA) connections, that is suitable for increasingly complex trap designs. In the BGA trap, through-substrate vias bring electrical signals from the back side of the trap die to the surface trap structure on the top side. Gold-ball bump bonds connect the back side of the trap die to an interposer for signal routing from the carrier. Trench capacitors fabricated into the trap die replace area-intensive surface or edge capacitors. Wirebonds in the BGA architecture are moved to the interposer. These last two features allow the trap die to be reduced to only the area required to produce trapping fields. The smaller trap dimensions allow tight focusing of an addressing laser beam for fast single-qubit rotations. Performance of the BGA trap as characterized with $^{40}$Ca$^+$ ions is comparable to previous surface-electrode traps in terms of ion heating rate, mode frequency stability, and storage lifetime. We demonstrate two-qubit entanglement operations with $^{171}$Yb$^+$ ions in a second BGA trap.
△ Less
Submitted 5 May, 2015; v1 submitted 17 December, 2014;
originally announced December 2014.
-
Bayesian Wavelet Shrinkage of the Haar-Fisz Transformed Wavelet Periodogram
Authors:
Guy P. Nason,
Kara N. Stevens
Abstract:
It is increasingly being realised that many real world time series are not stationary and exhibit evolving second-order autocovariance or spectral structure. This article introduces a Bayesian approach for modelling the evolving wavelet spectrum of a locally stationary wavelet time series. Our new method works by combining the advantages of a Haar-Fisz transformed spectrum with a simple, but power…
▽ More
It is increasingly being realised that many real world time series are not stationary and exhibit evolving second-order autocovariance or spectral structure. This article introduces a Bayesian approach for modelling the evolving wavelet spectrum of a locally stationary wavelet time series. Our new method works by combining the advantages of a Haar-Fisz transformed spectrum with a simple, but powerful, Bayesian wavelet shrinkage method. Our new method produces excellent and stable spectral estimates and this is demonstrated via simulated data and on differenced infant ECG data. A major additional benefit of the Bayesian paradigm is that we obtain rigorous and useful credible intervals of the evolving spectral structure. We show how the Bayesian credible intervals provide extra insight into the infant ECG data.
△ Less
Submitted 10 September, 2013;
originally announced September 2013.
-
Comparison of ancilla preparation and measurement procedures for the Steane [[7,1,3]] code on a model ion trap quantum computer
Authors:
Yu Tomita,
Mauricio Gutiérrez,
Chingiz Kabytayev,
Kenneth R. Brown,
M. R. Hutsel,
A. P. Morris,
Kelly E. Stevens,
G. Mohler
Abstract:
We schedule the Steane [[7,1,3]] error correction on a model ion trap architecture with ballistic transport. We compare the level one error rates for syndrome extraction using the Shor method of ancilla prepared in verified cat states to the DiVincenzo-Aliferis method without verification. The study examines how the quantum error correction circuit latency and error vary with the number of availab…
▽ More
We schedule the Steane [[7,1,3]] error correction on a model ion trap architecture with ballistic transport. We compare the level one error rates for syndrome extraction using the Shor method of ancilla prepared in verified cat states to the DiVincenzo-Aliferis method without verification. The study examines how the quantum error correction circuit latency and error vary with the number of available ancilla and the choice of protocol for ancilla preparation and measurement. We find that with few exceptions the DiVincenzo-Aliferis method without cat state verification outperforms the standard Shor method. We also find that additional ancilla always reduces the latency but does not significantly change the error due to the high memory fidelity.
△ Less
Submitted 13 May, 2013; v1 submitted 2 May, 2013;
originally announced May 2013.
-
Population genomics of sub-Saharan Drosophila melanogaster: African diversity and non-African admixture
Authors:
John E. Pool,
Russell B. Corbett-Detig,
Ryuichi P. Sugino,
Kristian A. Stevens,
Charis M. Cardeno,
Marc W. Crepeau,
Pablo Duchen,
J. J. Emerson,
Perot Saelao,
David J. Begun,
Charles H. Langley
Abstract:
(ABRIDGED) We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the developme…
▽ More
(ABRIDGED) We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the development and application of a novel admixture detection method. Admixture proportions varied among populations, with greater admixture in urban locations. Admixture levels also varied across the genome, with localized peaks and valleys suggestive of a non-neutral introgression process. Genomes from the same location differed starkly in ancestry, suggesting that isolation mechanisms may exist within African populations. After removing putatively admixed genomic segments, the greatest genetic diversity was observed in southern Africa (e.g. Zambia), while diversity in other populations was largely consistent with a geographic expansion from this potentially ancestral region. The European population showed different levels of diversity reduction on each chromosome arm, and some African populations displayed chromosome arm-specific diversity reductions. Inversions in the European sample were associated with strong elevations in diversity across chromosome arms. Genomic scans were conducted to identify loci that may represent targets of positive selection. A disproportionate number of candidate selective sweep regions were located near genes with varied roles in gene regulation. Outliers for Europe-Africa FST were found to be enriched in genomic regions of locally elevated cosmopolitan admixture, possibly reflecting a role for some of these loci in driving the introgression of non-African alleles into African populations.
△ Less
Submitted 23 August, 2012;
originally announced August 2012.
-
Non-existence of Asymptotically Flat Geons in 2+1 Gravity
Authors:
Kory A. Stevens,
Kristin Schleich,
Donald M. Witt
Abstract:
Geons, small topological structures that exhibit particle properties such as charge and angular momentum without the presence of matter sources, have been extensively discussed in 3+1-dimensional general relativity. Given the recent renewal of interest in 2+1 gravity, it is natural to ask whether or not the notion of geons extends to three dimensions. We prove here that, in contrast to the 3+1-d…
▽ More
Geons, small topological structures that exhibit particle properties such as charge and angular momentum without the presence of matter sources, have been extensively discussed in 3+1-dimensional general relativity. Given the recent renewal of interest in 2+1 gravity, it is natural to ask whether or not the notion of geons extends to three dimensions. We prove here that, in contrast to the 3+1-dimensional case, there are no 2+1-dimensional asymptotically flat solutions of the vacuum Einstein or Einstein-Maxwell equations containing geons. In contrast, 2+1-dimensional asymptotically anti-de Sitter spacetimes can indeed contain geons; however, the geons are always hidden behind a single black hole horizon. We also prove sufficient conditions for the non-existence of 2+1-dimensional asymptotically flat geon-containing solutions.
△ Less
Submitted 24 October, 2008; v1 submitted 17 September, 2008;
originally announced September 2008.
-
Integrability of Particle Motion and Scalar Field Propagation in Kerr-(Anti) de Sitter Black Hole Spacetimes in All Dimensions
Authors:
Muraari Vasudevan,
Kory A. Stevens
Abstract:
We study the Hamilton-Jacobi and massive Klein-Gordon equations in the general Kerr-(Anti) de Sitter black hole background in all dimensions. Complete separation of both equations is carried out in cases when there are two sets of equal black hole rotation parameters. We analyze explicitly the symmetry properties of these backgrounds that allow for this Liouville integrability and construct a no…
▽ More
We study the Hamilton-Jacobi and massive Klein-Gordon equations in the general Kerr-(Anti) de Sitter black hole background in all dimensions. Complete separation of both equations is carried out in cases when there are two sets of equal black hole rotation parameters. We analyze explicitly the symmetry properties of these backgrounds that allow for this Liouville integrability and construct a nontrivial irreducible Killing tensor associated with the enlarged symmetry group which permits separation. We also derive first-order equations of motion for particles in these backgrounds and examine some of their properties. This work greatly generalizes previously known results for both the Myers-Perry metrics, and the Kerr-(Anti) de Sitter metrics in higher dimensions.
△ Less
Submitted 22 July, 2005;
originally announced July 2005.
-
Anomalous Peaks in the Fourier Transformed Density of States of a Bilayer D-Wave Superconductor
Authors:
K. M. Stevens,
W. A. Atkinson
Abstract:
This paper has been withdrawn due to a sign error in the equation for F_11 which invalidates many results.
This paper has been withdrawn due to a sign error in the equation for F_11 which invalidates many results.
△ Less
Submitted 23 December, 2004; v1 submitted 18 August, 2004;
originally announced August 2004.
-
Particle Motion and Scalar Field Propagation in Myers-Perry Black Hole Spacetimes in All Dimensions
Authors:
Muraari Vasudevan,
Kory A. Stevens,
Don N. Page
Abstract:
We study separability of the Hamilton-Jacobi and massive Klein-Gordon equations in the general Myers-Perry black hole background in all dimensions. Complete separation of both equations is carried out in cases when there are two sets of equal black hole rotation parameters, which significantly enlarges the rotational symmetry group. We explicitly construct a nontrivial irreducible Killing tensor…
▽ More
We study separability of the Hamilton-Jacobi and massive Klein-Gordon equations in the general Myers-Perry black hole background in all dimensions. Complete separation of both equations is carried out in cases when there are two sets of equal black hole rotation parameters, which significantly enlarges the rotational symmetry group. We explicitly construct a nontrivial irreducible Killing tensor associated with the enlarged symmetry group which permits separation. We also derive first-order equations of motion for particles in these backgrounds and examine some of their properties.
△ Less
Submitted 7 July, 2004;
originally announced July 2004.
-
Separability of the Hamilton-Jacobi and Klein-Gordon Equations in Kerr-de Sitter Metrics
Authors:
Muraari Vasudevan,
Kory A. Stevens,
Don N. Page
Abstract:
We study separability of the Hamilton-Jacobi and massive Klein-Gordon equations in the general Kerr-de Sitter spacetime in all dimensions. Complete separation of both equations is carried out in 2n+1 spacetime dimensions with all n rotation parameters equal, in which case the rotational symmetry group is enlarged from (U(1))^n to U(n). We explicitly construct the additional Killing vectors assoc…
▽ More
We study separability of the Hamilton-Jacobi and massive Klein-Gordon equations in the general Kerr-de Sitter spacetime in all dimensions. Complete separation of both equations is carried out in 2n+1 spacetime dimensions with all n rotation parameters equal, in which case the rotational symmetry group is enlarged from (U(1))^n to U(n). We explicitly construct the additional Killing vectors associated with the enlarged symmetry group which permit separation. We also derive first-order equations of motion for particles in these backgrounds and examine some of their properties.
△ Less
Submitted 1 June, 2004; v1 submitted 26 May, 2004;
originally announced May 2004.
-
Stationary strings near a higher-dimensional rotating black hole
Authors:
Valeri P. Frolov,
Kory A. Stevens
Abstract:
We study stationary string configurations in a space-time of a higher-dimensional rotating black hole. We demonstrate that the Nambu-Goto equations for a stationary string in the 5D Myers-Perry metric allow a separation of variables. We present these equations in the first-order form and study their properties. We prove that the only stationary string configuration which crosses the infinite red…
▽ More
We study stationary string configurations in a space-time of a higher-dimensional rotating black hole. We demonstrate that the Nambu-Goto equations for a stationary string in the 5D Myers-Perry metric allow a separation of variables. We present these equations in the first-order form and study their properties. We prove that the only stationary string configuration which crosses the infinite red-shift surface and remains regular there is a principal Killing string. A worldsheet of such a string is generated by a principal null geodesic and a timelike at infinity Killing vector field. We obtain principal Killing string solutions in the Myers-Perry metrics with an arbitrary number of dimensions. It is shown that due to the interaction of a string with a rotating black hole there is an angular momentum transfer from the black hole to the string. We calculate the rate of this transfer in a spacetime with an arbitrary number of dimensions. This effect slows down the rotation of the black hole. We discuss possible final stationary configurations of a rotating black hole interacting with a string.
△ Less
Submitted 15 June, 2004; v1 submitted 7 April, 2004;
originally announced April 2004.