-
Hardware and Software Platform Inference
Authors:
Cheng Zhang,
Hanna Foerster,
Robert D. Mullins,
Yiren Zhao,
Ilia Shumailov
Abstract:
It is now a common business practice to buy access to large language model (LLM) inference rather than self-host, because of significant upfront hardware infrastructure and energy costs. However, as a buyer, there is no mechanism to verify the authenticity of the advertised service including the serving hardware platform, e.g. that it is actually being served using an NVIDIA H100. Furthermore, the…
▽ More
It is now a common business practice to buy access to large language model (LLM) inference rather than self-host, because of significant upfront hardware infrastructure and energy costs. However, as a buyer, there is no mechanism to verify the authenticity of the advertised service including the serving hardware platform, e.g. that it is actually being served using an NVIDIA H100. Furthermore, there are reports suggesting that model providers may deliver models that differ slightly from the advertised ones, often to make them run on less expensive hardware. That way, a client pays premium for a capable model access on more expensive hardware, yet ends up being served by a (potentially less capable) cheaper model on cheaper hardware. In this paper we introduce \textit{\textbf{hardware and software platform inference (HSPI)}} -- a method for identifying the underlying \GPU{} architecture and software stack of a (black-box) machine learning model solely based on its input-output behavior. Our method leverages the inherent differences of various \GPU{} architectures and compilers to distinguish between different \GPU{} types and software stacks. By analyzing the numerical patterns in the model's outputs, we propose a classification framework capable of accurately identifying the \GPU{} used for model inference as well as the underlying software configuration. Our findings demonstrate the feasibility of inferring \GPU{} type from black-box models. We evaluate HSPI against models served on different real hardware and find that in a white-box setting we can distinguish between different \GPU{}s with between $83.9\%$ and $100\%$ accuracy. Even in a black-box setting we are able to achieve results that are up to three times higher than random guess accuracy.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
Optimised Grouped-Query Attention Mechanism for Transformers
Authors:
Yuang Chen,
Cheng Zhang,
Xitong Gao,
Robert D. Mullins,
George A. Constantinides,
Yiren Zhao
Abstract:
Grouped-query attention (GQA) has been widely adopted in LLMs to mitigate the complexity of multi-head attention (MHA). To transform an MHA to a GQA, neighbour queries in MHA are evenly split into groups where each group shares the value and key layers. In this work, we propose AsymGQA, an activation-informed approach to asymmetrically grouping an MHA to a GQA for better model performance. Our Asy…
▽ More
Grouped-query attention (GQA) has been widely adopted in LLMs to mitigate the complexity of multi-head attention (MHA). To transform an MHA to a GQA, neighbour queries in MHA are evenly split into groups where each group shares the value and key layers. In this work, we propose AsymGQA, an activation-informed approach to asymmetrically grouping an MHA to a GQA for better model performance. Our AsymGQA outperforms the GQA within the same model size budget. For example, AsymGQA LLaMA-2-7B has an accuracy increase of 7.5% on MMLU compared to neighbour grouping. Our approach addresses the GQA's trade-off problem between model performance and hardware efficiency.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Unlocking the Global Synergies in Low-Rank Adapters
Authors:
Zixi Zhang,
Cheng Zhang,
Xitong Gao,
Robert D. Mullins,
George A. Constantinides,
Yiren Zhao
Abstract:
Low-rank Adaption (LoRA) has been the de-facto parameter-efficient fine-tuning technique for large language models. We present HeteroLoRA, a light-weight search algorithm that leverages zero-cost proxies to allocate the limited LoRA trainable parameters across the model for better fine-tuned performance. In addition to the allocation for the standard LoRA-adapted models, we also demonstrate the ef…
▽ More
Low-rank Adaption (LoRA) has been the de-facto parameter-efficient fine-tuning technique for large language models. We present HeteroLoRA, a light-weight search algorithm that leverages zero-cost proxies to allocate the limited LoRA trainable parameters across the model for better fine-tuned performance. In addition to the allocation for the standard LoRA-adapted models, we also demonstrate the efficacy of HeteroLoRA by performing the allocation in a more challenging search space that includes LoRA modules and LoRA-adapted shortcut connections. Experiments show that HeteroLoRA enables improvements in model performance given the same parameter budge. For example, on MRPC, we see an improvement of 1.6% in accuracy with similar training parameter budget. We will open-source our algorithm once the paper is accepted.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Revisiting Structured Dropout
Authors:
Yiren Zhao,
Oluwatomisin Dada,
Xitong Gao,
Robert D Mullins
Abstract:
Large neural networks are often overparameterised and prone to overfitting, Dropout is a widely used regularization technique to combat overfitting and improve model generalization. However, unstructured Dropout is not always effective for specific network architectures and this has led to the formation of multiple structured Dropout approaches to improve model performance and, sometimes, reduce t…
▽ More
Large neural networks are often overparameterised and prone to overfitting, Dropout is a widely used regularization technique to combat overfitting and improve model generalization. However, unstructured Dropout is not always effective for specific network architectures and this has led to the formation of multiple structured Dropout approaches to improve model performance and, sometimes, reduce the computational resources required for inference. In this work, we revisit structured Dropout comparing different Dropout approaches to natural language processing and computer vision tasks for multiple state-of-the-art networks. Additionally, we devise an approach to structured Dropout we call \textbf{\emph{ProbDropBlock}} which drops contiguous blocks from feature maps with a probability given by the normalized feature salience values. We find that with a simple scheduling strategy the proposed approach to structured Dropout consistently improved model performance compared to baselines and other Dropout approaches on a diverse range of tasks and models. In particular, we show \textbf{\emph{ProbDropBlock}} improves RoBERTa finetuning on MNLI by $0.22\%$, and training of ResNet50 on ImageNet by $0.28\%$.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
DARTFormer: Finding The Best Type Of Attention
Authors:
Jason Ross Brown,
Yiren Zhao,
Ilia Shumailov,
Robert D Mullins
Abstract:
Given the wide and ever growing range of different efficient Transformer attention mechanisms, it is important to identify which attention is most effective when given a task. In this work, we are also interested in combining different attention types to build heterogeneous Transformers. We first propose a DARTS-like Neural Architecture Search (NAS) method to find the best attention for a given ta…
▽ More
Given the wide and ever growing range of different efficient Transformer attention mechanisms, it is important to identify which attention is most effective when given a task. In this work, we are also interested in combining different attention types to build heterogeneous Transformers. We first propose a DARTS-like Neural Architecture Search (NAS) method to find the best attention for a given task, in this setup, all heads use the same attention (homogeneous models). Our results suggest that NAS is highly effective on this task, and it identifies the best attention mechanisms for IMDb byte level text classification and Listops. We then extend our framework to search for and build Transformers with multiple different attention types, and call them heterogeneous Transformers. We show that whilst these heterogeneous Transformers are better than the average homogeneous models, they cannot outperform the best. We explore the reasons why heterogeneous attention makes sense, and why it ultimately fails.
△ Less
Submitted 2 October, 2022;
originally announced October 2022.
-
Wide Attention Is The Way Forward For Transformers?
Authors:
Jason Ross Brown,
Yiren Zhao,
Ilia Shumailov,
Robert D Mullins
Abstract:
The Transformer is an extremely powerful and prominent deep learning architecture. In this work, we challenge the commonly held belief in deep learning that going deeper is better, and show an alternative design approach that is building wider attention Transformers. We demonstrate that wide single layer Transformer models can compete with or outperform deeper ones in a variety of Natural Language…
▽ More
The Transformer is an extremely powerful and prominent deep learning architecture. In this work, we challenge the commonly held belief in deep learning that going deeper is better, and show an alternative design approach that is building wider attention Transformers. We demonstrate that wide single layer Transformer models can compete with or outperform deeper ones in a variety of Natural Language Processing (NLP) tasks when both are trained from scratch. The impact of changing the model aspect ratio on Transformers is then studied systematically. This ratio balances the number of layers and the number of attention heads per layer while keeping the total number of attention heads and all other hyperparameters constant. On average, across 4 NLP tasks and 10 attention types, single layer wide models perform 0.3% better than their deep counterparts. We show an in-depth evaluation and demonstrate how wide models require a far smaller memory footprint and can run faster on commodity hardware, in addition, these wider models are also more interpretable. For example, a single layer Transformer on the IMDb byte level text classification has 3.1x faster inference latency on a CPU than its equally accurate deeper counterpart, and is half the size. We therefore put forward wider and shallower models as a viable and desirable alternative for small models on NLP tasks, and as an important area of research for domains beyond this.
△ Less
Submitted 8 November, 2022; v1 submitted 2 October, 2022;
originally announced October 2022.
-
Revisiting Embeddings for Graph Neural Networks
Authors:
S. Purchase,
A. Zhao,
R. D. Mullins
Abstract:
Current graph representation learning techniques use Graph Neural Networks (GNNs) to extract features from dataset embeddings. In this work, we examine the quality of these embeddings and assess how changing them can affect the accuracy of GNNs. We explore different embedding extraction techniques for both images and texts; and find that the performance of different GNN architectures is dependent…
▽ More
Current graph representation learning techniques use Graph Neural Networks (GNNs) to extract features from dataset embeddings. In this work, we examine the quality of these embeddings and assess how changing them can affect the accuracy of GNNs. We explore different embedding extraction techniques for both images and texts; and find that the performance of different GNN architectures is dependent on the embedding style used. We see a prevalence of bag of words (BoW) embeddings and text classification tasks in available graph datasets. Given the impact embeddings has on GNN performance. this leads to a phenomenon that GNNs being optimised for BoW vectors.
△ Less
Submitted 29 November, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
E-Scooter Rider Detection and Classification in Dense Urban Environments
Authors:
Shane Gilroy,
Darragh Mullins,
Edward Jones,
Ashkan Parsi,
Martin Glavin
Abstract:
Accurate detection and classification of vulnerable road users is a safety critical requirement for the deployment of autonomous vehicles in heterogeneous traffic. Although similar in physical appearance to pedestrians, e-scooter riders follow distinctly different characteristics of movement and can reach speeds of up to 45kmph. The challenge of detecting e-scooter riders is exacerbated in urban e…
▽ More
Accurate detection and classification of vulnerable road users is a safety critical requirement for the deployment of autonomous vehicles in heterogeneous traffic. Although similar in physical appearance to pedestrians, e-scooter riders follow distinctly different characteristics of movement and can reach speeds of up to 45kmph. The challenge of detecting e-scooter riders is exacerbated in urban environments where the frequency of partial occlusion is increased as riders navigate between vehicles, traffic infrastructure and other road users. This can lead to the non-detection or mis-classification of e-scooter riders as pedestrians, providing inaccurate information for accident mitigation and path planning in autonomous vehicle applications. This research introduces a novel benchmark for partially occluded e-scooter rider detection to facilitate the objective characterization of detection models. A novel, occlusion-aware method of e-scooter rider detection is presented that achieves a 15.93% improvement in detection performance over the current state of the art.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
An Objective Method for Pedestrian Occlusion Level Classification
Authors:
Shane Gilroy,
Martin Glavin,
Edward Jones,
Darragh Mullins
Abstract:
Pedestrian detection is among the most safety-critical features of driver assistance systems for autonomous vehicles. One of the most complex detection challenges is that of partial occlusion, where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of current pedestrian detection benchmarks provide annotation for partial occlusion t…
▽ More
Pedestrian detection is among the most safety-critical features of driver assistance systems for autonomous vehicles. One of the most complex detection challenges is that of partial occlusion, where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of current pedestrian detection benchmarks provide annotation for partial occlusion to assess algorithm performance in these scenarios, however each benchmark varies greatly in their definition of the occurrence and severity of occlusion. In addition, current occlusion level annotation methods contain a high degree of subjectivity by the human annotator. This can lead to inaccurate or inconsistent reporting of an algorithm's detection performance for partially occluded pedestrians, depending on which benchmark is used. This research presents a novel, objective method for pedestrian occlusion level classification for ground truth annotation. Occlusion level classification is achieved through the identification of visible pedestrian keypoints and through the use of a novel, effective method of 2D body surface area estimation. Experimental results demonstrate that the proposed method reflects the pixel-wise occlusion level of pedestrians in images and is effective for all forms of occlusion, including challenging edge cases such as self-occlusion, truncation and inter-occluding pedestrians.
△ Less
Submitted 31 May, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
The Impact of Partial Occlusion on Pedestrian Detectability
Authors:
Shane Gilroy,
Darragh Mullins,
Edward Jones,
Ashkan Parsi,
Martin Glavin
Abstract:
Robust detection of vulnerable road users is a safety critical requirement for the deployment of autonomous vehicles in heterogeneous traffic. One of the most complex outstanding challenges is that of partial occlusion where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of leading pedestrian detection benchmarks provide annotati…
▽ More
Robust detection of vulnerable road users is a safety critical requirement for the deployment of autonomous vehicles in heterogeneous traffic. One of the most complex outstanding challenges is that of partial occlusion where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of leading pedestrian detection benchmarks provide annotation for partial occlusion, however each benchmark varies greatly in their definition of the occurrence and severity of occlusion. Recent research demonstrates that a high degree of subjectivity is used to classify occlusion level in these cases and occlusion is typically categorized into 2 to 3 broad categories such as partially and heavily occluded. This can lead to inaccurate or inconsistent reporting of pedestrian detection model performance depending on which benchmark is used. This research introduces a novel, objective benchmark for partially occluded pedestrian detection to facilitate the objective characterization of pedestrian detection models. Characterization is carried out on seven popular pedestrian detection models for a range of occlusion levels from 0-99%, in order to demonstrate the efficacy and increased analysis capabilities of the proposed characterization method. Results demonstrate that pedestrian detection performance degrades, and the number of false negative detections increase as pedestrian occlusion level increases. Of the seven popular pedestrian detection routines characterized, CenterNet has the greatest overall performance, followed by SSDlite. RetinaNet has the lowest overall detection performance across the range of occlusion levels.
△ Less
Submitted 27 July, 2023; v1 submitted 10 May, 2022;
originally announced May 2022.
-
The design of the n2EDM experiment
Authors:
N. J. Ayres,
G. Ban,
L. Bienstman,
G. Bison,
K. Bodek,
V. Bondar,
T. Bouillaud,
E. Chanel,
J. Chen,
P. -J. Chiu,
B. Clément,
C. Crawford,
M. Daum,
B. Dechenaux,
C. B. Doorenbos,
S. Emmenegger,
L. Ferraris-Bouchez,
M. Fertl,
A. Fratangelo,
P. Flaux,
D. Goupillière,
W. C. Griffith,
Z. D. Grujic,
P. G. Harris,
K. Kirch
, et al. (36 additional authors not shown)
Abstract:
We present the design of a next-generation experiment, n2EDM, currently under construction at the ultracold neutron source at the Paul Scherrer Institute (PSI) with the aim of carrying out a high-precision search for an electric dipole moment of the neutron. The project builds on experience gained with the previous apparatus operated at PSI until 2017, and is expected to deliver an order of magnit…
▽ More
We present the design of a next-generation experiment, n2EDM, currently under construction at the ultracold neutron source at the Paul Scherrer Institute (PSI) with the aim of carrying out a high-precision search for an electric dipole moment of the neutron. The project builds on experience gained with the previous apparatus operated at PSI until 2017, and is expected to deliver an order of magnitude better sensitivity with provision for further substantial improvements. An overview is given of the experimental method and setup, the sensitivity requirements for the apparatus are derived, and its technical design is described.
△ Less
Submitted 22 January, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
Using Reinforcement Learning to Perform Qubit Routing in Quantum Compilers
Authors:
Matteo G. Pozzi,
Steven J. Herbert,
Akash Sengupta,
Robert D. Mullins
Abstract:
"Qubit routing" refers to the task of modifying quantum circuits so that they satisfy the connectivity constraints of a target quantum computer. This involves inserting SWAP gates into the circuit so that the logical gates only ever occur between adjacent physical qubits. The goal is to minimise the circuit depth added by the SWAP gates.
In this paper, we propose a qubit routing procedure that u…
▽ More
"Qubit routing" refers to the task of modifying quantum circuits so that they satisfy the connectivity constraints of a target quantum computer. This involves inserting SWAP gates into the circuit so that the logical gates only ever occur between adjacent physical qubits. The goal is to minimise the circuit depth added by the SWAP gates.
In this paper, we propose a qubit routing procedure that uses a modified version of the deep Q-learning paradigm. The system is able to outperform the qubit routing procedures from two of the most advanced quantum compilers currently available, on both random and realistic circuits, across near-term architecture sizes.
△ Less
Submitted 31 July, 2020;
originally announced July 2020.
-
If your P value looks too good to be true, it probably is: Communicating reproducibility and variability in cell biology
Authors:
Samuel J. Lord,
Katrina B. Velle,
R. Dyche Mullins,
Lillian K. Fritz-Laylin
Abstract:
The cell biology literature is littered with erroneously tiny P values, often the result of evaluating individual cells as independent samples. Because readers use P values and error bars to infer whether a reported difference would likely recur if the experiment were repeated, the sample size N used for statistical tests should actually be the number of times an experiment is performed, not the n…
▽ More
The cell biology literature is littered with erroneously tiny P values, often the result of evaluating individual cells as independent samples. Because readers use P values and error bars to infer whether a reported difference would likely recur if the experiment were repeated, the sample size N used for statistical tests should actually be the number of times an experiment is performed, not the number of cells (or subcellular structures) analyzed across all experiments. P values calculated using the number of cells do not reflect the reproducibility of the result and are thus highly misleading. To help authors avoid this mistake, we provide examples and practical tutorials for creating figures that communicate both the cell-level variability and the experimental reproducibility.
△ Less
Submitted 20 December, 2019; v1 submitted 8 November, 2019;
originally announced November 2019.
-
Characterizing a Dramatic $ΔV\sim-9$ Flare on an Ultracool Dwarf Found by the ASAS-SN Survey
Authors:
Sarah J. Schmidt,
Jose L. Prieto,
K. Z. Stanek,
Benjamin J. Shappee,
Nidia Morrell,
Daniella C. Bardalez Gagliuffi,
C. S. Kochanek,
J. Jencson,
T. W-S. Holoien,
U. Basu,
John. F. Beacom,
D. M. Szczygiel,
G. Pojmanski,
J. Brimacombe,
M. Dubberley,
M. Elphick,
S. Foale,
E. Hawkins,
D. Mullins,
W. Rosing,
R. Ross,
Z. Walker
Abstract:
We analyze a $ΔV\sim-9$ magnitude flare on the newly identified M8 dwarf SDSS J022116.84+194020.4 (hereafter SDSSJ0221) detected as part of the All-Sky Automated Survey for Supernovae (ASAS-SN). Using infrared and optical spectra, we confirm that SDSSJ0221 is a relatively nearby (d$\sim$76 pc) M8 dwarf with strong quiescent H$α$ emission. Based on kinematics and the absence of features consistent…
▽ More
We analyze a $ΔV\sim-9$ magnitude flare on the newly identified M8 dwarf SDSS J022116.84+194020.4 (hereafter SDSSJ0221) detected as part of the All-Sky Automated Survey for Supernovae (ASAS-SN). Using infrared and optical spectra, we confirm that SDSSJ0221 is a relatively nearby (d$\sim$76 pc) M8 dwarf with strong quiescent H$α$ emission. Based on kinematics and the absence of features consistent with low-gravity (young) ultracool dwarfs, we place a lower limit of 200 Myr on the age of SDSSJ0221. When modeled with a simple, classical flare light-curve, this flare is consistent with a total $U$-band flare energy $E_U\sim$ 10$^{34}$ erg, confirming that the most dramatic flares are not limited to warmer, more massive stars. Scaled to include a rough estimate of the emission line contribution to the $V$ band, we estimate a blackbody filling factor of $\sim$$10-30\%$ during the flare peak and $\sim$$0.5-1.6\%$ during the flare decay phase. These filling factors correspond to flare areas that are an order of magnitude larger than those measured for most mid-M dwarf flares.
△ Less
Submitted 21 November, 2013; v1 submitted 16 October, 2013;
originally announced October 2013.
-
The Man Behind the Curtain: X-rays Drive the UV through NIR Variability in the 2013 AGN Outburst in NGC 2617
Authors:
B. J. Shappee,
J. L. Prieto,
D. Grupe,
C. S. Kochanek,
K. Z. Stanek,
G. De Rosa,
S. Mathur,
Y. Zu,
B. M. Peterson,
R. W. Pogge,
S. Komossa,
M. Im,
J. Jencson,
T. W-S. Holoien,
U. Basu,
J. F. Beacom,
D. M. Szczygiel,
J. Brimacombe,
S. Adams,
A. Campillay,
C. Choi,
C. Contreras,
M. Dietrich,
M. Dubberley,
M. Elphick
, et al. (22 additional authors not shown)
Abstract:
After the All-Sky Automated Survey for SuperNovae (ASAS-SN) discovered a significant brightening of the inner region of NGC 2617, we began a ~70 day photometric and spectroscopic monitoring campaign from the X-ray through near-infrared (NIR) wavelengths. We report that NGC 2617 went through a dramatic outburst, during which its X-ray flux increased by over an order of magnitude followed by an incr…
▽ More
After the All-Sky Automated Survey for SuperNovae (ASAS-SN) discovered a significant brightening of the inner region of NGC 2617, we began a ~70 day photometric and spectroscopic monitoring campaign from the X-ray through near-infrared (NIR) wavelengths. We report that NGC 2617 went through a dramatic outburst, during which its X-ray flux increased by over an order of magnitude followed by an increase of its optical/ultraviolet (UV) continuum flux by almost an order of magnitude. NGC 2617, classified as a Seyfert 1.8 galaxy in 2003, is now a Seyfert 1 due to the appearance of broad optical emission lines and a continuum blue bump. Such "changing look Active Galactic Nuclei (AGN)" are rare and provide us with important insights about AGN physics. Based on the Hbeta line width and the radius-luminosity relation, we estimate the mass of central black hole to be (4 +/- 1) x 10^7 M_sun. When we cross-correlate the light curves, we find that the disk emission lags the X-rays, with the lag becoming longer as we move from the UV (2-3 days) to the NIR (6-9 days). Also, the NIR is more heavily temporally smoothed than the UV. This can largely be explained by a simple model of a thermally emitting thin disk around a black hole of the estimated mass that is illuminated by the observed, variable X-ray fluxes.
△ Less
Submitted 26 June, 2014; v1 submitted 8 October, 2013;
originally announced October 2013.
-
Las Cumbres Observatory Global Telescope Network
Authors:
T. M. Brown,
N. Baliber,
F. B. Bianco,
M. Bowman,
B. Burleson,
P. Conway,
M. Crellin,
É. Depagne,
J. De Vera,
B. Dilday,
D. Dragomir,
M. Dubberley,
J. D. Eastman,
M. Elphick,
M. Falarski,
S. Foale,
M. Ford,
B. J. Fulton,
J. Garza,
E. L. Gomez,
M. Graham,
R. Greene,
B. Haldeman,
E. Hawkins,
B. Haworth
, et al. (30 additional authors not shown)
Abstract:
Las Cumbres Observatory Global Telescope (LCOGT) is a young organization dedicated to time-domain observations at optical and (potentially) near-IR wavelengths. To this end, LCOGT is constructing a world-wide network of telescopes, including the two 2m Faulkes telescopes, as many as 17 x 1m telescopes, and as many as 23 x 40cm telescopes. These telescopes initially will be outfitted for imaging an…
▽ More
Las Cumbres Observatory Global Telescope (LCOGT) is a young organization dedicated to time-domain observations at optical and (potentially) near-IR wavelengths. To this end, LCOGT is constructing a world-wide network of telescopes, including the two 2m Faulkes telescopes, as many as 17 x 1m telescopes, and as many as 23 x 40cm telescopes. These telescopes initially will be outfitted for imaging and (excepting the 40cm telescopes) spectroscopy at wavelengths between the atmospheric UV cutoff and the roughly 1-micron limit of silicon detectors. Since the first of LCOGT's 1m telescopes are now being deployed, we lay out here LCOGT's scientific goals and the requirements that these goals place on network architecture and performance, we summarize the network's present and projected level of development, and we describe our expected schedule for completing it. In the bulk of the paper, we describe in detail the technical approaches that we have adopted to attain the desired performance. In particular, we discuss our choices for the number and location of network sites, for the number and sizes of telescopes, for the specifications of the first generation of instruments, for the software that will schedule and control the network's telescopes and reduce and archive its data, and for the structure of the scientific and educational programs for which the network will provide observations.
△ Less
Submitted 29 July, 2013; v1 submitted 10 May, 2013;
originally announced May 2013.
-
Quaternionic Formulation of the Dirac Equation
Authors:
Don Colladay,
Patrick McDonald,
David Mullins
Abstract:
The Dirac equation with Lorentz violation involves additional coefficients and yields a fourth-order polynomial that must be solved to yield the dispersion relation. The conventional method of taking the determinant of $4\times 4$ matrices of complex numbers often yields unwieldy dispersion relations. By using quaternions, the Dirac equation may be reduced to $2 \times 2$ form in which the structu…
▽ More
The Dirac equation with Lorentz violation involves additional coefficients and yields a fourth-order polynomial that must be solved to yield the dispersion relation. The conventional method of taking the determinant of $4\times 4$ matrices of complex numbers often yields unwieldy dispersion relations. By using quaternions, the Dirac equation may be reduced to $2 \times 2$ form in which the structure of the dispersion relations become more transparent. In particular, it is found that there are two subsets of Lorentz-violating parameter sets for which the dispersion relation is easily solvable. Each subset contains half of the parameter space so that all parameters are included.
△ Less
Submitted 6 August, 2010;
originally announced August 2010.
-
Factoring the Dispersion Relation in the Presence of Lorentz Violation
Authors:
Don Colladay,
Patrick McDonald,
David Mullins
Abstract:
We produce an explicit formula for the dispersion relation for the Dirac Equation in the Standard Model Extension (SME) in the presence of Lorentz violation. Our expression is obtained using a novel techniques which exploit the algebra of quaternions. The dispersion relation is found to conveniently factor in two special cases that each involve a mutually exclusive set of non-vanishing Lorentz-v…
▽ More
We produce an explicit formula for the dispersion relation for the Dirac Equation in the Standard Model Extension (SME) in the presence of Lorentz violation. Our expression is obtained using a novel techniques which exploit the algebra of quaternions. The dispersion relation is found to conveniently factor in two special cases that each involve a mutually exclusive set of non-vanishing Lorentz-violating parameters. This suggests that a useful approach to studies of Lorentz-violating models is to split the parameter space into two separate pieces, each of which yields a simple, tractable dispersion relation that can be used for analysis.
△ Less
Submitted 21 January, 2010;
originally announced January 2010.