Search | arXiv e-print repository

Energy Considerations of Large Language Model Inference and Efficiency Optimizations

Authors: Jared Fernandez, Clara Na, Vashisth Tiwari, Yonatan Bisk, Sasha Luccioni, Emma Strubell

Abstract: As large language models (LLMs) scale in size and adoption, their computational and environmental costs continue to rise. Prior benchmarking efforts have primarily focused on latency reduction in idealized settings, often overlooking the diverse real-world inference workloads that shape energy use. In this work, we systematically analyze the energy implications of common inference efficiency optim… ▽ More As large language models (LLMs) scale in size and adoption, their computational and environmental costs continue to rise. Prior benchmarking efforts have primarily focused on latency reduction in idealized settings, often overlooking the diverse real-world inference workloads that shape energy use. In this work, we systematically analyze the energy implications of common inference efficiency optimizations across diverse Natural Language Processing (NLP) and generative Artificial Intelligence (AI) workloads, including conversational AI and code generation. We introduce a modeling approach that approximates real-world LLM workflows through a binning strategy for input-output token distributions and batch size variations. Our empirical analysis spans software frameworks, decoding strategies, GPU architectures, online and offline serving settings, and model parallelism configurations. We show that the effectiveness of inference optimizations is highly sensitive to workload geometry, software stack, and hardware accelerators, demonstrating that naive energy estimates based on FLOPs or theoretical GPU utilization significantly underestimate real-world energy consumption. Our findings reveal that the proper application of relevant inference efficiency optimizations can reduce total energy use by up to 73% from unoptimized baselines. These insights provide a foundation for sustainable LLM deployment and inform energy-efficient design strategies for future AI infrastructure. △ Less

Submitted 24 April, 2025; originally announced April 2025.

Comments: 16 pages

arXiv:2504.13551 [pdf, other]

Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation

Authors: CheolWon Na, YunSeok Choi, Jee-Hyong Lee

Abstract: Many adversarial attack approaches are proposed to verify the vulnerability of language models. However, they require numerous queries and the information on the target model. Even black-box attack methods also require the target model's output information. They are not applicable in real-world scenarios, as in hard black-box settings where the target model is closed and inaccessible. Even the rec… ▽ More Many adversarial attack approaches are proposed to verify the vulnerability of language models. However, they require numerous queries and the information on the target model. Even black-box attack methods also require the target model's output information. They are not applicable in real-world scenarios, as in hard black-box settings where the target model is closed and inaccessible. Even the recently proposed hard black-box attacks still require many queries and demand extremely high costs for training adversarial generators. To address these challenges, we propose Q-faker (Query-free Hard Black-box Attacker), a novel and efficient method that generates adversarial examples without accessing the target model. To avoid accessing the target model, we use a surrogate model instead. The surrogate model generates adversarial sentences for a target-agnostic attack. During this process, we leverage controlled generation techniques. We evaluate our proposed method on eight datasets. Experimental results demonstrate our method's effectiveness including high transferability and the high quality of the generated adversarial examples, and prove its practical in hard black-box settings. △ Less

Submitted 18 April, 2025; originally announced April 2025.

Comments: NAACL 2025 Findings

arXiv:2503.05804 [pdf, other]

Holistically Evaluating the Environmental Impact of Creating Language Models

Authors: Jacob Morrison, Clara Na, Jared Fernandez, Tim Dettmers, Emma Strubell, Jesse Dodge

Abstract: As the performance of artificial intelligence systems has dramatically increased, so too has the environmental impact of creating these systems. While many model developers release estimates of the power consumption and carbon emissions from the final training runs for their latest models, there is comparatively little transparency into the impact of model development, hardware manufacturing, and… ▽ More As the performance of artificial intelligence systems has dramatically increased, so too has the environmental impact of creating these systems. While many model developers release estimates of the power consumption and carbon emissions from the final training runs for their latest models, there is comparatively little transparency into the impact of model development, hardware manufacturing, and total water usage throughout. In this work, we estimate the real-world environmental impact of developing a series of language models, ranging from 20 million to 13 billion active parameters, trained on up to 5.6 trillion tokens each. When accounting for hardware manufacturing, model development, and our final training runs, we find that our series of models released 493 metric tons of carbon emissions, equivalent to powering about 98 homes in the United States for one year, and consumed 2.769 million liters of water, equivalent to about 24.5 years of water usage by a person in the United States, even though our data center is extremely water-efficient. We measure and report the environmental impact of our model development; to the best of our knowledge we are the first to do so for LLMs, and we find that model development, the impact of which is generally not disclosed by most model developers, amounted to ~50% of that of training. By looking at detailed time series data for power consumption, we also find that power usage throughout training is not consistent, fluctuating between ~15% and ~85% of our hardware's maximum power draw, with negative implications for grid-scale planning as demand continues to grow. We close with a discussion on the continued difficulty of estimating the environmental impact of AI systems, and key takeaways for model developers and the public at large. △ Less

Submitted 3 March, 2025; originally announced March 2025.

Comments: ICLR 2025 (spotlight)

arXiv:2410.15661 [pdf, other]

doi 10.18653/v1/2024.emnlp-main.1176

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Authors: Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi

Abstract: Training data compositions for Large Language Models (LLMs) can significantly affect their downstream performance. However, a thorough data ablation study exploring large sets of candidate data mixtures is typically prohibitively expensive since the full effect is seen only after training the models; this can lead practitioners to settle for sub-optimal data mixtures. We propose an efficient metho… ▽ More Training data compositions for Large Language Models (LLMs) can significantly affect their downstream performance. However, a thorough data ablation study exploring large sets of candidate data mixtures is typically prohibitively expensive since the full effect is seen only after training the models; this can lead practitioners to settle for sub-optimal data mixtures. We propose an efficient method for approximating data ablations which trains individual models on subsets of a training corpus and reuses them across evaluations of combinations of subsets. In continued pre-training experiments, we find that, given an arbitrary evaluation set, the perplexity score of a single model trained on a candidate set of data is strongly correlated with perplexity scores of parameter averages of models trained on distinct partitions of that data. From this finding, we posit that researchers and practitioners can conduct inexpensive simulations of data ablations by maintaining a pool of models that were each trained on partitions of a large training corpus, and assessing candidate data mixtures by evaluating parameter averages of combinations of these models. This approach allows for substantial improvements in amortized training efficiency -- scaling only linearly with respect to new data -- by enabling reuse of previous training computation, opening new avenues for improving model performance through rigorous, incremental data assessment and mixing. △ Less

Submitted 21 October, 2024; originally announced October 2024.

Comments: EMNLP 2024. 17 pages

arXiv:2311.10267 [pdf, other]

doi 10.18653/v1/2023.findings-emnlp.607

Energy and Carbon Considerations of Fine-Tuning BERT

Authors: Xiaorong Wang, Clara Na, Emma Strubell, Sorelle Friedler, Sasha Luccioni

Abstract: Despite the popularity of the `pre-train then fine-tune' paradigm in the NLP community, existing work quantifying energy costs and associated carbon emissions has largely focused on language model pre-training. Although a single pre-training run draws substantially more energy than fine-tuning, fine-tuning is performed more frequently by many more individual actors, and thus must be accounted for… ▽ More Despite the popularity of the `pre-train then fine-tune' paradigm in the NLP community, existing work quantifying energy costs and associated carbon emissions has largely focused on language model pre-training. Although a single pre-training run draws substantially more energy than fine-tuning, fine-tuning is performed more frequently by many more individual actors, and thus must be accounted for when considering the energy and carbon footprint of NLP. In order to better characterize the role of fine-tuning in the landscape of energy and carbon emissions in NLP, we perform a careful empirical study of the computational costs of fine-tuning across tasks, datasets, hardware infrastructure and measurement modalities. Our experimental results allow us to place fine-tuning energy and carbon costs into perspective with respect to pre-training and inference, and outline recommendations to NLP researchers and practitioners who wish to improve their fine-tuning energy efficiency. △ Less

Submitted 16 October, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

Comments: EMNLP 2023 Findings; First two authors contributed equally; 12 pages

arXiv:2310.07715 [pdf, other]

doi 10.18653/v1/2023.emnlp-main.822

To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing

Authors: Sireesh Gururaja, Amanda Bertsch, Clara Na, David Gray Widder, Emma Strubell

Abstract: NLP is in a period of disruptive change that is impacting our methodologies, funding sources, and public perception. In this work, we seek to understand how to shape our future by better understanding our past. We study factors that shape NLP as a field, including culture, incentives, and infrastructure by conducting long-form interviews with 26 NLP researchers of varying seniority, research area,… ▽ More NLP is in a period of disruptive change that is impacting our methodologies, funding sources, and public perception. In this work, we seek to understand how to shape our future by better understanding our past. We study factors that shape NLP as a field, including culture, incentives, and infrastructure by conducting long-form interviews with 26 NLP researchers of varying seniority, research area, institution, and social identity. Our interviewees identify cyclical patterns in the field, as well as new shifts without historical parallel, including changes in benchmark culture and software infrastructure. We complement this discussion with quantitative analysis of citation, authorship, and language use in the ACL Anthology over time. We conclude by discussing shared visions, concerns, and hopes for the future of NLP. We hope that this study of our field's past and present can prompt informed discussion of our community's implicit norms and more deliberate action to consciously shape the future. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP 2023

arXiv:2305.11474 [pdf, other]

Reciprocal Attention Mixing Transformer for Lightweight Image Restoration

Authors: Haram Choi, Cheolwoong Na, Jihyeon Oh, Seungjae Lee, Jinseop Kim, Subeen Choe, Jeongmin Lee, Taehoon Kim, Jihoon Yang

Abstract: Although many recent works have made advancements in the image restoration (IR) field, they often suffer from an excessive number of parameters. Another issue is that most Transformer-based IR methods focus only on either local or global features, leading to limited receptive fields or deficient parameter issues. To address these problems, we propose a lightweight IR network, Reciprocal Attention… ▽ More Although many recent works have made advancements in the image restoration (IR) field, they often suffer from an excessive number of parameters. Another issue is that most Transformer-based IR methods focus only on either local or global features, leading to limited receptive fields or deficient parameter issues. To address these problems, we propose a lightweight IR network, Reciprocal Attention Mixing Transformer (RAMiT). It employs our proposed dimensional reciprocal attention mixing Transformer (D-RAMiT) blocks, which compute bi-dimensional (spatial and channel) self-attentions in parallel with different numbers of multi-heads. The bi-dimensional attentions help each other to complement their counterpart's drawbacks and are then mixed. Additionally, we introduce a hierarchical reciprocal attention mixing (H-RAMi) layer that compensates for pixel-level information losses and utilizes semantic information while maintaining an efficient hierarchical structure. Furthermore, we revisit and modify MobileNet V1 and V2 to attach efficient convolutions to our proposed components. The experimental results demonstrate that RAMiT achieves state-of-the-art performance on multiple lightweight IR tasks, including super-resolution, color denoising, grayscale denoising, low-light enhancement, and deraining. Codes are available at https://github.com/rami0205/RAMiT. △ Less

Submitted 18 April, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: CVPR 2024 Workshop - NTIRE. Codes are available at https://github.com/rami0205/RAMiT

arXiv:2304.01805 [pdf, other]

Exploration of Lightweight Single Image Denoising with Transformers and Truly Fair Training

Authors: Haram Choi, Cheolwoong Na, Jinseop Kim, Jihoon Yang

Abstract: As multimedia content often contains noise from intrinsic defects of digital devices, image denoising is an important step for high-level vision recognition tasks. Although several studies have developed the denoising field employing advanced Transformers, these networks are too momory-intensive for real-world applications. Additionally, there is a lack of research on lightweight denosing (LWDN) w… ▽ More As multimedia content often contains noise from intrinsic defects of digital devices, image denoising is an important step for high-level vision recognition tasks. Although several studies have developed the denoising field employing advanced Transformers, these networks are too momory-intensive for real-world applications. Additionally, there is a lack of research on lightweight denosing (LWDN) with Transformers. To handle this, this work provides seven comparative baseline Transformers for LWDN, serving as a foundation for future research. We also demonstrate the parts of randomly cropped patches significantly affect the denoising performances during training. While previous studies have overlooked this aspect, we aim to train our baseline Transformers in a truly fair manner. Furthermore, we conduct empirical analyses of various components to determine the key considerations for constructing LWDN Transformers. Codes are available at https://github.com/rami0205/LWDN. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: Technical report. Will be further revised. Codes are available at https://github.com/rami0205/LWDN

arXiv:2302.06117 [pdf, other]

The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment

Authors: Jared Fernandez, Jacob Kahn, Clara Na, Yonatan Bisk, Emma Strubell

Abstract: Increased focus on the computational efficiency of NLP systems has motivated the design of efficient model architectures and improvements to underlying hardware accelerators. However, the resulting increases in computational throughput and reductions in floating point operations have not directly translated to improvements in wall-clock inference latency. We demonstrate that these discrepancies ca… ▽ More Increased focus on the computational efficiency of NLP systems has motivated the design of efficient model architectures and improvements to underlying hardware accelerators. However, the resulting increases in computational throughput and reductions in floating point operations have not directly translated to improvements in wall-clock inference latency. We demonstrate that these discrepancies can be largely attributed to bottlenecks introduced by deep learning frameworks. We denote this phenomenon as the \textit{framework tax}, and observe that the disparity is growing as hardware speed increases over time. In this work, we examine this phenomenon through a series of case studies analyzing the effects of model design decisions, framework paradigms, and hardware platforms on total model latency. Code is available at https://github.com/JaredFern/Framework-Tax. △ Less

Submitted 22 December, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: EMNLP 2023

arXiv:2211.06315 [pdf, other]

Fraudulent User Detection Via Behavior Information Aggregation Network (BIAN) On Large-Scale Financial Social Network

Authors: Hanyi Hu, Long Zhang, Shuan Li, Zhi Liu, Yao Yang, Chongning Na

Abstract: Financial frauds cause billions of losses annually and yet it lacks efficient approaches in detecting frauds considering user profile and their behaviors simultaneously in social network . A social network forms a graph structure whilst Graph neural networks (GNN), a promising research domain in Deep Learning, can seamlessly process non-Euclidean graph data . In financial fraud detection, the modu… ▽ More Financial frauds cause billions of losses annually and yet it lacks efficient approaches in detecting frauds considering user profile and their behaviors simultaneously in social network . A social network forms a graph structure whilst Graph neural networks (GNN), a promising research domain in Deep Learning, can seamlessly process non-Euclidean graph data . In financial fraud detection, the modus operandi of criminals can be identified by analyzing user profile and their behaviors such as transaction, loaning etc. as well as their social connectivity. Currently, most GNNs are incapable of selecting important neighbors since the neighbors' edge attributes (i.e., behaviors) are ignored. In this paper, we propose a novel behavior information aggregation network (BIAN) to combine the user behaviors with other user features. Different from its close "relatives" such as Graph Attention Networks (GAT) and Graph Transformer Networks (GTN), it aggregates neighbors based on neighboring edge attribute distribution, namely, user behaviors in financial social network. The experimental results on a real-world large-scale financial social network dataset, DGraph, show that BIAN obtains the 10.2% gain in AUROC comparing with the State-Of-The-Art models. △ Less

Submitted 26 March, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

Comments: 6 pages, 1 figure

arXiv:2205.12694 [pdf, other]

doi 10.18653/v1/2022.findings-emnlp.361

Train Flat, Then Compress: Sharpness-Aware Minimization Learns More Compressible Models

Authors: Clara Na, Sanket Vaibhav Mehta, Emma Strubell

Abstract: Model compression by way of parameter pruning, quantization, or distillation has recently gained popularity as an approach for reducing the computational requirements of modern deep neural network models for NLP. Inspired by prior works suggesting a connection between simpler, more generalizable models and those that lie within wider loss basins, we hypothesize that optimizing for flat minima shou… ▽ More Model compression by way of parameter pruning, quantization, or distillation has recently gained popularity as an approach for reducing the computational requirements of modern deep neural network models for NLP. Inspired by prior works suggesting a connection between simpler, more generalizable models and those that lie within wider loss basins, we hypothesize that optimizing for flat minima should lead to simpler parameterizations and thus more compressible models. We propose to combine sharpness-aware minimization (SAM) with various task-specific model compression methods, including iterative magnitude pruning (IMP), structured pruning with a distillation objective, and post-training dynamic quantization. Empirically, we show that optimizing for flatter minima consistently leads to greater compressibility of parameters compared to vanilla Adam when fine-tuning BERT models, with little to no loss in accuracy on the GLUE text classification and SQuAD question answering benchmarks. Moreover, SAM finds superior winning tickets during IMP that 1) are amenable to vanilla Adam optimization, and 2) transfer more effectively across tasks. △ Less

Submitted 24 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

Comments: EMNLP 2022 Findings, 28 pages

arXiv:2205.11322 [pdf, other]

doi 10.1145/3603378

Revisiting the role of heterophily in graph representation learning: An edge classification perspective

Authors: Jincheng Huang, Ping Li, Rui Huang, Chen Na, Acong Zhang

Abstract: Graph representation learning aim at integrating node contents with graph structure to learn nodes/graph representations. Nevertheless, it is found that many existing graph learning methods do not work well on data with high heterophily level that accounts for a large proportion of edges between different class labels. Recent efforts to this problem focus on improving the message passing mechanism… ▽ More Graph representation learning aim at integrating node contents with graph structure to learn nodes/graph representations. Nevertheless, it is found that many existing graph learning methods do not work well on data with high heterophily level that accounts for a large proportion of edges between different class labels. Recent efforts to this problem focus on improving the message passing mechanism. However, it remains unclear whether heterophily truly does harm to the performance of graph neural networks (GNNs). The key is to unfold the relationship between a node and its immediate neighbors, e.g., are they heterophilous or homophilious? From this perspective, here we study the role of heterophily in graph representation learning before/after the relationships between connected nodes are disclosed. In particular, we propose an end-to-end framework that both learns the type of edges (i.e., heterophilous/homophilious) and leverage edge type information to improve the expressiveness of graph neural networks. We implement this framework in two different ways. Specifically, to avoid messages passing through heterophilous edges, we can optimize the graph structure to be homophilious by dropping heterophilous edges identified by an edge classifier. Alternatively, it is possible to exploit the information about the presence of heterophilous neighbors for feature learning, so a hybrid message passing approach is devised to aggregate homophilious neighbors and diversify heterophilous neighbors based on edge classification. Extensive experiments demonstrate the remarkable performance improvement of GNNs with the proposed framework on multiple datasets across the full spectrum of homophily level. △ Less

Submitted 5 August, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

arXiv:2010.09648 [pdf]

Agent-based Simulation Model and Deep Learning Techniques to Evaluate and Predict Transportation Trends around COVID-19

Authors: Ding Wang, Fan Zuo, Jingqin Gao, Yueshuai He, Zilin Bian, Suzana Duran Bernardes, Chaekuk Na, Jingxing Wang, John Petinos, Kaan Ozbay, Joseph Y. J. Chow, Shri Iyer, Hani Nassif, Xuegang Jeff Ban

Abstract: The COVID-19 pandemic has affected travel behaviors and transportation system operations, and cities are grappling with what policies can be effective for a phased reopening shaped by social distancing. This edition of the white paper updates travel trends and highlights an agent-based simulation model's results to predict the impact of proposed phased reopening strategies. It also introduces a re… ▽ More The COVID-19 pandemic has affected travel behaviors and transportation system operations, and cities are grappling with what policies can be effective for a phased reopening shaped by social distancing. This edition of the white paper updates travel trends and highlights an agent-based simulation model's results to predict the impact of proposed phased reopening strategies. It also introduces a real-time video processing method to measure social distancing through cameras on city streets. △ Less

Submitted 23 September, 2020; originally announced October 2020.

arXiv:2009.14019 [pdf]

NYC Recovery at a Glance: The Rise of Buses and Micromobility

Authors: Suzana Duran Bernardes, Zilin Bian, Siva Sooryaa Muruga Thambiran, Jingqin Gao, Chaekuk Na, Fan Zuo, Nick Hudanich, Abhinav Bhattacharyya, Kaan Ozbay, Shri Iyer, Joseph Y. J. Chow, Hani Nassif

Abstract: New York City (NYC) is entering Phase 4 of the state's reopening plan, starting July 20, 2020. This white paper updates travel trends observed during the first three reopening phases and highlights the spatial distributions in terms of bus speeds and Citi Bike trips, and further investigates the role of micro-mobility in the pandemic response. New York City (NYC) is entering Phase 4 of the state's reopening plan, starting July 20, 2020. This white paper updates travel trends observed during the first three reopening phases and highlights the spatial distributions in terms of bus speeds and Citi Bike trips, and further investigates the role of micro-mobility in the pandemic response. △ Less

Submitted 23 September, 2020; originally announced September 2020.

arXiv:2009.14018 [pdf]

Toward the "New Normal": A Surge in Speeding, New Volume Patterns, and Recent Trends in Taxis/For-Hire Vehicles

Authors: Jingqin Gao, Abhinav Bhattacharyya, Ding Wang, Nick Hudanich, Siva Sooryaa, Muruga Thambiran, Suzana Duran Bernardes, Chaekuk Na, Fan Zuo, Zilin Bian, Kaan Ozbay, Shri Iyer, Hani Nassif, Joseph Y. J. Chow

Abstract: Six months into the pandemic and one month after the phase four reopening in New York City (NYC), restrictions are lifting, businesses and schools are reopening, but global infections are still rising. This white paper updates travel trends observed in the aftermath of the COVID-19 outbreak in NYC and highlight some findings toward the "new normal." Six months into the pandemic and one month after the phase four reopening in New York City (NYC), restrictions are lifting, businesses and schools are reopening, but global infections are still rising. This white paper updates travel trends observed in the aftermath of the COVID-19 outbreak in NYC and highlight some findings toward the "new normal." △ Less

Submitted 23 September, 2020; originally announced September 2020.

arXiv:1608.05078 [pdf, ps, other]

doi 10.3847/0004-637X/826/2/155

The effects of individual metal contents on isochrones for C, N, O, Na, Mg, Al, Si, and Fe

Authors: Minje Beom, Chongsam Na, Jason W. Ferguson, Y. -C. Kim

Abstract: The individual characteristics of C, N, O, Na, Mg, Al, Si, and Fe on isochrones have been investigated in this study. Stellar models have been constructed for various mixtures in which the content of each element is changed up to the extreme value reported in recent studies, and the changes in isochrone shape have been analyzed for the various mixtures. To express the abundance variation of differ… ▽ More The individual characteristics of C, N, O, Na, Mg, Al, Si, and Fe on isochrones have been investigated in this study. Stellar models have been constructed for various mixtures in which the content of each element is changed up to the extreme value reported in recent studies, and the changes in isochrone shape have been analyzed for the various mixtures. To express the abundance variation of different elements with a single parameter, we have focused on the relative changes in the total number of metal ions. A review of the shape changes revealed that Na, Mg, and Al work the same way in stellar models, as the well-known fact that C, N, and O have the same reactions in the stellar interior. In addition, it was found that in high-metallicity conditions the influence of Si and Fe on the red giant branch becomes smaller than that of Na, Mg, and Al closer to the tip. Furthermore, the influence of Fe on the main sequence is larger than that of Na, Mg, Al, and even Si. △ Less

Submitted 17 August, 2016; originally announced August 2016.

Comments: published in ApJ (Aug 1. 2016) / accepted for publication in ApJ (May 25. 2016) / 11 pages, 9 figures, 5 tables / ads address: http://adsabs.harvard.edu/abs/2016ApJ...826..155B

Journal ref: 2016ApJ...826..155B

arXiv:1606.03897 [pdf, ps, other]

FM-index of Alignment with Gaps

Authors: Joong Chae Na, Hyunjoon Kim, Seunghwan Min, Heejin Park, Thierry Lecroq, Martine Leonard, Laurent Mouchardd, Kunsoo Park

Abstract: Recently, a compressed index for similar strings, called the FM-index of alignment (FMA), has been proposed with the functionalities of pattern search and random access. The FMA is quite efficient in space requirement and pattern search time, but it is applicable only for an alignment of similar strings without gaps. In this paper we propose the FM-index of alignment with gaps, a realistic index f… ▽ More Recently, a compressed index for similar strings, called the FM-index of alignment (FMA), has been proposed with the functionalities of pattern search and random access. The FMA is quite efficient in space requirement and pattern search time, but it is applicable only for an alignment of similar strings without gaps. In this paper we propose the FM-index of alignment with gaps, a realistic index for similar strings, which allows gaps in their alignment. For this, we design a new version of the suffix array of alignment by using alignment transformation and a new definition of the alignment-suffix. The new suffix array of alignment enables us to support the LF-mapping and backward search, the key functionalities of the FM-index, regardless of gap existence in the alignment. We experimentally compared our index with RLCSA due to Makinen et al. on 100 genome sequences from the 1000 Genomes Project. The index size of our index is less than one third of that of RLCSA. △ Less

Submitted 13 June, 2016; originally announced June 2016.

Comments: 15pages

arXiv:1404.7508 [pdf, ps, other]

doi 10.1093/mnrasl/slu064

Multiple Populations in Globular Clusters and the Origin of the Oosterhoff Period Groups

Authors: Sohee Jang, Young-Wook Lee, Seok-Joo Joo, Chongsam Na

Abstract: The presence of multiple populations is now well-established in most globular clusters in the Milky Way. In light of this progress, here we suggest a new model explaining the origin of the Sandage period-shift and the difference in mean period of type ab RR Lyrae variables between the two Oosterhoff groups. In our models, the instability strip in the metal-poor group II clusters, such as M15, is p… ▽ More The presence of multiple populations is now well-established in most globular clusters in the Milky Way. In light of this progress, here we suggest a new model explaining the origin of the Sandage period-shift and the difference in mean period of type ab RR Lyrae variables between the two Oosterhoff groups. In our models, the instability strip in the metal-poor group II clusters, such as M15, is populated by second generation stars (G2) with enhanced helium and CNO abundances, while the RR Lyraes in the relatively metal rich group I clusters like M3 are mostly produced by first generation stars (G1) without these enhancements. This population shift within the instability strip with metallicity can create the observed period-shift between the two groups, since both helium and CNO abundances play a role in increasing the period of RR Lyrae variables. The presence of more metal-rich clusters having Oosterhoff-intermediate characteristics, such as NGC 1851, as well as of most metal-rich clusters having RR Lyraes with longest periods (group III) can also be reproduced, as more helium-rich third and later generations of stars (G3) penetrate into the instability strip with further increase in metallicity. Therefore, for the most general cases, our models predict that the RR Lyraes are produced mostly by G1, G2, and G3, respectively, for the Oosterhoff groups I, II, and III. △ Less

Submitted 29 April, 2014; originally announced April 2014.

Comments: 5 pages, 4 figures, accepted for publication in MNRAS Letters

arXiv:1310.4499 [pdf, ps, other]

doi 10.1088/2041-8205/778/1/L13

Two distinct red giant branch populations in the globular cluster NGC 2419 as tracers of a merger event in the Milky Way

Authors: Young-Wook Lee, Sang-Il Han, Seok-Joo Joo, Sohee Jang, Chongsam Na, Sakurako Okamoto, Nobuo Arimoto, Dongwook Lim, Hak-Sub Kim, Suk-Jin Yoon

Abstract: Recent spectroscopic observations of the outer halo globular cluster (GC) NGC 2419 show that it is unique among GCs, in terms of chemical abundance patterns, and some suggest that it was originated in the nucleus of a dwarf galaxy. Here we show, from the Subaru narrow-band photometry employing a calcium filter, that the red giant-branch (RGB) of this GC is split into two distinct subpopulations. C… ▽ More Recent spectroscopic observations of the outer halo globular cluster (GC) NGC 2419 show that it is unique among GCs, in terms of chemical abundance patterns, and some suggest that it was originated in the nucleus of a dwarf galaxy. Here we show, from the Subaru narrow-band photometry employing a calcium filter, that the red giant-branch (RGB) of this GC is split into two distinct subpopulations. Comparison with spectroscopy has confirmed that the redder RGB stars in the $hk$[=(Ca$-b)-(b-y)$] index are enhanced in [Ca/H] by $\sim$0.2 dex compared to the bluer RGB stars. Our population model further indicates that the calcium-rich second generation stars are also enhanced in helium abundance by a large amount ($Δ$Y = 0.19). Our photometry, together with the results for other massive GCs (e.g., $ω$ Cen, M22, and NGC 1851), suggests that the discrete distribution of RGB stars in the $hk$ index might be a universal characteristic of this growing group of peculiar GCs. The planned narrow-band calcium photometry for the Local Group dwarf galaxies would help to establish an empirical connection between these GCs and the primordial building blocks in the hierarchical merging paradigm of galaxy formation. △ Less

Submitted 16 October, 2013; originally announced October 2013.

Comments: 4 pages, 4 figures, 1 table, accepted for the publication in ApJL

arXiv:1309.4290 [pdf, other]

Simple free-surface detection in two and three-dimensional SPH solver

Authors: Agra Barecasco, Hanifa Terissa, Christian Fredy Naa

Abstract: A simple free-surface particle detection method for two and three-dimensional SPH simulation has been implemented. The method uses sphere representation for the SPH particle. The fluid domain is covered by overlapping spheres. A sphere whose surface is not fully covered considered as boundary. To test particle boundary status, we used a sum of normalized relative position vectors from neighbouring… ▽ More A simple free-surface particle detection method for two and three-dimensional SPH simulation has been implemented. The method uses sphere representation for the SPH particle. The fluid domain is covered by overlapping spheres. A sphere whose surface is not fully covered considered as boundary. To test particle boundary status, we used a sum of normalized relative position vectors from neighbouring particles to the test particle. By checking the existence of un- covered sphere surface by this vector sum, boundary status of the test particle can be determined. This boundary detection method can be easily embedded in the SPH solver algorithm. △ Less

Submitted 17 September, 2013; originally announced September 2013.

Comments: 10 pages, 11 figures, Selected Paper from the International Symposium on Computational Science 2013

arXiv:1309.4234 [pdf, other]

Three-Dimensional Smoothed Particle Hydrodynamics Simulation for Liquid Metal Solidification Process

Authors: Raden Ahnaf Faqih S, Christian Fredy Naa

Abstract: The solidification behavior of liquid metal in a container under rapid cooling process is one of the major concerns to be analyzed. In order to analyze its fundamental behavior, a three- dimensional (3D) fluid dynamics simulation was developed using a particle-based method, known as the smoothed particle hydrodynamics (SPH) method. Governing equations that determine the fluid motion and heat trans… ▽ More The solidification behavior of liquid metal in a container under rapid cooling process is one of the major concerns to be analyzed. In order to analyze its fundamental behavior, a three- dimensional (3D) fluid dynamics simulation was developed using a particle-based method, known as the smoothed particle hydrodynamics (SPH) method. Governing equations that determine the fluid motion and heat transfer involving phase change process are solved by discretizing their gradient and Laplacian term with the moving particles and calculating the interaction with its neighboring particles. The results demonstrate that the SPH mehod can successfully reproduce the behavior and defect prediction of liquid metal solidification process. △ Less

Submitted 17 September, 2013; originally announced September 2013.

Comments: 10 pages, 7 figures, Selected Paper from the International Symposium on Computational Science 2013

arXiv:1309.4074 [pdf, other]

Three-Dimensional Smoothed Particle Hydrodynamics Method for Simulating Free Surface Flows

Authors: Rizal Dwi Prayogo, Christian Fredy Naa

Abstract: In this paper, we applied an improved Smoothing Particle Hydrodynamics (SPH) method by using gradient kernel renormalization in three-dimensional cases. The purpose of gradient kernel renormalization is to improve the accuracy of numerical simulation by improving gradient kernel approximation. This method is implemented for simulating free surface flows, in particular dam break case with rigid bal… ▽ More In this paper, we applied an improved Smoothing Particle Hydrodynamics (SPH) method by using gradient kernel renormalization in three-dimensional cases. The purpose of gradient kernel renormalization is to improve the accuracy of numerical simulation by improving gradient kernel approximation. This method is implemented for simulating free surface flows, in particular dam break case with rigid ball structures and the propagation of waves towards a slope in a rectangular tank. △ Less

Submitted 16 September, 2013; originally announced September 2013.

Comments: 9 pages, 5 figures, Selected Paper from the International Symposium on Computational Science 2013

arXiv:1309.3868 [pdf]

Three-Dimensional Smoothed Particle Hydrodynamics Simulation for Liquid Droplet with Surface Tension

Authors: Hanifa Terissa, Agra Barecasco, Christian Fredy Naa

Abstract: We provide a basic method of Smoothed Particle Hydrodynamics (SPH) to simulate liquid droplet with surface tension in three dimensions. Liquid droplet is a simple case for surface tension modeling. Surface tension works only on fluid surface. In SPH method, we simply apply the surface tension on the boundary particles of liquid. The particle on the 3D boundary was detected dynamically using Free-S… ▽ More We provide a basic method of Smoothed Particle Hydrodynamics (SPH) to simulate liquid droplet with surface tension in three dimensions. Liquid droplet is a simple case for surface tension modeling. Surface tension works only on fluid surface. In SPH method, we simply apply the surface tension on the boundary particles of liquid. The particle on the 3D boundary was detected dynamically using Free-Surface Detection algorithm. The normal vector and curvature of the boundary surface were calculated simultaneously with 3D boundary surface reconstruction using Moving Least-Squares (MLS) method. Before the reconstruction, the coordinate system was transformed into a local coordinate system. Afterwards, the surface tension force which depends on curvature of the surface, was calculated and applied on the boundary particles of the droplet. We present the simulation result of droplet motion with gravity force. By using the basic method of SPH for fluid modeling, and a combination of 3D Free-Surface Detection algorithm with MLS method, we can simulate droplet phenomena with expected result. △ Less

Submitted 16 September, 2013; originally announced September 2013.

Comments: 9 pages, 4 figure, Selected Paper from the International Symposium on Computational Science 2013

arXiv:1309.3632 [pdf, other]

3D Simulation of Dam-break effect on a Solid Wall using Smoothed Particle Hydrodynamics

Authors: Suprijadi, Ferry Faizal, Christian Naa, Anggy Trisnawan Putra

Abstract: Dam is built for water supply, water flow or flooding control and electricity energy storage, but in other hand, dam is one of the most dangerous natural disaster in many countries including in Indonesia. The impact of dam break in neighbour area and is huge and many flooding in remote area, as happen in Dam Situ Gintung in Tangerang (close to Jakarta) in 2009. Smoothed Particle Hydrodynamics (SPH… ▽ More Dam is built for water supply, water flow or flooding control and electricity energy storage, but in other hand, dam is one of the most dangerous natural disaster in many countries including in Indonesia. The impact of dam break in neighbour area and is huge and many flooding in remote area, as happen in Dam Situ Gintung in Tangerang (close to Jakarta) in 2009. Smoothed Particle Hydrodynamics (SPH), is one of numerical method based on Lagrangian grid which is ap- plied in astrophysical simulation may be used to solve the simulation on dam break effect. The development of SPH methods become alternative methods to solving Navier Stokes equation, which is main key in fluid dynamic simulation. In this paper, SPH is developed for supporting solid par- ticles in use for 3D dam break effect (3D-DBE) simulation. Solid particle have been treated same as fluid particles with additional calculation for converting gained position became translation and rotation of solid object in a whole body. With this capability, the result of 3D-DBE simulation has been varies and interesting. The goals of this simulation is for analyse fluid and solid particle interaction by using two different scenario. The first scenario relation between height of fluid to a solid wall barrier and second scenario, is to study relation between solid wall and its collapse time by dam break. The results show sliding distance of a solid wall is depend on a fluid height, and relation between fluid heght and wall dimension will be discussed too in this paper. △ Less

Submitted 14 September, 2013; originally announced September 2013.

Comments: 7 pages, 7 figures, Selected Paper from the International Symposium on Computational Science 2013

arXiv:1309.3524 [pdf, ps, other]

Stochastic moving particle semi-implicit for inviscid fluid wave simulation

Authors: Christian Fredy Naa, Seiro Omata, Masaki Kazama

Abstract: The present paper introduces stochastic velocity as improvement for moving particle semi-implicit (MPS) method. This improvement is to overcome energy loss caused by numerical dissipation in the basic MPS that brings about rapid decay of waves. Stochastic velocity is added in the explicit step of the basic MPS method. MPS with stochastic improvement is compared with the basic method in the case of… ▽ More The present paper introduces stochastic velocity as improvement for moving particle semi-implicit (MPS) method. This improvement is to overcome energy loss caused by numerical dissipation in the basic MPS that brings about rapid decay of waves. Stochastic velocity is added in the explicit step of the basic MPS method. MPS with stochastic improvement is compared with the basic method in the case of linear water waves, in particular dam break problem and standing wave in a rectangular tank. Surface detection and curve fitting are used to analyze the parameters of wave on the standing wave case. The surface detection and curved fitting was efficient to determine parameters of the wave and it was found that the stochastic improvement made the waves survived longer than in the basis method. △ Less

Submitted 13 September, 2013; originally announced September 2013.

Comments: 19 pages, 11 figures, International Symposium on Computational Science 2011

Journal ref: Selected Paper for Gakuto International Series Mathematical Science and Application Volume 34, 2011

arXiv:1309.3504 [pdf]

Simulation of Temperature Distribution In a Rectangular Cavity using Finite Element Method

Authors: Christian Naa, Suprijadi

Abstract: This paper presents the study and implementation of finite element method to find the temperature distribution in a rectangular cavity which contains a fluid substance. The fluid motion is driven by a sudden temperature difference applied to two opposite side walls of the cavity. The remaining walls were considered adiabatic. Fluid properties were assumed incompressible. The problem has been appro… ▽ More This paper presents the study and implementation of finite element method to find the temperature distribution in a rectangular cavity which contains a fluid substance. The fluid motion is driven by a sudden temperature difference applied to two opposite side walls of the cavity. The remaining walls were considered adiabatic. Fluid properties were assumed incompressible. The problem has been approached by two-dimensional transient conduction which applied on the heated sidewall and one-dimensional steady state convection-diffusion equation which applied inside the cavity. The parameters which investigated are time and velocity. These parameters were computed together with boundary conditions which result in temperature distribution in the cavity. The implementation of finite element method was resulted in algebraic equation which is in vector and matrix form. Therefore, MATLAB programs used to solve this algebraic equation. The final temperature distribution results were presented in contour map within the region. △ Less

Submitted 13 September, 2013; originally announced September 2013.

Comments: 7 pages, 9 figures, Proceedings of the 3rd Asian Physics Symposium, July 22-23, 2009, Bandung, Indonesia

arXiv:1309.1006 [pdf]

Novel Giant Magnetoresistance Model using Multiple Barrier Potential

Authors: Christian Naa, Suprijadi, Sparisoma Viridi, Mitra Djamal

Abstract: This paper proposes a new model for Giant Magnetoresistance (GMR) and calculates its typical graph qualitatively. The model foundation is the microscopic mechanism in GMR, where the GMR effect can be explained by intrinsic and extrinsic potential. The potentials are spin-dependent. The potentials determine the transmission probability then it will give conductance value. Here, the multiple barrier… ▽ More This paper proposes a new model for Giant Magnetoresistance (GMR) and calculates its typical graph qualitatively. The model foundation is the microscopic mechanism in GMR, where the GMR effect can be explained by intrinsic and extrinsic potential. The potentials are spin-dependent. The potentials determine the transmission probability then it will give conductance value. Here, the multiple barrier potential model is proposed as the new GMR model. The transmission probability is determined using transfer matrix method. It was found that this model is fit qualitatively with the typical GMR graph. △ Less

Submitted 6 September, 2013; v1 submitted 4 September, 2013; originally announced September 2013.

Comments: 4 pages, 9 figures, Submitted to ARPN Journal of Engineering and Applied Sciences 2013

arXiv:1305.1744 [pdf, ps, other]

Suffix Tree of Alignment: An Efficient Index for Similar Data

Authors: Joong Chae Na, Heejin Park, Maxime Crochemore, Jan Holub, Costas S. Iliopoulos, Laurent Mouchard, Kunsoo Park

Abstract: We consider an index data structure for similar strings. The generalized suffix tree can be a solution for this. The generalized suffix tree of two strings $A$ and $B$ is a compacted trie representing all suffixes in $A$ and $B$. It has $|A|+|B|$ leaves and can be constructed in $O(|A|+|B|)$ time. However, if the two strings are similar, the generalized suffix tree is not efficient because it does… ▽ More We consider an index data structure for similar strings. The generalized suffix tree can be a solution for this. The generalized suffix tree of two strings $A$ and $B$ is a compacted trie representing all suffixes in $A$ and $B$. It has $|A|+|B|$ leaves and can be constructed in $O(|A|+|B|)$ time. However, if the two strings are similar, the generalized suffix tree is not efficient because it does not exploit the similarity which is usually represented as an alignment of $A$ and $B$. In this paper we propose a space/time-efficient suffix tree of alignment which wisely exploits the similarity in an alignment. Our suffix tree for an alignment of $A$ and $B$ has $|A| + l_d + l_1$ leaves where $l_d$ is the sum of the lengths of all parts of $B$ different from $A$ and $l_1$ is the sum of the lengths of some common parts of $A$ and $B$. We did not compromise the pattern search to reduce the space. Our suffix tree can be searched for a pattern $P$ in $O(|P|+occ)$ time where $occ$ is the number of occurrences of $P$ in $A$ and $B$. We also present an efficient algorithm to construct the suffix tree of alignment. When the suffix tree is constructed from scratch, the algorithm requires $O(|A| + l_d + l_1 + l_2)$ time where $l_2$ is the sum of the lengths of other common substrings of $A$ and $B$. When the suffix tree of $A$ is already given, it requires $O(l_d + l_1 + l_2)$ time. △ Less

Submitted 8 May, 2013; originally announced May 2013.

Comments: 12 pages

arXiv:quant-ph/0703219 [pdf]

doi 10.1103/PhysRevA.77.031803

Strongly Correlated Polaritons in a Two-Dimensional Array of Photonic Crystal Microcavities

Authors: Y. C. Neil Na, Shoko Utsunomiya, Lin Tian, Yoshihisa Yamamoto

Abstract: We propose a practical scheme to observe the polaritonic quantum phase transition (QPT) from the superfluid (SF) to Bose-glass (BG) to Mott-insulator (MI) states. The system consists of a two-dimensional array of photonic crystal microcavities doped with substitutional donor/acceptor impurities. Using realistic parameters, we show that such strongly correlated polaritonic systems can be construc… ▽ More We propose a practical scheme to observe the polaritonic quantum phase transition (QPT) from the superfluid (SF) to Bose-glass (BG) to Mott-insulator (MI) states. The system consists of a two-dimensional array of photonic crystal microcavities doped with substitutional donor/acceptor impurities. Using realistic parameters, we show that such strongly correlated polaritonic systems can be constructed using the state-of-art semiconductor technology. △ Less

Submitted 12 October, 2007; v1 submitted 23 March, 2007; originally announced March 2007.

Comments: 14 pages, 5 figures

Journal ref: Phys. Rev. A 77, 031803(R) (2008)

arXiv:physics/0702216

Strongly Correlated Photons in a Photonic Crystal

Authors: Y. C. Neil Na, Shoko Utsunomiya, Yoshihisa Yamamoto

Abstract: This paper is withdrawn. This paper is withdrawn. △ Less

Submitted 4 April, 2007; v1 submitted 24 February, 2007; originally announced February 2007.

Comments: This paper is withdrawn. Please see quant-ph/0703219 for the revision

Showing 1–30 of 30 results for author: Na, C