-
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Authors:
Heeji Yoon,
Heeseong Shin,
Eunbeen Hong,
Hyunwook Choi,
Hansang Cho,
Daun Jeong,
Seungryong Kim
Abstract:
Semi-supervised instance segmentation poses challenges due to limited labeled data, causing difficulties in accurately localizing distinct object instances. Current teacher-student frameworks still suffer from performance constraints due to unreliable pseudo-label quality stemming from limited labeled data. While the Segment Anything Model (SAM) offers robust segmentation capabilities at various g…
▽ More
Semi-supervised instance segmentation poses challenges due to limited labeled data, causing difficulties in accurately localizing distinct object instances. Current teacher-student frameworks still suffer from performance constraints due to unreliable pseudo-label quality stemming from limited labeled data. While the Segment Anything Model (SAM) offers robust segmentation capabilities at various granularities, directly applying SAM to this task introduces challenges such as class-agnostic predictions and potential over-segmentation. To address these complexities, we carefully integrate SAM into the semi-supervised instance segmentation framework, developing a novel distillation method that effectively captures the precise localization capabilities of SAM without compromising semantic recognition. Furthermore, we incorporate pseudo-label refinement as well as a specialized data augmentation with the refined pseudo-labels, resulting in superior performance. We establish state-of-the-art performance, and provide comprehensive experiments and ablation studies to validate the effectiveness of our proposed approach.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Robust Deep Joint Source Channel Coding for Task-Oriented Semantic Communications
Authors:
Taewoo Park,
Eunhye Hong,
Yo-Seb Jeon,
Namyoon Lee,
Yongjune Kim
Abstract:
Semantic communications based on deep joint source-channel coding (JSCC) aim to improve communication efficiency by transmitting only task-relevant information. However, ensuring robustness to the stochasticity of communication channels remains a key challenge in learning-based JSCC. In this paper, we propose a novel regularization technique for learning-based JSCC to enhance robustness against ch…
▽ More
Semantic communications based on deep joint source-channel coding (JSCC) aim to improve communication efficiency by transmitting only task-relevant information. However, ensuring robustness to the stochasticity of communication channels remains a key challenge in learning-based JSCC. In this paper, we propose a novel regularization technique for learning-based JSCC to enhance robustness against channel noise. The proposed method utilizes the Kullback-Leibler (KL) divergence as a regularizer term in the training loss, measuring the discrepancy between two posterior distributions: one under noisy channel conditions (noisy posterior) and one for a noise-free system (noise-free posterior). Reducing this KL divergence mitigates the impact of channel noise on task performance by keeping the noisy posterior close to the noise-free posterior. We further show that the expectation of the KL divergence given the encoded representation can be analytically approximated using the Fisher information matrix and the covariance matrix of the channel noise. Notably, the proposed regularization is architecture-agnostic, making it broadly applicable to general semantic communication systems over noisy channels. Our experimental results validate that the proposed regularization consistently improves task performance across diverse semantic communication systems and channel conditions.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Fourier-Based 3D Multistage Transformer for Aberration Correction in Multicellular Specimens
Authors:
Thayer Alshaabi,
Daniel E. Milkie,
Gaoxiang Liu,
Cyna Shirazinejad,
Jason L. Hong,
Kemal Achour,
Frederik Görlitz,
Ana Milunovic-Jevtic,
Cat Simmons,
Ibrahim S. Abuzahriyeh,
Erin Hong,
Samara Erin Williams,
Nathanael Harrison,
Evan Huang,
Eun Seok Bae,
Alison N. Killilea,
David G. Drubin,
Ian A. Swinburne,
Srigokul Upadhyayula,
Eric Betzig
Abstract:
High-resolution tissue imaging is often compromised by sample-induced optical aberrations that degrade resolution and contrast. While wavefront sensor-based adaptive optics (AO) can measure these aberrations, such hardware solutions are typically complex, expensive to implement, and slow when serially mapping spatially varying aberrations across large fields of view. Here, we introduce AOViFT (Ada…
▽ More
High-resolution tissue imaging is often compromised by sample-induced optical aberrations that degrade resolution and contrast. While wavefront sensor-based adaptive optics (AO) can measure these aberrations, such hardware solutions are typically complex, expensive to implement, and slow when serially mapping spatially varying aberrations across large fields of view. Here, we introduce AOViFT (Adaptive Optical Vision Fourier Transformer) -- a machine learning-based aberration sensing framework built around a 3D multistage Vision Transformer that operates on Fourier domain embeddings. AOViFT infers aberrations and restores diffraction-limited performance in puncta-labeled specimens with substantially reduced computational cost, training time, and memory footprint compared to conventional architectures or real-space networks. We validated AOViFT on live gene-edited zebrafish embryos, demonstrating its ability to correct spatially varying aberrations using either a deformable mirror or post-acquisition deconvolution. By eliminating the need for the guide star and wavefront sensing hardware and simplifying the experimental workflow, AOViFT lowers technical barriers for high-resolution volumetric microscopy across diverse biological samples.
△ Less
Submitted 23 May, 2025; v1 submitted 16 March, 2025;
originally announced March 2025.
-
MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images
Authors:
Eunji Hong,
Minh Hieu Nguyen,
Mikaela Angelina Uy,
Minhyuk Sung
Abstract:
We present MV2Cyl, a novel method for reconstructing 3D from 2D multi-view images, not merely as a field or raw geometry but as a sketch-extrude CAD model. Extracting extrusion cylinders from raw 3D geometry has been extensively researched in computer vision, while the processing of 3D data through neural networks has remained a bottleneck. Since 3D scans are generally accompanied by multi-view im…
▽ More
We present MV2Cyl, a novel method for reconstructing 3D from 2D multi-view images, not merely as a field or raw geometry but as a sketch-extrude CAD model. Extracting extrusion cylinders from raw 3D geometry has been extensively researched in computer vision, while the processing of 3D data through neural networks has remained a bottleneck. Since 3D scans are generally accompanied by multi-view images, leveraging 2D convolutional neural networks allows these images to be exploited as a rich source for extracting extrusion cylinder information. However, we observe that extracting only the surface information of the extrudes and utilizing it results in suboptimal outcomes due to the challenges in the occlusion and surface segmentation. By synergizing with the extracted base curve information, we achieve the optimal reconstruction result with the best accuracy in 2D sketch and extrude parameter estimation. Our experiments, comparing our method with previous work that takes a raw 3D point cloud as input, demonstrate the effectiveness of our approach by taking advantage of multi-view images. Our project page can be found at http://mv2cyl.github.io .
△ Less
Submitted 18 November, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
SST-GCN: The Sequential based Spatio-Temporal Graph Convolutional networks for Minute-level and Road-level Traffic Accident Risk Prediction
Authors:
Tae-wook Kim,
Han-jin Lee,
Hyeon-Jin Jung,
Ji-Woong Yang,
Ellen J. Hong
Abstract:
Traffic accidents are recognized as a major social issue worldwide, causing numerous injuries and significant costs annually. Consequently, methods for predicting and preventing traffic accidents have been researched for many years. With advancements in the field of artificial intelligence, various studies have applied Machine Learning and Deep Learning techniques to traffic accident prediction. M…
▽ More
Traffic accidents are recognized as a major social issue worldwide, causing numerous injuries and significant costs annually. Consequently, methods for predicting and preventing traffic accidents have been researched for many years. With advancements in the field of artificial intelligence, various studies have applied Machine Learning and Deep Learning techniques to traffic accident prediction. Modern traffic conditions change rapidly by the minute, and these changes vary significantly across different roads. In other words, the risk of traffic accidents changes minute by minute in various patterns for each road. Therefore, it is desirable to predict traffic accident risk at the Minute-Level and Road-Level. However, because roads have close and complex relationships with adjacent roads, research on predicting traffic accidents at the Minute-Level and Road-Level is challenging. Thus, it is essential to build a model that can reflect the spatial and temporal characteristics of roads for traffic accident prediction. Consequently, recent attempts have been made to use Graph Convolutional Networks to capture the spatial characteristics of roads and Recurrent Neural Networks to capture their temporal characteristics for predicting traffic accident risk. This paper proposes the Sequential based Spatio-Temporal Graph Convolutional Networks (SST-GCN), which combines GCN and LSTM, to predict traffic accidents at the Minute-Level and Road-Level using a road dataset constructed in Seoul, the capital of South Korea. Experiments have demonstrated that SST-GCN outperforms other state-of-the-art models in Minute-Level predictions.
△ Less
Submitted 3 June, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
Authors:
Rajeev V. Rikhye,
Aaron Loh,
Grace Eunhae Hong,
Preeti Singh,
Margaret Ann Smith,
Vijaytha Muralidharan,
Doris Wong,
Rory Sayres,
Michelle Phung,
Nicolas Betancourt,
Bradley Fong,
Rachna Sahasrabudhe,
Khoban Nasim,
Alec Eschholz,
Basil Mustafa,
Jan Freyberg,
Terry Spitz,
Yossi Matias,
Greg S. Corrado,
Katherine Chou,
Dale R. Webster,
Peggy Bui,
Yuan Liu,
Yun Liu,
Justin Ko
, et al. (1 additional authors not shown)
Abstract:
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali…
▽ More
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generalizable AI that can aid in the diagnosis of skin conditions across a variety of clinical settings. In this retrospective study, we demonstrate that differences in skin condition distribution, rather than in demographics or image capture mode are the main source of errors when an AI algorithm is evaluated on data from a previously unseen source. We demonstrate a series of steps to close this generalization gap, requiring progressively more information about the new source, ranging from the condition distribution to training data enriched for data less frequently seen during training. Our results also suggest comparable performance from end-to-end fine tuning versus fine tuning solely the classification layer on top of a frozen embedding model. Our approach can inform the adaptation of AI algorithms to new settings, based on the information and resources available.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray Report Labeling
Authors:
Jawook Gu,
Kihyun You,
Han-Cheol Cho,
Jiho Kim,
Eun Kyoung Hong,
Byungseok Roh
Abstract:
Free-text radiology reports present a rich data source for various medical tasks, but effectively labeling these texts remains challenging. Traditional rule-based labeling methods fall short of capturing the nuances of diverse free-text patterns. Moreover, models using expert-annotated data are limited by data scarcity and pre-defined classes, impacting their performance, flexibility and scalabili…
▽ More
Free-text radiology reports present a rich data source for various medical tasks, but effectively labeling these texts remains challenging. Traditional rule-based labeling methods fall short of capturing the nuances of diverse free-text patterns. Moreover, models using expert-annotated data are limited by data scarcity and pre-defined classes, impacting their performance, flexibility and scalability. To address these issues, our study offers three main contributions: 1) We demonstrate the potential of GPT as an adept labeler using carefully designed prompts. 2) Utilizing only the data labeled by GPT, we trained a BERT-based labeler, CheX-GPT, which operates faster and more efficiently than its GPT counterpart. 3) To benchmark labeler performance, we introduced a publicly available expert-annotated test set, MIMIC-500, comprising 500 cases from the MIMIC validation set. Our findings demonstrate that CheX-GPT not only excels in labeling accuracy over existing models, but also showcases superior efficiency, flexibility, and scalability, supported by our introduction of the MIMIC-500 dataset for robust benchmarking. Code and models are available at https://github.com/Soombit-ai/CheXGPT.
△ Less
Submitted 5 November, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training
Authors:
Kihyun You,
Jawook Gu,
Jiyeon Ham,
Beomhee Park,
Jiho Kim,
Eun Kyoung Hong,
Woonhyunk Baek,
Byungseok Roh
Abstract:
A large-scale image-text pair dataset has greatly contributed to the development of vision-language pre-training (VLP) models, which enable zero-shot or few-shot classification without costly annotation. However, in the medical domain, the scarcity of data remains a significant challenge for developing a powerful VLP model. In this paper, we tackle the lack of image-text data in chest X-ray by exp…
▽ More
A large-scale image-text pair dataset has greatly contributed to the development of vision-language pre-training (VLP) models, which enable zero-shot or few-shot classification without costly annotation. However, in the medical domain, the scarcity of data remains a significant challenge for developing a powerful VLP model. In this paper, we tackle the lack of image-text data in chest X-ray by expanding image-label pair as image-text pair via general prompt and utilizing multiple images and multiple sections in a radiologic report. We also design two contrastive losses, named ICL and TCL, for learning study-level characteristics of medical images and reports, respectively. Our model outperforms the state-of-the-art models trained under the same conditions. Also, enlarged dataset improve the discriminative power of our pre-trained model for classification, while sacrificing marginal retrieval performance. Code is available at https://github.com/kakaobrain/cxr-clip.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Double Descent Optimization Pattern and Aliasing: Caveats of Noisy Labels
Authors:
Florian Dubost,
Erin Hong,
Max Pike,
Siddharth Sharma,
Siyi Tang,
Nandita Bhaskhar,
Christopher Lee-Messer,
Daniel Rubin
Abstract:
Optimization plays a key role in the training of deep neural networks. Deciding when to stop training can have a substantial impact on the performance of the network during inference. Under certain conditions, the generalization error can display a double descent pattern during training: the learning curve is non-monotonic and seemingly diverges before converging again after additional epochs. Thi…
▽ More
Optimization plays a key role in the training of deep neural networks. Deciding when to stop training can have a substantial impact on the performance of the network during inference. Under certain conditions, the generalization error can display a double descent pattern during training: the learning curve is non-monotonic and seemingly diverges before converging again after additional epochs. This optimization pattern can lead to early stopping procedures to stop training before the second convergence and consequently select a suboptimal set of parameters for the network, with worse performance during inference. In this work, in addition to confirming that double descent occurs with small datasets and noisy labels as evidenced by others, we show that noisy labels must be present both in the training and generalization sets to observe a double descent pattern. We also show that the learning rate has an influence on double descent, and study how different optimizers and optimizer parameters influence the apparition of double descent. Finally, we show that increasing the learning rate can create an aliasing effect that masks the double descent pattern without suppressing it. We study this phenomenon through extensive experiments on variants of CIFAR-10 and show that they translate to a real world application: the forecast of seizure events in epileptic patients from continuous electroencephalographic recordings.
△ Less
Submitted 16 September, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing
Authors:
Florian Dubost,
Erin Hong,
Nandita Bhaskhar,
Siyi Tang,
Daniel Rubin,
Christopher Lee-Messer
Abstract:
Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection performance on sequential data, such as video recordings, when only sparse labels are available, such as event start times without their corresponding end time…
▽ More
Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection performance on sequential data, such as video recordings, when only sparse labels are available, such as event start times without their corresponding end times. Our method uses noisy guesses of the events' end times to train event detection models. Depending on how conservative these guesses are, mislabeled samples may be introduced into the training set. We further propose a mathematical model for explaining and estimating the evolution of the classification performance for increasingly noisier end time estimates. We show that neural networks can improve their detection performance by leveraging more training data with less conservative approximations despite the higher proportion of incorrect labels. We adapt sequential versions of CIFAR-10 and MNIST, and use the Berkeley MHAD and HMBD51 video datasets to empirically evaluate our method, and find that our risk-tolerant strategy outperforms conservative estimates by 3.5 points of mean average precision for CIFAR, 30 points for MNIST, 3 points for MHAD, and 14 points for HMBD51. Then, we leverage the proposed training strategy to tackle a real-life application: processing continuous video recordings of epilepsy patients, and show that our method outperforms baseline labeling methods by 17 points of average precision, and reaches a classification performance similar to that of fully supervised models. We share part of the code for this article.
△ Less
Submitted 1 October, 2022; v1 submitted 28 November, 2020;
originally announced November 2020.
-
Isometry-Dual Flags of AG Codes
Authors:
Maria Bras-Amorós,
Iwan Duursma,
Euijin Hong
Abstract:
Consider a complete flag $\{0\} = C_0 < C_1 < \cdots < C_n = \mathbb{F}^n$ of one-point AG codes of length $n$ over the finite field $\mathbb{F}$. The codes are defined by evaluating functions with poles at a given point $Q$ in points $P_1,\dots,P_n$ distinct from $Q$. A flag has the isometry-dual property if the given flag and the corresponding dual flag are the same up to isometry. For several c…
▽ More
Consider a complete flag $\{0\} = C_0 < C_1 < \cdots < C_n = \mathbb{F}^n$ of one-point AG codes of length $n$ over the finite field $\mathbb{F}$. The codes are defined by evaluating functions with poles at a given point $Q$ in points $P_1,\dots,P_n$ distinct from $Q$. A flag has the isometry-dual property if the given flag and the corresponding dual flag are the same up to isometry. For several curves, including the projective line, Hermitian curves, Suzuki curves, Ree curves, and the Klein curve over the field of eight elements, the maximal flag, obtained by evaluation in all rational points different from the point $Q$, is self-dual. More generally, we ask whether a flag obtained by evaluation in a proper subset of rational points is isometry-dual. In [3] it is shown, for a curve of genus $g$, that a flag of one-point AG codes defined with a subset of $n > 2g+2$ rational points is isometry-dual if and only if the last code $C_n$ in the flag is defined with functions of pole order at most $n+2g-1$. Using a different approach, we extend this characterization to all subsets of size $n \geq 2g+2$. Moreover we show that this is best possible by giving examples of isometry-dual flags with $n=2g+1$ such that $C_n$ is generated by functions of pole order at most $n+2g-2$. We also prove a necessary condition, formulated in terms of maximum sparse ideals of the Weierstrass semigroup of $Q$, under which a flag of punctured one-point AG codes inherits the isometry-dual property from the original unpunctured flag.
△ Less
Submitted 25 June, 2019;
originally announced June 2019.
-
Improved Methods for Computing Distances between Unordered Trees Using Integer Programming
Authors:
Eunpyeong Hong,
Yasuaki Kobayashi,
Akihiro Yamamoto
Abstract:
Kondo et al. (DS 2014) proposed methods for computing distances between unordered rooted trees by transforming an instance of the distance computing problem into an instance of the integer programming problem. They showed that the tree edit distance, segmental distance, and bottom-up segmental distance problem can be respectively transformed into an integer program which has $O(nm)$ variables and…
▽ More
Kondo et al. (DS 2014) proposed methods for computing distances between unordered rooted trees by transforming an instance of the distance computing problem into an instance of the integer programming problem. They showed that the tree edit distance, segmental distance, and bottom-up segmental distance problem can be respectively transformed into an integer program which has $O(nm)$ variables and $O(n^2m^2)$ constraints, where $n$ and $m$ are the number of nodes of input trees. In this work, we propose new integer programming formulations for these three distances and the bottom-up distance by applying dynamic programming approach. We divide the tree edit distance problem into $O(nm)$ subproblems each of which has only $O(n + m)$ constraints. For the other three distances, each subproblem can be reduced to a maximum weighted matching problem in a bipartite graph which can be solved in polynomial time. In order to evaluate our methods, we compare our method to the previous one due to Kondo et al. The experimental results show that the performance of our methods have been improved remarkably compared to that of the previous method.
△ Less
Submitted 12 June, 2017;
originally announced June 2017.
-
Downlink Power Optimization for Heterogeneous Networks with Time Reversal-based Transmission under Backhaul Limitation
Authors:
Ha-Vu Tran,
Georges Kaddoum,
Hung Tran,
Een-Kee Hong
Abstract:
In this paper, we investigate an application of two different beamforming techniques and propose a novel downlink power minimization scheme for a two-tier heterogeneous network (HetNet) model. In this context, we employ time reversal (TR) technique to a femtocell base station (FBS) whereas we assume that a macrocell base station (MBS) uses a zero-forcing-based algorithm and the communication chann…
▽ More
In this paper, we investigate an application of two different beamforming techniques and propose a novel downlink power minimization scheme for a two-tier heterogeneous network (HetNet) model. In this context, we employ time reversal (TR) technique to a femtocell base station (FBS) whereas we assume that a macrocell base station (MBS) uses a zero-forcing-based algorithm and the communication channels are subject to frequency selective fading. Additionally, HetNet's backhaul connection is unable to support a sufficient throughput for signaling information exchange between two tiers. Given the considered HetNet model, a downlink power minimization scheme is proposed, and closed-form expressions concerning the optimal solution are provided, taking this constraint into account. Furthermore, considering imperfect channel estimation at TR-employed femtocell, a worst-case robust power minimization problem is formulated. By devising TR worst-case analysis, this robust problem is transformed into an equivalent formulation that is tractable to solve. The results presented in our paper show that the TR technique outperforms the zero-forcing one in the perspective of beamforming methods for femtocell working environments. Finally, we validate the proposed power loading strategy for both cases of perfect and imperfect channel estimations.
△ Less
Submitted 6 April, 2017;
originally announced April 2017.
-
Conic Quadratic Formulations for Wireless Communications Design
Authors:
Quang-Doanh Vu,
Markku Juntti,
Een-Kee Hong,
Le-Nam Tran
Abstract:
As a wide class of resource management problems in wireless communications are nonconvex and even NP-hard in many cases, finding globally optimal solutions to these problems is of little practical interest. Towards more pragmatic approaches, there is a rich literature on iterative methods aiming at finding a solution satisfying necessary optimality conditions to these problems. These approaches ha…
▽ More
As a wide class of resource management problems in wireless communications are nonconvex and even NP-hard in many cases, finding globally optimal solutions to these problems is of little practical interest. Towards more pragmatic approaches, there is a rich literature on iterative methods aiming at finding a solution satisfying necessary optimality conditions to these problems. These approaches have been derived under several similar mathematical frameworks such as inner approximation algorithm, concave-convex procedure, majorization-minimization algorithm, and successive convex approximation (SCA). However, a large portion of existing algorithms arrive at a relatively generic program at each iteration, which is less computationally efficient compared to a more standard convex formulation. This paper proposes \emph{numerically efficient} transformations and approximations for SCA-based methods to deal with nonconvexity in wireless communications design. More specifically, the central goal is to show that various nonconvex problems in wireless communications can be iteratively solved by conic quadratic optimization. We revisit various examples to demonstrate the advantages of the proposed approximations. Theoretical complexity analysis and numerical results show the superior efficiency in terms of computational cost of our proposed solutions compared to the existing ones.
△ Less
Submitted 14 April, 2017; v1 submitted 18 October, 2016;
originally announced October 2016.
-
Time Reversal-based Transmissions with Distributed Power Allocation for Two-Tier Networks
Authors:
Vu Tran-Ha,
Quang-Doanh Vu,
Een-Kee Hong
Abstract:
Radio pollution and power consumption problems lead to innovative development of green heterogeneous networks (HetNet). Time reversal (TR) technique which has been validated from wide- to narrow-band transmissions is evaluated as one of most prominent linear precoders with superior capability of harvesting signal energy. In this paper, we consider a new HetNet model, in which TR-employed femtocell…
▽ More
Radio pollution and power consumption problems lead to innovative development of green heterogeneous networks (HetNet). Time reversal (TR) technique which has been validated from wide- to narrow-band transmissions is evaluated as one of most prominent linear precoders with superior capability of harvesting signal energy. In this paper, we consider a new HetNet model, in which TR-employed femtocell is proposed to attain saving power benefits whereas macrocell utilizes the beam-forming algorithm based on zero-forcing principle, over frequency selective channels. In the considered HetNet, the practical case of limited signaling information exchanged via backhaul connections is also taken under advisement. We hence organize a distributed power loading strategy, in which macrocell users are treated with a superior priority compared to femtocell users. By Monte-Carlo simulation, the obtained results show that TR is preferred to zero-forcing in the perspective of beamforming technique for femtocell environments due to very high achievable gain in saving energy, and the validity of power loading strategy is verified over multipath channels.
△ Less
Submitted 8 January, 2015;
originally announced January 2015.