-
SEDD-PCC: A Single Encoder-Dual Decoder Framework For End-To-End Learned Point Cloud Compression
Authors:
Kai Hsiang Hsieh,
Monyneath Yim,
Jui Chiu Chiang
Abstract:
To encode point clouds containing both geometry and attributes, most learning-based compression schemes treat geometry and attribute coding separately, employing distinct encoders and decoders. This not only increases computational complexity but also fails to fully exploit shared features between geometry and attributes. To address this limitation, we propose SEDD-PCC, an end-to-end learning-base…
▽ More
To encode point clouds containing both geometry and attributes, most learning-based compression schemes treat geometry and attribute coding separately, employing distinct encoders and decoders. This not only increases computational complexity but also fails to fully exploit shared features between geometry and attributes. To address this limitation, we propose SEDD-PCC, an end-to-end learning-based framework for lossy point cloud compression that jointly compresses geometry and attributes. SEDD-PCC employs a single encoder to extract shared geometric and attribute features into a unified latent space, followed by dual specialized decoders that sequentially reconstruct geometry and attributes. Additionally, we incorporate knowledge distillation to enhance feature representation learning from a teacher model, further improving coding efficiency. With its simple yet effective design, SEDD-PCC provides an efficient and practical solution for point cloud compression. Comparative evaluations against both rule-based and learning-based methods demonstrate its competitive performance, highlighting SEDD-PCC as a promising AI-driven compression approach.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression
Authors:
Yu-Ting Zhan,
Cheng-Yuan Ho,
Hebi Yang,
Yi-Hsin Chen,
Jui Chiu Chiang,
Yu-Lun Liu,
Wen-Hsiao Peng
Abstract:
3D Gaussian Splatting (3DGS) has recently emerged as a promising 3D representation. Much research has been focused on reducing its storage requirements and memory footprint. However, the needs to compress and transmit the 3DGS representation to the remote side are overlooked. This new application calls for rate-distortion-optimized 3DGS compression. How to quantize and entropy encode sparse Gaussi…
▽ More
3D Gaussian Splatting (3DGS) has recently emerged as a promising 3D representation. Much research has been focused on reducing its storage requirements and memory footprint. However, the needs to compress and transmit the 3DGS representation to the remote side are overlooked. This new application calls for rate-distortion-optimized 3DGS compression. How to quantize and entropy encode sparse Gaussian primitives in the 3D space remains largely unexplored. Few early attempts resort to the hyperprior framework from learned image compression. But, they fail to utilize fully the inter and intra correlation inherent in Gaussian primitives. Built on ScaffoldGS, this work, termed CAT-3DGS, introduces a context-adaptive triplane approach to their rate-distortion-optimized coding. It features multi-scale triplanes, oriented according to the principal axes of Gaussian primitives in the 3D space, to capture their inter correlation (i.e. spatial correlation) for spatial autoregressive coding in the projected 2D planes. With these triplanes serving as the hyperprior, we further perform channel-wise autoregressive coding to leverage the intra correlation within each individual Gaussian primitive. Our CAT-3DGS incorporates a view frequency-aware masking mechanism. It actively skips from coding those Gaussian primitives that potentially have little impact on the rendering quality. When trained end-to-end to strike a good rate-distortion trade-off, our CAT-3DGS achieves the state-of-the-art compression performance on the commonly used real-world datasets.
△ Less
Submitted 7 March, 2025; v1 submitted 1 March, 2025;
originally announced March 2025.
-
Context Matters: An Empirical Study of the Impact of Contextual Information in Temporal Question Answering Systems
Authors:
Dan Schumacher,
Fatemeh Haji,
Tara Grey,
Niharika Bandlamudi,
Nupoor Karnik,
Gagana Uday Kumar,
Jason Cho-Yu Chiang,
Paul Rad,
Nishant Vishwamitra,
Anthony Rios
Abstract:
Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answe…
▽ More
Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answering (TQA) systems trained on various context types, including relevant, irrelevant, slightly altered, and no context. Our findings indicate that training with a mix of these contexts enhances model robustness and accuracy. Additionally, we show that the position of context relative to the question significantly impacts performance, with question-first positioning yielding better results. We introduce two new context-rich TQA datasets, ContextAQA and ContextTQE, and provide comprehensive evaluations and guidelines for training robust TQA models. Our work lays the foundation for developing reliable and context-aware temporal QA systems, with broader implications for enhancing LLM robustness against diverse and potentially adversarial information.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Anomalous k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures
Authors:
Ikai Lo,
M. H. Gau,
J. K. Tsai,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
T. Aggerstam
Abstract:
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling o…
▽ More
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling of wurtzite band folding effect.
△ Less
Submitted 9 November, 2006; v1 submitted 15 September, 2006;
originally announced September 2006.
-
Study of two-subband population in Fe-doped AlxGa1-xN/GaN heterostructures by persistent photoconductivity effect
Authors:
Ikai Lo,
J. K. Tsai,
M. H. Gau,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
K. R. Wang,
Chun-Nan Chen,
T. Aggerstam
Abstract:
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 f…
▽ More
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 for the second subband. The persistent photoconductivity effect (~13% increase) is mostly attributed to the Fe-related deep-donor level in GaN layer. The second subband starts to populate when the first subband is filled at a density n1 = 9.40 x 1012 cm-2. We calculate the energy separation between the first and second subbands to be 105 meV.
△ Less
Submitted 14 September, 2006;
originally announced September 2006.
-
Wurtzite Effects on Spin Splitting of GaN/AlN Quantum Wells
Authors:
Ikai Lo,
W. T. Wang,
M. H. Gau,
S. F. Tsay,
J. C. Chiang
Abstract:
A new mechanism (DeltaC1-DeltaC3 coupling) is accounted for the spin splitting of wurtzite GaN, which is originated from the intrinsic wurtzite effects (band folding and structure inversion asymmetry). The band-folding effect generates two conduction bands (DeltaC1 and DeltaC3), in which p-wave probability has tremendous change when kz approaches anti-crossing zone. The spin-splitting energy ind…
▽ More
A new mechanism (DeltaC1-DeltaC3 coupling) is accounted for the spin splitting of wurtzite GaN, which is originated from the intrinsic wurtzite effects (band folding and structure inversion asymmetry). The band-folding effect generates two conduction bands (DeltaC1 and DeltaC3), in which p-wave probability has tremendous change when kz approaches anti-crossing zone. The spin-splitting energy induced by the DeltaC1-DeltaC3 coupling and wurtzite structure inversion asymmetry is much larger than that evaluated by traditional Rashba or Dresselhaus effects. When we apply the coupling to GaN/AlN quantum wells, we find that the spin-splitting energy is sensitively controllable by an electric field. Based on the mechanism, we proposed a p-wave-enhanced spin-polarized field effect transistor, made of InxGa1-xN/InyAl1-yN, for spintronics application.
△ Less
Submitted 31 October, 2005;
originally announced October 2005.