-
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics
Authors:
Conor F. Hayes,
Felipe Leno Da Silva,
Jiachen Yang,
T. Nathan Mundhenk,
Chak Shing Lee,
Jacob F. Pettit,
Claudio Santiago,
Sookyung Kim,
Joanne T. Kim,
Ignacio Aravena Solis,
Ruben Glatt,
Andre R. Goncalves,
Alexander Ladd,
Ahmet Can Solak,
Thomas Desautels,
Daniel Faissol,
Brenden K. Petersen,
Mikel Landajuela
Abstract:
Deep Symbolic Optimization (DSO) is a novel computational framework that enables symbolic optimization for scientific discovery, particularly in applications involving the search for intricate symbolic structures. One notable example is equation discovery, which aims to automatically derive mathematical models expressed in symbolic form. In DSO, the discovery process is formulated as a sequential…
▽ More
Deep Symbolic Optimization (DSO) is a novel computational framework that enables symbolic optimization for scientific discovery, particularly in applications involving the search for intricate symbolic structures. One notable example is equation discovery, which aims to automatically derive mathematical models expressed in symbolic form. In DSO, the discovery process is formulated as a sequential decision-making task. A generative neural network learns a probabilistic model over a vast space of candidate symbolic expressions, while reinforcement learning strategies guide the search toward the most promising regions. This approach integrates gradient-based optimization with evolutionary and local search techniques, and it incorporates in-situ constraints, domain-specific priors, and advanced policy optimization methods. The result is a robust framework capable of efficiently exploring extensive search spaces to identify interpretable and physically meaningful models. Extensive evaluations on benchmark problems have demonstrated that DSO achieves state-of-the-art performance in both accuracy and interpretability. In this chapter, we provide a comprehensive overview of the DSO framework and illustrate its transformative potential for automating symbolic optimization in scientific discovery.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Do Looks Matter? Exploring Functional and Aesthetic Design Preferences for a Robotic Guide Dog
Authors:
Aviv L. Cohav,
A. Xinran Gong,
J. Taery Kim,
Clint Zeagler,
Sehoon Ha,
Bruce N. Walker
Abstract:
Dog guides offer an effective mobility solution for blind or visually impaired (BVI) individuals, but conventional dog guides have limitations including the need for care, potential distractions, societal prejudice, high costs, and limited availability. To address these challenges, we seek to develop a robot dog guide capable of performing the tasks of a conventional dog guide, enhanced with addit…
▽ More
Dog guides offer an effective mobility solution for blind or visually impaired (BVI) individuals, but conventional dog guides have limitations including the need for care, potential distractions, societal prejudice, high costs, and limited availability. To address these challenges, we seek to develop a robot dog guide capable of performing the tasks of a conventional dog guide, enhanced with additional features. In this work, we focus on design research to identify functional and aesthetic design concepts to implement into a quadrupedal robot. The aesthetic design remains relevant even for BVI users due to their sensitivity toward societal perceptions and the need for smooth integration into society. We collected data through interviews and surveys to answer specific design questions pertaining to the appearance, texture, features, and method of controlling and communicating with the robot. Our study identified essential and preferred features for a future robot dog guide, which are supported by relevant statistics aligning with each suggestion. These findings will inform the future development of user-centered designs to effectively meet the needs of BVI individuals.
△ Less
Submitted 18 February, 2025;
originally announced March 2025.
-
Understanding Expectations for a Robotic Guide Dog for Visually Impaired People
Authors:
J. Taery Kim,
Morgan Byrd,
Jack L. Crandell,
Bruce N. Walker,
Greg Turk,
Sehoon Ha
Abstract:
Robotic guide dogs hold significant potential to enhance the autonomy and mobility of blind or visually impaired (BVI) individuals by offering universal assistance over unstructured terrains at affordable costs. However, the design of robotic guide dogs remains underexplored, particularly in systematic aspects such as gait controllers, navigation behaviors, interaction methods, and verbal explanat…
▽ More
Robotic guide dogs hold significant potential to enhance the autonomy and mobility of blind or visually impaired (BVI) individuals by offering universal assistance over unstructured terrains at affordable costs. However, the design of robotic guide dogs remains underexplored, particularly in systematic aspects such as gait controllers, navigation behaviors, interaction methods, and verbal explanations. Our study addresses this gap by conducting user studies with 18 BVI participants, comprising 15 cane users and three guide dog users. Participants interacted with a quadrupedal robot and provided both quantitative and qualitative feedback. Our study revealed several design implications, such as a preference for a learning-based controller and a rigid handle, gradual turns with asymmetric speeds, semantic communication methods, and explainability. The study also highlighted the importance of customization to support users with diverse backgrounds and preferences, along with practical concerns such as battery life, maintenance, and weather issues. These findings offer valuable insights and design implications for future research and development of robotic guide dogs.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Modeling social interaction dynamics using temporal graph networks
Authors:
J. Taery Kim,
Archit Naik,
Isuru Jayarathne,
Sehoon Ha,
Jouh Yeong Chew
Abstract:
Integrating intelligent systems, such as robots, into dynamic group settings poses challenges due to the mutual influence of human behaviors and internal states. A robust representation of social interaction dynamics is essential for effective human-robot collaboration. Existing approaches often narrow their focus to facial expressions or speech, overlooking the broader context. We propose employi…
▽ More
Integrating intelligent systems, such as robots, into dynamic group settings poses challenges due to the mutual influence of human behaviors and internal states. A robust representation of social interaction dynamics is essential for effective human-robot collaboration. Existing approaches often narrow their focus to facial expressions or speech, overlooking the broader context. We propose employing an adapted Temporal Graph Networks to comprehensively represent social interaction dynamics while enabling its practical implementation. Our method incorporates temporal multi-modal behavioral data including gaze interaction, voice activity and environmental context. This representation of social interaction dynamics is trained as a link prediction problem using annotated gaze interaction data. The F1-score outperformed the baseline model by 37.0%. This improvement is consistent for a secondary task of next speaker prediction which achieves an improvement of 29.0%. Our contributions are two-fold, including a model to representing social interaction dynamics which can be used for many downstream human-robot interaction tasks like human state inference and next speaker prediction. More importantly, this is achieved using a more concise yet efficient message passing method, significantly reducing it from 768 to 14 elements, while outperforming the baseline model.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Transforming a Quadruped into a Guide Robot for the Visually Impaired: Formalizing Wayfinding, Interaction Modeling, and Safety Mechanism
Authors:
J. Taery Kim,
Wenhao Yu,
Yash Kothari,
Jie Tan,
Greg Turk,
Sehoon Ha
Abstract:
This paper explores the principles for transforming a quadrupedal robot into a guide robot for individuals with visual impairments. A guide robot has great potential to resolve the limited availability of guide animals that are accessible to only two to three percent of the potential blind or visually impaired (BVI) users. To build a successful guide robot, our paper explores three key topics: (1)…
▽ More
This paper explores the principles for transforming a quadrupedal robot into a guide robot for individuals with visual impairments. A guide robot has great potential to resolve the limited availability of guide animals that are accessible to only two to three percent of the potential blind or visually impaired (BVI) users. To build a successful guide robot, our paper explores three key topics: (1) formalizing the navigation mechanism of a guide dog and a human, (2) developing a data-driven model of their interaction, and (3) improving user safety. First, we formalize the wayfinding task of the human-guide robot team using Markov Decision Processes based on the literature and interviews. Then we collect real human-robot interaction data from three visually impaired and six sighted people and develop an interaction model called the ``Delayed Harness'' to effectively simulate the navigation behaviors of the team. Additionally, we introduce an action shielding mechanism to enhance user safety by predicting and filtering out dangerous actions. We evaluate the developed interaction model and the safety mechanism in simulation, which greatly reduce the prediction errors and the number of collisions, respectively. We also demonstrate the integrated system on a quadrupedal robot with a rigid harness, by guiding users over $100+$~m trajectories.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Spine-like Joint Link Mechanism to Design Wearable Assistive Devices with Comfort and Support
Authors:
Jungyeong Kim,
Jungsan Cho,
Jinhyeon Kim,
Jin Tak Kim,
Sangchul Han,
Sangshin Park,
Han Ul Yoon
Abstract:
When we develop wearable assistive devices comfort and support are two main issues needed to be considered. In conventional design approaches, the degree of freedom of wearer's joint movement tends to be oversimplified. Accordingly, the wearer's motion becomes restrained and bone/ligament injuries might occur in case of unexpected fall. To mitigate those issues, this letter proposes a novel joint…
▽ More
When we develop wearable assistive devices comfort and support are two main issues needed to be considered. In conventional design approaches, the degree of freedom of wearer's joint movement tends to be oversimplified. Accordingly, the wearer's motion becomes restrained and bone/ligament injuries might occur in case of unexpected fall. To mitigate those issues, this letter proposes a novel joint link mechanism inspired by a human spine structure as well as functionalities. The key feature of the proposed spine-like joint link mechanism is that hemispherical blocks are concatenated via flexible synthetic fiber lines so that their concatenation stiffness can be adjusted according to a tensile force. This feature has a great potentiality for designing a wearable assistive devices that can support aged people's sit-to-stand action or augment a spinal motion by regulating the concatenation stiffness. In addition, the concatenated hemispherical blocks enables the wearer to move his/her joint with the full degree of freedom, which in turn, increases wearer's mobility and prevents joint misalignment. The experimental results with a testbed and a pilot wearer substantiated that the spine-like joint link mechanism can serve as a key component to design the wearable assistive devices for better mobility and safety.
△ Less
Submitted 11 March, 2022; v1 submitted 27 November, 2021;
originally announced November 2021.
-
Learning Robot Structure and Motion Embeddings using Graph Neural Networks
Authors:
J. Taery Kim,
Jeongeun Park,
Sungjoon Choi,
Sehoon Ha
Abstract:
We propose a learning framework to find the representation of a robot's kinematic structure and motion embedding spaces using graph neural networks (GNN). Finding a compact and low-dimensional embedding space for complex phenomena is a key for understanding its behaviors, which may lead to a better learning performance, as we observed in other domains of images or languages. However, although nume…
▽ More
We propose a learning framework to find the representation of a robot's kinematic structure and motion embedding spaces using graph neural networks (GNN). Finding a compact and low-dimensional embedding space for complex phenomena is a key for understanding its behaviors, which may lead to a better learning performance, as we observed in other domains of images or languages. However, although numerous robotics applications deal with various types of data, the embedding of the generated data has been relatively less studied by roboticists. To this end, our work aims to learn embeddings for two types of robotic data: the robot's design structure, such as links, joints, and their relationships, and the motion data, such as kinematic joint positions. Our method exploits the tree structure of the robot to train appropriate embeddings to the given robot data. To avoid overfitting, we formulate multi-task learning to find a general representation of the embedding spaces. We evaluate the proposed learning method on a robot with a simple linear structure and visualize the learned embeddings using t-SNE. We also study a few design choices of the learning framework, such as network architectures and message passing schemes.
△ Less
Submitted 15 September, 2021;
originally announced September 2021.
-
Distilling Wikipedia mathematical knowledge into neural network models
Authors:
Joanne T. Kim,
Mikel Landajuela,
Brenden K. Petersen
Abstract:
Machine learning applications to symbolic mathematics are becoming increasingly popular, yet there lacks a centralized source of real-world symbolic expressions to be used as training data. In contrast, the field of natural language processing leverages resources like Wikipedia that provide enormous amounts of real-world textual data. Adopting the philosophy of "mathematics as language," we bridge…
▽ More
Machine learning applications to symbolic mathematics are becoming increasingly popular, yet there lacks a centralized source of real-world symbolic expressions to be used as training data. In contrast, the field of natural language processing leverages resources like Wikipedia that provide enormous amounts of real-world textual data. Adopting the philosophy of "mathematics as language," we bridge this gap by introducing a pipeline for distilling mathematical expressions embedded in Wikipedia into symbolic encodings to be used in downstream machine learning tasks. We demonstrate that a $\textit{mathematical}$ $\textit{language}$ $\textit{model}$ trained on this "corpus" of expressions can be used as a prior to improve the performance of neural-guided search for the task of symbolic regression.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
Observation Space Matters: Benchmark and Optimization Algorithm
Authors:
Joanne Taery Kim,
Sehoon Ha
Abstract:
Recent advances in deep reinforcement learning (deep RL) enable researchers to solve challenging control problems, from simulated environments to real-world robotic tasks. However, deep RL algorithms are known to be sensitive to the problem formulation, including observation spaces, action spaces, and reward functions. There exist numerous choices for observation spaces but they are often designed…
▽ More
Recent advances in deep reinforcement learning (deep RL) enable researchers to solve challenging control problems, from simulated environments to real-world robotic tasks. However, deep RL algorithms are known to be sensitive to the problem formulation, including observation spaces, action spaces, and reward functions. There exist numerous choices for observation spaces but they are often designed solely based on prior knowledge due to the lack of established principles. In this work, we conduct benchmark experiments to verify common design choices for observation spaces, such as Cartesian transformation, binary contact flags, a short history, or global positions. Then we propose a search algorithm to find the optimal observation spaces, which examines various candidate observation spaces and removes unnecessary observation channels with a Dropout-Permutation test. We demonstrate that our algorithm significantly improves learning speed compared to manually designed observation spaces. We also analyze the proposed algorithm by evaluating different hyperparameters.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
A Strong XOR Lemma for Randomized Query Complexity
Authors:
Joshua Brody,
Jae Tak Kim,
Peem Lerdputtipongporn,
Hariharan Srinivasulu
Abstract:
We give a strong direct sum theorem for computing $xor \circ g$. Specifically, we show that for every function g and every $k\geq 2$, the randomized query complexity of computing the xor of k instances of g satisfies $\overline{R}_\eps(xor\circ g) = Θ(k \overline{R}_{\eps/k}(g))$. This matches the naive success amplification upper bound and answers a conjecture of Blais and Brody (CCC19).
As a c…
▽ More
We give a strong direct sum theorem for computing $xor \circ g$. Specifically, we show that for every function g and every $k\geq 2$, the randomized query complexity of computing the xor of k instances of g satisfies $\overline{R}_\eps(xor\circ g) = Θ(k \overline{R}_{\eps/k}(g))$. This matches the naive success amplification upper bound and answers a conjecture of Blais and Brody (CCC19).
As a consequence of our strong direct sum theorem, we give a total function g for which $R(xor \circ g) = Θ(k \log(k)\cdot R(g))$, answering an open question from Ben-David et al.(arxiv:2006.10957v1).
△ Less
Submitted 17 July, 2020; v1 submitted 10 July, 2020;
originally announced July 2020.
-
Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks
Authors:
Sungha Choi,
Joanne T. Kim,
Jaegul Choo
Abstract:
This paper exploits the intrinsic features of urban-scene images and proposes a general add-on module, called height-driven attention networks (HANet), for improving semantic segmentation for urban-scene images. It emphasizes informative features or classes selectively according to the vertical position of a pixel. The pixel-wise class distributions are significantly different from each other amon…
▽ More
This paper exploits the intrinsic features of urban-scene images and proposes a general add-on module, called height-driven attention networks (HANet), for improving semantic segmentation for urban-scene images. It emphasizes informative features or classes selectively according to the vertical position of a pixel. The pixel-wise class distributions are significantly different from each other among horizontally segmented sections in the urban-scene images. Likewise, urban-scene images have their own distinct characteristics, but most semantic segmentation networks do not reflect such unique attributes in the architecture. The proposed network architecture incorporates the capability exploiting the attributes to handle the urban scene dataset effectively. We validate the consistent performance (mIoU) increase of various semantic segmentation models on two datasets when HANet is adopted. This extensive quantitative analysis demonstrates that adding our module to existing models is easy and cost-effective. Our method achieves a new state-of-the-art performance on the Cityscapes benchmark with a large margin among ResNet-101 based segmentation models. Also, we show that the proposed model is coherent with the facts observed in the urban scene by visualizing and interpreting the attention map. Our code and trained models are publicly available at https://github.com/shachoi/HANet
△ Less
Submitted 6 April, 2020; v1 submitted 11 March, 2020;
originally announced March 2020.
-
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Authors:
Brenden K. Petersen,
Mikel Landajuela,
T. Nathan Mundhenk,
Claudio P. Santiago,
Soo K. Kim,
Joanne T. Kim
Abstract:
Discovering the underlying mathematical expressions describing a dataset is a core challenge for artificial intelligence. This is the problem of $\textit{symbolic regression}$. Despite recent advances in training neural networks to solve complex tasks, deep learning approaches to symbolic regression are underexplored. We propose a framework that leverages deep learning for symbolic regression via…
▽ More
Discovering the underlying mathematical expressions describing a dataset is a core challenge for artificial intelligence. This is the problem of $\textit{symbolic regression}$. Despite recent advances in training neural networks to solve complex tasks, deep learning approaches to symbolic regression are underexplored. We propose a framework that leverages deep learning for symbolic regression via a simple idea: use a large model to search the space of small models. Specifically, we use a recurrent neural network to emit a distribution over tractable mathematical expressions and employ a novel risk-seeking policy gradient to train the network to generate better-fitting expressions. Our algorithm outperforms several baseline methods (including Eureqa, the gold standard for symbolic regression) in its ability to exactly recover symbolic expressions on a series of benchmark problems, both with and without added noise. More broadly, our contributions include a framework that can be applied to optimize hierarchical, variable-length objects under a black-box performance metric, with the ability to incorporate constraints in situ, and a risk-seeking policy gradient formulation that optimizes for best-case performance instead of expected performance.
△ Less
Submitted 5 April, 2021; v1 submitted 10 December, 2019;
originally announced December 2019.
-
RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records
Authors:
Bum Chul Kwon,
Min-Je Choi,
Joanne Taery Kim,
Edward Choi,
Young Bin Kim,
Soonwook Kwon,
Jimeng Sun,
Jaegul Choo
Abstract:
We have recently seen many successful applications of recurrent neural networks (RNNs) on electronic medical records (EMRs), which contain histories of patients' diagnoses, medications, and other various events, in order to predict the current and future states of patients. Despite the strong performance of RNNs, it is often challenging for users to understand why the model makes a particular pred…
▽ More
We have recently seen many successful applications of recurrent neural networks (RNNs) on electronic medical records (EMRs), which contain histories of patients' diagnoses, medications, and other various events, in order to predict the current and future states of patients. Despite the strong performance of RNNs, it is often challenging for users to understand why the model makes a particular prediction. Such black-box nature of RNNs can impede its wide adoption in clinical practice. Furthermore, we have no established methods to interactively leverage users' domain expertise and prior knowledge as inputs for steering the model. Therefore, our design study aims to provide a visual analytics solution to increase interpretability and interactivity of RNNs via a joint effort of medical experts, artificial intelligence scientists, and visual analytics researchers. Following the iterative design process between the experts, we design, implement, and evaluate a visual analytics tool called RetainVis, which couples a newly improved, interpretable and interactive RNN-based model called RetainEX and visualizations for users' exploration of EMR data in the context of prediction tasks. Our study shows the effective use of RetainVis for gaining insights into how individual medical codes contribute to making risk predictions, using EMRs of patients with heart failure and cataract symptoms. Our study also demonstrates how we made substantial changes to the state-of-the-art RNN model called RETAIN in order to make use of temporal information and increase interactivity. This study will provide a useful guideline for researchers that aim to design an interpretable and interactive visual analytics tool for RNNs.
△ Less
Submitted 23 October, 2018; v1 submitted 27 May, 2018;
originally announced May 2018.
-
Performance Analysis of License Assisted Access LTE with Asymmetric Hidden Terminals
Authors:
H. R. Lee,
H. Kim,
H. J. Yang,
J. T. Kim,
S. K. Baek
Abstract:
License Assisted Access (LAA) LTE (LTE-LAA) is a new type of LTE that aggregates the licensed LTE bands with the unlicensed bands via carrier aggregation. To operate in unlicensed bands, LTE-LAA adopts the listen-before-talk policy and designs its channel access mechanism similar to WLAN's DCF. In this paper, we consider an LAA network consisting of an LTE-LAA eNB coexisting with a Wi-Fi STA, and…
▽ More
License Assisted Access (LAA) LTE (LTE-LAA) is a new type of LTE that aggregates the licensed LTE bands with the unlicensed bands via carrier aggregation. To operate in unlicensed bands, LTE-LAA adopts the listen-before-talk policy and designs its channel access mechanism similar to WLAN's DCF. In this paper, we consider an LAA network consisting of an LTE-LAA eNB coexisting with a Wi-Fi STA, and capture the {\em asymmetric hidden terminal problem} where the eNB recognizes the STA while the opposite is not true, which is caused by the asymmetric CCA thresholds between them. We model the network as a joint Markov chain (MC) consisting of two individual MCs, and derive its steady-state probabilities, throughput, and channel access delay along with other key metrics like transmit, busy, collision, and doubling probabilities. Through extensive evaluation, we confirm that the proposed model well predicts the dynamics of the LAA network, and identify important design guidelines for fair coexistence between LTE-LAA and WLAN as follows. First, LTE-LAA should design its contention window (CW) doubling policy by considering Wi-Fi's packet duration and subframe-dependent collision probabilities. Second, there exists a tradeoff between throughput and channel access delay, according to which the CW doubling policy should be adapted.
△ Less
Submitted 13 December, 2016;
originally announced December 2016.
-
Fractional Dynamical Behavior in Quantum Brownian Motion
Authors:
Kyungsik Kim,
Y. S. Kong,
M. K. Yum,
J. T. Kim
Abstract:
The dynamical behavior for a quantum Brownian particle is investigated under a random potential of the fractional iterative map on a one-dimensional lattice. For our case, the quantum expectation values can be obtained numerically from the wave function of the fractional Schr$\ddot{o}$dinger equation. Particularly, the square of mean displacement which is ensemble-averaged over our configuration…
▽ More
The dynamical behavior for a quantum Brownian particle is investigated under a random potential of the fractional iterative map on a one-dimensional lattice. For our case, the quantum expectation values can be obtained numerically from the wave function of the fractional Schr$\ddot{o}$dinger equation. Particularly, the square of mean displacement which is ensemble-averaged over our configuration is found to be proportional approximately to $t^δ$ in the long time limit, where $δ$ $=$ $0.96 \pm 0.02$. The power-law behavior with scaling exponents $ε$ $=$ $0.98 \pm 0.02$ and $θ$ $=$ $ 0.51 \pm 0.01$ is estimated for $ \bar {{< p(t) >}^2}$ and $ \bar {{< f(t) >}^2}$, and the result presented is compared with other numerical calculations.
△ Less
Submitted 28 March, 2002;
originally announced March 2002.
-
Absence of non-linear Meissner effect in YBa2Cu3O6.95
Authors:
A. Carrington,
R. W. Giannetta,
J. T. Kim,
J. Giapintzakis
Abstract:
We present measurements the field and temperature dependence of the penetration depth (lambda) in high purity, untwinned single crystals of YBa2Cu3O6.95 in all three crystallographic directions. The temperature dependence of lambda is linear down to low temperatures, showing that our crystals are extremely clean. Both the magnitude and temperature dependence of the field dependent correction to…
▽ More
We present measurements the field and temperature dependence of the penetration depth (lambda) in high purity, untwinned single crystals of YBa2Cu3O6.95 in all three crystallographic directions. The temperature dependence of lambda is linear down to low temperatures, showing that our crystals are extremely clean. Both the magnitude and temperature dependence of the field dependent correction to lambda however, are considerably different from that predicted from the theory of the non-linear Meissner effect for a d-wave superconductor (Yip-Sauls theory). Our results suggest that the Yip-Sauls effect is either absent or is unobservably small in the Meissner state of YBa2Cu3O6.95.
△ Less
Submitted 16 December, 1998;
originally announced December 1998.