Search | arXiv e-print repository

arXiv:2411.19906 [pdf, ps, other]

Classical and Quantum Algorithms for the Deterministic L-system Inductive Inference Problem

Authors: Ali Lotfi, Ian McQuillan, Steven Rayan

Abstract: L-systems can be made to model and create simulations of many biological processes, such as plant development. Finding an L-system for a given process is typically solved by hand, by experts, in a massively time-consuming process. It would be significant if this could be done automatically from data, such as from sequences of images. In this paper, we are interested in inferring a particular type… ▽ More L-systems can be made to model and create simulations of many biological processes, such as plant development. Finding an L-system for a given process is typically solved by hand, by experts, in a massively time-consuming process. It would be significant if this could be done automatically from data, such as from sequences of images. In this paper, we are interested in inferring a particular type of L-system, deterministic context-free L-system (D0L-system) from a sequence of strings. We introduce the characteristic graph of a sequence of strings, which we then utilize to translate our problem (inferring D0L-system) in polynomial time into the maximum independent set problem (MIS) and the SAT problem. After that, we offer a classical exact algorithm and an approximate quantum algorithm for the problem. △ Less

Submitted 30 December, 2024; v1 submitted 29 November, 2024; originally announced November 2024.

Comments: 18 pages, 1 figure

arXiv:2409.02259 [pdf, ps, other]

Optimal L-Systems for Stochastic L-system Inference Problems

Authors: Ali Lotfi, Ian McQuillan

Abstract: This paper presents two novel theorems that address two open problems in stochastic Lindenmayer-system (L-system) inference, specifically focusing on the construction of an optimal stochastic L-system capable of generating a given sequence of strings. The first theorem delineates a method for crafting a stochastic L-system that has the maximum probability of a derivation producing a given sequence… ▽ More This paper presents two novel theorems that address two open problems in stochastic Lindenmayer-system (L-system) inference, specifically focusing on the construction of an optimal stochastic L-system capable of generating a given sequence of strings. The first theorem delineates a method for crafting a stochastic L-system that has the maximum probability of a derivation producing a given sequence of words through a single derivation (noting that multiple derivations may generate the same sequence). Furthermore, the second theorem determines the stochastic L-systems with the highest probability of producing a given sequence of words with multiple possible derivations. From these, we introduce an algorithm to infer an optimal stochastic L-system from a given sequence. This algorithm incorporates advanced optimization techniques, such as interior point methods, to ensure the creation of a stochastic L-system that maximizes the probability of generating the given sequence (allowing for multiple derivations). This allows for the use of stochastic L-systems as a model for machine learning using only positive data for training. △ Less

Submitted 28 December, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

Comments: 15 pages

arXiv:2405.08988 [pdf, other]

Techniques for Showing the Decidability of the Boundedness Problem of Language Acceptors

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: There are many types of automata and grammar models that have been studied in the literature, and for these models, it is common to determine whether certain problems are decidable. One problem that has been difficult to answer throughout the history of automata and formal language theory is to decide whether a given system $M$ accepts a bounded language (whether there exist words… ▽ More There are many types of automata and grammar models that have been studied in the literature, and for these models, it is common to determine whether certain problems are decidable. One problem that has been difficult to answer throughout the history of automata and formal language theory is to decide whether a given system $M$ accepts a bounded language (whether there exist words $w_1, \ldots,w_k$ such that $L(M) \subseteq w_1 \cdots w_k$?). Decidability of this problem has gone unanswered for the majority of automata/grammar models in the literature. Boundedness was only known to be decidable for regular and context-free languages until recently when it was shown to also be decidable for finite-automata and pushdown automata augmented with reversal-bounded counters, and for vector addition systems with states. In this paper, we develop new techniques to show that the boundedness problem is decidable for larger classes of one-way nondeterministic automata and grammar models, by reducing the problem to the decidability of boundedness for simpler classes of automata. One technique involves characterizing the models in terms of multi-tape automata. We give new characterizations of finite-turn Turing machines, finite-turn Turing machines augmented with various storage structures (like a pushdown, multiple reversal-bounded counters, partially-blind counters, etc.), and simple matrix grammars. The characterizations are then used to show that the boundedness problem for these models is decidable. Another technique uses the concept of the store language of an automaton. This is used to show that the boundedness problem is decidable for pushdown automata that can "flip" their pushdown a bounded number of times, and boundedness remains decidable even if we augment this device with additional stores. △ Less

Submitted 16 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

Comments: 23 pages,2 figures

ACM Class: F.4.2

arXiv:2404.05128 [pdf, other]

Importance of realism in procedurally-generated synthetic images for deep learning: case studies in maize and canola

Authors: Nazifa Azam Khan, Mikolaj Cieslak, Ian McQuillan

Abstract: Artificial neural networks are often used to identify features of crop plants. However, training their models requires many annotated images, which can be expensive and time-consuming to acquire. Procedural models of plants, such as those developed with Lindenmayer-systems (L-systems) can be created to produce visually realistic simulations, and hence images of plant simulations, where annotations… ▽ More Artificial neural networks are often used to identify features of crop plants. However, training their models requires many annotated images, which can be expensive and time-consuming to acquire. Procedural models of plants, such as those developed with Lindenmayer-systems (L-systems) can be created to produce visually realistic simulations, and hence images of plant simulations, where annotations are implicitly known. These synthetic images can either augment or completely replace real images in training neural networks for phenotyping tasks. In this paper, we systematically vary amounts of real and synthetic images used for training in both maize and canola to better understand situations where synthetic images generated from L-systems can help prediction on real images. This work also explores the degree to which realism in the synthetic images improves prediction. We have five different variants of a procedural canola model (these variants were created by tuning the realism while using calibration), and the deep learning results showed how drastically these results improve as the canola synthetic images are made to be more realistic. Furthermore, we see how neural network predictions can be used to help calibrate L-systems themselves, creating a feedback loop. △ Less

Submitted 15 May, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

arXiv:2301.10198 [pdf, other]

Unboundedness problems for machines with reversal-bounded counters

Authors: Pascal Baumann, Flavio D'Alessandro, Moses Ganardi, Oscar Ibarra, Ian McQuillan, Lia Schütze, Georg Zetzsche

Abstract: We consider a general class of decision problems concerning formal languages, called ``(one-dimensional) unboundedness predicates'', for automata that feature reversal-bounded counters (RBCA). We show that each problem in this class reduces -- non-deterministically in polynomial time -- to the same problem for just finite automata. We also show an analogous reduction for automata that have access… ▽ More We consider a general class of decision problems concerning formal languages, called ``(one-dimensional) unboundedness predicates'', for automata that feature reversal-bounded counters (RBCA). We show that each problem in this class reduces -- non-deterministically in polynomial time -- to the same problem for just finite automata. We also show an analogous reduction for automata that have access to both a pushdown stack and reversal-bounded counters (PRBCA). This allows us to answer several open questions: For example, we show that it is coNP-complete to decide whether a given (P)RBCA language $L$ is bounded, meaning whether there exist words $w_1,\ldots,w_n$ with $L\subseteq w_1^*\cdots w_n^*$. For PRBCA, even decidability was open. Our methods also show that there is no language of a (P)RBCA of intermediate growth. This means, the number of words of each length grows either polynomially or exponentially. Part of our proof is likely of independent interest: We show that one can translate an RBCA into a machine with $\mathbb{Z}$-counters in logarithmic space, while preserving the accepted language. △ Less

Submitted 24 January, 2023; originally announced January 2023.

arXiv:2212.03992 [pdf, ps, other]

doi 10.1016/j.tcs.2019.06.024

State Grammars with Stores

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: State grammars are context-free grammars where the productions have states associated with them, and a production can only be applied to a nonterminal if the current state matches the state in the production. Once states are added to grammars, it is natural to add various stores, similar to machine models. With such extensions, productions can only be applied if both the state and the value read f… ▽ More State grammars are context-free grammars where the productions have states associated with them, and a production can only be applied to a nonterminal if the current state matches the state in the production. Once states are added to grammars, it is natural to add various stores, similar to machine models. With such extensions, productions can only be applied if both the state and the value read from each store matches between the current sentential form and the production. Here, generative capacity results are presented for different derivation modes, with and without additional stores. In particular, with the standard derivation relation, it is shown that adding reversal-bounded counters does not increase the capacity, and states are enough. Also, state grammars with reversal-bounded counters that operate using leftmost derivations are shown to coincide with languages accepted by one-way machines with a pushdown and reversal-bounded counters, and these are surprisingly shown to be strictly weaker than state grammars with the standard derivation relation (and no counters). The complexity of the emptiness problem involving state grammars with reversal-bounded counters is also studied. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: 21 pages

ACM Class: F.4.2

Journal ref: Theoretical Computer Science 798, 23-39 (2019)

arXiv:2212.03791 [pdf, other]

doi 10.1016/j.tcs.2019.09.045

On Families of Full Trios Containing Counter Machine Languages

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: We look at nondeterministic finite automata augmented with multiple reversal-bounded counters where, during an accepting computation, the behavior of the counters is specified by some fixed pattern. These patterns can serve as a useful "bridge" to other important automata and grammar models in the theoretical computer science literature, thereby helping in their study. Various pattern behaviors ar… ▽ More We look at nondeterministic finite automata augmented with multiple reversal-bounded counters where, during an accepting computation, the behavior of the counters is specified by some fixed pattern. These patterns can serve as a useful "bridge" to other important automata and grammar models in the theoretical computer science literature, thereby helping in their study. Various pattern behaviors are considered, together with characterizations and comparisons. For example, one such pattern defines exactly the smallest full trio containing all the bounded semilinear languages. Another pattern defines the smallest full trio containing all the bounded context-free languages. The "bridging" to other families is then applied, e.g. to certain Turing machine restrictions, as well as other families. Certain general decidability properties are also studied using this framework. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: 28 pages, 1 figure

ACM Class: F.4.3

Journal ref: Theoretical Computer Science 799, 71--93 (2019)

arXiv:2212.03359 [pdf, other]

doi 10.1016/j.tcs.2019.09.045

Relationships Between Bounded Languages, Counter Machines, Finite-Index Grammars, Ambiguity, and Commutative Regularity

Authors: Arturo Carpi, Flavio D'Alessandro, Oscar H. Ibarra, Ian McQuillan

Abstract: It is shown that for every language family that is a trio containing only semilinear languages, all bounded languages in it can be accepted by one-way deterministic reversal-bounded multicounter machines (DCM). This implies that for every semilinear trio (where these properties are effective), it is possible to decide containment, equivalence, and disjointness concerning its bounded languages. A c… ▽ More It is shown that for every language family that is a trio containing only semilinear languages, all bounded languages in it can be accepted by one-way deterministic reversal-bounded multicounter machines (DCM). This implies that for every semilinear trio (where these properties are effective), it is possible to decide containment, equivalence, and disjointness concerning its bounded languages. A condition is also provided for when the bounded languages in a semilinear trio coincide exactly with those accepted by DCM machines, and it is used to show that many grammar systems of finite index -- such as finite-index matrix grammars and finite-index ETOL -- have identical bounded languages as DCM. Then connections between ambiguity, counting regularity, and commutative regularity are made, as many machines and grammars that are unambiguous can only generate/accept counting regular or commutatively regular languages. Thus, such a system that can generate/accept a non-counting regular or non-commutatively regular language implies the existence of inherently ambiguous languages over that system. In addition, it is shown that every language generated by an unambiguous finite-index matrix grammar has a rational characteristic series in commutative variables, and is counting regular. This result plus the connections are used to demonstrate that finite-index matrix grammars and finite-index ETOL can generate inherently ambiguous languages (over their grammars), as do several machine models. It is also shown that all bounded languages generated by these two grammar systems (those in any semilinear trio) can be generated unambiguously within the systems. Finally, conditions on languages generated by finite-index matrix grammars and finite-index ETOL implying commutative regularity are obtained. In particular, it is shown that every finite-index EDOL language is commutatively regular. △ Less

Submitted 6 December, 2022; originally announced December 2022.

Comments: 28 pages, 4 images

ACM Class: F.4.2

Journal ref: Theoretical Computer Science 862, 97--118 (2021)

arXiv:2212.01301 [pdf, ps, other]

doi 10.1142/S0129054120420095

Semilinearity of Families of Languages

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: Techniques are developed for creating new and general language families of only semilinear languages, and for showing families only contain semilinear languages. It is shown that for language families L that are semilinear full trios, the smallest full AFL containing L that is also closed under intersection with languages in NCM (where NCM is the family of languages accepted by NFAs augmented with… ▽ More Techniques are developed for creating new and general language families of only semilinear languages, and for showing families only contain semilinear languages. It is shown that for language families L that are semilinear full trios, the smallest full AFL containing L that is also closed under intersection with languages in NCM (where NCM is the family of languages accepted by NFAs augmented with reversal-bounded counters), is also semilinear. If these closure properties are effective, this also immediately implies decidability of membership, emptiness, and infiniteness for these general families. From the general techniques, new grammar systems are given that are extensions of well-known families of semilinear full trios, whereby it is implied that these extensions must only describe semilinear languages. This also implies positive decidability properties for the new systems. Some characterizations of the new families are also given. △ Less

Submitted 2 December, 2022; originally announced December 2022.

Comments: 20 pages

ACM Class: F.4.3

Journal ref: International Journal of Foundations of Computer Science, 31 (8), 1179-1198 (2020)

arXiv:2212.00897 [pdf, ps, other]

doi 10.1142/S0129054121410045

Generalizations of Checking Stack Automata: Characterizations and Hierarchies

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: We examine different generalizations of checking stack automata by allowing multiple input heads and multiple stacks, and characterize their computing power in terms of two-way multi-head finite automata and space-bounded Turing machines. For various models, we obtain hierarchies in terms of their computing power. Our characterizations and hierarchies expand or tighten some previously known result… ▽ More We examine different generalizations of checking stack automata by allowing multiple input heads and multiple stacks, and characterize their computing power in terms of two-way multi-head finite automata and space-bounded Turing machines. For various models, we obtain hierarchies in terms of their computing power. Our characterizations and hierarchies expand or tighten some previously known results. We also discuss some decidability questions and the space/time complexity of the models. △ Less

Submitted 1 December, 2022; originally announced December 2022.

Comments: 28 pages

ACM Class: F.4.3

Journal ref: International Journal of Foundations of Computer Science, 32 (5), 481-508, (2021)

arXiv:2212.00891 [pdf, ps, other]

doi 10.1142/S0129054121420090

Space Complexity of Stack Automata Models

Authors: Oscar H. Ibarra, Jozef Jirásek, Ian McQuillan, Luca Prigioniero

Abstract: This paper examines several measures of space complexity of variants of stack automata: non-erasing stack automata and checking stack automata. These measures capture the minimum stack size required to accept every word in the language of the automaton (weak measure), the maximum stack size used in any accepting computation on any accepted word (accept measure),and the maximum stack size used in a… ▽ More This paper examines several measures of space complexity of variants of stack automata: non-erasing stack automata and checking stack automata. These measures capture the minimum stack size required to accept every word in the language of the automaton (weak measure), the maximum stack size used in any accepting computation on any accepted word (accept measure),and the maximum stack size used in any computation (strong measure). We give a detailed characterization of the accept and strong space complexity measures for checking stack automata. Exactly one of three cases can occur: the complexity is either bounded by a constant, behaves like a linear function, or it can not be bounded by any function of the length of the input word (and it is decidable which case occurs). However, this result does not hold for non-erasing stack automata; we provide an example where the space complexity grows proportionally to the square root of the length of the input. Furthermore, we study the complexity bounds of machines which accept a given language, and decidability of space complexity properties. △ Less

Submitted 1 December, 2022; originally announced December 2022.

Comments: 23 pages, 1 figure, 2 tables

ACM Class: F.4.3

Journal ref: International Journal of Foundations of Computer Science, 32 (6), 801--823 (2021)

arXiv:2010.00523 [pdf, other]

doi 10.1016/j.ic.2019.03.003

On Store Languages and Applications

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: The store language of a machine of some arbitrary type is the set of all store configurations (state plus store contents but not the input) that can appear in an accepting computation. New algorithms and characterizations of store languages are obtained, such as the result that any nondeterministic pushdown automaton augmented with reversal-bounded counters, where the pushdown can "flip" its conte… ▽ More The store language of a machine of some arbitrary type is the set of all store configurations (state plus store contents but not the input) that can appear in an accepting computation. New algorithms and characterizations of store languages are obtained, such as the result that any nondeterministic pushdown automaton augmented with reversal-bounded counters, where the pushdown can "flip" its contents up to a bounded number of times, can be accepted by a machine with only reversal-bounded counters. Then, connections are made between store languages and several model checking and reachability problems, such as accepting the set of all predecessor and successor configurations from a given set of configurations, and determining whether there are at least one, or infinitely many, common configurations between accepting computations of two machines. These are explored for a variety of different machine models often containing multiple parallel data stores. Many of the machine models studied can accept the set of predecessor configurations (of a regular set of configurations), the set of successor configurations, and the set of common configurations between two machines, with a machine model that is simpler than itself, with a decidable emptiness, infiniteness, and disjointness property. Store languages are key to showing these properties. △ Less

Submitted 1 October, 2020; originally announced October 2020.

Journal ref: Information and Computation, 267, 28-48, 2019

arXiv:2005.09022 [pdf, other]

A Novel Technique Combining Image Processing, Plant Development Properties, and the Hungarian Algorithm, to Improve Leaf Detection in Maize

Authors: Nazifa Khan, Oliver A. S. Lyon, Mark Eramian, Ian McQuillan

Abstract: Manual determination of plant phenotypic properties such as plant architecture, growth, and health is very time consuming and sometimes destructive. Automatic image analysis has become a popular approach. This research aims to identify the position (and number) of leaves from a temporal sequence of high-quality indoor images consisting of multiple views, focussing in particular of images of maize.… ▽ More Manual determination of plant phenotypic properties such as plant architecture, growth, and health is very time consuming and sometimes destructive. Automatic image analysis has become a popular approach. This research aims to identify the position (and number) of leaves from a temporal sequence of high-quality indoor images consisting of multiple views, focussing in particular of images of maize. The procedure used a segmentation on the images, using the convex hull to pick the best view at each time step, followed by a skeletonization of the corresponding image. To remove skeleton spurs, a discrete skeleton evolution pruning process was applied. Pre-existing statistics regarding maize development was incorporated to help differentiate between true leaves and false leaves. Furthermore, for each time step, leaves were matched to those of the previous and next three days using the graph-theoretic Hungarian algorithm. This matching algorithm can be used to both remove false positives, and also to predict true leaves, even if they were completely occluded from the image itself. The algorithm was evaluated using an open dataset consisting of 13 maize plants across 27 days from two different views. The total number of true leaves from the dataset was 1843, and our proposed techniques detect a total of 1690 leaves including 1674 true leaves, and only 16 false leaves, giving a recall of 90.8%, and a precision of 99.0%. △ Less

Submitted 18 May, 2020; originally announced May 2020.

Comments: to be published in the IEEE CVPR 2020 Workshop Proceedings, and accepted by The 1st International Workshop and Prize Challenge on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2020

arXiv:2004.01819 [pdf]

doi 10.1109/ICHI.2018.00057

DNA Methylation Data to Predict Suicidal and Non-Suicidal Deaths: A Machine Learning Approach

Authors: Rifat Zahan, Ian McQuillan, Nathaniel D. Osgood

Abstract: The objective of this study is to predict suicidal and non-suicidal deaths from DNA methylation data using a modern machine learning algorithm. We used support vector machines to classify existing secondary data consisting of normalized values of methylated DNA probe intensities from tissues of two cortical brain regions to distinguish suicide cases from control cases. Before classification, we em… ▽ More The objective of this study is to predict suicidal and non-suicidal deaths from DNA methylation data using a modern machine learning algorithm. We used support vector machines to classify existing secondary data consisting of normalized values of methylated DNA probe intensities from tissues of two cortical brain regions to distinguish suicide cases from control cases. Before classification, we employed Principal component analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) to reduce the dimension of the data. In comparison to PCA, the modern data visualization method t-SNE performs better in dimensionality reduction. t-SNE accounts for the possible non-linear patterns in low-dimensional data. We applied four-fold cross-validation in which the resulting output from t-SNE was used as training data for the Support Vector Machine (SVM). Despite the use of cross-validation, the nominally perfect prediction of suicidal deaths for BA11 data suggests possible over-fitting of the model. The study also may have suffered from 'spectrum bias' since the individuals were only studied from two extreme scenarios. This research constitutes a baseline study for classifying suicidal and non-suicidal deaths from DNA methylation data. Future studies with larger sample size, while possibly incorporating methylation data from living individuals, may reduce the bias and improve the accuracy of the results. △ Less

Submitted 3 April, 2020; originally announced April 2020.

Journal ref: In 2018 IEEE International Conference on Healthcare Informatics (ICHI) (pp. 363-365). IEEE (2018, June)

arXiv:2001.10922 [pdf, other]

Stochastic L-system Inference from Multiple String Sequence Inputs

Authors: Jason Bernard, Ian McQuillan

Abstract: Lindenmayer systems (L-systems) are a grammar system that consist of string rewriting rules. The rules replace every symbol in a string in parallel with a successor to produce the next string, and this procedure iterates. In a stochastic context-free L-system (S0L-system), every symbol may have one or more rewriting rule, each with an associated probability of selection. Properly constructed rewri… ▽ More Lindenmayer systems (L-systems) are a grammar system that consist of string rewriting rules. The rules replace every symbol in a string in parallel with a successor to produce the next string, and this procedure iterates. In a stochastic context-free L-system (S0L-system), every symbol may have one or more rewriting rule, each with an associated probability of selection. Properly constructed rewriting rules have been found to be useful for modeling and simulating some natural and human engineered processes where each derived string describes a step in the simulation. Typically, processes are modeled by experts who meticulously construct the rules based on measurements or domain knowledge of the process. This paper presents an automated approach to finding stochastic L-systems, given a set of string sequences as input. The implemented tool is called the Plant Model Inference Tool for S0L-systems (PMIT-S0L). PMIT-S0L is evaluated using 960 procedurally generated S0L-systems in a test suite, which are each used to generate input strings, and PMIT-S0L is then used to infer the system from only the sequences. The evaluation shows that PMIT-S0L infers S0L-systems with up to 9 rewriting rules each in under 12 hours. Additionally, it is found that 3 sequences of strings is sufficient to find the correct original rewriting rules in 100% of the cases in the test suite, and 6 sequences of strings reduces the difference in the associated probabilities to approximately 1% or less. △ Less

Submitted 29 January, 2020; originally announced January 2020.

Comments: 24 pages, 5 figures, submitted to Applied Soft Computing

arXiv:1906.08860 [pdf, other]

doi 10.1016/j.swevo.2021.100893

Techniques for Inferring Context-Free Lindenmayer Systems With Genetic Algorithm

Authors: Jason Bernard, Ian McQuillan

Abstract: Lindenmayer systems (L-systems) are a formal grammar system, where the most notable feature is a set of rewriting rules that are used to replace every symbol in a string in parallel; by repeating this process, a sequence of strings is produced. Some symbols in the strings may be interpreted as instructions for simulation software. Thus, the sequence can be used to model the steps of a process. Cur… ▽ More Lindenmayer systems (L-systems) are a formal grammar system, where the most notable feature is a set of rewriting rules that are used to replace every symbol in a string in parallel; by repeating this process, a sequence of strings is produced. Some symbols in the strings may be interpreted as instructions for simulation software. Thus, the sequence can be used to model the steps of a process. Currently, creating an L-system for a specific process is done by hand by experts through much effort. The inductive inference problem attempts to infer an L-system from such a sequence of strings generated by an unknown system; this can be thought of as an intermediate step to inferring from a sequence of images. This paper evaluates and analyzes different genetic algorithm encoding schemes and mathematical properties for the L-system inductive inference problem. A new tool, the Plant Model Inference Tool for Context-Free L-systems (PMIT-D0L) is implemented based on these techniques. PMIT-D0L has been successfully evaluated on 28 known L-systems, with alphabets up to 31 symbols and a total sum of 281 symbols across the rewriting rules. PMIT-D0L can infer even the largest of these L-systems in less than a few seconds. △ Less

Submitted 29 April, 2021; v1 submitted 15 May, 2019; originally announced June 2019.

Comments: 28 pages, 3 figures

arXiv:1903.03518 [pdf, ps, other]

doi 10.1016/j.jcss.2018.02.003

Insertion Operations on Deterministic Reversal-Bounded Counter Machines

Authors: Joey Eremondi, Oscar H. Ibarra, Ian McQuillan

Abstract: Several insertion operations are studied applied to languages accepted by one-way and two-way deterministic reversal-bounded multicounter machines. These operations are defined by the ideals obtained from relations such as the prefix, infix, suffix, and outfix relations, as well as operations defined from inverses of a type of deterministic transducer with reversal-bounded counters attached. The q… ▽ More Several insertion operations are studied applied to languages accepted by one-way and two-way deterministic reversal-bounded multicounter machines. These operations are defined by the ideals obtained from relations such as the prefix, infix, suffix, and outfix relations, as well as operations defined from inverses of a type of deterministic transducer with reversal-bounded counters attached. The question of whether the resulting languages can always be accepted by deterministic machines with the same number (or larger number) of input-turns (resp., counters, counter-reversals, etc.) is investigated. △ Less

Submitted 8 March, 2019; originally announced March 2019.

arXiv:1903.03504 [pdf, other]

doi 10.1016/j.tcs.2018.11.021

On counting functions and slenderness of languages

Authors: Oscar H. Ibarra, Ian McQuillan, Bala Ravikumar

Abstract: We study counting-regular languages -- these are languages $L$ for which there is a regular language $L'$ such that the number of strings of length $n$ in $L$ and $L'$ are the same for all $n$. We show that the languages accepted by unambiguous nondeterministic Turing machines with a one-way read-only input tape and a reversal-bounded worktape are counting-regular. Many one-way acceptors are a spe… ▽ More We study counting-regular languages -- these are languages $L$ for which there is a regular language $L'$ such that the number of strings of length $n$ in $L$ and $L'$ are the same for all $n$. We show that the languages accepted by unambiguous nondeterministic Turing machines with a one-way read-only input tape and a reversal-bounded worktape are counting-regular. Many one-way acceptors are a special case of this model, such as reversal-bounded deterministic pushdown automata, reversal-bounded deterministic queue automata, and many others, and therefore all languages accepted by these models are counting-regular. This result is the best possible in the sense that the claim does not hold for either $2$-ambiguous PDA's, unambiguous PDA's with no reversal-bound, and other models. We also study closure properties of counting-regular languages, and we study decidability problems in regards to counting-regularity. For example, it is shown that the counting-regularity of even some restricted subclasses of PDA's is undecidable. Lastly, $k$-slender languages -- where there are at most $k$ words of any length -- are also studied. Amongst other results, it is shown that it is decidable whether a language in any semilinear full trio is $k$-slender. △ Less

Submitted 8 March, 2019; originally announced March 2019.

arXiv:1903.03405 [pdf, other]

doi 10.25596/jalc-2018-127

From Helmut Jürgensen's Former Students: The Game of Informatics Research

Authors: Mark Daley, Mark Eramian, Christopher Power, Ian McQuillan

Abstract: Personal reflections are given on being students of Helmut Jürgensen. Then, we attempt to address his hypothesis that informatics follows trend-like behaviours through the use of a content analysis of university job advertisements, and then via simulation techniques from the area of quantitative economics. Personal reflections are given on being students of Helmut Jürgensen. Then, we attempt to address his hypothesis that informatics follows trend-like behaviours through the use of a content analysis of university job advertisements, and then via simulation techniques from the area of quantitative economics. △ Less

Submitted 7 March, 2019; originally announced March 2019.

Journal ref: Journal of Automata, Languages and Combinatorics, 23, 127-141, 2018

arXiv:1903.03018 [pdf, ps, other]

doi 10.25596/jalc-2018-189

On the Density of Languages Accepted by Turing Machines and Other Machine Models

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: A language is dense if the set of all infixes (or subwords) of the language is the set of all words. Here, it is shown that it is decidable whether the language accepted by a nondeterministic Turing machine with a one-way read-only input and a reversal-bounded read/write worktape (the read/write head changes direction at most some fixed number of times) is dense. From this, it is implied that it i… ▽ More A language is dense if the set of all infixes (or subwords) of the language is the set of all words. Here, it is shown that it is decidable whether the language accepted by a nondeterministic Turing machine with a one-way read-only input and a reversal-bounded read/write worktape (the read/write head changes direction at most some fixed number of times) is dense. From this, it is implied that it is also decidable for one-way reversal-bounded queue automata, one-way reversal-bounded stack automata, and one-way reversal-bounded $k$-flip pushdown automata (machines that can "flip" their pushdowns up to $k$ times). However, it is undecidable for deterministic Turing machines with two 1-reversal-bounded worktapes (even when the two tapes are restricted to operate as 1-reversal-bounded pushdown stacks). △ Less

Submitted 7 March, 2019; originally announced March 2019.

Journal ref: Journal of Automata, Languages and Combinatorics, 23, 189-199, 2018

arXiv:1903.03001 [pdf, ps, other]

doi 10.1142/S0129054118400051

On the Density of Context-Free and Counter Languages

Authors: Joey Eremondi, Oscar H. Ibarra, Ian McQuillan

Abstract: A language $L$ is said to be dense if every word in the universe is an infix of some word in $L$. This notion has been generalized from the infix operation to arbitrary word operations $\varrho$ in place of the infix operation ($\varrho$-dense, with infix-dense being the standard notion of dense). It is shown here that it is decidable, for a language $L$ accepted by a one-way nondeterministic reve… ▽ More A language $L$ is said to be dense if every word in the universe is an infix of some word in $L$. This notion has been generalized from the infix operation to arbitrary word operations $\varrho$ in place of the infix operation ($\varrho$-dense, with infix-dense being the standard notion of dense). It is shown here that it is decidable, for a language $L$ accepted by a one-way nondeterministic reversal-bounded pushdown automaton, whether $L$ is infix-dense. However, it becomes undecidable for both deterministic pushdown automata (with no reversal-bound), and for nondeterministic one-counter automata. When examining suffix-density, it is undecidable for more restricted families such as deterministic one-counter automata that make three reversals on the counter, but it is decidable with less reversals. Other decidability results are also presented on dense languages, and contrasted with a marked version called $\varrho$-marked-density. Also, new languages are demonstrated to be outside various deterministic language families after applying different deletion operations from smaller families. Lastly, bounded-dense languages are defined and examined. △ Less

Submitted 7 March, 2019; originally announced March 2019.

Journal ref: International Journal of Foundations of Computer Science, 29(2), 233-250, 2018

arXiv:1712.00180 [pdf, other]

New Techniques for Inferring L-Systems Using Genetic Algorithm

Authors: Jason Bernard, Ian McQuillan

Abstract: Lindenmayer systems (L-systems) are a formal grammar system that iteratively rewrites all symbols of a string, in parallel. When visualized with a graphical interpretation, the images have self-similar shapes that appear frequently in nature, and they have been particularly successful as a concise, reusable technique for simulating plants. The L-system inference problem is to find an L-system to s… ▽ More Lindenmayer systems (L-systems) are a formal grammar system that iteratively rewrites all symbols of a string, in parallel. When visualized with a graphical interpretation, the images have self-similar shapes that appear frequently in nature, and they have been particularly successful as a concise, reusable technique for simulating plants. The L-system inference problem is to find an L-system to simulate a given plant. This is currently done mainly by experts, but this process is limited by the availability of experts, the complexity that may be solved by humans, and time. This paper introduces the Plant Model Inference Tool (PMIT) that infers deterministic context-free L-systems from an initial sequence of strings generated by the system using a genetic algorithm. PMIT is able to infer more complex systems than existing approaches. Indeed, while existing approaches are limited to L-systems with a total sum of 20 combined symbols in the productions, PMIT can infer almost all L-systems tested where the total sum is 140 symbols. This was validated using a test bed of 28 previously developed L-system models, in addition to models created artificially by bootstrapping larger models. △ Less

Submitted 4 December, 2017; v1 submitted 30 November, 2017; originally announced December 2017.

Comments: 18 pages. 2 figures

arXiv:1705.09732 [pdf, ps, other]

doi 10.1016/j.tcs.2018.04.024

Variations of Checking Stack Automata: Obtaining Unexpected Decidability Properties

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: We introduce a model of one-way language acceptors (a variant of a checking stack automaton) and show the following decidability properties: (1) The deterministic version has a decidable membership problem but has an undecidable emptiness problem. (2) The nondeterministic version has an undecidable membership problem and emptiness problem. There are many models of accepting devices for which there… ▽ More We introduce a model of one-way language acceptors (a variant of a checking stack automaton) and show the following decidability properties: (1) The deterministic version has a decidable membership problem but has an undecidable emptiness problem. (2) The nondeterministic version has an undecidable membership problem and emptiness problem. There are many models of accepting devices for which there is no difference with these problems between deterministic and nondeterministic versions, and the same holds for the emptiness problem. As far as we know, the model we introduce above is the first one-way model to exhibit properties (1) and (2). We define another family of one-way acceptors where the nondeterministic version has an undecidable emptiness problem, but the deterministic version has a decidable emptiness problem. We also know of no other model with this property in the literature. We also investigate decidability properties of other variations of checking stack automata (e.g., allowing multiple stacks, two-way input, etc.). Surprisingly, two-way deterministic machines with multiple checking stacks and multiple reversal-bounded counters are shown to have a decidable membership problem, a very general model with this property. △ Less

Submitted 7 March, 2019; v1 submitted 26 May, 2017; originally announced May 2017.

Journal ref: Theoretical Computer Science, 738, 1-12, 2018

arXiv:1702.07388 [pdf, ps, other]

doi 10.1016/j.tcs.2018.05.036

On Store Languages of Language Acceptors

Authors: Oscar H. Ibarra, Ian McQuillan

Abstract: It is well known that the "store language" of every pushdown automaton -- the set of store configurations (state and stack contents) that can appear as an intermediate step in accepting computations -- is a regular language. Here many models of language acceptors with various data structures are examined, along with a study of their store languages. For each model, an attempt is made to find the s… ▽ More It is well known that the "store language" of every pushdown automaton -- the set of store configurations (state and stack contents) that can appear as an intermediate step in accepting computations -- is a regular language. Here many models of language acceptors with various data structures are examined, along with a study of their store languages. For each model, an attempt is made to find the simplest model that accepts their store languages. Some connections between store languages of one-way and two-way machines generally are demonstrated, as with connections between nondeterministic and deterministic machines. A nice application of these store language results is also presented, showing a general technique for proving families accepted by many deterministic models are closed under right quotient with regular languages, resolving some open questions (and significantly simplifying proofs for others that are known) in the literature. Lower bounds on the space complexity for recognizing store languages for the languages to be non-regular are obtained. △ Less

Submitted 7 March, 2019; v1 submitted 23 February, 2017; originally announced February 2017.

Comments: 19 pages, preprint to be submitted to a journal

Journal ref: Theoretical Computer Science, 745, 114-132, 2018

arXiv:1610.06366 [pdf, ps, other]

doi 10.1016/j.ic.2020.104613

On Finite-Index Indexed Grammars and Their Restrictions

Authors: Flavio D'Alessandro, Oscar H. Ibarra, Ian McQuillan

Abstract: The family, L(INDLIN), of languages generated by linear indexed grammars has been studied in the literature. It is known that the Parikh image of every language in L(INDLIN) is semi-linear. However, there are bounded semi linear languages that are not in L(INDLIN). Here, we look at larger families of (restricted) indexed languages and study their properties, their relationships, and their decidabi… ▽ More The family, L(INDLIN), of languages generated by linear indexed grammars has been studied in the literature. It is known that the Parikh image of every language in L(INDLIN) is semi-linear. However, there are bounded semi linear languages that are not in L(INDLIN). Here, we look at larger families of (restricted) indexed languages and study their properties, their relationships, and their decidability properties. △ Less

Submitted 7 December, 2022; v1 submitted 20 October, 2016; originally announced October 2016.

Comments: 16 pages, latest version

MSC Class: 68Q45

Journal ref: Information and Computation, Vol. 279, 2021, p. 1-13

arXiv:1607.00931 [pdf, ps, other]

doi 10.1016/j.ic.2017.07.009

Deletion Operations on Deterministic Families of Automata

Authors: Joey Eremondi, Oscar H. Ibarra, Ian McQuillan

Abstract: Many different deletion operations are investigated applied to languages accepted by one-way and two-way deterministic reversal-bounded multicounter machines, deterministic pushdown automata, and finite automata. Operations studied include the prefix, suffix, infix and outfix operations, as well as left and right quotient with languages from different families. It is often expected that language f… ▽ More Many different deletion operations are investigated applied to languages accepted by one-way and two-way deterministic reversal-bounded multicounter machines, deterministic pushdown automata, and finite automata. Operations studied include the prefix, suffix, infix and outfix operations, as well as left and right quotient with languages from different families. It is often expected that language families defined from deterministic machines will not be closed under deletion operations. However, here, it is shown that one-way deterministic reversal-bounded multicounter languages are closed under right quotient with languages from many different language families; even those defined by nondeterministic machines such as the context-free languages. Also, it is shown that when starting with one-way deterministic machines with one counter that makes only one reversal, taking the left quotient with languages from many different language families -- again including those defined by nondeterministic machines such as the context-free languages -- yields only one-way deterministic reversal-bounded multicounter languages (by increasing the number of counters). However, if there are two more reversals on the counter, or a second 1-reversal-bounded counter, taking the left quotient (or even just the suffix operation) yields languages that can neither be accepted by deterministic reversal-bounded multicounter machines, nor by 2-way nondeterministic machines with one reversal-bounded counter. △ Less

Submitted 18 October, 2016; v1 submitted 4 July, 2016; originally announced July 2016.

Comments: 20 pages, accepted version to Information and Computation

arXiv:1606.01199 [pdf, ps, other]

doi 10.1016/j.ic.2017.09.002

On the Complexity and Decidability of Some Problems Involving Shuffle

Authors: Joey Eremondi, Oscar H. Ibarra, Ian McQuillan

Abstract: The complexity and decidability of various decision problems involving the shuffle operation are studied. The following three problems are all shown to be $NP$-complete: given a nondeterministic finite automaton (NFA) $M$, and two words $u$ and $v$, is $L(M)$ not a subset of $u$ shuffled with $v$, is $u$ shuffled with $v$ not a subset of $L(M)$, and is $L(M)$ not equal to $u$ shuffled with $v$? It… ▽ More The complexity and decidability of various decision problems involving the shuffle operation are studied. The following three problems are all shown to be $NP$-complete: given a nondeterministic finite automaton (NFA) $M$, and two words $u$ and $v$, is $L(M)$ not a subset of $u$ shuffled with $v$, is $u$ shuffled with $v$ not a subset of $L(M)$, and is $L(M)$ not equal to $u$ shuffled with $v$? It is also shown that there is a polynomial-time algorithm to determine, for $NFA$s $M_1, M_2$ and a deterministic pushdown automaton $M_3$, whether $L(M_1)$ shuffled with $L(M_2)$ is a subset of $L(M_3)$. The same is true when $M_1, M_2,M_3$ are one-way nondeterministic $l$-reversal-bounded $k$-counter machines, with $M_3$ being deterministic. Other decidability and complexity results are presented for testing whether given languages $L_1, L_2$ and $R$ from various languages families satisfy $L_1$ shuffled with $L_2$ is a subset of $R$, and $R$ is a subset of $L_1$ shuffled with $L_2$. Several closure results on shuffle are also shown. △ Less

Submitted 7 March, 2019; v1 submitted 3 June, 2016; originally announced June 2016.

Comments: Preprint submitted to Information and Computation

Journal ref: Information and Computation 259: 214-224, 2018

arXiv:1505.00456 [pdf, other]

A Remark on Baserunning risk: Waiting Can Cost You the Game

Authors: Peter MacDonald, Dan McQuillan, Ian McQuillan

Abstract: We address the value of a baserunner at first base waiting to see if a ball in play falls in for a hit, before running. When a ball is hit in the air, the baserunner will usually wait, to gather additional information as to whether a ball will fall for a hit before deciding to run aggressively. This additional information guarantees that there will not be a double play and an "unnecessary out". Ho… ▽ More We address the value of a baserunner at first base waiting to see if a ball in play falls in for a hit, before running. When a ball is hit in the air, the baserunner will usually wait, to gather additional information as to whether a ball will fall for a hit before deciding to run aggressively. This additional information guarantees that there will not be a double play and an "unnecessary out". However, waiting could potentially cost the runner the opportunity to reach third base, or even scoring on the play if the ball falls for a hit. This in turn affects the probability of scoring at least one run henceforth in the inning. We create a new statistic, the baserunning risk threshold (BRT), which measures the minimum probability with which the baserunner should be sure that a ball in play will fall in for a hit, before running without waiting to see if the ball will be caught, with the goal of scoring at least one run in the inning. We measure a 0-out and a 1-out version of BRT, both in aggregate, and also in high leverage situations, where scoring one run is particularly important. We show a drop in BRT for pitchers who pitch in more high leverage innings, and a very low BRT on average for "elite closers". It follows that baserunners should be frequently running without waiting, and getting thrown out in double plays regularly to maximize their chances of scoring at least one run. △ Less

Submitted 3 May, 2015; originally announced May 2015.

Comments: 9 pages, 4 figures

arXiv:1008.1270

doi 10.4204/EPTCS.31

Proceedings Twelfth Annual Workshop on Descriptional Complexity of Formal Systems

Authors: Ian McQuillan, Giovanni Pighizzini

Abstract: The 12th annual workshop, Descriptional Complexity of Formal Systems 2010, is taking place in Saskatoon, Canada, on August 8-10, 2010. It is jointly organized by the IFIP Working Group 1.2 on Descriptional Complexity and by the Department of Computer Science at the University of Saskatchewan. This volume contains the papers of the invited lectures and the accepted contributions. The 12th annual workshop, Descriptional Complexity of Formal Systems 2010, is taking place in Saskatoon, Canada, on August 8-10, 2010. It is jointly organized by the IFIP Working Group 1.2 on Descriptional Complexity and by the Department of Computer Science at the University of Saskatchewan. This volume contains the papers of the invited lectures and the accepted contributions. △ Less

Submitted 6 August, 2010; originally announced August 2010.

Comments: These proceedings are dedicated to Chandra M. R. Kintala, who passed away on November 05, 2009, at the age of 61. We, the DCFS community, remember him as Chair of the IFIP Working Group 1.2 on Descriptional Complexity, as a co-initiator of the area of descriptional complexity with limited resources and as a co-founder of the workshops DCAGRS and DCFS. He was on the program committee for this DCFS2010 and did unfortunately not live to see the results of the submission and reviewing process

ACM Class: F.1.1; F.1.2; F.4.1; F.4.2; F.4.3

Journal ref: EPTCS 31, 2010

arXiv:0907.5111 [pdf, ps, other]

doi 10.4204/EPTCS.3.7

On the Shuffle Automaton Size for Words

Authors: Franziska Biegler, Mark Daley, Ian McQuillan

Abstract: We investigate the state size of DFAs accepting the shuffle of two words. We provide words u and v, such that the minimal DFA for u shuffled with v requires an exponential number of states. We also show some conditions for the words u and v which ensure a quadratic upper bound on the state size of u shuffled with v. Moreover, switching only two letters within one of u or v is enough to trigger t… ▽ More We investigate the state size of DFAs accepting the shuffle of two words. We provide words u and v, such that the minimal DFA for u shuffled with v requires an exponential number of states. We also show some conditions for the words u and v which ensure a quadratic upper bound on the state size of u shuffled with v. Moreover, switching only two letters within one of u or v is enough to trigger the change from quadratic to exponential. △ Less

Submitted 29 July, 2009; originally announced July 2009.

Journal ref: EPTCS 3, 2009, pp. 79-89

Showing 1–30 of 30 results for author: McQuillan, I