Skip to main content

Showing 1–50 of 55 results for author: Turney, P D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.01242  [pdf

    cs.NE nlin.CG

    Evolution of Symbiosis in the Game of Life: Three Characteristics of Successful Symbiotes

    Authors: Peter D. Turney

    Abstract: In past work, we developed a computational model of the evolution of symbiotic entities (Model-S), based on Conway's Game of Life. In this article, we examine three trends that biologists have observed in the evolution of symbiotes. (1) Management: If one partner is able to control the symbiotic relation, this control can reduce conflict; thus, evolutionary selection favours symbiotes that have a… ▽ More

    Submitted 26 September, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

  2. arXiv:2010.08431  [pdf

    cs.NE nlin.CG

    Measuring Behavioural Similarity of Cellular Automata

    Authors: Peter D. Turney

    Abstract: Conway's Game of Life is the best-known cellular automaton. It is a classic model of emergence and self-organization, it is Turing-complete, and it can simulate a universal constructor. The Game of Life belongs to the set of semi-totalistic cellular automata, a family with 262,144 members. Many of these automata may deserve as much attention as the Game of Life, if not more. The challenge we addre… ▽ More

    Submitted 17 December, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

    Journal ref: Artificial Life, 27(1), 62-71 (2021)

  3. Evolution of Autopoiesis and Multicellularity in the Game of Life

    Authors: Peter D. Turney

    Abstract: Recently we introduced a model of symbiosis, Model-S, based on the evolution of seed patterns in Conway's Game of Life. In the model, the fitness of a seed pattern is measured by one-on-one competitions in the Immigration Game, a two-player variation of the Game of Life. Our previous article showed that Model-S can serve as a highly abstract, simplified model of biological life: (1) The initial se… ▽ More

    Submitted 11 January, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    ACM Class: I.6.3; I.6.8; J.3

    Journal ref: Artificial Life, 27(1), 26-43 (2021)

  4. arXiv:2004.02720  [pdf

    cs.NE cs.AI

    Conditions for Open-Ended Evolution in Immigration Games

    Authors: Peter D. Turney

    Abstract: The Immigration Game (invented by Don Woods in 1971) extends the solitaire Game of Life (invented by John Conway in 1970) to enable two-player competition. The Immigration Game can be used in a model of evolution by natural selection, where fitness is measured with competitions. The rules for the Game of Life belong to the family of semitotalistic rules, a family with 262,144 members. Woods' metho… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    ACM Class: I.6.3; I.6.8; J.3

  5. arXiv:1908.07034  [pdf

    cs.NE q-bio.PE

    Symbiosis Promotes Fitness Improvements in the Game of Life

    Authors: Peter D. Turney

    Abstract: We present a computational simulation of evolving entities that includes symbiosis with shifting levels of selection. Evolution by natural selection shifts from the level of the original entities to the level of the new symbiotic entity. In the simulation, the fitness of an entity is measured by a series of one-on-one competitions in the Immigration Game, a two-player variation of Conway's Game of… ▽ More

    Submitted 16 June, 2020; v1 submitted 19 August, 2019; originally announced August 2019.

    Comments: Changes to Sections 1, 3, 4, 5, and 6. Figures and tables appear at the end of the document

    ACM Class: I.6.3; I.6.8; J.3

    Journal ref: Artificial Life, 26(3): 338-365 (2020)

  6. The Natural Selection of Words: Finding the Features of Fitness

    Authors: Peter D. Turney, Saif M. Mohammad

    Abstract: We introduce a dataset for studying the evolution of words, constructed from WordNet and the Google Books Ngram Corpus. The dataset tracks the evolution of 4,000 synonym sets (synsets), containing 9,000 English words, from 1800 AD to 2000 AD. We present a supervised learning algorithm that is able to predict the future leader of a synset: the word in the synset that will have the highest frequency… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    ACM Class: I.2.6; I.2.7

    Journal ref: Published in PLOS ONE, 14(1), e0211512, January 28, 2019

  7. arXiv:1806.07941  [pdf, ps, other

    cs.NE

    Conditions for Major Transitions in Biological and Cultural Evolution

    Authors: Peter D. Turney

    Abstract: Evolution by natural selection can be seen an algorithm for generating creative solutions to difficult problems. More precisely, evolution by natural selection is a class of algorithms that share a set of properties. The question we address here is, what are the conditions that define this class of algorithms? There is a standard answer to this question: Briefly, the conditions are variation, here… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: To be presented at the Third Workshop on Open-Ended Evolution (OEE3), Tokyo, Japan, July 2018 (hosted by the 2018 Conference on Artificial Life)

  8. arXiv:1704.03543  [pdf, ps, other

    cs.IR cs.CL cs.LG

    Leveraging Term Banks for Answering Complex Questions: A Case for Sparse Vectors

    Authors: Peter D. Turney

    Abstract: While open-domain question answering (QA) systems have proven effective for answering simple questions, they struggle with more complex questions. Our goal is to answer more complex questions reliably, without incurring a significant cost in knowledge resource construction to support the QA. One readily available knowledge resource is a term bank, enumerating the key concepts in a domain. We have… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

    Comments: Related datasets can be found at http://allenai.org/data.html

    ACM Class: H.3.1; I.2.6; I.2.7

  9. arXiv:1405.7908  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Semantic Composition and Decomposition: From Recognition to Generation

    Authors: Peter D. Turney

    Abstract: Semantic composition is the task of understanding the meaning of text by composing the meanings of the individual words in the text. Semantic decomposition is the task of understanding the meaning of an individual word by decomposing it into various aspects (factors, constituents, components) that are latent in the meaning of the word. We take a distributional approach to semantics, in which a wor… ▽ More

    Submitted 30 May, 2014; originally announced May 2014.

    Comments: National Research Council Canada - Technical Report

    ACM Class: H.3.1; I.2.6; I.2.7

  10. arXiv:1401.8269  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Experiments with Three Approaches to Recognizing Lexical Entailment

    Authors: Peter D. Turney, Saif M. Mohammad

    Abstract: Inference in natural language often involves recognizing lexical entailment (RLE); that is, identifying whether one word entails another. For example, "buy" entails "own". Two general strategies for RLE have been proposed: One strategy is to manually construct an asymmetric similarity measure for context vectors (directional similarity) and another is to treat RLE as a problem of learning to recog… ▽ More

    Submitted 31 January, 2014; originally announced January 2014.

    Comments: to appear in Natural Language Engineering

    ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Natural Language Engineering, 21 (3), (2015), 437-476

  11. arXiv:1310.5042  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.IR

    Distributional semantics beyond words: Supervised learning of analogy and paraphrase

    Authors: Peter D. Turney

    Abstract: There have been several efforts to extend distributional semantics beyond individual words, to measure the similarity of word pairs, phrases, and sentences (briefly, tuples; ordered sets of words, contiguous or noncontiguous). One way to extend beyond words is to compare two tuples using a function that combines pairwise similarities between the component words in the tuples. A strength of this ap… ▽ More

    Submitted 18 October, 2013; originally announced October 2013.

    ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Transactions of the Association for Computational Linguistics (TACL), (2013), 1, 353-366

  12. arXiv:1309.4035  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Domain and Function: A Dual-Space Model of Semantic Relations and Compositions

    Authors: Peter D. Turney

    Abstract: Given appropriate representations of the semantic relations between carpenter and wood and between mason and stone (for example, vectors in a vector space model), a suitable algorithm should be able to recognize that these relations are highly similar (carpenter is to wood as mason is to stone; the relations are analogous). Likewise, with representations of dog, house, and kennel, an algorithm sho… ▽ More

    Submitted 16 September, 2013; originally announced September 2013.

    ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Journal of Artificial Intelligence Research (JAIR), (2012), 44, 533-585

  13. arXiv:1308.6300  [pdf, ps, other

    cs.CL

    Computing Lexical Contrast

    Authors: Saif M. Mohammad, Bonnie J. Dorr, Graeme Hirst, Peter D. Turney

    Abstract: Knowing the degree of semantic contrast between words has widespread application in natural language processing, including machine translation, information retrieval, and dialogue systems. Manually-created lexicons focus on opposites, such as {\rm hot} and {\rm cold}. Opposites are of many kinds such as antipodals, complementaries, and gradable. However, existing lexicons often do not classify opp… ▽ More

    Submitted 28 August, 2013; originally announced August 2013.

    Journal ref: Computational Linguistics, 39 (3), 555-590, 2013

  14. arXiv:1308.6297  [pdf, other

    cs.CL

    Crowdsourcing a Word-Emotion Association Lexicon

    Authors: Saif M. Mohammad, Peter D. Turney

    Abstract: Even though considerable attention has been given to the polarity of words (positive and negative) and the creation of large polarity lexicons, research in emotion analysis has had to rely on limited and small emotion lexicons. In this paper we show how the combined strength and wisdom of the crowds can be used to generate a large, high-quality, word-emotion and word-polarity association lexicon q… ▽ More

    Submitted 28 August, 2013; originally announced August 2013.

    Journal ref: Computational Intelligence, 29 (3), 436-465, Wiley Blackwell Publishing Ltd, 2013

  15. arXiv:1107.4573  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Analogy perception applied to seven tests of word comprehension

    Authors: Peter D. Turney

    Abstract: It has been argued that analogy is the core of cognition. In AI research, algorithms for analogy are often limited by the need for hand-coded high-level representations as input. An alternative approach is to use high-level perception, in which high-level representations are automatically generated from raw data. Analogy perception is the process of recognizing analogies using high-level perceptio… ▽ More

    Submitted 22 July, 2011; originally announced July 2011.

    Comments: related work available at http://purl.org/peter.turney/

    ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Journal of Experimental & Theoretical Artificial Intelligence (JETAI), 2011, Volume 23, Issue 3, pages 343-362

  16. arXiv:1003.1141  [pdf, ps, other

    cs.CL cs.IR cs.LG

    From Frequency to Meaning: Vector Space Models of Semantics

    Authors: Peter D. Turney, Patrick Pantel

    Abstract: Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are beginning to address these limits. This paper surveys the use of VSMs for semantic processing of text.… ▽ More

    Submitted 4 March, 2010; originally announced March 2010.

    ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Journal of Artificial Intelligence Research, (2010), 37, 141-188

  17. arXiv:0812.4446  [pdf, ps, other

    cs.CL cs.AI cs.LG

    The Latent Relation Mapping Engine: Algorithm and Experiments

    Authors: Peter D. Turney

    Abstract: Many AI researchers and cognitive scientists have argued that analogy is the core of cognition. The most influential work on computational modeling of analogy-making is Structure Mapping Theory (SMT) and its implementation in the Structure Mapping Engine (SME). A limitation of SME is the requirement for complex hand-coded representations. We introduce the Latent Relation Mapping Engine (LRME), w… ▽ More

    Submitted 23 December, 2008; originally announced December 2008.

    Comments: related work available at http://purl.org/peter.turney/

    Report number: NRC-50738 ACM Class: H.3.1, I.2.6, I.2.7

    Journal ref: Journal of Artificial Intelligence Research, (2008), 33, 615-655

  18. arXiv:0809.0124  [pdf, ps, other

    cs.CL cs.IR cs.LG

    A Uniform Approach to Analogies, Synonyms, Antonyms, and Associations

    Authors: Peter D. Turney

    Abstract: Recognizing analogies, synonyms, antonyms, and associations appear to be four distinct tasks, requiring distinct NLP algorithms. In the past, the four tasks have been treated independently, using a wide variety of algorithms. These four semantic classes, however, are a tiny sample of the full range of semantic phenomena, and we cannot afford to create ad hoc algorithms for each semantic phenomen… ▽ More

    Submitted 31 August, 2008; originally announced September 2008.

    Comments: related work available at http://purl.org/peter.turney/

    Report number: NRC 50398 ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), August 2008, Manchester, UK, Pages 905-912

  19. arXiv:0711.2023  [pdf, ps, other

    cs.LG cs.CL cs.IR

    Empirical Evaluation of Four Tensor Decomposition Algorithms

    Authors: Peter D. Turney

    Abstract: Higher-order tensor decompositions are analogous to the familiar Singular Value Decomposition (SVD), but they transcend the limitations of matrices (second-order tensors). SVD is a powerful tool that has achieved impressive results in information retrieval, collaborative filtering, computational linguistics, computational vision, and other fields. However, SVD is limited to two-dimensional array… ▽ More

    Submitted 13 November, 2007; originally announced November 2007.

    Comments: related work available at http://purl.org/peter.turney/

    Report number: ERB-1152, NRC-49877 ACM Class: H.3.1; I.2.6; I.2.7; E.1; G.1.3

  20. Similarity of Semantic Relations

    Authors: Peter D. Turney

    Abstract: There are at least two kinds of similarity. Relational similarity is correspondence between relations, in contrast with attributional similarity, which is correspondence between attributes. When two words have a high degree of attributional similarity, we call them synonyms. When two pairs of words have a high degree of relational similarity, we say that their relations are analogous. For exampl… ▽ More

    Submitted 25 August, 2006; originally announced August 2006.

    Comments: related work available at http://purl.org/peter.turney/

    Report number: NRC-48775 ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Computational Linguistics, (2006), 32(3), 379-416

  21. Self-Replication and Self-Assembly for Manufacturing

    Authors: Robert Ewaschuk, Peter D. Turney

    Abstract: It has been argued that a central objective of nanotechnology is to make products inexpensively, and that self-replication is an effective approach to very low-cost manufacturing. The research presented here is intended to be a step towards this vision. We describe a computational simulation of nanoscale machines floating in a virtual liquid. The machines can bond together to form strands (chain… ▽ More

    Submitted 27 July, 2006; originally announced July 2006.

    Comments: Java code available at http://purl.org/net/johnnyvon/

    Report number: NRC-48760 ACM Class: I.6.3; I.6.8; J.2; J.3

    Journal ref: Artificial Life, (2006), 12, 411-433

  22. arXiv:cs/0607120  [pdf

    cs.CL cs.AI cs.IR cs.LG

    Expressing Implicit Semantic Relations without Supervision

    Authors: Peter D. Turney

    Abstract: We present an unsupervised learning algorithm that mines large text corpora for patterns that express implicit semantic relations. For a given input word pair X:Y with some unspecified semantic relations, the corresponding output list of patterns <P1,...,Pm> is ranked according to how well each pattern Pi expresses the relations between X and Y. For example, given X=ostrich and Y=bird, the two h… ▽ More

    Submitted 27 July, 2006; originally announced July 2006.

    Comments: 8 pages, related work available at http://purl.org/peter.turney/

    Report number: NRC-48761 ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL-06), (2006), Sydney, Australia, 313-320

  23. arXiv:cs/0508103  [pdf, ps, other

    cs.LG cs.CL cs.IR

    Corpus-based Learning of Analogies and Semantic Relations

    Authors: Peter D. Turney, Michael L. Littman

    Abstract: We present an algorithm for learning from unlabeled text, based on the Vector Space Model (VSM) of information retrieval, that can solve verbal analogy questions of the kind found in the SAT college entrance exam. A verbal analogy has the form A:B::C:D, meaning "A is to B as C is to D"; for example, mason:stone::carpenter:wood. SAT analogy questions provide a word pair, A:B, and the problem is t… ▽ More

    Submitted 23 August, 2005; originally announced August 2005.

    Comments: related work available at http://purl.org/peter.turney/ and http://www.cs.rutgers.edu/~mlittman/

    Report number: NRC-48273 ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Machine Learning, (2005), 60(1-3), 251-278

  24. arXiv:cs/0508053  [pdf

    cs.LG cs.CL cs.IR

    Measuring Semantic Similarity by Latent Relational Analysis

    Authors: Peter D. Turney

    Abstract: This paper introduces Latent Relational Analysis (LRA), a method for measuring semantic similarity. LRA measures similarity in the semantic relations between two pairs of words. When two pairs have a high degree of relational similarity, they are analogous. For example, the pair cat:meow is analogous to the pair dog:bark. There is evidence from cognitive science that relational similarity is fun… ▽ More

    Submitted 10 August, 2005; originally announced August 2005.

    Comments: 6 pages, related work available at http://purl.org/peter.turney/

    Report number: NRC-48255 ACM Class: H.3.1; I.2.6; I.2.7

    Journal ref: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (IJCAI-05), (2005), Edinburgh, Scotland, 1136-1141

  25. arXiv:cs/0501018  [pdf, ps, other

    cs.LG cs.CL cs.IR

    Combining Independent Modules in Lexical Multiple-Choice Problems

    Authors: Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder

    Abstract: Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of multiple modules to create more accurate solutions. This paper examines three merging rules for combining probabili… ▽ More

    Submitted 10 January, 2005; originally announced January 2005.

    Comments: 10 pages, related work available at http://www.cs.rutgers.edu/~mlittman/ and http://purl.org/peter.turney/

    Report number: NRC-47434 ACM Class: I.2.6; I.2.7; H.3.1; J.5

    Journal ref: Recent Advances in Natural Language Processing III: Selected Papers from RANLP 2003, Eds: N. Nicolov, K. Botcheva, G. Angelova, and R. Mitkov, (2004), Current Issues in Linguistic Theory (CILT), 260, John Benjamins, 101-110

  26. arXiv:cs/0412024  [pdf

    cs.CL cs.IR cs.LG

    Human-Level Performance on Word Analogy Questions by Latent Relational Analysis

    Authors: Peter D. Turney

    Abstract: This paper introduces Latent Relational Analysis (LRA), a method for measuring relational similarity. LRA has potential applications in many areas, including information extraction, word sense disambiguation, machine translation, and information retrieval. Relational similarity is correspondence between relations, in contrast with attributional similarity, which is correspondence between attribu… ▽ More

    Submitted 6 December, 2004; originally announced December 2004.

    Comments: 32 pages, issued 2004, related work available at http://purl.org/peter.turney

    Report number: NRC-47422 ACM Class: H.3.1; I.2.6; I.2.7

  27. arXiv:cs/0407065  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Word Sense Disambiguation by Web Mining for Word Co-occurrence Probabilities

    Authors: Peter D. Turney

    Abstract: This paper describes the National Research Council (NRC) Word Sense Disambiguation (WSD) system, as applied to the English Lexical Sample (ELS) task in Senseval-3. The NRC system approaches WSD as a classical supervised machine learning problem, using familiar tools such as the Weka machine learning software and Brill's rule-based part-of-speech tagger. Head words are represented as feature vect… ▽ More

    Submitted 29 July, 2004; originally announced July 2004.

    Comments: related work available at http://purl.org/peter.turney/

    ACM Class: H.3.1; H.3.3; I.2.6; I.2.7; J.5

    Journal ref: Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), (2004), Barcelona, Spain, 239-242

  28. arXiv:cs/0309035  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Combining Independent Modules to Solve Multiple-choice Synonym and Analogy Problems

    Authors: Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder

    Abstract: Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of successful, separately developed modules to create more accurate solutions. This paper examines three merging rules… ▽ More

    Submitted 19 September, 2003; originally announced September 2003.

    Comments: 8 pages, related work available at http://www.cs.rutgers.edu/~mlittman/ and http://purl.org/peter.turney/

    Report number: NRC-46506 ACM Class: I.2.6; I.2.7; H.3.1; J.5

    Journal ref: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03), (2003), Borovets, Bulgaria, 482-489

  29. arXiv:cs/0309034  [pdf

    cs.CL cs.IR cs.LG

    Measuring Praise and Criticism: Inference of Semantic Orientation from Association

    Authors: Peter D. Turney, Michael L. Littman

    Abstract: The evaluative character of a word is called its semantic orientation. Positive semantic orientation indicates praise (e.g., "honest", "intrepid") and negative semantic orientation indicates criticism (e.g., "disturbing", "superfluous"). Semantic orientation varies in both direction (positive or negative) and degree (mild to strong). An automated system for measuring semantic orientation would h… ▽ More

    Submitted 19 September, 2003; originally announced September 2003.

    Comments: 37 pages, related work available at http://www.cs.rutgers.edu/~mlittman/ and http://purl.org/peter.turney/

    Report number: NRC-46516 ACM Class: H.3.1; H.3.3; I.2.6; I.2.7

    Journal ref: ACM Transactions on Information Systems (TOIS), (2003), 21 (4), 315-346

  30. arXiv:cs/0308033  [pdf

    cs.LG cs.CL cs.IR

    Coherent Keyphrase Extraction via Web Mining

    Authors: Peter D. Turney

    Abstract: Keyphrases are useful for a variety of purposes, including summarizing, indexing, labeling, categorizing, clustering, highlighting, browsing, and searching. The task of automatic keyphrase extraction is to select keyphrases from within the text of a given document. Automatic keyphrase extraction makes it feasible to generate keyphrases for the huge number of documents that do not have manually a… ▽ More

    Submitted 20 August, 2003; originally announced August 2003.

    Comments: 6 pages, related work available at http://purl.org/peter.turney/

    Report number: NRC-46496 ACM Class: H.3.1; H.3.3; I.2.6; I.2.7

    Journal ref: Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03), (2003), Acapulco, Mexico, 434-439

  31. arXiv:cs/0307055  [pdf

    cs.LG cs.CL cs.IR

    Learning Analogies and Semantic Relations

    Authors: Peter D. Turney, Michael L. Littman

    Abstract: We present an algorithm for learning from unlabeled text, based on the Vector Space Model (VSM) of information retrieval, that can solve verbal analogy questions of the kind found in the Scholastic Aptitude Test (SAT). A verbal analogy has the form A:B::C:D, meaning "A is to B as C is to D"; for example, mason:stone::carpenter:wood. SAT analogy questions provide a word pair, A:B, and the problem… ▽ More

    Submitted 24 July, 2003; originally announced July 2003.

    Comments: 28 pages, issued 2003

    Report number: NRC-46488 ACM Class: H.3.1; I.2.6; I.2.7

  32. arXiv:cs/0212042  [pdf

    cs.NE cs.CE q-bio.PE

    Increasing Evolvability Considered as a Large-Scale Trend in Evolution

    Authors: Peter D. Turney

    Abstract: Evolvability is the capacity to evolve. This paper introduces a simple computational model of evolvability and demonstrates that, under certain conditions, evolvability can increase indefinitely, even when there is no direct selection for evolvability. The model shows that increasing evolvability implies an accelerating evolutionary pace. It is suggested that the conditions for indefinitely incr… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 4 pages

    Report number: NRC-43583 ACM Class: I.6.3; I.6.8; J.3

    Journal ref: Proceedings of the 1999 Genetic and Evolutionary Computation Conference Workshop Program, (1999), 43-46

  33. arXiv:cs/0212041  [pdf

    cs.LG cs.CV

    Robust Classification with Context-Sensitive Features

    Authors: Peter D. Turney

    Abstract: This paper addresses the problem of classifying observations when features are context-sensitive, especially when the testing set involves a context that is different from the training set. The paper begins with a precise definition of the problem, then general strategies are presented for enhancing the performance of classification algorithms on this type of problem. These strategies are tested… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 9 pages

    Report number: NRC-35074 ACM Class: I.2.6; I.5.2; I.5.4

    Journal ref: Proceedings of the Sixth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, Edinburgh, Scotland, (1993), 268-276

  34. arXiv:cs/0212040  [pdf

    cs.LG cs.CE cs.CV

    Data Engineering for the Analysis of Semiconductor Manufacturing Data

    Authors: Peter D. Turney

    Abstract: We have analyzed manufacturing data from several different semiconductor manufacturing plants, using decision tree induction software called Q-YIELD. The software generates rules for predicting when a given product should be rejected. The rules are intended to help the process engineers improve the yield of the product, by helping them to discover the causes of rejection. Experience with Q-YIELD… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 10 pages

    Report number: NRC-39163 ACM Class: I.2.6; I.5.2; I.5.4; J.2

    Journal ref: Proceedings of the IJCAI-95 Workshop on Data Engineering for Inductive Learning, Montreal, Quebec, (1995), 50-59

  35. arXiv:cs/0212039  [pdf

    cs.LG cs.NE

    Low Size-Complexity Inductive Logic Programming: The East-West Challenge Considered as a Problem in Cost-Sensitive Classification

    Authors: Peter D. Turney

    Abstract: The Inductive Logic Programming community has considered proof-complexity and model-complexity, but, until recently, size-complexity has received little attention. Recently a challenge was issued "to the international computing community" to discover low size-complexity Prolog programs for classifying trains. The challenge was based on a problem first proposed by Ryszard Michalski, 20 years ago.… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 17 pages

    Report number: NRC-39164 ACM Class: I.2.6; I.2.8

    Journal ref: Proceedings of the Fifth International Inductive Logic Programming Workshop, Leuven, Belgium, (1995), 247-263

  36. arXiv:cs/0212038  [pdf

    cs.LG cs.CV

    The Identification of Context-Sensitive Features: A Formal Definition of Context for Concept Learning

    Authors: Peter D. Turney

    Abstract: A large body of research in machine learning is concerned with supervised learning from examples. The examples are typically represented as vectors in a multi-dimensional feature space (also known as attribute-value descriptions). A teacher partitions a set of training examples into a finite number of classes. The task of the learning algorithm is to induce a concept from the training examples.… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 7 pages

    Report number: NRC-39222 ACM Class: I.2.6; I.5.2

    Journal ref: 13th International Conference on Machine Learning, Workshop on Learning in Context-Sensitive Domains, Bari, Italy, (1996), 53-59

  37. arXiv:cs/0212037  [pdf

    cs.LG cs.CV

    The Management of Context-Sensitive Features: A Review of Strategies

    Authors: Peter D. Turney

    Abstract: In this paper, we review five heuristic strategies for handling context-sensitive features in supervised machine learning from examples. We discuss two methods for recovering lost (implicit) contextual information. We mention some evidence that hybrid strategies can have a synergetic effect. We then show how the work of several machine learning researchers fits into this framework. While we do n… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 7 pages

    Report number: NRC-39221 ACM Class: I.2.6; I.5.2

    Journal ref: 13th International Conference on Machine Learning, Workshop on Learning in Context-Sensitive Domains, Bari, Italy, (1996), 60-66

  38. arXiv:cs/0212036  [pdf

    cs.LG cs.NE

    Myths and Legends of the Baldwin Effect

    Authors: Peter D. Turney

    Abstract: This position paper argues that the Baldwin effect is widely misunderstood by the evolutionary computation community. The misunderstandings appear to fall into two general categories. Firstly, it is commonly believed that the Baldwin effect is concerned with the synergy that results when there is an evolving population of learning individuals. This is only half of the story. The full story is mo… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 8 pages

    Report number: NRC-39220 ACM Class: I.2.6; I.2.8

    Journal ref: 13th International Conference on Machine Learning, Workshop on Evolutionary Computation and Machine Learning, Bari, Italy, (1996), 135-142

  39. arXiv:cs/0212035  [pdf

    cs.LG cs.CV

    Exploiting Context When Learning to Classify

    Authors: Peter D. Turney

    Abstract: This paper addresses the problem of classifying observations when features are context-sensitive, specifically when the testing set involves a context that is different from the training set. The paper begins with a precise definition of the problem, then general strategies are presented for enhancing the performance of classification algorithms on this type of problem. These strategies are test… ▽ More

    Submitted 12 December, 2002; originally announced December 2002.

    Comments: 6 pages

    Report number: NRC-35058 ACM Class: I.2.6; I.5.2; I.5.4

    Journal ref: Proceedings of the European Conference on Machine Learning, Vienna, Austria, (1993), 402-407

  40. arXiv:cs/0212034  [pdf

    cs.LG cs.CV

    Types of Cost in Inductive Concept Learning

    Authors: Peter D. Turney

    Abstract: Inductive concept learning is the task of learning to assign cases to a discrete set of classes. In real-world applications of concept learning, there are many different types of cost involved. The majority of the machine learning literature ignores all types of cost (unless accuracy is interpreted as a type of cost measure). A few papers have investigated the cost of misclassification errors. V… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 7 pages

    Report number: NRC-43671 ACM Class: I.2.6; I.5.2

    Journal ref: Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning, (2000), Stanford University, California, 15-21

  41. arXiv:cs/0212033  [pdf

    cs.LG cs.CL cs.IR

    Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL

    Authors: Peter D. Turney

    Abstract: This paper presents a simple unsupervised learning algorithm for recognizing synonyms, based on statistical data acquired by querying a Web search engine. The algorithm, called PMI-IR, uses Pointwise Mutual Information (PMI) and Information Retrieval (IR) to measure the similarity of pairs of words. PMI-IR is empirically evaluated using 80 synonym test questions from the Test of English as a For… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 12 pages

    Report number: NRC-44893 ACM Class: I.2.6; I.2.7; H.3.1; H.3.3

    Journal ref: Proceedings of the Twelfth European Conference on Machine Learning, (2001), Freiburg, Germany, 491-502

  42. arXiv:cs/0212032  [pdf

    cs.LG cs.CL cs.IR

    Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

    Authors: Peter D. Turney

    Abstract: This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a n… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 8 pages

    Report number: NRC-44946 ACM Class: I.2.6; I.2.7; H.3.1; H.3.3

    Journal ref: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, (2002), Philadelphia, Pennsylvania, 417-424

  43. arXiv:cs/0212031  [pdf

    cs.LG cs.CE cs.CV

    Contextual Normalization Applied to Aircraft Gas Turbine Engine Diagnosis

    Authors: Peter D. Turney, Michael Halasz

    Abstract: Diagnosing faults in aircraft gas turbine engines is a complex problem. It involves several tasks, including rapid and accurate interpretation of patterns in engine sensor data. We have investigated contextual normalization for the development of a software tool to help engine repair technicians with interpretation of sensor data. Contextual normalization is a new strategy for employing machine… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 45 pages

    Report number: NRC-35028 ACM Class: I.2.6; I.5.4; J.2

    Journal ref: Journal of Applied Intelligence, (1993), 3, 109-129

  44. arXiv:cs/0212030  [pdf

    cs.LG cs.CV

    Theoretical Analyses of Cross-Validation Error and Voting in Instance-Based Learning

    Authors: Peter D. Turney

    Abstract: This paper begins with a general theory of error in cross-validation testing of algorithms for supervised learning from examples. It is assumed that the examples are described by attribute-value pairs, where the values are symbolic. Cross-validation requires a set of training examples and a set of testing examples. The value of the attribute that is to be predicted is known to the learner in the… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 48 pages

    Report number: NRC-35073 ACM Class: I.2.6; I.5.2

    Journal ref: Journal of Experimental and Theoretical Artificial Intelligence, (1994), 6, 331-360

  45. arXiv:cs/0212029  [pdf

    cs.LG cs.CV

    A Theory of Cross-Validation Error

    Authors: Peter D. Turney

    Abstract: This paper presents a theory of error in cross-validation testing of algorithms for predicting real-valued attributes. The theory justifies the claim that predicting real-valued attributes requires balancing the conflicting demands of simplicity and accuracy. Furthermore, the theory indicates precisely how these conflicting demands must be balanced, in order to minimize cross-validation error. A… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 48 pages

    Report number: NRC-35072 ACM Class: I.2.6; I.5.2

    Journal ref: Journal of Experimental and Theoretical Artificial Intelligence, (1994), 6, 361-391

  46. arXiv:cs/0212028  [pdf

    cs.LG cs.CV

    Technical Note: Bias and the Quantification of Stability

    Authors: Peter D. Turney

    Abstract: Research on bias in machine learning algorithms has generally been concerned with the impact of bias on predictive accuracy. We believe that there are other factors that should also play a role in the evaluation of bias. One such factor is the stability of the algorithm; in other words, the repeatability of the results. If we obtain two sets of data from the same phenomenon, with the same underl… ▽ More

    Submitted 11 December, 2002; originally announced December 2002.

    Comments: 14 pages

    Report number: NRC-38313 ACM Class: I.2.6; I.5.2

    Journal ref: Machine Learning, (1995), 20, 23-33

  47. arXiv:cs/0212023  [pdf

    cs.LG cs.NE

    How to Shift Bias: Lessons from the Baldwin Effect

    Authors: Peter D. Turney

    Abstract: An inductive learning algorithm takes a set of data as input and generates a hypothesis as output. A set of data is typically consistent with an infinite number of hypotheses; therefore, there must be factors other than the data that determine the output of the learning algorithm. In machine learning, these other factors are called the bias of the learner. Classical learning algorithms have a fi… ▽ More

    Submitted 10 December, 2002; originally announced December 2002.

    Comments: 36 pages

    Report number: NRC-40180 ACM Class: I.2.6; I.2.8

    Journal ref: Evolutionary Computation, (1996), 4 (3), 271-295

  48. arXiv:cs/0212021  [pdf

    cs.NE cs.CE q-bio.PE

    A Simple Model of Unbounded Evolutionary Versatility as a Largest-Scale Trend in Organismal Evolution

    Authors: Peter D. Turney

    Abstract: The idea that there are any large-scale trends in the evolution of biological organisms is highly controversial. It is commonly believed, for example, that there is a large-scale trend in evolution towards increasing complexity, but empirical and theoretical arguments undermine this belief. Natural selection results in organisms that are well adapted to their local environments, but it is not cl… ▽ More

    Submitted 10 December, 2002; originally announced December 2002.

    Comments: 32 pages

    Report number: NRC-43672 ACM Class: I.6.3; I.6.8; J.3

    Journal ref: Artificial Life, (2000), 6, 109-128

  49. arXiv:cs/0212020  [pdf

    cs.LG cs.CL cs.IR

    Learning Algorithms for Keyphrase Extraction

    Authors: Peter D. Turney

    Abstract: Many academic journals ask their authors to provide a list of about five to fifteen keywords, to appear on the first page of each article. Since these key words are often phrases of two or more words, we prefer to call them keyphrases. There is a wide variety of tasks for which keyphrases are useful, as we discuss in this paper. We approach the problem of automatically extracting keyphrases from… ▽ More

    Submitted 10 December, 2002; originally announced December 2002.

    Comments: 46 pages

    Report number: NRC-44105 ACM Class: H.3.1; H.3.3; I.2.6; I.2.7

    Journal ref: Information Retrieval, (2000), 2 (4), 303-336

  50. arXiv:cs/0212015  [pdf

    cs.CL

    Answering Subcognitive Turing Test Questions: A Reply to French

    Authors: Peter D. Turney

    Abstract: Robert French has argued that a disembodied computer is incapable of passing a Turing Test that includes subcognitive questions. Subcognitive questions are designed to probe the network of cultural and perceptual associations that humans naturally develop as we live, embodied and embedded in the world. In this paper, I show how it is possible for a disembodied computer to answer subcognitive que… ▽ More

    Submitted 9 December, 2002; originally announced December 2002.

    Comments: 15 pages

    Report number: NRC-44898 ACM Class: I.2.7

    Journal ref: Journal of Experimental and Theoretical Artificial Intelligence, (2001), 13 (4), 409-419