Skip to main content

Showing 1–8 of 8 results for author: Campbell, W M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.16497  [pdf, other

    cs.AI

    Unsupervised Text Representation Learning via Instruction-Tuning for Zero-Shot Dense Retrieval

    Authors: Qiuhai Zeng, Zimeng Qiu, Dae Yon Hwang, Xin He, William M. Campbell

    Abstract: Dense retrieval systems are commonly used for information retrieval (IR). They rely on learning text representations through an encoder and usually require supervised modeling via labelled data which can be costly to obtain or simply unavailable. In this study, we introduce a novel unsupervised text representation learning technique via instruction-tuning the pre-trained encoder-decoder large lang… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted at DCAI24 workshop@CIKM2024

  2. arXiv:1910.04196  [pdf, other

    cs.CL

    Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

    Authors: Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell

    Abstract: Expanding new functionalities efficiently is an ongoing challenge for single-turn task-oriented dialogue systems. In this work, we explore functionality-specific semi-supervised learning via self-training. We consider methods that augment training data automatically from unlabeled data sets in a functionality-targeted manner. In addition, we examine multiple techniques for efficient selection of a… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: IEEE Copyright. To appear at ASRU 2019

  3. arXiv:1704.05516  [pdf, other

    cs.SI

    Graph Model Selection via Random Walks

    Authors: Lin Li, William M. Campbell, Rajmonda S. Caceres

    Abstract: In this paper, we present a novel approach based on the random walk process for finding meaningful representations of a graph model. Our approach leverages the transient behavior of many short random walks with novel initialization mechanisms to generate model discriminative features. These features are able to capture a more comprehensive structural signature of the underlying graph model. The re… ▽ More

    Submitted 10 May, 2018; v1 submitted 18 April, 2017; originally announced April 2017.

  4. arXiv:1704.05505  [pdf, other

    cs.SI

    Making Sense of Unstructured Text Data

    Authors: Lin Li, William M. Campbell, Cagri Dagli, Joseph P. Campbell

    Abstract: Many network analysis tasks in social sciences rely on pre-existing data sources that were created with explicit relations or interactions between entities under consideration. Examples include email logs, friends and followers networks on social media, communication networks, etc. In these data, it is relatively easy to identify who is connected to whom and how they are connected. However, most o… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

  5. arXiv:1702.07680  [pdf, other

    cs.CL cs.IR stat.ML

    Consistent Alignment of Word Embedding Models

    Authors: Cem Safak Sahin, Rajmonda S. Caceres, Brandon Oselio, William M. Campbell

    Abstract: Word embedding models offer continuous vector representations that can capture rich contextual semantics based on their word co-occurrence patterns. While these word vectors can provide very effective features used in many NLP tasks such as clustering similar words and inferring learning relationships, many challenges and open research questions remain. In this paper, we propose a solution that al… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: 4 pages, 2 figures

  6. arXiv:1609.04859  [pdf, other

    cs.SI physics.soc-ph

    Model Selection Framework for Graph-based data

    Authors: Rajmonda S. Caceres, Leah Weiner, Matthew C. Schmidt, Benjamin A. Miller, William M. Campbell

    Abstract: Graphs are powerful abstractions for capturing complex relationships in diverse application settings. An active area of research focuses on theoretical models that define the generative mechanism of a graph. Yet given the complexity and inherent noise in real datasets, it is still very challenging to identify the best model for a given observed graph. We discuss a framework for graph model selecti… ▽ More

    Submitted 15 September, 2016; originally announced September 2016.

    Comments: 7 pages

  7. arXiv:1608.01386  [pdf, other

    cs.SI

    Cross-Domain Entity Resolution in Social Media

    Authors: W. M. Campbell, Lin Li, C. Dagli, J. Acevedo-Aviles, K. Geyer, J. P. Campbell, C. Priebe

    Abstract: The challenge of associating entities across multiple domains is a key problem in social media understanding. Successful cross-domain entity resolution provides integration of information from multiple sites to create a complete picture of user and community activities, characteristics, and trends. In this work, we examine the problem of entity resolution across Twitter and Instagram using general… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

    Journal ref: The 4th International Workshop on Natural Language Processing for Social Media, 2016

  8. arXiv:1608.01373  [pdf, other

    cs.SI physics.soc-ph

    Matching Community Structure Across Online Social Networks

    Authors: Lin Li, W. M. Campbell

    Abstract: The discovery of community structure in networks is a problem of considerable interest in recent years. In online social networks, often times, users are simultaneously involved in multiple social media sites, some of which share common social relationships. It is of great interest to uncover a shared community structure across these networks. However, in reality, users typically identify themselv… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

    Journal ref: Workshop on Networks in the Social and Information Sciences, NIPS 2015