Skip to main content

Showing 1–4 of 4 results for author: Jo, U E S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.14174  [pdf, ps, other

    cs.CL cs.LG

    Cheaper, Better, Faster, Stronger: Robust Text-to-SQL without Chain-of-Thought or Fine-Tuning

    Authors: Yusuf Denizay Dönder, Derek Hommel, Andrea W Wen-Yi, David Mimno, Unso Eun Seo Jo

    Abstract: LLMs are effective at code generation tasks like text-to-SQL, but is it worth the cost? Many state-of-the-art approaches use non-task-specific LLM techniques including Chain-of-Thought (CoT), self-consistency, and fine-tuning. These methods can be costly at inference time, sometimes requiring over a hundred LLM calls with reasoning, incurring average costs of up to \… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2504.00289  [pdf, other

    cs.CL cs.AI cs.CY

    Do Chinese models speak Chinese languages?

    Authors: Andrea W Wen-Yi, Unso Eun Seo Jo, David Mimno

    Abstract: The release of top-performing open-weight LLMs has cemented China's role as a leading force in AI development. Do these models support languages spoken in China? Or do they speak the same languages as Western models? Comparing multilingual capabilities is important for two reasons. First, language ability provides insights into pre-training data curation, and thus into resource allocation and deve… ▽ More

    Submitted 7 April, 2025; v1 submitted 31 March, 2025; originally announced April 2025.

    Comments: First and second author contribute equally

  3. arXiv:2407.09652  [pdf, other

    cs.CL

    How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs

    Authors: Andrea W Wen-Yi, Unso Eun Seo Jo, Lu Jia Lin, David Mimno

    Abstract: Contemporary language models are increasingly multilingual, but Chinese LLM developers must navigate complex political and business considerations of language diversity. Language policy in China aims at influencing the public discourse and governing a multi-ethnic society, and has gradually transitioned from a pluralist to a more assimilationist approach since 1949. We explore the impact of these… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Wen-Yi and Jo contributed equally to this work

  4. arXiv:2209.11055  [pdf, other

    cs.CL

    Efficient Few-Shot Learning Without Prompts

    Authors: Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg

    Abstract: Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high variability from manually crafted prompts, and typically require billion-parameter language models to achieve high accuracy. To address these shortcomings, we pr… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.