Skip to main content

Showing 1–50 of 56 results for author: Hashemi, H

.
  1. arXiv:2506.10257  [pdf, ps, other

    physics.med-ph

    Enhancing Ultrasound Molecular Imaging: Toward Real-Time RPCA-Based Filtering to Differentiate Bound and Free Microbubbles

    Authors: Hoda S. Hashemi, Dongwoon Hyun, Nathan Nguyen, Jihye Baek, Arutselvan Natarajan, Farbod Tabesh, Andrew Andrzejek, Ramasamy Paulmurugan, Jeremy J. Dahl

    Abstract: Ultrasound molecular imaging (UMI) is an advanced imaging modality that shows promise in detecting cancer at early stages. It uses microbubbles as contrast agents, which are functionalized to bind to cancer biomarkers overexpressed on endothelial cells. A major challenge in UMI is isolating bound microbubble signal, which represents the molecular imaging signal, from that of free-floating microbub… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2505.06027  [pdf, other

    cs.CL cs.LG

    Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation

    Authors: Stefan Vasilev, Christian Herold, Baohao Liao, Seyyed Hadi Hashemi, Shahram Khadivi, Christof Monz

    Abstract: This paper introduces Unilogit, a novel self-distillation method for machine unlearning in Large Language Models. Unilogit addresses the challenge of selectively forgetting specific information while maintaining overall model utility, a critical task in compliance with data privacy regulations like GDPR. Unlike prior methods that rely on static hyperparameters or starting model outputs, Unilogit d… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 16 pages, 6 figures, 5 tables, under review at ACL

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2504.14690  [pdf

    cs.CL cs.AI

    FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models

    Authors: Mehrnoush Shamsfard, Zahra Saaberi, Mostafa Karimi manesh, Seyed Mohammad Hossein Hashemi, Zahra Vatankhah, Motahareh Ramezani, Niki Pourazin, Tara Zare, Maryam Azimi, Sarina Chitsaz, Sama Khoraminejad, Morteza Mahdavi Mortazavi, Mohammad Mahdi Chizari, Sahar Maleki, Seyed Soroush Majd, Mostafa Masumi, Sayed Ali Musavi Khoeini, Amir Mohseni, Sogol Alipour

    Abstract: Research on evaluating and analyzing large language models (LLMs) has been extensive for resource-rich languages such as English, yet their performance in languages such as Persian has received considerably less attention. This paper introduces FarsEval-PKBETS benchmark, a subset of FarsEval project for evaluating large language models in Persian. This benchmark consists of 4000 questions and answ… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: 24 pages, 3 figures, 3 tables

    MSC Class: 68T50 ACM Class: I.2.7; E.0

  4. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  5. arXiv:2503.13089  [pdf, ps, other

    cs.CL cs.AI

    ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning

    Authors: Baohao Liao, Christian Herold, Seyyed Hadi Hashemi, Stefan Vasilev, Shahram Khadivi, Christof Monz

    Abstract: As large language models (LLMs) scale, model compression is crucial for edge deployment and accessibility. Weight-only quantization reduces model size but suffers from performance degradation at lower bit widths. Moreover, standard finetuning is incompatible with quantized models, and alternative methods often fall short of full finetuning. In this paper, we propose ClusComp, a simple yet effectiv… ▽ More

    Submitted 1 June, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: ACL camera-ready version

  6. arXiv:2503.02967  [pdf

    cs.CV cs.LG

    Revolutionizing Traffic Management with AI-Powered Machine Vision: A Step Toward Smart Cities

    Authors: Seyed Hossein Hosseini DolatAbadi, Sayyed Mohammad Hossein Hashemi, Mohammad Hosseini, Moein-Aldin AliHosseini

    Abstract: The rapid urbanization of cities and increasing vehicular congestion have posed significant challenges to traffic management and safety. This study explores the transformative potential of artificial intelligence (AI) and machine vision technologies in revolutionizing traffic systems. By leveraging advanced surveillance cameras and deep learning algorithms, this research proposes a system for real… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 6 pages, 1 figure, 2 tables, accepted to 1th AITC conference in University Of Isfahan

  7. arXiv:2501.09706  [pdf, other

    cs.CL

    Domain Adaptation of Foundation LLMs for e-Commerce

    Authors: Christian Herold, Michael Kozielski, Tala Bazazo, Pavel Petrushkov, Patrycja Cieplicka, Dominika Basaj, Yannick Versley, Seyyed Hadi Hashemi, Shahram Khadivi

    Abstract: We present the e-Llama models: 8 billion and 70 billion parameter large language models that are adapted towards the e-commerce domain. These models are meant as foundation models with deep knowledge about e-commerce, that form a base for instruction- and fine-tuning. The e-Llama models are obtained by continuously pretraining the Llama 3.1 base models on 1 trillion tokens of domain-specific data.… ▽ More

    Submitted 25 May, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

    Comments: Accepted at ACL25 (Industry )

  8. LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts

    Authors: Helia Hashemi, Jason Eisner, Corby Rosset, Benjamin Van Durme, Chris Kedzie

    Abstract: This paper introduces a framework for the automated evaluation of natural language texts. A manually constructed rubric describes how to assess multiple dimensions of interest. To evaluate a text, a large language model (LLM) is prompted with each rubric question and produces a distribution over potential responses. The LLM predictions often fail to agree well with human judges -- indeed, the huma… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

    Comments: Updated version of 17 June 2024

    ACM Class: I.2.1; I.2.6; I.2.7

    Journal ref: Proceedings of ACL 2024 (Volume 1: Long Papers), pp. 13806-13834

  9. arXiv:2410.12380  [pdf, other

    cs.CL

    Evaluation of Attribution Bias in Retrieval-Augmented Large Language Models

    Authors: Amin Abolghasemi, Leif Azzopardi, Seyyed Hadi Hashemi, Maarten de Rijke, Suzan Verberne

    Abstract: Attributing answers to source documents is an approach used to enhance the verifiability of a model's output in retrieval augmented generation (RAG). Prior work has mainly focused on improving and evaluating the attribution quality of large language models (LLMs) in RAG, but this may come at the expense of inducing biases in the attribution of answers. We define and examine two aspects in the eval… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  10. arXiv:2408.00118  [pdf, other

    cs.CL cs.AI

    Gemma 2: Improving Open Language Models at a Practical Size

    Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (173 additional authors not shown)

    Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More

    Submitted 2 October, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2401.03302  [pdf

    eess.IV cs.AI cs.CV cs.LG stat.ML

    Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT

    Authors: Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzadeh Taromi

    Abstract: In the field of medical sciences, reliable detection and classification of brain tumors from images remains a formidable challenge due to the rarity of tumors within the population of patients. Therefore, the ability to detect tumors in anomaly scenarios is paramount for ensuring timely interventions and improved patient outcomes. This study addresses the issue by leveraging deep learning (DL) tec… ▽ More

    Submitted 25 September, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the Elsevier for possible publication

  13. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  14. arXiv:2311.11816  [pdf, other

    eess.SY

    Hybrid Controller for Robot Manipulators in Task-Space with Visual-Inertial Feedback

    Authors: Seyed Hamed Hashemi, Jouni Mattila

    Abstract: This paper presents a visual-inertial-based control strategy to address the task space control problem of robot manipulators. To this end, an observer-based hybrid controller is employed to control end-effector motion. In addition, a hybrid observer is introduced for a visual-inertial navigation system to close the control loop directly at the Cartesian space by estimating the end-effector pose. A… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  15. arXiv:2309.00002  [pdf, other

    physics.med-ph eess.IV

    3D Ultrafast Shear Wave Absolute Vibro-Elastography using a Matrix Array Transducer

    Authors: Hoda S. Hashemi, Shahed K. Mohammed, Qi Zeng, Reza Zahiri Azar, Robert N. Rohling, Septimiu E. Salcudean

    Abstract: 3D ultrasound imaging provides more spatial information compared to conventional 2D frames by considering the volumes of data. One of the main bottlenecks of 3D imaging is the long data acquisition time which reduces practicality and can introduce artifacts from unwanted patient or sonographer motion. This paper introduces the first shear wave absolute vibro-elastography (S-WAVE) method with real-… ▽ More

    Submitted 22 May, 2023; originally announced September 2023.

  16. arXiv:2307.11749  [pdf, other

    cs.LG cs.CR

    Differentially Private Heavy Hitter Detection using Federated Analytics

    Authors: Karan Chadha, Junye Chen, John Duchi, Vitaly Feldman, Hanieh Hashemi, Omid Javidbakht, Audra McMillan, Kunal Talwar

    Abstract: In this work, we study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  17. arXiv:2307.05925  [pdf, other

    cs.IT eess.SP

    A Tractable Statistical Representation of IFTR Fading with Applications

    Authors: Maryam Olyaee, Hadi Hashemi, Juan M. Romero-Jerez

    Abstract: The recently introduced independent fluctuating two-ray (IFTR) fading model, consisting of two specular components fluctuating independently plus a diffuse component, has proven to provide an excellent fit to different wireless environments, including the millimeter-wave band. However, the original formulations of the probability density function (PDF) and cumulative distribution function (CDF) of… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: This work was submitted to the IEEE for publication

  18. arXiv:2307.02740  [pdf, other

    cs.IR cs.CL

    Dense Retrieval Adaptation using Target Domain Description

    Authors: Helia Hashemi, Yong Zhuang, Sachith Sri Ram Kothur, Srivas Prasad, Edgar Meij, W. Bruce Croft

    Abstract: In information retrieval (IR), domain adaptation is the process of adapting a retrieval model to a new domain whose data distribution is different from the source domain. Existing methods in this area focus on unsupervised domain adaptation where they have access to the target document collection or supervised (often few-shot) domain adaptation where they additionally have access to (limited) labe… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  19. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  20. arXiv:2302.04163  [pdf, ps, other

    eess.SY cs.RO

    Task Space Control of Robot Manipulators based on Visual SLAM

    Authors: Seyed Hamed Hashemi, Jouni Mattila

    Abstract: This paper aims to address the open problem of designing a globally stable vision-based controller for robot manipulators. Accordingly, based on a hybrid mechanism, this paper proposes a novel task-space control law attained by taking the gradient of a potential function in SE(3). The key idea is to employ the Visual Simultaneous Localization and Mapping (VSLAM) algorithm to estimate a robot pose.… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  21. arXiv:2212.06712  [pdf, other

    cs.IT

    Analysis of the Outage Probability of Ground-Based Relaying for Satellite Systems

    Authors: Hadi Hashemi, Beatriz Soret, M. Carmen Aguayo-Torres

    Abstract: This paper investigates the theoretical basis for using ground relaying in multi-antenna satellites exposed to blocking situations. Inactive and unobstructed User Equipments (UEs) located on ground are the relaying nodes of UEs that are not in the field of view of the satellite. Exact closed-form relationships of the Signal-to-Noise Ratio (SNR) and the outage probability are obtained for the case… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  22. arXiv:2212.06264  [pdf, other

    cs.CE cs.CR cs.DC cs.LG

    Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems

    Authors: Hanieh Hashemi, Wenjie Xiong, Liu Ke, Kiwan Maeng, Murali Annavaram, G. Edward Suh, Hsien-Hsin S. Lee

    Abstract: Online personalized recommendation services are generally hosted in the cloud where users query the cloud-based model to receive recommended input such as merchandise of interest or news feed. State-of-the-art recommendation models rely on sparse and dense features to represent users' profile information and the items they interact with. Although sparse features account for 99% of the total model… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  23. Color Image steganography using Deep convolutional Autoencoders based on ResNet architecture

    Authors: Seyed Hesam Odin Hashemi, Mohammad-Hassan Majidi, Saeed Khorashadizadeh

    Abstract: In this paper, a deep learning color image steganography scheme combining convolutional autoencoders and ResNet architecture is proposed. Traditional steganography methods suffer from some critical defects such as low capacity, security, and robustness. In recent decades, image hiding and image extraction were realized by autoencoder convolutional neural networks to solve the aforementioned challe… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  24. arXiv:2207.00083  [pdf, other

    cs.CR cs.AR cs.LG

    DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Privacy and security-related concerns are growing as machine learning reaches diverse application domains. The data holders want to train or infer with private data while exploiting accelerators, such as GPUs, that are hosted in the cloud. Cloud systems are vulnerable to attackers that compromise the privacy of data and integrity of computations. Tackling such a challenge requires unifying theoret… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture. arXiv admin note: text overlap with arXiv:2105.00334

  25. A Global Asymptotic Convergent Observer for SLAM

    Authors: Seyed Hamed Hashemi, Jouni Mattila

    Abstract: This paper examines the global convergence problem of SLAM algorithms, an issue that faces topological obstructions. This is because the state-space of attitude dynamics is defined on a non-contractible manifold: the special orthogonal group of order three SO(3). Therefore, this paper presents a novel, gradient-based hybrid observer to overcome these topological obstacles. The Lyapunov stability t… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: 7 pages, 8 figures, conference

  26. arXiv:2203.13949  [pdf, other

    eess.IV physics.med-ph

    Ultrafast Ultrasound Imaging for 3D Shear Wave Absolute Vibro-Elastography

    Authors: Hoda S. Hashemi, Reza Zahiri Azar, Septimiu E. Salcudean, Robert N. Rohling

    Abstract: Shear wave absolute vibro-elastography (S-WAVE) is an imaging technique that generates steady-state shear waves inside the tissue using multi-frequency excitation from an external vibration source. In this work, plane wave imaging is introduced to reduce total acquisition time while retaining the benefit of 3D formulation. Plane wave imaging with a frame rate of 3000 frames/s is followed by 3D abs… ▽ More

    Submitted 25 July, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  27. arXiv:2112.13416  [pdf, other

    cs.CR cs.LG cs.MM

    Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings

    Authors: Tiantian Feng, Hanieh Hashemi, Rajat Hebbar, Murali Annavaram, Shrikanth S. Narayanan

    Abstract: Speech emotion recognition (SER) processes speech signals to detect and characterize expressed perceived emotions. Many SER application systems often acquire and transmit speech data collected at the client-side to remote cloud platforms for inference and decision making. However, speech data carry rich information not only about emotions conveyed in vocal expressions, but also other sensitive dem… ▽ More

    Submitted 22 December, 2022; v1 submitted 26 December, 2021; originally announced December 2021.

  28. arXiv:2111.12179  [pdf, other

    eess.SP

    Multifrequency 3D Elasticity Reconstruction withStructured Sparsity and ADMM

    Authors: Shahed Mohammed, Mohammad Honarvar, Qi Zeng, Hoda Hashemi, Robert Rohling, Piotr Kozlowski, Septimiu Salcudean

    Abstract: We introduce a model-based iterative method to obtain shear modulus images of tissue using magnetic resonance elastography. The method jointly finds the displacement field that best fits multifrequency tissue displacement data and the corresponding shear modulus. The displacement satisfies a viscoelastic wave equation constraint, discretized using the finite element method. Sparsifying regularizat… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  29. arXiv:2107.12958  [pdf, other

    cs.DC cs.CR cs.IT cs.LG

    Adaptive Verifiable Coded Computing: Towards Fast, Secure and Private Distributed Machine Learning

    Authors: Tingting Tang, Ramy E. Ali, Hanieh Hashemi, Tynan Gangwani, Salman Avestimehr, Murali Annavaram

    Abstract: Stragglers, Byzantine workers, and data privacy are the main bottlenecks in distributed cloud computing. Some prior works proposed coded computing strategies to jointly address all three challenges. They require either a large number of workers, a significant communication cost or a significant computational complexity to tolerate Byzantine workers. Much of the overhead in prior schemes comes from… ▽ More

    Submitted 22 March, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

  30. arXiv:2106.15085  [pdf, other

    cs.CL

    Automatic Construction of Enterprise Knowledge Base

    Authors: Junyi Chai, Yujie He, Homa Hashemi, Bing Li, Daraksha Parveen, Ranganath Kondapally, Wenjin Xu

    Abstract: In this paper, we present an automatic knowledge base construction system from large scale enterprise documents with minimal efforts of human intervention. In the design and deployment of such a knowledge mining system for enterprise, we faced several challenges including data distributional shift, performance evaluation, compliance requirements and other practical issues. We leveraged state-of-th… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  31. arXiv:2106.09227  [pdf, other

    cs.IR

    Current Challenges and Future Directions in Podcast Information Access

    Authors: Rosie Jones, Hamed Zamani, Markus Schedl, Ching-Wei Chen, Sravana Reddy, Ann Clifton, Jussi Karlgren, Helia Hashemi, Aasish Pappu, Zahra Nazari, Longqi Yang, Oguz Semerci, Hugues Bouchard, Ben Carterette

    Abstract: Podcasts are spoken documents across a wide-range of genres and styles, with growing listenership across the world, and a rapidly lowering barrier to entry for both listeners and creators. The great strides in search and recommendation in research and industry have yet to see impact in the podcast space, where recommendations are still largely driven by word of mouth. In this perspective paper, we… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: SIGIR 2021

  32. arXiv:2105.02295  [pdf, other

    cs.CR cs.AR cs.LG

    Byzantine-Robust and Privacy-Preserving Framework for FedML

    Authors: Hanieh Hashemi, Yongqin Wang, Chuan Guo, Murali Annavaram

    Abstract: Federated learning has emerged as a popular paradigm for collaboratively training a model from data distributed among a set of clients. This learning setting presents, among others, two unique challenges: how to protect privacy of the clients' data during training, and how to ensure integrity of the trained model. We propose a two-pronged solution that aims to address both challenges under a singl… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Journal ref: Security and Safety in Machine Learning Systems Workshop in ICLR 2021

  33. arXiv:2105.00334  [pdf, other

    cs.CR cs.AR cs.LG

    Privacy and Integrity Preserving Training Using Trusted Hardware

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Privacy and security-related concerns are growing as machine learning reaches diverse application domains. The data holders want to train with private data while exploiting accelerators, such as GPUs, that are hosted in the cloud. However, Cloud systems are vulnerable to attackers that compromise the privacy of data and integrity of computations. This work presents DarKnight, a framework for large… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Journal ref: Distributed and Private Machine Learning ICLR 2021 Workshop

  34. arXiv:2103.03221  [pdf, ps, other

    cs.LG q-bio.QM

    GenoML: Automated Machine Learning for Genomics

    Authors: Mary B. Makarious, Hampton L. Leonard, Dan Vitale, Hirotaka Iwaki, David Saffo, Lana Sargent, Anant Dadu, Eduardo Salmerón Castaño, John F. Carter, Melina Maleknia, Juan A. Botia, Cornelis Blauwendraat, Roy H. Campbell, Sayed Hadi Hashemi, Andrew B. Singleton, Mike A. Nalls, Faraz Faghri

    Abstract: GenoML is a Python package automating machine learning workflows for genomics (genetics and multi-omics) with an open science philosophy. Genomics data require significant domain expertise to clean, pre-process, harmonize and perform quality control of the data. Furthermore, tuning, validation, and interpretation involve taking into account the biology and possibly the limitations of the underlyin… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  35. arXiv:2010.07541  [pdf, other

    cs.DC

    Secure and Fault Tolerant Decentralized Learning

    Authors: Saurav Prakash, Hanieh Hashemi, Yongqin Wang, Murali Annavaram, Salman Avestimehr

    Abstract: Federated learning (FL) is a promising paradigm for training a global model over data distributed across multiple data owners without centralizing clients' raw data. However, sharing of local model updates can also reveal information of clients' local datasets. Trusted execution environments (TEEs) within the FL server have been recently deployed by companies like Meta for secure aggregation. Howe… ▽ More

    Submitted 13 September, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

  36. arXiv:2006.07548  [pdf, other

    cs.IR cs.CL cs.LG

    Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search

    Authors: Helia Hashemi, Hamed Zamani, W. Bruce Croft

    Abstract: Asking clarifying questions in response to ambiguous or faceted queries has been recognized as a useful technique for various information retrieval systems, especially conversational search systems with limited bandwidth interfaces. Analyzing and generating clarifying questions have been studied recently but the accurate utilization of user responses to clarifying questions has been relatively les… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: To appear in the Proceedings of ACM SIGIR 2020. 10 pages

  37. arXiv:2006.01300  [pdf, other

    cs.CR

    DarKnight: A Data Privacy Scheme for Training and Inference of Deep Neural Networks

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Protecting the privacy of input data is of growing importance as machine learning methods reach new application domains. In this paper, we provide a unified training and inference framework for large DNNs while protecting input privacy and computation integrity. Our approach called DarKnight uses a novel data blinding strategy using matrix masking to create input obfuscation within a trusted execu… ▽ More

    Submitted 15 October, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

  38. arXiv:2004.14020  [pdf, other

    cs.NI cs.DC cs.LG

    Caramel: Accelerating Decentralized Distributed Deep Learning with Computation Scheduling

    Authors: Sayed Hadi Hashemi, Sangeetha Abdu Jyothi, Brighten Godfrey, Roy Campbell

    Abstract: The method of choice for parameter aggregation in Deep Neural Network (DNN) training, a network-intensive task, is shifting from the Parameter Server model to decentralized aggregation schemes (AllReduce) inspired by theoretical guarantees of better performance. However, current implementations of AllReduce overlook the interdependence of communication and computation, resulting in significant per… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  39. arXiv:1905.08957  [pdf, other

    cs.IR cs.CL

    ANTIQUE: A Non-Factoid Question Answering Benchmark

    Authors: Helia Hashemi, Mohammad Aliannejadi, Hamed Zamani, W. Bruce Croft

    Abstract: Considering the widespread use of mobile and voice search, answer passage retrieval for non-factoid questions plays a critical role in modern information retrieval systems. Despite the importance of the task, the community still feels the significant lack of large-scale non-factoid question answering collections with real questions and comprehensive relevance judgments. In this paper, we develop a… ▽ More

    Submitted 19 August, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

  40. arXiv:1905.04264  [pdf, other

    cs.DC

    PartitionedVC: Partitioned External Memory Graph Analytics Framework for SSDs

    Authors: Kiran Kumar Matam, Hanieh Hashemi, Murali Annavaram

    Abstract: Graph analytics are at the heart of a broad range of applications such as drug discovery, page ranking, and recommendation systems. When graph size exceeds memory size, out-of-core graph processing is needed. For the widely used external memory graph processing systems, accessing storage becomes the bottleneck. We make the observation that nearly all graph algorithms have a dynamically varying num… ▽ More

    Submitted 11 February, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: 13 pages

  41. arXiv:1904.06578  [pdf

    physics.comp-ph cs.LG

    Deep-learning PDEs with unlabeled data and hardwiring physics laws

    Authors: S. Mohammad H. Hashemi, Demetri Psaltis

    Abstract: Providing fast and accurate solutions to partial differential equations is a problem of continuous interest to the fields of applied mathematics and physics. With the recent advances in machine learning, the adoption learning techniques in this domain is being eagerly pursued. We build upon earlier works on linear and homogeneous PDEs, and develop convolutional deep neural networks that can accura… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

  42. arXiv:1812.04401  [pdf, other

    cs.CC

    Output-Oblivious Stochastic Chemical Reaction Networks

    Authors: Ben Chugg, Anne Condon, Hooman Hashemi

    Abstract: We classify the functions $f:\mathbb{N}^2 \rightarrow \mathbb{N}$ which are stably computable by output-oblivious Stochastic Chemical Reaction Networks (CRNs), i.e., systems of reactions in which output species are never reactants. While it is known that precisely the semilinear functions are stably computable by CRNs, such CRNs sometimes rely on initially producing too many output species and the… ▽ More

    Submitted 30 August, 2022; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: Published in OPODIS 2018. Latest version adds appendix containing all proofs

  43. arXiv:1810.09819  [pdf, other

    astro-ph.SR astro-ph.GA

    3D mapping of young stars in the solar neighbourhood with Gaia DR2

    Authors: E. Zari, H. Hashemi, A. G. A. Brown, K. Jardine, P. T. de Zeeuw

    Abstract: We study the three dimensional arrangement of young stars in the solar neighbourhood using the second release of the Gaia mission (Gaia DR2) and we provide a new, original view of the spatial configuration of the star forming regions within 500 pc from the Sun. By smoothing the star distribution through a gaussian filter, we construct three dimensional density maps for early-type stars (upper-main… ▽ More

    Submitted 6 November, 2018; v1 submitted 23 October, 2018; originally announced October 2018.

    Comments: 17 pages, 17 figures, 6 appendixes; accepted for publication in A&A; image quality decreased to comply with the arXiv.org rules on file size

    Journal ref: A&A 620, A172 (2018)

  44. arXiv:1810.01953  [pdf

    cond-mat.mtrl-sci

    The Effects Of Longitudinal And Circumferential Cracks On The Torsional Dynamic Response Of Shafts

    Authors: Mohsen Nabian, Hamid Nayeb Hashemi

    Abstract: Turbo-generators shafts are manufactured through the extrusion process. This results in the formation of weak planes along the extrusion process. It has been observed that large longitudinal cracks often form in these shafts before any circumferential cracks when these shafts are subjected to cyclic torsion due to electrical line faults. The presence of these cracks could severely compromise the s… ▽ More

    Submitted 11 August, 2018; originally announced October 2018.

  45. arXiv:1803.03288  [pdf, other

    cs.DC cs.LG cs.PF

    TicTac: Accelerating Distributed Deep Learning with Communication Scheduling

    Authors: Sayed Hadi Hashemi, Sangeetha Abdu Jyothi, Roy H. Campbell

    Abstract: State-of-the-art deep learning systems rely on iterative distributed training to tackle the increasing complexity of models and input data. The iteration time in these communication-heavy systems depends on the computation time, communication time and the extent of overlap of computation and communication. In this work, we identify a shortcoming in systems with graph representation for computati… ▽ More

    Submitted 3 October, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

  46. arXiv:1710.00112  [pdf

    cs.DC cs.LG stat.ML

    Toward Scalable Machine Learning and Data Mining: the Bioinformatics Case

    Authors: Faraz Faghri, Sayed Hadi Hashemi, Mohammad Babaeizadeh, Mike A. Nalls, Saurabh Sinha, Roy H. Campbell

    Abstract: In an effort to overcome the data deluge in computational biology and bioinformatics and to facilitate bioinformatics research in the era of big data, we identify some of the most influential algorithms that have been widely used in the bioinformatics community. These top data mining and machine learning algorithms cover classification, clustering, regression, graphical model-based learning, and d… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  47. arXiv:1710.00110  [pdf, other

    cs.CR

    Decentralized User-Centric Access Control using PubSub over Blockchain

    Authors: Sayed Hadi Hashemi, Faraz Faghri, Roy H Campbell

    Abstract: We present a mechanism that puts users in the center of control and empowers them to dictate the access to their collections of data. Revisiting the fundamental mechanisms in security for providing protection, our solution uses capabilities, access lists, and access rights following well-understood formal notions for reasoning about access. This contribution presents a practical, correct, auditabl… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  48. arXiv:1612.00521  [pdf, other

    cs.DC

    Performance Modeling of Distributed Deep Neural Networks

    Authors: Sayed Hadi Hashemi, Shadi A. Noghabi, William Gropp, Roy H Campbell

    Abstract: During the past decade, machine learning has become extremely popular and can be found in many aspects of our every day life. Nowayadays with explosion of data while rapid growth of computation capacity, Distributed Deep Neural Networks (DDNNs) which can improve their performance linearly with more computation resources, have become hot and trending. However, there has not been an in depth study o… ▽ More

    Submitted 14 December, 2016; v1 submitted 1 December, 2016; originally announced December 2016.

  49. arXiv:1607.04768  [pdf, other

    math.CO cs.DM

    Hoffmann-Ostenhof's conjecture for traceable cubic graphs

    Authors: F. Abdolhosseini, S. Akbari, H. Hashemi, M. S. Moradian

    Abstract: It was conjectured by Hoffmann-Ostenhof that the edge set of every connected cubic graph can be decomposed into a spanning tree, a matching and a family of cycles. In this paper, we show that this conjecture holds for traceable cubic graphs.

    Submitted 16 July, 2016; originally announced July 2016.

    MSC Class: 05C45; 05C70 (Primary)

  50. arXiv:1409.7637  [pdf

    cs.NI

    Experimental Demonstration of Nanosecond Accuracy Wireless Network Synchronization

    Authors: Marcelo Segura, S. Niranjayan, Hossein Hashemi, Andreas F. Molisch

    Abstract: Accurate wireless timing synchronization has been an extremely important topic in wireless sensor networks, required in applications ranging from distributed beam forming to precision localization and navigation. However, it is very challenging to realize, in particular when the required accuracy should be better than the runtime between the nodes. This work presents, to our knowledge for the firs… ▽ More

    Submitted 26 September, 2014; originally announced September 2014.

    Comments: Submitted to ICC 2015