Skip to main content

Showing 1–46 of 46 results for author: Bose, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04439  [pdf, ps, other

    cs.LG

    RETRO SYNFLOW: Discrete Flow Matching for Accurate and Diverse Single-Step Retrosynthesis

    Authors: Robin Yadav, Qi Yan, Guy Wolf, Avishek Joey Bose, Renjie Liao

    Abstract: A fundamental problem in organic chemistry is identifying and predicting the series of reactions that synthesize a desired target product molecule. Due to the combinatorial nature of the chemical search space, single-step reactant prediction -- i.e. single-step retrosynthesis -- remains challenging even for existing state-of-the-art template-free generative approaches to produce an accurate yet di… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2506.01158  [pdf, ps, other

    cs.LG cs.AI stat.ML

    FORT: Forward-Only Regression Training of Normalizing Flows

    Authors: Danyal Rehman, Oscar Davis, Jiarui Lu, Jian Tang, Michael Bronstein, Yoshua Bengio, Alexander Tong, Avishek Joey Bose

    Abstract: Simulation-free training frameworks have been at the forefront of the generative modelling revolution in continuous spaces, leading to neural dynamical systems that encompass modern large-scale diffusion and flow matching models. Despite the scalability of training, the generation of high-quality samples and their corresponding likelihood under the model requires expensive numerical simulation --… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Preprint

  3. A Hybrid Framework for Real-Time Data Drift and Anomaly Identification Using Hierarchical Temporal Memory and Statistical Tests

    Authors: Subhadip Bandyopadhyay, Joy Bose, Sujoy Roy Chowdhury

    Abstract: Data Drift is the phenomenon where the generating model behind the data changes over time. Due to data drift, any model built on the past training data becomes less relevant and inaccurate over time. Thus, detecting and controlling for data drift is critical in machine learning models. Hierarchical Temporal Memory (HTM) is a machine learning model developed by Jeff Hawkins, inspired by how the hum… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 26 pages, 9 figures

    MSC Class: 62M10; 62P30; 68T07 ACM Class: G.3; I.2.6; I.2.7; H.2.8; H.3.3

    Journal ref: International Journal of Mathematical, Engineering and Management Sciences, Vol. 10, No. 3, 777-796, 2025

  4. arXiv:2503.05394  [pdf, other

    cs.SE cs.AI

    Static Program Analysis Guided LLM Based Unit Test Generation

    Authors: Sujoy Roychowdhury, Giriprasad Sridhara, A K Raghavan, Joy Bose, Sourav Mazumdar, Hamender Singh, Srinivasan Bajji Sugumaran, Ricardo Britto

    Abstract: We describe a novel approach to automating unit test generation for Java methods using large language models (LLMs). Existing LLM-based approaches rely on sample usage(s) of the method to test (focal method) and/or provide the entire class of the focal method as input prompt and context. The former approach is often not viable due to the lack of sample usages, especially for newly written focal me… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  5. arXiv:2502.18462  [pdf, ps, other

    cs.LG cs.AI

    Scalable Equilibrium Sampling with Sequential Boltzmann Generators

    Authors: Charlie B. Tan, Avishek Joey Bose, Chen Lin, Leon Klein, Michael M. Bronstein, Alexander Tong

    Abstract: Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann generators tackle this problem by pairing normalizing flows with importance sampling to obtain uncorrelated samples under the target distribution. In this paper, we extend the Boltzmann generator framework with two key contributions, denoting our framework Sequential Bo… ▽ More

    Submitted 10 June, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

    Comments: Presented at ICML 2025

  6. arXiv:2502.03540  [pdf, other

    cs.LG cs.AI

    Path Planning for Masked Diffusion Model Sampling

    Authors: Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel, Jarrid Rector-Brooks, Sherwood Yao, Avishek Joey Bose, Alexander Tong, Pranam Chatterjee

    Abstract: Any order generation of discrete data using masked diffusion models (MDMs) offers a compelling alternative to traditional autoregressive models, especially in domains that lack a natural causal ordering of data. However, current popular MDMs depart from their successful continuous diffusion model counterparts with simplified masked inference wherein unmasked tokens cannot be iteratively refined --… ▽ More

    Submitted 27 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  7. arXiv:2412.17762  [pdf, other

    cs.LG

    The Superposition of Diffusion Models Using the Itô Density Estimator

    Authors: Marta Skreta, Lazar Atanackovic, Avishek Joey Bose, Alexander Tong, Kirill Neklyudov

    Abstract: The Cambrian explosion of easily accessible pre-trained diffusion models suggests a demand for methods that combine multiple different pre-trained diffusion models without incurring the significant computational burden of re-training a larger combined model. In this paper, we cast the problem of combining multiple pre-trained diffusion models at the generation stage under a novel proposed framewor… ▽ More

    Submitted 28 February, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: Accepted as a Spotlight Presentation at the International Conference on Learning Representations 2025

  8. arXiv:2410.08134  [pdf, other

    cs.LG cs.AI

    Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction

    Authors: Jarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio, Pranam Chatterjee, Alexander Tong, Avishek Joey Bose

    Abstract: Generative modeling of discrete data underlies important applications spanning text-based agents like ChatGPT to the design of the very building blocks of life in protein sequences. However, application domains need to exert control over the generated data by steering the generative process - typically via RLHF - to satisfy a specified property, reward, or affinity metric. In this paper, we study… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  9. arXiv:2407.09499  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences

    Authors: Damien Ferbach, Quentin Bertrand, Avishek Joey Bose, Gauthier Gidel

    Abstract: The rapid progress in generative models has resulted in impressive leaps in generation quality, blurring the lines between synthetic and real data. Web-scale datasets are now prone to the inevitable contamination by synthetic data, directly impacting the training of future generated models. Already, some theoretical results on self-consuming generative models (a.k.a., iterative retraining) have em… ▽ More

    Submitted 12 June, 2024; originally announced July 2024.

    MSC Class: 68T10 ACM Class: I.2.6

  10. arXiv:2405.20313  [pdf, other

    cs.LG q-bio.BM

    Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation

    Authors: Guillaume Huguet, James Vuckovic, Kilian Fatras, Eric Thibodeau-Laufer, Pablo Lemos, Riashat Islam, Cheng-Hao Liu, Jarrid Rector-Brooks, Tara Akhound-Sadegh, Michael Bronstein, Alexander Tong, Avishek Joey Bose

    Abstract: Proteins are essential for almost all biological processes and derive their diverse functions from complex 3D structures, which are in turn determined by their amino acid sequences. In this paper, we exploit the rich biological inductive bias of amino acid sequences and introduce FoldFlow-2, a novel sequence-conditioned SE(3)-equivariant flow matching model for protein structure generation. FoldFl… ▽ More

    Submitted 11 December, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Presented at NeurIPS 2024

  11. arXiv:2405.14780  [pdf, other

    cs.LG stat.ML

    Metric Flow Matching for Smooth Interpolations on the Data Manifold

    Authors: Kacper Kapuśniak, Peter Potaptchik, Teodora Reu, Leo Zhang, Alexander Tong, Michael Bronstein, Avishek Joey Bose, Francesco Di Giovanni

    Abstract: Matching objectives underpin the success of modern generative models and rely on constructing conditional paths that transform a source distribution into a target distribution. Despite being a fundamental building block, conditional paths have been designed principally under the assumption of Euclidean geometry, resulting in straight interpolations. However, this can be particularly restrictive fo… ▽ More

    Submitted 4 November, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.14664  [pdf, other

    cs.LG cs.AI

    Fisher Flow Matching for Generative Modeling over Discrete Data

    Authors: Oscar Davis, Samuel Kessler, Mircea Petrache, İsmail İlkan Ceylan, Michael Bronstein, Avishek Joey Bose

    Abstract: Generative modeling over discrete data has recently seen numerous success stories, with applications spanning language modeling, biological sequence design, and graph-structured molecular data. The predominant generative modeling paradigm for discrete data is still autoregressive, with more recent alternatives based on diffusion or flow-matching falling short of their impressive performance in con… ▽ More

    Submitted 30 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024

  13. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  14. arXiv:2310.02391  [pdf, other

    cs.LG cs.AI

    SE(3)-Stochastic Flow Matching for Protein Backbone Generation

    Authors: Avishek Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet, Kilian Fatras, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong

    Abstract: The computational design of novel protein structures has the potential to impact numerous scientific disciplines greatly. Toward this goal, we introduce FoldFlow, a series of novel generative models of increasing modeling power based on the flow-matching paradigm over $3\mathrm{D}$ rigid motions -- i.e. the group $\text{SE}(3)$ -- enabling accurate modeling of protein backbones. We first introduce… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  15. arXiv:2310.00429  [pdf, other

    cs.LG stat.ML

    On the Stability of Iterative Retraining of Generative Models on their own Data

    Authors: Quentin Bertrand, Avishek Joey Bose, Alexandre Duplessis, Marco Jiralerspong, Gauthier Gidel

    Abstract: Deep generative models have made tremendous progress in modeling complex data, often exhibiting generation quality that surpasses a typical human's ability to discern the authenticity of samples. Undeniably, a key driver of this success is enabled by the massive amounts of web-scale data consumed by these models. Due to these models' striking performance and ease of availability, the web will inev… ▽ More

    Submitted 2 April, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  16. arXiv:2308.12601  [pdf

    cs.HC

    Effect of Mindfulness and Mindful Art on Beginners and Experienced Meditators

    Authors: Koonlin Eunice Chan, Joy Bose

    Abstract: Mindfulness meditation has been proven to be effective in treating a range of mental and physical conditions. Mindful Art is a type of mindfulness meditation that comprises sessions of drawing, painting and sculpturing with mindfulness for a given length of time. To date, the efficacy of mindful art has not been systematically studied. In this paper, we describe an experimental pilot study on two… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 5 pages, 15 figures

    ACM Class: H.5

  17. arXiv:2303.12410  [pdf, other

    cs.LG cs.RO stat.ML

    EDGI: Equivariant Diffusion for Planning with Embodied Agents

    Authors: Johann Brehmer, Joey Bose, Pim de Haan, Taco Cohen

    Abstract: Embodied agents operate in a structured world, often solving tasks with spatial, temporal, and permutation symmetries. Most algorithms for planning and model-based reinforcement learning (MBRL) do not take this rich geometric structure into account, leading to sample inefficiency and poor generalization. We introduce the Equivariant Diffuser for Generating Interactions (EDGI), an algorithm for MBR… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted at NeurIPS 2023. v2: matches camera-ready version

  18. arXiv:2302.04440  [pdf, other

    cs.LG cs.CV

    Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples

    Authors: Marco Jiralerspong, Avishek Joey Bose, Ian Gemp, Chongli Qin, Yoram Bachrach, Gauthier Gidel

    Abstract: The past few years have seen impressive progress in the development of deep generative models capable of producing high-dimensional, complex, and photo-realistic data. However, current methods for evaluating such models remain incomplete: standard likelihood-based metrics do not always apply and rarely correlate with perceptual fidelity, while sample-based metrics, such as FID, are insensitive to… ▽ More

    Submitted 12 March, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: FLD code: https://github.com/marcojira/fld

  19. arXiv:2208.07949  [pdf, other

    cs.LG

    Riemannian Diffusion Models

    Authors: Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, Aaron Courville

    Abstract: Diffusion models are recent state-of-the-art methods for image generation and likelihood estimation. In this work, we generalize continuous-time diffusion models to arbitrary Riemannian manifolds and derive a variational framework for likelihood estimation. Computationally, we propose new methods for computing the Riemannian divergence which is needed in the likelihood estimation. Moreover, in gen… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  20. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  21. arXiv:2110.08649  [pdf, other

    cs.LG cs.AI

    Equivariant Finite Normalizing Flows

    Authors: Avishek Joey Bose, Marcus Brubaker, Ivan Kobyzev

    Abstract: Generative modeling seeks to uncover the underlying factors that give rise to observed data that can often be modeled as the natural symmetries that manifest themselves through invariances and equivariances to certain transformation laws. However, current approaches to representing these symmetries are couched in the formalism of continuous normalizing flows that require the construction of equiva… ▽ More

    Submitted 12 August, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Preprint

  22. arXiv:2110.03422  [pdf

    cs.LG cs.AI

    Modeling Effect of Lockdowns and Other Effects on India Covid-19 Infections Using SEIR Model and Machine Learning

    Authors: Sathiyanarayanan Sampath, Joy Bose

    Abstract: The SEIR model is a widely used epidemiological model used to predict the rise in infections. This model has been widely used in different countries to predict the number of Covid-19 cases. But the original SEIR model does not take into account the effect of factors such as lockdowns, vaccines, and re-infections. In India the first wave of Covid started in March 2020 and the second wave in April 2… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 6 pages, 8 figures

    MSC Class: 68T01 ACM Class: I.2.1

  23. arXiv:2109.03111  [pdf

    cs.NE cs.IR

    Sparse Distributed Memory using Spiking Neural Networks on Nengo

    Authors: Rohan Deepak Ajwani, Arshika Lalan, Basabdatta Sen Bhattacharya, Joy Bose

    Abstract: We present a Spiking Neural Network (SNN) based Sparse Distributed Memory (SDM) implemented on the Nengo framework. We have based our work on previous work by Furber et al, 2004, implementing SDM using N-of-M codes. As an integral part of the SDM design, we have implemented Correlation Matrix Memory (CMM) using SNN on Nengo. Our SNN implementation uses Leaky Integrate and Fire (LIF) spiking neuron… ▽ More

    Submitted 3 December, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: 8 pages, 11 figures, accepted as poster in Bernstein Conference 2021

    ACM Class: H.3.2; I.5.5

  24. arXiv:2104.08455  [pdf, other

    cs.CL

    Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding

    Authors: Nouha Dziri, Andrea Madotto, Osmar Zaiane, Avishek Joey Bose

    Abstract: Dialogue systems powered by large pre-trained language models (LM) exhibit an innate ability to deliver fluent and natural-looking responses. Despite their impressive generation performance, these models can often generate factually incorrect statements impeding their widespread adoption. In this paper, we focus on the task of improving the faithfulness -- and thus reduce hallucination -- of Neura… ▽ More

    Submitted 14 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 18 pages

  25. arXiv:2103.02014  [pdf, other

    cs.LG cs.CR cs.DS

    Online Adversarial Attacks

    Authors: Andjela Mladenovic, Avishek Joey Bose, Hugo Berard, William L. Hamilton, Simon Lacoste-Julien, Pascal Vincent, Gauthier Gidel

    Abstract: Adversarial attacks expose important vulnerabilities of deep learning models, yet little attention has been paid to settings where data arrives as a stream. In this paper, we formalize the online adversarial attack problem, emphasizing two key elements found in real-world use-cases: attackers must operate under partial knowledge of the target model, and the decisions made by the attacker are irrev… ▽ More

    Submitted 22 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: ICLR 2022

  26. arXiv:2009.11355  [pdf, other

    cs.LG cs.CL stat.ML

    Structure Aware Negative Sampling in Knowledge Graphs

    Authors: Kian Ahrabian, Aarash Feizi, Yasmin Salehi, William L. Hamilton, Avishek Joey Bose

    Abstract: Learning low-dimensional representations for entities and relations in knowledge graphs using contrastive estimation represents a scalable and effective method for inferring connectivity patterns. A crucial aspect of contrastive learning approaches is the choice of corruption distribution that generates hard negative samples, which force the embedding model to learn discriminative representations… ▽ More

    Submitted 6 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020. Camera-ready submission

  27. arXiv:2007.00720  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Example Games

    Authors: Avishek Joey Bose, Gauthier Gidel, Hugo Berard, Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, William L. Hamilton

    Abstract: The existence of adversarial examples capable of fooling trained neural network classifiers calls for a much better understanding of possible attacks to guide the development of safeguards against them. This includes attack methods in the challenging non-interactive blackbox setting, where adversarial attacks are generated without any access, including queries, to the target model. Prior attacks i… ▽ More

    Submitted 8 January, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  28. arXiv:2002.06336  [pdf, other

    cs.LG stat.ML

    Latent Variable Modelling with Hyperbolic Normalizing Flows

    Authors: Avishek Joey Bose, Ariella Smofsky, Renjie Liao, Prakash Panangaden, William L. Hamilton

    Abstract: The choice of approximate posterior distributions plays a central role in stochastic variational inference (SVI). One effective solution is the use of normalizing flows \cut{defined on Euclidean spaces} to construct flexible posterior distributions. However, one key limitation of existing normalizing flows is that they are restricted to the Euclidean space and are ill-equipped to model data with a… ▽ More

    Submitted 13 August, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

    Comments: Preprint, work under review

  29. arXiv:2001.04338  [pdf

    cs.IR

    Extraction of Relevant Images for Boilerplate Removal in Web Browsers

    Authors: Joy Bose

    Abstract: Boilerplate refers to unwanted and repeated parts of a webpage (such as ads or table of contents) that distracts the user from reading the core content of the webpage, such as a news article. Accurate detection and removal of boilerplate content from a webpage can enable the users to have a clutter free view of the webpage or news article. This can be useful in features like reader mode in web bro… ▽ More

    Submitted 13 January, 2020; v1 submitted 17 December, 2019; originally announced January 2020.

    Comments: 4 pages, 3 figures, 1 table

    MSC Class: I.2 ACM Class: I.2

  30. arXiv:1912.12144  [pdf

    cs.CV cs.LG stat.ML

    Evaluating Usage of Images for App Classification

    Authors: Kushal Singla, Niloy Mukherjee, Hari Manassery Koduvely, Joy Bose

    Abstract: App classification is useful in a number of applications such as adding apps to an app store or building a user model based on the installed apps. Presently there are a number of existing methods to classify apps based on a given taxonomy on the basis of their text metadata. However, text based methods for app classification may not work in all cases, such as when the text descriptions are in a di… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: 5 pages, 3 figures, 3 tables, INDICON conference

    ACM Class: I.2.1

  31. arXiv:1912.09867  [pdf, other

    cs.LG cs.SI stat.ML

    Meta-Graph: Few Shot Link Prediction via Meta Learning

    Authors: Avishek Joey Bose, Ankit Jain, Piero Molino, William L. Hamilton

    Abstract: We consider the task of few shot link prediction on graphs. The goal is to learn from a distribution over graphs so that a model is able to quickly infer missing edges in a new graph after a small amount of training. We show that current link prediction methods are generally ill-equipped to handle this task. They cannot effectively transfer learned knowledge from one graph to another and are unabl… ▽ More

    Submitted 1 March, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

  32. arXiv:1912.08809  [pdf

    cs.HC cs.LG stat.ML

    Field Label Prediction for Autofill in Web Browsers

    Authors: Joy Bose

    Abstract: Automatic form fill is an important productivity related feature present in major web browsers, which predicts the field labels of a web form and automatically fills values in a new form based on the values previously filled for the same field in other forms. This feature increases the convenience and efficiency of users who have to fill similar information in fields in multiple forms. In this pap… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: 3 pages, 5 figures

  33. arXiv:1912.07323  [pdf

    cs.SE cs.LG

    Analysis of Software Engineering for Agile Machine Learning Projects

    Authors: Kushal Singla, Joy Bose, Chetan Naik

    Abstract: The number of machine learning, artificial intelligence or data science related software engineering projects using Agile methodology is increasing. However, there are very few studies on how such projects work in practice. In this paper, we analyze project issues tracking data taken from Scrum (a popular tool for Agile) for several machine learning projects. We compare this data with correspondin… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: 5 pages, 8 figures , INDICON conference

    ACM Class: D.2

  34. arXiv:1911.02991  [pdf

    stat.ML cs.IR cs.LG

    Semi-Supervised Method using Gaussian Random Fields for Boilerplate Removal in Web Browsers

    Authors: Joy Bose, Sumanta Mukherjee

    Abstract: Boilerplate removal refers to the problem of removing noisy content from a webpage such as ads and extracting relevant content that can be used by various services. This can be useful in several features in web browsers such as ad blocking, accessibility tools such as read out loud, translation, summarization etc. In order to create a training dataset to train a model for boilerplate detection and… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: 4 pages, 1 figure, IEEE INDICON conference

  35. arXiv:1906.02771  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies

    Authors: Patrick Nadeem Ward, Ariella Smofsky, Avishek Joey Bose

    Abstract: Deep Reinforcement Learning (DRL) algorithms for continuous action spaces are known to be brittle toward hyperparameters as well as \cut{being}sample inefficient. Soft Actor Critic (SAC) proposes an off-policy deep actor critic algorithm within the maximum entropy RL framework which offers greater stability and empirical gains. The choice of policy distribution, a factored Gaussian, is motivated b… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: INNF workshop, International Conference on Machine Learning 2019, Long Beach CA, USA

  36. arXiv:1905.11912  [pdf, other

    cs.CL

    A Cross-Domain Transferable Neural Coherence Model

    Authors: Peng Xu, Hamidreza Saghir, Jin Sung Kang, Teng Long, Avishek Joey Bose, Yanshuai Cao, Jackie Chi Kit Cheung

    Abstract: Coherence is an important aspect of text quality and is crucial for ensuring its readability. One important limitation of existing coherence models is that training on one domain does not easily generalize to unseen categories of text. Previous work advocates for generative models for cross-domain generalization, because for discriminative models, the space of incoherent sentence orderings to disc… ▽ More

    Submitted 9 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted at ACL 2019

  37. arXiv:1905.10864  [pdf, other

    cs.LG cs.CR stat.ML

    Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

    Authors: Avishek Joey Bose, Andre Cianflone, William L. Hamilton

    Abstract: Adversarial attacks on deep neural networks traditionally rely on a constrained optimization paradigm, where an optimization procedure is used to obtain a single adversarial perturbation for a given input example. In this work we frame the problem as learning a distribution of adversarial perturbations, enabling us to generate diverse adversarial distributions given an unperturbed input. We show t… ▽ More

    Submitted 20 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  38. arXiv:1905.10674  [pdf, other

    cs.LG cs.AI stat.ML

    Compositional Fairness Constraints for Graph Embeddings

    Authors: Avishek Joey Bose, William L. Hamilton

    Abstract: Learning high-quality node embeddings is a key building block for machine learning models that operate on graph data, such as social networks and recommender systems. However, existing graph embedding techniques are unable to cope with fairness constraints, e.g., ensuring that the learned representations do not correlate with certain attributes, such as age or gender. Here, we introduce an adversa… ▽ More

    Submitted 16 July, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

  39. Identifying Implementation Bugs in Machine Learning based Image Classifiers using Metamorphic Testing

    Authors: Anurag Dwarakanath, Manish Ahuja, Samarth Sikand, Raghotham M. Rao, R. P. Jagadeesh Chandra Bose, Neville Dubash, Sanjay Podder

    Abstract: We have recently witnessed tremendous success of Machine Learning (ML) in practical applications. Computer vision, speech recognition and language translation have all seen a near human level performance. We expect, in the near future, most business applications will have some form of ML. However, testing such applications is extremely challenging and would be very expensive if we follow today's m… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: Published at 27th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2018)

  40. arXiv:1805.12302  [pdf, other

    cs.CV cs.LG

    Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization

    Authors: Avishek Joey Bose, Parham Aarabi

    Abstract: Adversarial attacks involve adding, small, often imperceptible, perturbations to inputs with the goal of getting a machine learning model to misclassifying them. While many different adversarial attack strategies have been proposed on image classification models, object detection pipelines have been much harder to break. In this paper, we propose a novel strategy to craft adversarial examples by s… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to IEEE MMSP

  41. arXiv:1805.07907  [pdf

    cs.HC cs.AI cs.NE cs.NI

    IoT2Vec: Identification of Similar IoT Devices via Activity Footprints

    Authors: Kushal Singla, Joy Bose

    Abstract: We consider a smart home or smart office environment with a number of IoT devices connected and passing data between one another. The footprints of the data transferred can provide valuable information about the devices, which can be used to (a) identify the IoT devices and (b) in case of failure, to identify the correct replacements for these devices. In this paper, we generate the embeddings for… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: 5 pages, 4 figures

    ACM Class: I.2.6

  42. arXiv:1805.03642  [pdf, other

    cs.CL cs.AI cs.LG

    Adversarial Contrastive Estimation

    Authors: Avishek Joey Bose, Huan Ling, Yanshuai Cao

    Abstract: Learning by contrasting positive and negative samples is a general strategy adopted by many methods. Noise contrastive estimation (NCE) for word embeddings and translating embeddings for knowledge graphs are examples in NLP employing this approach. In this work, we view contrastive learning as an abstraction of all such methods and augment the negative sampler into a mixture distribution containin… ▽ More

    Submitted 2 August, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

    Comments: Association for Computational Linguistics, 2018

  43. arXiv:1803.03430  [pdf

    cs.HC

    VR Content Capture using Aligned Smartphones

    Authors: Ramanujam R Srinivasa, Joy Bose, Dipin KP

    Abstract: There are a number of dedicated 3D capture devices in the market, but generally they are unaffordable and do not make use of existing smartphone cameras, which are generally of decent quality. Due to this, while there are several means to consume 3D or VR content, there is currently lack of means to capture 3D content, resulting in very few 3D videos being publicly available. Some mobile applicati… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 6 pages, 12 figures

    ACM Class: H.5.2

  44. arXiv:1803.03428  [pdf

    cs.IR cs.HC

    A Bias Aware News Recommendation System

    Authors: Anish Anil Patankar, Joy Bose, Harshit Khanna

    Abstract: In this era of fake news and political polarization, it is desirable to have a system to enable users to access balanced news content. Current solutions focus on top down, server based approaches to decide whether a news article is fake or biased, and display only trusted news to the end users. In this paper, we follow a different approach to help the users make informed choices about which news t… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 11 pages, 7 figures

    ACM Class: H.5.2

  45. arXiv:1601.01092  [pdf

    cs.HC

    Attention Sensitive Web Browsing

    Authors: Joy Bose, Amit Singhai, Anish Patankar, Ankit Kumar

    Abstract: With a number of cheap commercial dry EEG kits available today, it is possible to look at user attention driven scenarios for interaction with the web browser. Using EEG to determine the user's attention level is preferable to using methods such as gaze tracking or time spent on the webpage. In this paper we use the attention level in three different ways. First, as a control mechanism, to control… ▽ More

    Submitted 6 January, 2016; originally announced January 2016.

    Comments: 5 pages, 7 figures

    ACM Class: H.5.2

  46. An associative memory for the on-line recognition and prediction of temporal sequences

    Authors: J. Bose, S. B. Furber, J. L. Shapiro

    Abstract: This paper presents the design of an associative memory with feedback that is capable of on-line temporal sequence learning. A framework for on-line sequence learning has been proposed, and different sequence learning models have been analysed according to this framework. The network model is an associative memory with a separate store for the sequence context of a symbol. A sparse distributed m… ▽ More

    Submitted 4 November, 2006; originally announced November 2006.

    Comments: Published in IJCNN 2005, Montreal, Canada