-
FlashHack: Reflections on the Usage of a Micro Hackathon as an Assessment Tool in a Machine Learning Course
Authors:
R Indra,
PD Parthasarathy,
Jatin Ambasana,
Spruha Satavlekar
Abstract:
Machine learning (ML) course for undergraduates face challenges in assessing student learning and providing practical exposure. Group project-based learning, an increasingly popular form of experiential learning in CS education, encounters certain limitation in participation and non-participation from a few students. Studies also suggest that students find longer programming assignments and projec…
▽ More
Machine learning (ML) course for undergraduates face challenges in assessing student learning and providing practical exposure. Group project-based learning, an increasingly popular form of experiential learning in CS education, encounters certain limitation in participation and non-participation from a few students. Studies also suggest that students find longer programming assignments and project-based assessments distracting and struggle to maintain focus when they coincide with other courses. To tackle these issues, we introduced FlashHack: a monitored, incremental, in-classroom micro Hackathon that combines project-based learning with Hackathon elements. Engaging 229 third year CS undergraduate students in teams of four, FlashHack prompted them to tackle predefined challenges using machine learning techniques within a set timeframe. Assessment criteria emphasized machine learning application, problem-solving, collaboration, and creativity. Our results indicate high student engagement and satisfaction, alongside simplified assessment processes for instructors. This experience report outlines the Hackathon design and implementation, highlights successes and areas for improvement making it feasible for replication by interested computing educators.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Online Universal Dirichlet Factor Portfolios
Authors:
Purushottam Parthasarathy,
Avinash Bhardwaj,
Manjesh K. Hanawal
Abstract:
We revisit the online portfolio allocation problem and propose universal portfolios that use factor weighing to produce portfolios that out-perform uniform dirichlet allocation schemes. We show a few analytical results on the lower bounds of portfolio growth when the returns are known to follow a factor model. We also show analytically that factor weighted dirichlet sampled portfolios dominate the…
▽ More
We revisit the online portfolio allocation problem and propose universal portfolios that use factor weighing to produce portfolios that out-perform uniform dirichlet allocation schemes. We show a few analytical results on the lower bounds of portfolio growth when the returns are known to follow a factor model. We also show analytically that factor weighted dirichlet sampled portfolios dominate the wealth generated by uniformly sampled dirichlet portfolios. We corroborate our analytical results with empirical studies on equity markets that are known to be driven by factors.
△ Less
Submitted 7 November, 2023; v1 submitted 15 August, 2023;
originally announced August 2023.
-
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Authors:
Rui Zhao,
Jian Xue,
Partha Parthasarathy,
Veljko Miljanic,
Jinyu Li
Abstract:
Neural transducer is now the most popular end-to-end model for speech recognition, due to its naturally streaming ability. However, it is challenging to adapt it with text-only data. Factorized neural transducer (FNT) model was proposed to mitigate this problem. The improved adaptation ability of FNT on text-only adaptation data came at the cost of lowered accuracy compared to the standard neural…
▽ More
Neural transducer is now the most popular end-to-end model for speech recognition, due to its naturally streaming ability. However, it is challenging to adapt it with text-only data. Factorized neural transducer (FNT) model was proposed to mitigate this problem. The improved adaptation ability of FNT on text-only adaptation data came at the cost of lowered accuracy compared to the standard neural transducer model. We propose several methods to improve the performance of the FNT model. They are: adding CTC criterion during training, adding KL divergence loss during adaptation, using a pre-trained language model to seed the vocabulary predictor, and an efficient adaptation approach by interpolating the vocabulary predictor with the n-gram language model. A combination of these approaches results in a relative word-error-rate reduction of 9.48\% from the standard FNT model. Furthermore, n-gram interpolation with the vocabulary predictor improves the adaptation speed hugely with satisfactory adaptation performance.
△ Less
Submitted 23 February, 2023; v1 submitted 4 December, 2022;
originally announced December 2022.
-
Almost Exact Risk Budgeting with Return Forecasts for Portfolio Allocation
Authors:
Avinash Bhardwaj,
Manjesh K Hanawal,
Purushottam Parthasarathy
Abstract:
In this paper, we revisit the portfolio allocation problem with designated risk-budget [Qian, 2005]. We generalize the problem of arbitrary risk budgets with unequal correlations to one that includes return forecasts and transaction costs while keeping the no-shorting (long-only positions) constraint. We offer a convex second order cone formulation that scales well with the number of assets and ex…
▽ More
In this paper, we revisit the portfolio allocation problem with designated risk-budget [Qian, 2005]. We generalize the problem of arbitrary risk budgets with unequal correlations to one that includes return forecasts and transaction costs while keeping the no-shorting (long-only positions) constraint. We offer a convex second order cone formulation that scales well with the number of assets and explore solutions to the problem in different settings. In particular, the problem is solved on a few practical cases - on equity and bond asset allocation problems as well as formulating index constituents for the NASDAQ100 index, illustrating the benefits of this approach.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
Advances in Online Audio-Visual Meeting Transcription
Authors:
Takuya Yoshioka,
Igor Abramovski,
Cem Aksoylar,
Zhuo Chen,
Moshe David,
Dimitrios Dimitriadis,
Yifan Gong,
Ilya Gurvich,
Xuedong Huang,
Yan Huang,
Aviv Hurvitz,
Li Jiang,
Sharon Koubi,
Eyal Krupka,
Ido Leichter,
Changliang Liu,
Partha Parthasarathy,
Alon Vinnikov,
Lingfeng Wu,
Xiong Xiao,
Wayne Xiong,
Huaming Wang,
Zhenghao Wang,
Jun Zhang,
Yong Zhao
, et al. (1 additional authors not shown)
Abstract:
This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in realistic settings for over a decade. We show that this problem can be addressed by using a continuous speech separation approach. In addition, we desc…
▽ More
This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in realistic settings for over a decade. We show that this problem can be addressed by using a continuous speech separation approach. In addition, we describe an online audio-visual speaker diarization method that leverages face tracking and identification, sound source localization, speaker identification, and, if available, prior speaker information for robustness to various real world challenges. All components are integrated in a meeting transcription framework called SRD, which stands for "separate, recognize, and diarize". Experimental results using recordings of natural meetings involving up to 11 attendees are reported. The continuous speech separation improves a word error rate (WER) by 16.1% compared with a highly tuned beamformer. When a complete list of meeting attendees is available, the discrepancy between WER and speaker-attributed WER is only 1.0%, indicating accurate word-to-speaker association. This increases marginally to 1.6% when 50% of the attendees are unknown to the system.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.