-
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Authors:
Md Tahmid Rahman Laskar,
M Saiful Bari,
Mizanur Rahman,
Md Amran Hossen Bhuiyan,
Shafiq Joty,
Jimmy Xiangji Huang
Abstract:
The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently. However, their evaluation in the benchmark academic datasets remains under-explored due to the difficulty of evaluating the generative outputs produced by this model against the ground truth. In this paper, we aim to present a thorough evaluation of ChatGPT's performance on diverse academic dat…
▽ More
The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently. However, their evaluation in the benchmark academic datasets remains under-explored due to the difficulty of evaluating the generative outputs produced by this model against the ground truth. In this paper, we aim to present a thorough evaluation of ChatGPT's performance on diverse academic datasets, covering tasks like question-answering, text summarization, code generation, commonsense reasoning, mathematical problem-solving, machine translation, bias detection, and ethical considerations. Specifically, we evaluate ChatGPT across 140 tasks and analyze 255K responses it generates in these datasets. This makes our work the largest evaluation of ChatGPT in NLP benchmarks. In short, our study aims to validate the strengths and weaknesses of ChatGPT in various tasks and provide insights for future research using LLMs. We also report a new emergent ability to follow multi-query instructions that we mostly found in ChatGPT and other instruction-tuned models. Our extensive evaluation shows that even though ChatGPT is capable of performing a wide variety of tasks, and may obtain impressive performance in several benchmark datasets, it is still far from achieving the ability to reliably solve many challenging tasks. By providing a thorough assessment of ChatGPT's performance across diverse NLP tasks, this paper sets the stage for a targeted deployment of ChatGPT-like LLMs in real-world applications.
△ Less
Submitted 5 July, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
TiN-GST-TiN All-Optical Reflection Modulator for 2 $μ$m Waveband Reaching 85% Efficiency
Authors:
Md Asif Hossain Bhuiyan,
Shamima Akter Mitu,
Sajid Muhaimin Choudhury
Abstract:
In this study, we present an all-optical reflection modulator for 2$μ$m communication band exploiting a nano-gear-array metasurface and a phase-change-material Ge$_2$Sb$_2$Te$_5$ (GST). The reflectance of the structure can be manipulated by altering the phase of GST by employing optical stimuli. The paper shows details on the optical and opto-thermal modeling techniques of GST. Numerical investiga…
▽ More
In this study, we present an all-optical reflection modulator for 2$μ$m communication band exploiting a nano-gear-array metasurface and a phase-change-material Ge$_2$Sb$_2$Te$_5$ (GST). The reflectance of the structure can be manipulated by altering the phase of GST by employing optical stimuli. The paper shows details on the optical and opto-thermal modeling techniques of GST. Numerical investigation reveals that the metastructure exhibits a conspicuous changeover from $\sim$ 99% absorption to very poor interaction with the operating light depending on the switching states of the GST, ending up with 85\% modulation depth and only 0.58 dB insertion loss. Due to noticeable differences in optical responses, we can demonstrate a high extinction ratio of 28 dB and a commendable FOM of 49, so far the best modulation performance in this wavelength window. In addition, real-time tracking of the reflectance during phase transition manifests high-speed switching expending low energy per cycle, on the order of sub-nJ. Hence, given its overall performance, the device will be a paradigm for the optical modulators for upcoming 2 $μ$m communication technology.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
T Grating on Nano-Cavity Array based Refractive Index Sensor
Authors:
Yasir Fatha Abed,
Md Asif Hossain Bhuiyan,
Sajid Muhaimin Choudhury
Abstract:
We report a refractive index sensor comprising of unique T grating on top of periodic nano-cavities. The sensor has two resonant modes sensitive to different regions of the structure with low inter-region interference, hence allows simultaneous detection of two different analytes or more accurate detection of a single analyte. The sensor also provides a self-referencing feature for a broad range o…
▽ More
We report a refractive index sensor comprising of unique T grating on top of periodic nano-cavities. The sensor has two resonant modes sensitive to different regions of the structure with low inter-region interference, hence allows simultaneous detection of two different analytes or more accurate detection of a single analyte. The sensor also provides a self-referencing feature for a broad range of refractive index, from 1.3 to 1.5. Using the FDTD method, the sensitivities of 801.7 nm/RIU and 1386.8 nm/RIU have been recorded for the two modes respectively. The versatility of the structure makes the sensor a prominent candidate for biochemical and other sensing applications.
△ Less
Submitted 2 August, 2021; v1 submitted 20 May, 2021;
originally announced May 2021.