Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach
Authors:
William Lotter,
Abdul Rahman Diab,
Bryan Haslam,
Jiye G. Kim,
Giorgia Grisot,
Eric Wu,
Kevin Wu,
Jorge Onieva Onieva,
Jerrold L. Boxerman,
Meiyun Wang,
Mack Bandler,
Gopal Vijayaraghavan,
A. Gregory Sorensen
Abstract:
Breast cancer remains a global challenge, causing over 1 million deaths globally in 2018. To achieve earlier breast cancer detection, screening x-ray mammography is recommended by health organizations worldwide and has been estimated to decrease breast cancer mortality by 20-40%. Nevertheless, significant false positive and false negative rates, as well as high interpretation costs, leave opportun…
▽ More
Breast cancer remains a global challenge, causing over 1 million deaths globally in 2018. To achieve earlier breast cancer detection, screening x-ray mammography is recommended by health organizations worldwide and has been estimated to decrease breast cancer mortality by 20-40%. Nevertheless, significant false positive and false negative rates, as well as high interpretation costs, leave opportunities for improving quality and access. To address these limitations, there has been much recent interest in applying deep learning to mammography; however, obtaining large amounts of annotated data poses a challenge for training deep learning models for this purpose, as does ensuring generalization beyond the populations represented in the training dataset. Here, we present an annotation-efficient deep learning approach that 1) achieves state-of-the-art performance in mammogram classification, 2) successfully extends to digital breast tomosynthesis (DBT; "3D mammography"), 3) detects cancers in clinically-negative prior mammograms of cancer patients, 4) generalizes well to a population with low screening rates, and 5) outperforms five-out-of-five full-time breast imaging specialists by improving absolute sensitivity by an average of 14%. Our results demonstrate promise towards software that can improve the accuracy of and access to screening mammography worldwide.
△ Less
Submitted 27 December, 2019; v1 submitted 23 December, 2019;
originally announced December 2019.
Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing
Authors:
Jesús Camacho-Rodríguez,
Ashutosh Chauhan,
Alan Gates,
Eugene Koifman,
Owen O'Malley,
Vineet Garg,
Zoltan Haindrich,
Sergey Shelukhin,
Prasanth Jayachandran,
Siddharth Seth,
Deepak Jaiswal,
Slim Bouguerra,
Nishant Bangarwa,
Sankar Hariappan,
Anishek Agarwal,
Jason Dere,
Daniel Dai,
Thejas Nair,
Nita Dembla,
Gopal Vijayaraghavan,
Günther Hagleitner
Abstract:
Apache Hive is an open-source relational database system for analytic big-data workloads. In this paper we describe the key innovations on the journey from batch tool to fully fledged enterprise data warehousing system. We present a hybrid architecture that combines traditional MPP techniques with more recent big data and cloud concepts to achieve the scale and performance required by today's anal…
▽ More
Apache Hive is an open-source relational database system for analytic big-data workloads. In this paper we describe the key innovations on the journey from batch tool to fully fledged enterprise data warehousing system. We present a hybrid architecture that combines traditional MPP techniques with more recent big data and cloud concepts to achieve the scale and performance required by today's analytic applications. We explore the system by detailing enhancements along four main axis: Transactions, optimizer, runtime, and federation. We then provide experimental results to demonstrate the performance of the system for typical workloads and conclude with a look at the community roadmap.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.