Nearly 10,000 Global Problem Solvers Work to Improve Detection of Lung Cancer, Winning Solutions Can Aid Doctors

May 3, 2017 2:05 PM ET

This year, nearly 10,000 people around the world harnessed data science principles in a massive effort to improve the detection of lung cancer – the deadliest, and most common cancer. The third annual Data Science Bowl, hosted by Booz Allen Hamilton and Kaggle, challenged data scientists to help medical professionals detect lung cancer earlier, and with better accuracy. A record number of participants rose to the challenge, spending more than an estimated 150,000 hours of collective work to produce nearly 18,000 algorithms.

CT. . .But More Can Be Done
Lung cancer is the most common type of cancer worldwide, affecting nearly 225,000 people each year in the United States alone. Low-dose computed tomography (CT) is a breakthrough technology for early detection, with the potential to reduce lung cancer deaths by 20 percent. But, the technology must overcome a relatively high false positive rate. “Reducing the false positive rate of low-dose CT scans is a critical step in improving the accuracy of CT screening of lung cancer and having a positive impact on public health,” said Keyvan Farahani, Program Director, National Cancer Institute.

Data Science Bowl participants created algorithms that can improve lung cancer screening technology by accurately predicting when lesions in the lungs are cancerous, thereby dramatically decreasing the false positive rate of current low-dose CT technology.

Collective Ingenuity for the Greater Good
Josh Sullivan, senior vice president at Booz Allen, said, “The Data Science Bowl shows that the power of collective ingenuity, data science and advanced analytics can be harnessed to tackle society’s toughest challenges like eradicating cancer. This year’s complex problem required the diversity of perspectives and approaches that only a crowd-sourced challenge like the Data Science Bowl can provide. We look forward to advancing these solutions in the fight against cancer.”

 “This is one of the most important competitions Kaggle has ever hosted, and the results are incredibly promising,” said Anthony Goldbloom, CEO, Kaggle.

The winners of the 2017 Data Science Bowl include:

  • First Place: Liao Fangzhou and Zhe Li, two researchers from China’s Tsinghua University who have no formal medical background but were able to apply their analytics skills to an unfamiliar but challenging area of research.
  • Second Place: Julian de Wit and Daniel Hammack, both software and machine learning engineers based in the Netherlands. Julian came in third in the Data Science Bowl 2016.
  • Third Place: Team Aidence, members of which work for a Netherlands-based company that applies deep learning to medical image interpretation.

Want to learn more?