Machine Learning 101: Simplifying It One Term at a Time

Machine learning is one of the hottest topics in radiology and all of healthcare, but reading the latest and greatest ML research can be difficult, even for experienced medical professionals. A new analysis written by a team at Northern Ireland’s Belfast City Hospital and published in the American Journal of Roentgenology was written with that very problem in mind.

Here’s a quick primer:

Cross Validation: How a lot of ML algorithms generate various performance measures. When researchers begin work on their algorithm, they separate subjects into two groups: a training dataset and a testing dataset. The training dataset is used to create the algorithm, training it so that it can make predictions. The testing dataset is used as an initial test of the algorithm’s accuracy. The program can compare them, see what’s best, alter overall predictive capability and “improve the generalizability of the results,” the authors wrote.

ROC Curve: By “plotting the effect of different levels of sensitivity on specificity,” researchers can help readers understand the performance of their algorithm. “Algorithms that perform better will have a higher sensitivity and specificity and thus the area under the plotted line will be greater than those that perform worse. The metric termed the ‘area under the ROC curve’ or ‘AUROC’ is commonly quoted and offers a quick way to compare algorithms.”

Confusion Matrix: This helps readers locate information about a specific term or metric and compare an algorithm with others. It is largely comprised of true-positive and false-positive rate, specificity, accuracy, positive predictive value, likelihood ratios and diagnostic odds ratio. A study may mention an algorithm’s accuracy, but what if there are more important metrics a specific instance than accuracy? The confusion matrix helps the reader locate those other metrics.

Mean squared error and mean absolute error: The relationship between variables in ML—regression—are expressed through an equation which minimizes the distance between a fitted line and data point. The degree of regression and its reliability to make predictions is represented by the mean squared error (MSE). “Smaller is better” except in the case of coefficient of determination (R2) metric.

Image Segmentation Evaluation: When the algorithm is designed to detect the presence of something, for instance, it’s not just about detecting the finding; it’s about looking at its location and size. “The predicted area of interest generated by the algorithm is compared against an ideal or completely accurate evaluation image,” the authors wrote.

View more features from this issue:

Building Foundations to Build Better Care

Embracing AI: Why Now Is the Time for Medical Imaging

Leveraging Technology, Data and Patient Care: How Geisinger Is Interjecting Insight & Action

Bullish on AI: The Wisconsin Way: Reengineering Imaging & Image Strategy

ML’s Role in Building Confidence and Value in Breast Imaging

Will ‘Smart’ Solutions Really Transform Cardiology?

Matching Machine Learning and Medical Imaging: Predictions for 2019

NYU’s Daniel Sodickson on AI, Facebook and Why Faster MR Scans Could Improve Healthcare

AI in Healthcare

Around the web

Cardiovascular Business

FDA says years-long tirzepatide shortage is resolved, will give limited leeway to compounders

The tirzepatide shortage that first began in 2022 has been resolved. Drug companies distributing compounded versions of the popular drug now have two to three more months to distribute their remaining supply.

AI in Healthcare

From Capitol Hill to a hospital near you? 5 federal recommendations for healthcare AI policy

The 24 members of the House Task Force on AI—12 reps from each party—have posted a 253-page report detailing their bipartisan vision for encouraging innovation while minimizing risks.

Cardiovascular Business

Merck spends up to $2B to license new weight loss drug with potential heart benefits

Merck sent Hansoh Pharma, a Chinese biopharmaceutical company, an upfront payment of $112 million to license a new investigational GLP-1 receptor agonist. There could be many more payments to come if certain milestones are met.

Machine Learning 101: Simplifying It One Term at a Time

Related Content

Around the web