JAMIA: Making EHR data extraction more exact is possible

interoperability - 53.70 Kb
The medical information contained within EHRs is undoubtedly valuable to many of healthcare’s stakeholders, but its presentation as unstructured text often makes locating necessary information difficult, according to a study published July 6 in the Journal of the American Medical Informatics Association.

Using a novel method to detect named entities within EHRs, however, researchers demonstrated that constructing more effective information extraction systems is possible at low costs.

Most named entity recognition methods for collecting information from datasets use conditional random field (CRF) recognizers, which rely heavily on local contexts surrounding named entities and assume that similar local contexts lead to the same judgements, according to the study’s author, Eric I-Chao Chang, MD, of Microsoft Research Asia. The problem with data extraction methods relying solely on CRF recognizers is that they often deliver contradictory information.

Using 20,000 radiology reports as a dataset to test multiple extraction systems’ ability to parse through records and deliver relevant patient follow-up information through an EHR system, Chang determined that pairing a labeled sequential pattern (LSP) classifier with a CRF recognizer was an effective method for filtering out irrelevant information.

The radiology reports contained a total of 121,748 sentences of which only 3,997 contained follow-up information. A data extraction method using both an LSP classifier and a CRF recognizer was able to reduce inexact matching by 6 percent compared to a data extraction method using just a CRF recognizer.

“In our method, LSP captures global patterns to choose candidate sentences before CRF identifies NEs [named entities] or relevant phrases,” Chang wrote. “The experiment shows that filtering out a large number of negative examples from the training set by an LSP classifier can significantly improve the performance of a CRF recognizer.”

Around the web

The American College of Cardiology has shared its perspective on new CMS payment policies, highlighting revenue concerns while providing key details for cardiologists and other cardiology professionals. 

As debate simmers over how best to regulate AI, experts continue to offer guidance on where to start, how to proceed and what to emphasize. A new resource models its recommendations on what its authors call the “SETO Loop.”

FDA Commissioner Robert Califf, MD, said the clinical community needs to combat health misinformation at a grassroots level. He warned that patients are immersed in a "sea of misinformation without a compass."

Trimed Popup
Trimed Popup