Journal of biomedical informatics | 2019

An interpretable natural language processing system for written medical examination assessment

 
 
 
 
 

Abstract


OBJECTIVE\nThe assessment of written medical examinations is a tedious and expensive process, requiring significant amounts of time from medical experts. Our objective was to develop a natural language processing (NLP) system that can expedite the assessment of unstructured answers in medical examinations by automatically identifying relevant concepts in the examinee responses.\n\n\nMATERIALS AND METHODS\nOur NLP system, Intelligent Clinical Text Evaluator (INCITE), is semi-supervised in nature. Learning from a limited set of fully annotated examples, it sequentially applies a series of customized text comparison and similarity functions to determine if a text span represents an entry in a given reference standard. Combinations of fuzzy matching and set intersection-based methods capture inexact matches and also fragmented concepts. Customizable, dynamic similarity-based matching thresholds allow the system to be tailored for examinee responses of different lengths.\n\n\nRESULTS\nINCITE achieved an average F1-score of 0.89 (precision=0.87, recall=0.91) against human annotations over held-out evaluation data. Fuzzy text matching, dynamic thresholding and the incorporation of supervision using annotated data resulted in the biggest jumps in performances.\n\n\nDISCUSSION\nLong and non-standard expressions are difficult for INCITE to detect, but the problem is mitigated by the use of dynamic thresholding (i.e., varying the similarity threshold for a text span to be considered a match). Annotation variations within exams and disagreements between annotators were the primary causes for false positives. Small amounts of annotated data can significantly improve system performance.\n\n\nCONCLUSIONS\nThe high performance and interpretability of INCITE will likely significantly aid the assessment process and also help mitigate the impact of manual assessment inconsistencies.

Volume None
Pages \n 103268\n
DOI 10.1016/j.jbi.2019.103268
Language English
Journal Journal of biomedical informatics

Full Text