TY - JOUR
T1 - Identifying main finding sentences in clinical case reports
AU - Luo, Mengqi
AU - Cohen, Aaron M.
AU - Addepalli, Sidharth
AU - Smalheiser, Neil R.
N1 - Publisher Copyright:
© 2020 The Author(s) 2020. Published by Oxford University Press.
PY - 2020
Y1 - 2020
N2 - Clinical case reports are the 'eyewitness reports' of medicine and provide a valuable, unique, albeit noisy and underutilized type of evidence. Generally, a case report has a single main finding that represents the reason for writing up the report in the first place. However, no one has previously created an automatic way of identifying main finding sentences in case reports. We previously created a manual corpus of main finding sentences extracted from the abstracts and full text of clinical case reports. Here, we have utilized the corpus to create a machine learning-based model that automatically predicts which sentence(s) from abstracts state the main finding. The model has been evaluated on a separate manual corpus of clinical case reports and found to have good performance. This is a step toward setting up a retrieval system in which, given one case report, one can find other case reports that report the same or very similar main findings. The code and necessary files to run the main finding model can be downloaded from https://github.com/qi29/main- finding-recognition, released under the Apache License, Version 2.0.
AB - Clinical case reports are the 'eyewitness reports' of medicine and provide a valuable, unique, albeit noisy and underutilized type of evidence. Generally, a case report has a single main finding that represents the reason for writing up the report in the first place. However, no one has previously created an automatic way of identifying main finding sentences in case reports. We previously created a manual corpus of main finding sentences extracted from the abstracts and full text of clinical case reports. Here, we have utilized the corpus to create a machine learning-based model that automatically predicts which sentence(s) from abstracts state the main finding. The model has been evaluated on a separate manual corpus of clinical case reports and found to have good performance. This is a step toward setting up a retrieval system in which, given one case report, one can find other case reports that report the same or very similar main findings. The code and necessary files to run the main finding model can be downloaded from https://github.com/qi29/main- finding-recognition, released under the Apache License, Version 2.0.
UR - http://www.scopus.com/inward/record.url?scp=85086355787&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086355787&partnerID=8YFLogxK
U2 - 10.1093/database/baaa041
DO - 10.1093/database/baaa041
M3 - Article
C2 - 32525207
AN - SCOPUS:85086355787
SN - 1758-0463
VL - 2020
JO - Database : the journal of biological databases and curation
JF - Database : the journal of biological databases and curation
M1 - baaa041
ER -