Abstract
This paper describes the participation of the LIMSI-MIROR team at CLEF eHealth 2017, task 2. The task addresses the automatic ranking of articles in order to assist with the screening process of Diagnostic Test Accuracy (DTA) Systematic Reviews. We used a logistic regression classifier and handled class imbalance using a combination of class reweighting and undersampling. We also experimented with two strategies for relevance feedback. Our best run obtained an overall Average Precision of 0.179 and Work Saved over Sampling @95% Recall of 0.650. This run uses stochastic gradient descent for training but no feature selection or relevance feedback. We observe high performance variation within the queries in the test set. Nonetheless, our results suggest that automatic assistance is promising for ranking the DTA literature as it could reduce the screening workload for review writer by 65% on average.
Original language | English |
---|---|
Publication status | Published - 2017 |
Event | 18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017 - Dublin, Ireland Duration: 11 Sept 2017 → 14 Sept 2017 |
Conference
Conference | 18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017 |
---|---|
Country/Territory | Ireland |
City | Dublin |
Period | 11/09/2017 → 14/09/2017 |
Keywords
- Evidence based medicine
- Information storage and retrieval
- Review literature as topic
- Supervised machine learning