LIMSI@CLEF eHealth 2017 task 2: Logistic regression for automatic article ranking

Christopher Norman, Mariska Leeflang, Aurélie Névéol

Research output: Contribution to conferencePaperAcademic

2 Citations (Scopus)

Abstract

This paper describes the participation of the LIMSI-MIROR team at CLEF eHealth 2017, task 2. The task addresses the automatic ranking of articles in order to assist with the screening process of Diagnostic Test Accuracy (DTA) Systematic Reviews. We used a logistic regression classifier and handled class imbalance using a combination of class reweighting and undersampling. We also experimented with two strategies for relevance feedback. Our best run obtained an overall Average Precision of 0.179 and Work Saved over Sampling @95% Recall of 0.650. This run uses stochastic gradient descent for training but no feature selection or relevance feedback. We observe high performance variation within the queries in the test set. Nonetheless, our results suggest that automatic assistance is promising for ranking the DTA literature as it could reduce the screening workload for review writer by 65% on average.

Original languageEnglish
Publication statusPublished - 2017
Event18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017 - Dublin, Ireland
Duration: 11 Sept 201714 Sept 2017

Conference

Conference18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017
Country/TerritoryIreland
CityDublin
Period11/09/201714/09/2017

Keywords

  • Evidence based medicine
  • Information storage and retrieval
  • Review literature as topic
  • Supervised machine learning

Cite this