Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

In bioRxiv : the preprint server for biology

MOTIVATION : The increasing number of peer-reviewed publications constitutes a challenge for biocuration. For example, NeuroMorpho.Org, a sharing platform for digital reconstructions of neural morphology, must evaluate more than 6000 potentially relevant articles per year to identify data of interest. Here, we describe a tool that uses natural language processing and deep learning to assess the likelihood of a publication to be relevant for the project.

RESULTS : The tool automatically identifies articles describing digitally reconstructed neural morphologies with high accuracy. Its processing rate of 900 publications per hour is not only amply sufficient to autonomously track new research, but also allowed the successful evaluation of older publications backlogged due to limited human resources. The number of bio-entities found since launching the tool almost doubled while greatly reducing manual labor. The classification tool is open source, configurable, and simple to use, making it extensible to other biocuration projects.

AVAILABILITY : https://github.com/Joindbre/TextRelevancy.

CONTACT : ascoli@gmu.edu.

SUPPLEMENTARY INFORMATION : Supplementary information, tool installation, and API usage are available at https://docs.joindbre.com.

Maraver Patricia, Tecuatl Carolina, Ascoli Giorgio A

2023-Feb-15