Welcome to our directory of biomedical corpus.

The scripts to obtain all corpora in CoNLL-2003 format can be found in the HUNER/ner_scripts repository. Updated results for these corpora can be found in this publication

CorporaEntitiesPublication
Arizona disease disease link to paper
BioCreative 2 Gene Mention genes/proteins link to paper
BioInfer proteins link to paper
Biosemantics diseases, chemicals link to paper
CDR diseases, chemicals link to paper
CellFinder gene/proteins, species, cells, anatomy link to paper
CEMP chemicals link to paper
CHEBI chemicals link to paper
CHEMDNER chemicals link to paper
CLL cell lines link to paper
DECA gene/proteins link to paper
DDI drugs link to paper
FSU-PRGE genes/proteins link to paper
GELLUS cell lines link to paper
Genia various link to paper
GPRO genes/proteins link to paper
HPRD50 proteins link to paper
IEPA proteins link to paper
JNLPBA cell lines, and others link to paper
Linneaus species link to paper
Loctext genes/proteins link to paper
NCBI disease diseases link to paper
OSIRIS genes and variants link to paper
PennBioIE various link to paper
S800 species link to paper
SCAI chemicals chemicals link to paper
SCAI disease diseases link to paper
Variome various link to paper