Corposaurus
View on GitHub
Download .zip
Download .tar.gz
Welcome to our directory of biomedical corpus.
The scripts to obtain all corpora in CoNLL-2003 format can be found in the
HUNER/ner_scripts repository
. Updated results for these corpora can be found in
this publication
Corpora
Entities
Publication
Arizona disease
disease
link to paper
BioCreative 2 Gene Mention
genes/proteins
link to paper
BioInfer
proteins
link to paper
Biosemantics
diseases, chemicals
link to paper
CDR
diseases, chemicals
link to paper
CellFinder
gene/proteins, species, cells, anatomy
link to paper
CEMP
chemicals
link to paper
CHEBI
chemicals
link to paper
CHEMDNER
chemicals
link to paper
CLL
cell lines
link to paper
DECA
gene/proteins
link to paper
DDI
drugs
link to paper
FSU-PRGE
genes/proteins
link to paper
GELLUS
cell lines
link to paper
Genia
various
link to paper
GPRO
genes/proteins
link to paper
HPRD50
proteins
link to paper
IEPA
proteins
link to paper
JNLPBA
cell lines, and others
link to paper
Linneaus
species
link to paper
Loctext
genes/proteins
link to paper
NCBI disease
diseases
link to paper
OSIRIS
genes and variants
link to paper
PennBioIE
various
link to paper
S800
species
link to paper
SCAI chemicals
chemicals
link to paper
SCAI disease
diseases
link to paper
Variome
various
link to paper