The GENIA corpus is being developed to provide reference materials to let NLP techniques work for bio-textmining and has been released with more than 400 000 words and almost 100 000 annotations for biological terms.
Jin-Dong Kim
1 papers
Tomoko Ohta
Yuka Tateisi
Junichi Tsujii
2 papers