Nanopublications for Data Curation and Text Annotation

From Master Projects
Jump to: navigation, search

About Nanopublications for Data Curation and Text Annotation

  • This project has not yet been fulfilled.
  • This project fits in the following Bachelor programs: {{#arraymap:|, |xXx|bachelorproject within::xXx|,}}
  • This project fits in the following masterareas: {{#arraymap:Bioinformatics,

Computer Science and Communication, Human Ambience, Information Sciences, Information and Communication Technology, Internet and Web Technology, Knowledge Technology and Intelligent Internet Applications,|, |xXx|project within::xXx|,}}


Problem: Annotating text (for example scientific publications) with standardized terms to store them in databases is a costly and very demanding task, and so is the curation of data that is already captured.

Solution: We could allow everybody (e.g. scientists reading a paper anyways) to contribute to such datasets, but then we have to make sure that we know who provided what data entry to distinguish good from bad contributions. By capturing and publishing this authorship information of annotations, we can also motivate volunteers to contribute because they can get public recognition from that.

Method: We can build upon the nanopublication approach ( to format such small data entries in a formal way, and to record all provenance information (who created it when, with which tools, etc.). Like that we can record who published what, and give credit to those who deserve it, and thereby provide motivation to contribute valuable data.