Scientific Databases and Visualization - BioReader
The main focus of this project is the development and application of natural
language processing (NLP) methods for the systematic
analysis of chemical compound names in order to identify synonymic notations of
compounds and to distinguish between different chemical compounds based on
variations in their names. A chemical compound can have many different names;
it can have several trivial names as well as several systematic names, even
when following naming recommendations as those of the International Union of
Pure and Applied Chemistry (IUPAC).
The methods and tools developed under this project are to be used by curators
of the SABIO-RKdatabase for the identification
of compounds.