The GNSuite System


BioCreative Team#: 93, Team leader: Rune Sætre, University of Tokyo


General description

Our GNSuite system addresses the Gene Normalization, and therefore also the Gene Indexing, tasks.
The front page of the system (Figure 1) lets the user enter or chose from a list of PubMed Central (PMC) or PubMed identifiers (PMID) for all the provided full text articles.
The two lists contain the number of normalized gene mentions for each given paper.
The user can click on one of the PMC-IDs to view that paper, and any recognized gene name will be highlighted in the text.
At the top of each paper's visualization page is a summary of all the genes in the paper. There is one tab for the genes recognized by MEDIE in the abstract, and one tab for all the genes recognized by NERSuite and GNSuite in the full text.

Finally there is a species-tab with taxonomy information recognized by LINNAEUS.
The user can click on a gene symbol to look up the corresponding gene entry in Entrez Gene. The naming information is shown in the right column (Figure 2).
Behind each gene symbol is the number of mentions in the current paper. The user can see each mention in the text, by clicking on the gene-count number to jump to the first occurrence, and then clicking that occurrence to jump to the next occurrences, and so on.

18.000 articles are available online, integrated with the Gene Normalization (GN) results from the MEDIE system.
MEDIE uses the GENA dictionary, since the entries are normalized using Entrez-Gene, Swiss-Prot, TREMBL, Fly-base and several other major Gene/Protein databases.
The MEDIE system contains only the genes found in the abstract, so we processed the full papers with our NER- and GN-Suite systems. The normalized gene entries from GNSuite are visible from the "gnsuite" tab for each individual paper.
The names for a specific gene entry are mapped to the text. For this we use a fast web service providing cached information from Entrez Gene: The same web-service is also used to find alternative names for the species for each gene, and to highlight these species names in the text as well.

We are still missing the following feature:
1) Make the list of all the genes for each paper editable.
This will be added silently in the very near future...


The GNSuite system is developed and tested using the FireFox Web Browser. In case you are using a different browser like MS-IE, Conqueror, Opera, Chrome etc. there is a chance that the system will not work as expected. It is recommended to download the latest version of FireFox from here
Once FireFox is installed, you can just launch the URL below in the FireFox Web-Browser, and you will see the front page of the system. Please note that Java-Scripts must be enabled in the browser for all the functionality to work.


If you want to contribute to this project, you need access to Tsujii-labs SVN-server. You can check out the code from SVN (for example from mason):
svn co file:///home/svn/satre/biocreative/IAT IAT

Front Page:

Notice: Undefined index: HTTP_X_FORWARDED_FOR in /var/www/html/satre/biocreative/IAT/docs/GNSuiteSystemDescription.php on line 144
MENU: [Links] [BussTUC] [Guest] [Jp-Eng] [Log] [Map] [Photo] [180] [SMS] [Summer19] [Tl_Chili] [Spam] [Top]
WORK: [CV] [Kalender] [ENDORS] [FastlegeVakten] [C++] [C++RefGroup] [IDIemner] [Projects] [Publications] [TDT44] [Tekna] [UbiCompFA] [Wiki] [Yast]

A computer scientist is someone who fixes things that aren't broken.