The International Nucleotide Sequence Database (INSD) consists of a joint effort to collect and disseminate databases about DNA and RNA sequences. It involves the following computerized databases: DNA Data Bank of Japan (Japan), GenBank (USA -- now called Entrez Nucleotide, which integrates GenBank, RefSeq and Protein Data Bank) and the EMBL Nucleotide Sequence Database (European Molecular Biology Laboratory, Germany). New and updated data on nucleotide sequences contributed by research teams to each of the three databases are synchronized on a daily basis, thanks to continuous interactions and meetings between the staffs of the organizations in charge.
The DDBJ/EMBL/GenBank synchronization is possible thanks to a number of guidelines which are produced and published by an International Advisory. These consist in a common definition of the feature tables [1] for the databases, which regulate the content and syntax [2] of the database entries, in the form of a common DTD or Document Type Definition.
The syntax is called INSDSeq and its core consists of the letter sequence of the gene expression (aminoacid sequence) and the letter sequence for nucleotide bases in the gene or decoded segment. In [3] a DBFetch operation shows a typical INSD entry.
Related topics
External links