Ultra high-throughput sequencing technologies allow biologists to obtain larger amounts of nucleotide sequence data. Reliable database operation and high-quality annotation supply are essential. As the core part of DNA Data Bank of Japan (DDBJ), Nakamura laboratory attempts 1) to develop advanced database management systems, and 2) to improve quality of annotations in DDBJ databases. We have been constructing an automatic analytical system “DDBJ Read Annotation Pipeline” in NIG supercomputers, and “TogoAnnotation” system as the integrated support tool for manual curations. Structural and functional annotations by automatic and manual processing are evaluated by using proposed statistical methods.
A screenshot of reference mapping tools on a NGS automatic analytical system
Kodama, Y., Mashima, J., Kosuge, T., Kaminuma, E., Ogasawara, O., Okubo, K., Nakamura, Y., and Takagi, T. (2018). DNA Data Bank of Japan: 30th anniversary. Nucleic Acids Res 46, D30-D35.
Shimizu, T., Tanizawa, Y., Mochizuki, T., Nagasaki, H., Yoshioka, T., Toyoda, A., Fujiyama, A., Kaminuma, E., and Nakamura, Y. (2017). Draft sequencing of the heterozygous diploid genome of Satsuma (Citrus unshiu Marc.) using a hybrid assembly approach. Front Genet 8, 180.
Tanizawa, Y., Fujisawa, T., and Nakamura, Y. (2017). DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics, doi: 10.1093/bioinformatics/btx713.