Laboratory for Research and Development of Biological Databases • Takagi Group

Studies on a Large-Scale Data Processing of Biomedical Knowledge

Faculty



Research Summary

We have been conducting application study of paralleldistributed computing technology and wide area distributed computing technology to genome data processing.

We conduct feasibility study for applying new parallel-distributed computing technology such as Hadoop and distributed key-value store to genome data processing. We conduct research to handle large genome data in distributed-memory type parallel cluster computer which has elasticity for rapid data growth in bioinformatics.

We conduct feasibility study for applying new paralleldistributed computing technology such as Hadoop and distributed key-value store to genome data processing. We conduct research to handle large genome data in distributedmemory type parallel cluster computer which has elasticity for rapid data growth in bioinformatics.

Data processing tests using Hadoop distributed environment

Publications

Mashima, J., Kodama, Y., Fujisawa, T., Katayama, T., Okuda, Y., Kaminuma, E., Ogasawara, O., Okubo, K., Nakamura, Y., and Takagi, T. (2017). DNA data bank of Japan. Nucleic Acids Res 45, D25-D31.

Cochrane, G., Karsch-Mizrachi, I., Takagi, T., and International Nucleotide Sequence Database Collaboration. (2016). The international nucleotide sequence database collaboration. Nucleic Acids Res 44, D48-D50.

Kodama, Y., Mashima, J., Kosuge, T., Katayama, T., Fujisawa, T., Kaminuma, E., Ogasawara, O., Okubo, K., Takagi, T., and Nakamura, Y. (2014). The DDBJ Japanese Genotype-phenotype Archive for genetic and phenotypic human data. Nucleic Acids Res 43, D18-D22.


  • Twitter
  • facebook
  • youtube