Takagi Group • Laboratory for Research and Development of Biological Databases

Studies on a Large-Scale Data Processing of Biomedical Knowledge

Faculty



Research Summary

We have been conducting application study of paralleldistributed computing technology and wide area distributed computing technology to genome data processing.

We conduct feasibility study for applying new parallel-distributed computing technology such as Hadoop and distributed key-value store to genome data processing. We conduct research to handle large genome data in distributed-memory type parallel cluster computer which has elasticity for rapid data growth in bioinformatics.

We conduct feasibility study for applying new paralleldistributed computing technology such as Hadoop and distributed key-value store to genome data processing. We conduct research to handle large genome data in distributedmemory type parallel cluster computer which has elasticity for rapid data growth in bioinformatics.

Data processing tests using Hadoop distributed environment

Publications

Kodama, Y., Mashima, J., Kosuge, T., Kaminuma, E., Ogasawara, O., Okubo, K., Nakamura, Y., and Takagi, T. (2018). DNA data bank of Japan: 30th anniversary. Nucleic Acids Res 46, D30-D35.

Mashima, J., Kodama, Y., Fujisawa, T., Katayama, T., Okuda, Y., Kaminuma, E., Ogasawara, O., Okubo, K., Nakamura, Y., and Takagi, T. (2017). DNA data bank of Japan. Nucleic Acids Res 45, D25-D31.

Cochrane, G., Karsch-Mizrachi, I., Takagi, T., and International Nucleotide Sequence Database Collaboration. (2016). The international nucleotide sequence database collaboration. Nucleic Acids Res 44, D48-D50.


  • Twitter
  • facebook
  • youtube