John Hartman, associate professor of Computer Science, will present "Data-Driven Science in the Cloud."
At one time, data collection was a large and difficult part of science. However, modern scientific instruments collect seemingly limitless amounts of information, while modern electronic data storage ("the Cloud") makes it possible to store these data essentially forever. This has led to the rise of data-driven science - "big data" scientific experiments that analyze large-scale datasets. In this talk, Hartman will discuss how the cloud computing facilities originally developed to support the web are now being re-purposed to support data-driven science. He will cover the infrastructure provided by the cloud, how it is and is not suited for running scientific experiments, and will describe a metagenomics research project he is involved in that is using the cloud to study ocean viruses collected by the Tara Oceans Expedition.