The medical science DMZ: a network design pattern for data-intensive medical science
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA, Department of Computer Science, University of California Davis, Davis, CA, USA, Corporation for Education Network Initiatives in California (CENIC), Berkeley, CA, USA
- ESnet, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Indiana Clinical and Translational Sciences Institute and Regenstrief Institute, Indiana University, Indianapolis, IN, USA
- Global Research Network Operations Center, Indiana University, Bloomington, IN, USA
- Research Computing, Harvard University, Cambridge, MA, USA
- Center for Data Intensive Science, University of Chicago, Chicago, USA
- BioTeam, Middleton, MA, USA
- Pervasive Technology Institute, Indiana University, Bloomington, IN, USA
Abstract Objective We describe a detailed solution for maintaining high-capacity, data-intensive network flows (eg, 10, 40, 100 Gbps+) in a scientific, medical context while still adhering to security and privacy laws and regulations. Materials and Methods High-end networking, packet-filter firewalls, network intrusion-detection systems. Results We describe a “Medical Science DMZ” concept as an option for secure, high-volume transport of large, sensitive datasets between research institutions over national research networks, and give 3 detailed descriptions of implemented Medical Science DMZs. Discussion The exponentially increasing amounts of “omics” data, high-quality imaging, and other rapidly growing clinical datasets have resulted in the rise of biomedical research “Big Data.” The storage, analysis, and network resources required to process these data and integrate them into patient diagnoses and treatments have grown to scales that strain the capabilities of academic health centers. Some data are not generated locally and cannot be sustained locally, and shared data repositories such as those provided by the National Library of Medicine, the National Cancer Institute, and international partners such as the European Bioinformatics Institute are rapidly growing. The ability to store and compute using these data must therefore be addressed by a combination of local, national, and industry resources that exchange large datasets. Maintaining data-intensive flows that comply with the Health Insurance Portability and Accountability Act (HIPAA) and other regulations presents a new challenge for biomedical research. We describe a strategy that marries performance and security by borrowing from and redefining the concept of a Science DMZ, a framework that is used in physical sciences and engineering research to manage high-capacity data flows. Conclusion By implementing a Medical Science DMZ architecture, biomedical researchers can leverage the scale provided by high-performance computer and cloud storage facilities and national high-speed research networks while preserving privacy and meeting regulatory requirements.
- Sponsoring Organization:
- USDOE
- OSTI ID:
- 1779352
- Journal Information:
- Journal of the American Medical Informatics Association, Journal Name: Journal of the American Medical Informatics Association Vol. 25 Journal Issue: 3; ISSN 1067-5027
- Publisher:
- Oxford University PressCopyright Statement
- Country of Publication:
- United Kingdom
- Language:
- English
Web of Science
Leveraging the national cyberinfrastructure for biomedical research
|
journal | March 2014 |
The Science DMZ: a network design pattern for data-intensive science
|
conference | January 2013 |
Bro: a system for detecting network intruders in real-time
|
journal | December 1999 |
The Medical Science DMZ
|
journal | May 2016 |
OpenFlow: enabling innovation in campus networks
|
journal | March 2008 |
Similar Records
The Medical Science DMZ
The Science DMZ: A Network Design Pattern for Data-Intensive Science