PAPERS

Swift Technology and SwiftScript Application Papers

Zhao, Y.,Dobson, J., Moreau, L., Foster, I. and Wilde, M A Notation and System for Expressing and Executing Cleanly Typed Workflows on Messy Scientific Data SIGMOD 2005 [ pdf ]
Raicu, I., Zhao Y., Dumitrescu, C., Foster, I. and Wilde, M Falkon: a Fast and Light-weight tasK executiON framework Supercomputing Conference 2007 [ pdf ]
Zhao Y., Hategan, M., Clifford, B., Foster, I., vonLaszewski, G., Raicu, I., Stef-Praun, T. and Wilde, M Swift: Fast, Reliable, Loosely Coupled Parallel Computation IEEE International Workshop on Scientific Workflows 2007 [ pdf ]
Stef-Praun, T., Clifford, B., Foster, I., Hasson, U., Hategan, M., Small, S., Wilde, M and Zhao,Y. Accelerating Medical Research using the Swift Workflow System Health Grid 2007 [ pdf ]
Stef-Praun, T., Madeira, G., Foster, I., and Townsend, R. Accelerating solution of a moral hazard problem with Swift e-Social Science 2007 [ pdf ]

Research leading to Swift and SwiftScript

Zhao, Y., Wilde, M. and Foster, I. Virtual Data Language: A Typed Workflow Notation for Diversely Structured Scientific Data. Taylor, I.J., Deelman, E., Gannon, D.B. and Shields, M. eds. Workflows for eScience, Springer, 2007, 258-278.
Zhao, Y., Dobson, J., Foster, I., Moreau, L. and Wilde, M. A Notation and System for Expressing and Executing Cleanly Typed Workflows on Messy Scientific Data. SIGMOD Record 34 (3), 37-43 [ pdf ]
Moreau, L., Zhao, Y., Foster, I., Voeckler, J. and Wilde, M., XDTM: XML Data Type and Mapping for Specifying Datasets. European Grid Conference, 2005. [ pdf ]
Foster, I., Voeckler, J., Wilde, M. and Zhao, Y., The Virtual Data Grid: A New Model and Architecture for Data-Intensive Collaboration. Conference on Innovative Data Systems Research, 2003. [ pdf ]

Karajan Technology used in Swift

von Laszewski, G., Hategan, M. and Kodeboyina, D. Java CoG Kit Workflow. Taylor, I.J., Deelman, E., Gannon, D.B. and Shields, M. eds. Workflows for Science, 2007, 340-356. [ pdf ]

Virtual Data Language and Virtual Data System - predecessors to Swift

Zhao, Y., Wilde, M. and Foster, I., Applying the Virtual Data Provenance Model. International Provenance and Annotation Workshop, Chicago, Illinois, 2006. [ pdf ]
Foster, I., Voeckler, J., Wilde, M. and Zhao, Y., Chimera: A Virtual Data System for Representing, Querying, and Automating Data Derivation. 14th Intl. Conf. on Scientific and Statistical Database Management, Edinburgh, Scotland, 2002. [ pdf ]
Vöckler, J.-S., Mehta, G., Zhao, Y., Deelman, E. and Wilde, M., Kickstarting Remote Applications. 2nd International Workshop on Grid Computing Environments, 2006. [ pdf ]
Vöckler, J.-S., Wilde, M. and Foster, I. The GriPhyN Virtual Data System. Technical Report GriPhyN-2002-02, 2002.
Zhao, Y., Wilde, M., Foster, I., Voeckler, J., Dobson, J., Gilbert, E., Jordan, T. and Quigg, E. Virtual Data Grid Middleware Services for Data-intensive Science. Concurrency and Computation: Practice and Experience, 18 (6), 595-608. [ pdf ]
Zhao, Y., Wilde, M., Foster, I., Voeckler, J., Jordan, T., Quigg, E. and Dobson, J., Grid Middleware Services for Virtual Data Discovery, Composition, and Integration. 2nd International Workshop on Middleware for Grid Computing, 2004. [ pdf ]

VDL Applications - predecessors to Swift

Annis, J., Zhao, Y., Voeckler, J., Wilde, M., Kent, S. and Foster, I., Applying Chimera Virtual Data Concepts to Cluster Finding in the Sloan Sky Survey. SC2002, Baltimore, MD, 2002. [ pdf ]
Arbree, A., Avery, P., Bourilkov, D., Cavanaugh, R., Katageri, S., Graham, G., Rodriguez, J., Voeckler, J. and Wilde, M., Virtual Data in CMS Production. Computing in High Energy and Nuclear Physics, 2003. [ pdf ]
Arbree, A., Avery, P., Bourilkov, D., Cavanaugh, R., Rodriguez, J., Graham, G., Wilde, M. and Zhao, Y., Virtual Data in CMS Analysis. Computing in High Energy and Nuclear Physics, 2003. [ pdf ]
Bardeen, M., Gilbert, E., Jordan, T., Nepywoda, P., Quigg, E., Wilde, M. and Zhao, Y. The QuarkNet/Grid Collaborative Learning e-Lab. Future Generation Computer Systems, 22 (6), 700-708. [ pdf ]
Horn, J.V., Dobson, J., Woodward, J., Wilde, M., Zhao, Y., Voeckler, J. and Foster, I. Grid-Based Computing and the Future of Neuroscience Computation. Methods in Mind, MIT Press, 2006.
Nefedova, V., Jacob, R., Foster, I., Liu, Y., Liu, Z., Deelman, E., Mehta, G. and Vahi, K., Automating Climate Science: Large Ensemble Simulations on the TeraGrid with the GriPhyN Virtual Data System. 2nd IEEE International Conference on eScience and Grid Computing, 2006. [ pdf ]
Sulakhe, D., Rodriguez, A., D'Souza, M., Wilde, M., Nefedova, V., Foster, I. and Maltsev, N. GNARE: An Environment for Grid-Based High-Throughput Genome Analysis. Journal of Clinical Monitoring and Computing. [ pdf ]
Sulakhe, D., Rodriguez, A., Wilde, M., Foster, I. and Maltsev, N., Using Multiple Grid Resources for Bioinformatics Applications in GADU. IEEE/ACM International Symposium on Cluster Computing and Grid, 2006. [ pdf ]
Zhao, Y. Virtual Galaxy Clusters: An Application of the GriPhyN Virtual Data Toolkit to Sloan Digital Sky Survey Data. MS thesis, University of Chicago, GriPhyN-2002-06, 2002.

Related Workflow Scheduling and Provenance Research

Malewicz, G., Foster, I., Rosenberg, A. and Wilde, M., A Tool for Prioritizing DAGMan Jobs and Its Evaluation. IEEE International Symposium on High Performance Distributed Computing, 2006. [ pdf ]
Meyer, L., Scheftner, D., Voeckler, J., Mattoso, M., Wilde, M. and Foster, I., An Opportunistic Algorithm for Scheduling Workflows on Grids. VECPAR'06, Rio De Janiero, 2006. [ pdf ]
Moreau, L. and others, The First Provenance Challenge, Concurrency and Computation: Practice and Experience. [ pdf ]