Efficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies
Title | Efficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies |
Publication Type | Journal Articles |
Year of Publication | 2004 |
Authors | Andrade H, Aryangat S, Kurc T, Saltz J, Sussman A |
Journal | Languages and Compilers for Parallel Computing |
Date Published | 2004/// |
Abstract | This work investigates the leverage that can be obtained from compiler optimization techniques for efficient execution of multi-query workloads in data analysis applications. Our approach is to address multi-query optimization at the algorithmic level, by transforming a declarative specification of scientific data analysis queries into a high-level imperative program that can be made more efficient by applying compiler optimization techniques. These techniques – including loop fusion, common subexpression elimination and dead code elimination – are employed to allow data and computation reuse across queries. We describe a preliminary experimental analysis on a real remote sensing application that analyzes very large quantities of satellite data. The results show our techniques achieve sizable reductions in the amount of computation and I/O necessary for executing query batches and in average execution times for the individual queries in a given batch. |
DOI | 10.1007/978-3-540-24644-2_33 |