Exact and Inexact Methods for Selecting Views and Indexes for OLAP Performance Improvement
dc.contributor.advisor | Dr. Matthias Stallmann, Committee Member | en_US |
dc.contributor.advisor | Dr. Carla Savage, Committee Member | en_US |
dc.contributor.advisor | Dr. Rada Chirkova, Committee Co-Chair | en_US |
dc.contributor.advisor | Dr. Yahya Fathi, Committee Co-Chair | en_US |
dc.contributor.author | Asgharzadeh Talebi, Zohreh | en_US |
dc.date.accessioned | 2010-08-19T18:14:45Z | |
dc.date.available | 2010-08-19T18:14:45Z | |
dc.date.issued | 2010-04-28 | en_US |
dc.degree.discipline | Operations Research | en_US |
dc.degree.level | dissertation | en_US |
dc.degree.name | PhD | en_US |
dc.description.abstract | In on-line analytical processing (OLAP), precomputing (materializing as views) and indexing auxiliary data aggregations is a common way of reducing query-evaluation time (cost) for important data-analysis queries. We consider an OLAP view- and index-selection problem as an optimization problem, where (i) the input includes the data-warehouse schema, a set of data-analysis queries of interest, and a storage-limit constraint, and (ii) the output is a set of views and indexes that minimizes the total cost of evaluating the input queries, subject to the storage limit. While greedy and other heuristic strategies for choosing views or indexes might have some success in reducing the cost, it is highly nontrivial to arrive at a globally optimal solution, one that reduces the processing cost of typical OLAP queries as much as is theoretically possible. In this dissertation we present a systematic study of the OLAP view- and indexselection problem. Our specific contributions are: (1) we introduce an integer programming model for OLAP view- and index-selection problem; (2) we develop an algorithm that effectively and efficiently prunes the space of potentially beneficial views and indexes of the problem, and provide formal proofs that our pruning algorithm keeps at least one globally optimal solution in the search space, thus the resulting integer-programming model is guaranteed to find an optimal solution; this allows us to solve realistic-size instances of the problem within reasonable execution time. (3) we develop a family of algorithms to further reduce the size of the search space so that we are able to solve larger instances of the problem, although we no longer guarantee global optimality of the resulting solution; and (4) we present an experimental comparison of our proposed approach with other approaches discussed in the open literature. Our experiments show that our proposed approach to view and index selection results in high-quality solutions — in fact, in the global optimal solutions for many realistic-size problem instances. Thus, it compares favorably with the well-known OLAP-centered approach of [13] and provides for a winning combination with the end-toend framework of [2] for generic view and index selection. | en_US |
dc.identifier.other | etd-02192010-165023 | en_US |
dc.identifier.uri | http://www.lib.ncsu.edu/resolver/1840.16/6202 | |
dc.rights | I hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dis sertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report. | en_US |
dc.subject | Heuristics | en_US |
dc.subject | OLAP | en_US |
dc.subject | View and Index Selection | en_US |
dc.subject | Integer Programming | en_US |
dc.title | Exact and Inexact Methods for Selecting Views and Indexes for OLAP Performance Improvement | en_US |
Files
Original bundle
1 - 1 of 1