Query Size Estimation through Sampling

Show full item record

Title: Query Size Estimation through Sampling
Author: Kim, Kyoung-Hwa
Advisors: Jaewoo Kang, Committee Member
Xiaosong Ma, Committee Member
Rada Y. Chirkova, Committee Chair
Abstract: Current Database management systems (DBMS) handle huge amounts of data and need fast query response time. DBMSs apply several strategies to execute user queries. Query optimizers in DBMSs compare costs for these strategies and choose the cheapest one. Materialized views are suggested to enhance query response time as one of the strategies in DBMSs. Cost of each strategy has to be accurately estimated to choose right strategy. Because the materialized view is stored as a one table, we consider that sequential scan is used for executing of the materialized view. Therefore, I/O and CPU costs to execute materialized views depend on the number of tuples for the result. Hence, Our focus will be the accuracy of estimation of the number of tuples in materialized views; that is a query-size estimation. Many researches have been proposed to find methods to estimate the cost of query. This thesis reviews these researches and compares good and bad aspects for each cost estimation method. We choose size estimation methods that are more accurate than others to implement. We suggest various query environments for experiments. We suggest a guideline by experimental results.
Date: 2005-01-06
Degree: MS
Discipline: Computer Science
URI: http://www.lib.ncsu.edu/resolver/1840.16/1274

Files in this item

Files Size Format View
etd.pdf 356.5Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record