Constructing Strata of Primary Sampling Units for the Residential Energy Consumption Survey [report]

Rachel M. Harter, Pinliang (Patrick) Chen, Joseph P. McMichael, Edgardo S. Cureg, Samson A. Adeshiyan, Katherine B. Morton
2017 unpublished
This PDF document was made available from as a public service of RTI International. More information about RTI Press can be found at RTI International is an independent, nonprofit research organization dedicated to improving the human condition by turning knowledge into practice. The RTI Press mission is to disseminate information about RTI research, analytic tools, and technical expertise to a national and international audience. RTI Press publications
more » ... Press publications are peerreviewed by at least two independent substantive experts and one or more Press editors. Suggested Citation Abstract The 2015 Residential Energy Consumption Survey design called for stratification of primary sampling units to improve estimation. Two methods of defining strata from multiple stratification variables were proposed, leading to this investigation. All stratification methods use stratification variables available for the entire frame. We reviewed textbook guidance on the general principles and desirable properties of stratification variables and the assumptions on which the two methods were based. Using principal components combined with cluster analysis on the stratification variables to define strata focuses on relationships among stratification variables. Decision trees, regressions, and correlation approaches focus more on relationships between the stratification variables and prior outcome data, which may be available for just a sample of units. Using both principal components/cluster analysis and decision trees, we stratified primary sampling units for the 2009 Residential Energy Consumption Survey and compared the resulting strata.
doi:10.3768/rtipress.2017.op.0041.1705 fatcat:tp3t5yeoxjauvjvbzjanbzqxpm