Evidence-Based Stratification Methodology for Non-Probabilistic Sampling Surveys

Gazala, Ali

Evidence-Based Stratification Methodology for Non-Probabilistic Sampling Surveys

aut.embargo	No	en_NZ
aut.thirdpc.contains	No	en_NZ
dc.contributor.advisor	Narayanan, Ajit
dc.contributor.advisor	Pears, Russel
dc.contributor.author	Gazala, Ali
dc.date.accessioned	2018-08-29T22:39:02Z
dc.date.available	2018-08-29T22:39:02Z
dc.date.copyright	2018
dc.date.issued	2018
dc.date.updated	2018-08-29T17:45:35Z
dc.description.abstract	There is increasing use of non-probability sampling methods in large-scale surveys due to the costs involved in ensuring that the sample chosen is representative of the population, as is the case with probability sampling. Conventionally, it has been believed that non-probability sampling does not permit precise estimates of how the statistical properties of the sample differ from the statistical properties of the population due to possible biases in the non-probability sample. However, the increasing growth of big data survey data using non-probability sampling methods may provide an opportunity for researchers to use novel methods for quantifying the amount of bias that may exist in different strata so that within each stratum it may be possible to select respondents through probability sampling or random sampling to create pseudo-controlled samples for estimating population parameters. In this thesis, we use one of the largest survey databases ever collected in healthcare (Improving Practice Questionnaire IPQ for patients visiting their doctor in UK) through convenience sampling to show it is possible to adopt different stratification strategies in conjunction with machine learning techniques to help researchers to decide on the most appropriate stratification method for estimating population parameters from the chosen strata. Such strategies can enrich our knowledge for an evidence-based stratification methodology to reveal similarities and differences in feedback experience among different smaller sub-populations. This research combines standard statistical and machine learning techniques into a systematic stratification methodology to analysis survey data collected through non-probability sampling. In summary, the traditional statistical problem of how to estimate population parameters from a study that does not use probability sampling is shown in this thesis to be possible through the use of big data and appropriate use of measures and metrics from machine learning as well as standard statistical methods for analysing population parameters. The implication of this thesis are that it will be possible, in the age of big data, to overcome traditional statistical concerns about the quality of data not obtained through traditional probabilistic techniques and that outcomes of statistical analysis using non-probability sampling methods can be as reliable as from probability sampling, provided that a clear methodology is used to quantify bias at various stratification levels.	en_NZ
dc.identifier.uri	https://hdl.handle.net/10292/11784
dc.language.iso	en	en_NZ
dc.publisher	Auckland University of Technology
dc.rights.accessrights	OpenAccess
dc.subject	Healthcare	en_NZ
dc.subject	Stratification	en_NZ
dc.subject	Survey	en_NZ
dc.subject	Feedback	en_NZ
dc.title	Evidence-Based Stratification Methodology for Non-Probabilistic Sampling Surveys	en_NZ
dc.type	Thesis	en_NZ
thesis.degree.grantor	Auckland University of Technology
thesis.degree.level	Doctoral Theses
thesis.degree.name	Doctor of Philosophy	en_NZ

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Complete Thesis.pdf
Size:: 3.44 MB
Format:: Adobe Portable Document Format
Description:: Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 889 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Doctoral Theses