An Alternative Approach to Designing Clinical Trials: Budgeted Learning of
Probabilistic Classifiers
Speaker: Russ Greiner (University of Alberta)
Researchers often use clinical trials to collect the data needed to evaluate
some hypothesis, or produce a classifier. During this process, they have to
pay the cost of performing each test. Many studies will run a comprehensive
battery of tests on each subject, for as many subjects as their budget will
allow -- ie, "round robin" (RR). We consider a more general model, where the
researcher can sequentially decide which single test to perform on which
specific individual; again subject to spending only the available funds. Our
goal here is to use these funds most effectively, to collect the data that
allows us to learn the most accurate classifier.
We first explore the simplified "coins version" of this task. After observing
that this is NP-hard, we consider a range of heuristic algorithms, both
standard and novel, and observe that our "biased robin" approach is both
efficient and much more effective than most other approaches, including the
standard RR approach. We then apply these ideas to learning a naive-bayes
classifier, and see similar behavior. Finally, we consider the most realistic
model, where both the researcher gathering data to build the classifier, and
the user (eg, physician) applying this classifier to an instance (patient)
must pay for the features used --- eg, the researcher has $10,000 to acquire
the feature values needed to produce an optimal $30/patient classifier. Again,
we see that our novel approaches are almost always much more effective that
the standard RR model.
This is joint work with Aloak Kapoor, Dan Lizotte and Omid Madani.