Department Seminars 1998
DATE/PLACE:
Thursday, December, 1998, 16:00
Computer Science 301
TYPE:
To be announced
TITLE:
To be announced
SPEAKER:
Penny Brasher, Alberta Cancer Board
| DATE/PLACE: | Thursday, December 17, 1998, 14:30 NOTE DIFFERENT DAY AND TIME Computer Science 301 |
| TITLE: | Visible Management: A Perspective on Quality, Productivity, and the Design of Work Processes |
| SPEAKER: | John C. Nash, Faculty of Administration, University of Ottawa, Ottawa, ON |
| Managers are expected to address quality, productivity and work processes within their organizational units. Visible Management is an approach to these issues that accents the importance of rendering work visible and hence manageable. It provides a perspective on various popular movements in managerial philosophy, for Taylor's scientific management, through Deming's quality improvement, to Hammer's business process re-engineering and beyond. This talk will present a brief and generally light-hearted look at Visible Management and some of its applications.
The speaker will, in particular, highlight some recent attempts to address the issue of improving productivity of "creative" work such as book writing, examination marking, and software engineering. |
|
In previous works, the transition rate into the preclinical detectable phase is assumed to be constant over time for all ages. This does not lead to an increasing incidence of breast cancer with age but observations in several populations indicate that incidence increases at approximately the third power of age. This relationship is induced in the model by introducing a transition rate that is age dependent.
DATE/PLACE:
Thursday, November 26, 1998, 16:00
Computer Science 301
TYPE:
Research Seminar
TITLE:
A semi Markov model for mammographic detection of breast cancer
SPEAKER:
Keith Chan, St. Paul's Hospital, Vancouver, B.C.
The presence of binary covariates requires some elements of a covariance matrix to be fixed. We develop a general approach for sampling such a constrained covariance matrix. The Bayesian inference in this context now demands the use of a Gibbs sampling algorithm.
DATE/PLACE:
Thursday, November 19, 1998, 16:00
Computer Science 301
TYPE:
Research Seminar
TITLE:
Bayesian analysis of case-control data with imprecise exposure measurements
SPEAKER:
Marc Vallee, Department of Statistics, University of British Columbia, Vancouver, B.C.
| DATE/PLACE: | Tuesday, November 17, 1998, 16:00 Computer Science 301 |
| TITLE: | Diagnosing Dementia: Statistical Issues in Detecting an Elusive Group of Diseases |
| SPEAKER: | Alan Donald, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| In epidemiological studies of dementia, the detection of a condition such as Alzheimer's disease is easily subject to error. A clinician treating an individual patient can usually afford to wait and watch the progress of the disease and the patient's reaction to treatment before reaching a definitive diagnosis. But research settings generally allow only "one shot" at a diagnosis. Often, in order to avoid error, researchers will give subjects two or more diagnostic tests, believing that having both tests yield the same diagnosis lends support to the diagnosis. They justify this by appealing to the kappa coefficient, a chance-corrected measure of agreement between two dichotomous tests. In this talk, I will discuss some of the controversy and confusion about the kappa coefficient that has recently concerned both medical researchers and biostatisticians. I will also present an original probabilistic model that explains some odd paradoxes about the behaviour of this popular measure of diagnostic test agreement. | |
| JOINT SEMINAR WITH PETER WALL INSTITUTE FOR ADVANCED STUDIES | |
| DATE/PLACE: | Tuesday, November 10, 1998, 16:00 Computer Science 301 |
| TITLE: | Spatial statistics, hierarchical models and massive datasets |
| SPEAKER: | Douglas Nychka, Geophysical Statistics Project, National Center for Atmospheric Research, Boulder, CO |
| A different perspective on modeling spatial data provides a route to handling large problems. Standard, statistical methods for analyzing spatial fields focus on the covariance of the spatial process. The problem with this approach for geophysical problems is the difficulty in formulating non stationary fields and, even when this is successful, computing spatial estimates using large covariance matrices. This talk considers the advantages of modeling the process directly instead of short cutting to the second order moments. This basic change of emphasis from covariance function to the process is also the key ingredient of a hierarchal model for spatial or space/time data. In the simplest case the idea is to expand the spatial field with respect to a basis and then model the variances of the basis coefficients. This alone is not a new idea. But recent developments in multiresolution bases such as wavelets allow one flexibility in capturing nonstationary structure and also permit rapid evaluation of the basis functions. The spatial estimates for a large number of locations can be found using iterative techniques, such as the conjugate gradient method, in place of standard solutions of linear systems. Such methods are common in the field of meteorological data assimilation, but have had minor impact in statistics. Here the use of a multiresolution basis is important, making the matrix multiplications in the iterations efficient. As a motivating example, we consider monthly precipitation records at approximately 5000 sites in the continental US. Here the goal is to produce spatial predictions on a regular grid and also to impute readings at sites where data is missing.
|
|
| PACIFIC NORTHWEST STATISTICS CONFERENCE | Friday, November 6, 1998 Simon Fraser University See the conference web site for more information. |
| DATE/PLACE: | Thursday, November 5, 1998, 16:00 Computer Science 301 |
| TYPE: | Research Seminar |
| TITLE: | The challenge of biometrical practice: how do we teach the right people the right things? |
| SPEAKER: | Ray Littler, University of Waikato |
|
[Joint work with Larry Weldon, Simon Fraser University] We take a critical look at conventional statistical theory and training when confronted with statistical practice. We assert that there are compelling reasons to concentrate on the "big ideas" of statistics in introductory courses, whether for potential statistics specialists or others. Making use of practical examples, we also discuss some proposed strategies for conveying essential ideas of statistical thinking; and some characteristics of statisticians of the future. |
|
| JOINT SEMINAR WITH PETER WALL INSTITUTE FOR ADVANCED STUDIES | |
| DATE/PLACE: | Tuesday, November 3, 1998, 16:00 Hennings 318 CANCELLED |
| TITLE: | The effect of latency on the course of an epidemic |
| SPEAKER: | Steve Marion, Health Care and Epidemiology, University of British Columbia, Vancouver, B.C. |
| DATE/PLACE: | Tuesday, October 27, 1998, 16:00 Computer Science 301 |
| TITLE: | Predictive inference for the elliptical linear model |
| SPEAKER: | Golam Kibria, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| The prediction distribution of future responses from the linear model with errors having an elliptical distribution with known covariance parameters has been considered. For unknown covariance parameters, the marginal likelihood function of the parameters has been obtained and then the prediction distribution has been modified by replacing the covariance parameters by their estimates obtained from the marginal likelihood function. It is observed that the prediction distribution with elliptical error has a multivariate Student t-distribution with appropriate degrees of freedom. The results for some special cases such as Intra-class correlation model, AR(1), MA(1) and ARMA(1,1) models have been obtained from the general results. As an application, the beta-expectation tolerance region has been constructed. An example has been added.
|
|
| DATE/PLACE: | Thursday, October 22, 1998, 16:00 Computer Science 301 |
| TYPE: | Research Seminar |
| TITLE: | Searching for the meaning of life: coding regions in genomic DNA |
| SPEAKER: | Peter Hooper, University of Alberta |
|
A fundamental problem in DNA sequence analysis is to determine which parts of a sequence are used to code proteins. I will describe a recently developed algorithm for prediction of coding regions. More precisely, the algorithm parses a DNA sequence into intergenic regions (intervals between genes), introns (intervals spliced out during construction of mRNA) and exons (intervals used to code proteins). The exons can then be assembled to predict complete genes. The algorithm's accuracy and speed are competitive with the best existing methods. The algorithm is based upon a generalized Hidden Markov model, an approach used by Burge and Karlin (1997) and others. This general approach combines several kinds of information to predict genetic structure. The two most important sources of information are models for coding and noncoding regions, and models for functional sites (translation initiation sites, translation termination sites, 5' splice sites, and 3' splice sites). Log-linear models are used to estimate conditional hexamer probabilities, give C+G content, which are applied to calculate likelihoods for coding region and noncoding region models. A novel pattern recognition technique, called reference point logistic classification, is used to estimate functional site probabilities. The likelihood scores and functional site probabilities are combined with additional information to determine a score for each possible parse of a sequence. The parse maximizing the combined score is obtained via a dynamic programming algorithm. |
|
| DATE/PLACE: | Tuesday, October 13, 1998, 16:00 Computer Science 301 |
| TITLE: | ISO* property of the two-parameter compound Poisson distribution |
| SPEAKER: | Chunsheng Ma, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| The two-parameter compound Poisson distribution is introduced via an insurance model, where the total aggregate claims consist of two parts: a fixed overhead and a random sum of claims. It contains the usual compound Poisson distribution and noncentral chi-square distribution as special cases. An ISO* property of this distribution is obtained, and applied to consider the order restricted statistical inference for the noncentrality parameter. | |
| DATE/PLACE: | Tuesday, October 6, 1998, 16:00 Computer Science 301 |
| TITLE: | Relevance Weighted Likelihood Estimates of a Normal Mean |
| SPEAKERS: | James V. Zidek and Constance van Eeden, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| In this paper we answer a question asked by Professor James Berger following a lecture given by the second author in 1997 at the Centre de recherches mathématiques. That question is concerned with the estimation of (theta)1 when Yi ~ N(thetai, sigmai2, i=1,2 , are observed and (theta)1 <= (theta)2. Clearly in this case Y2 contains information about (theta)1 and Professor Berger asked how the relevance weights in the so-called relevance weighted likelihood might be selected so that Y2 can be used together with Y1 for effective likelihood-based inference about (theta)1.
Our answer to this question uses the Akaike entropy maximization criterion to find the relevance weights empirically. Although the problem of estimating (theta)1 under these conditions has a long history, our estimator appears to be new. Unlike the MLE it is continuously differentiable. Unlike the Pitman estimator for this problem, but like the MLE, it has a simple form. We will describe the derivation of our estimator, present some of its properties and compare it with some obvious competitors. Finally we present a number of open questions. For completeness we will review the idea of the relevance weighted likelihood and give illustrative applications in addition to those described above that are central to the talk. In particular, we will show how to forecast the number of goals in NHL hockey as an illustrative digression.
|
|
DATE/PLACE:
Thursday, October 8, 1998
CANCELLED
| DATE/PLACE: | Tuesday, September 29, 1998, 16:00 Computer Science 301 |
| TITLE: | Reference Point Logistic Classification |
| SPEAKER: | Peter Hooper, Department of Mathematical Sciences, University of Alberta |
| I will describe a new method for pattern recognition based on ideas from statistics and neural networks. Reference point logistic classification uses normalized exponential functions of squared distance from reference points in the feature space to construct piecewise linear classification boundaries. Reference points and other parameters are determined by minimizing a smoothed training risk; i.e., the expected loss based on a smoothed nonparametric estimate of the distribution. A general loss function can be specified. The risk is minimized by stochastic approximation, using importance sampling to reduce computations. The number of reference points and the smoothing parameter are selected, using test data or cross- validation, to provide an appropriate level of complexity and avoid overfitting.
The method performs well in comparison with 22 other classification methods on ten data sets from the European (ESPRIT) project StatLog. Examples include letter recognition and DNA sequence analysis. |
|
| DATE/PLACE: | Thursday, September 24, 1998, 16:00 Computer Science 301 |
| TYPE: | Research Seminar (to motivate a Journal Club session) |
| TITLE: | Multivariate probit and logit models for multivariate binary and ordinal response data with covariates |
| SPEAKER: | Harry Joe, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| I will discuss models which generalize the logit and probit models for univariate binary and ordinal response (with covariates) to multivariate response or repeated measures response. These useful models, based on a multivariate normal or multivariate logistic distribution, may not be widely known.
I will present specific models from the statistical literature, discuss the computations for the models, give examples where I have used the models, and mention some open research problems. References: 1. Joe, H. (1997). Multivariate Models and Dependence Concepts, Chapman & Hall. 2. Xu, J.J. (1996). Statistical Modelling and Inference for Multivariate and Longitudinal Discrete Response Data. 3. Molenberghs, G. and Lesaffre, E. (1994). Marginal modelling of correlated ordinal data using a multivariate Plackett distribution. J. Amer. Statist. Assoc., 89, 633-644. Note: The speaker gives this talk in hope that someone will lead a follow-up journal club session based on Glonek, G.F.V. (1996). A class of regression models for multivariate categorical responses. Biometrika v. 83, pp. 15-28. |
|
| DATE/PLACE: | Tuesday, September 22, 1998, 16:00 Computer Science 301 |
| TITLE: | An Orthodox BLUP Approach to Generalized Linear Mixed Models |
| SPEAKER: | Renjun Ma, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| Research in the area of generalized linear mixed models has advanced rapidly in recent years, due to the recent increasing demands for random effects modeling methodology to analyze clustered data from sample survey, repeated measures, multivariate and longitudinal data analysis from clinical trials, biomedical,agricultural research,industry and economic study. However, the presence of random effects in generalized linear models complicates the estimation problem considerably compared with the independence case, because maximum likelihood estimation involves intractable numerical integration. Ways to overcome this are the Gibbs sampler method of Zeger and Karim, the penalized quasi-likelihood approach of Breslow and Clayton and hierarchical likelihood approach of Lee and Nelder. The advantages and disadvantages of these approaches will be discussed. We consider an orthodox BLUP approach based on a predictor of the random effects that is truly best linear and unbiased, in contrast to the conventional BLUP which is the conditional mode. We assume Tweedie exponential dispersion model distributions for both the response and the random effects, accommodating a wide range of discrete, continuous and mixed data. This approach facilitates analysis of residuals, and allows justification of asymptotic results under realistic conditions. While fully parametric, the model is to some extent robust against misspecification of the random effects distributions. | |
| DATE/PLACE: | Thursday, September 17, 1998, 16:00 NOTE DIFFERENT DAY Computer Science 301 |
| TITLE: | Smoothing Parameter Selection when Errors are Correlated and Application to Ozone Data |
| SPEAKER: | Robert St. Aubin, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| Automatic smoothing parameter selection methods for nonparametric regression like cross-validation and generalized cross-validation are known to be severely affected by dependence in the regression errors. We proposed, in this work, to modify some of the ideas used in the cross-validation criterion in kernel regression with dependent errors and apply them to smoothing splines with model based penalty. Model based penalty smoothing permits us to keep the flexibility of the nonparametric methods while it also allows us to specify a favoured parametric model which can help improve on the estimate of the regression function.
We consider the ``modified cross-validation'' (also known as Leave-2l+1 out) and the ``blockwise cross-validation'' smoothing parameter selection techniques which were initially proposed by Wehrly and Hart (1988) and Hardle and Vieu (1992) respectively. These two smoothing parameter selection techniques take correlation into account and alleviate its effect on the regression function estimation. We use a simulation study to evaluate the performance of our twosmoothing parameter selection techniques. We compare the results with a few commonly used parametric techniques. Our techniques are also applied to an air pollution data set where we estimate the underlying trend of daily and monthly ground ozone levels in southern Ontario. |
|
| DATE/PLACE: | Tuesday, September 15, 1998, 16:00 Computer Science 301 |
| TITLE: | Some Statistical Properties of Multivariate Proper Dispersion Models, with Special Reference to a Multivariate Gamma Model |
| SPEAKER: | Jeevanantham Rajeswaran, Department of Statistics, University of British Columbia, Vancouver, B.C. |
| A broad class of error distributions for generalized linear models is provided by the class of dispersion models which was introduced by Jorgensen (1987a, 1997a) and a detailed study on dispersion models was made by Jorgensen (1997b). In this thesis we study multivariate proper dispersion models.
Our aim is to do multivariate analysis for non-normal data, particularly data from the multivariate gamma distribution which is an example of a multivariate proper dispersion model, introduced by Jorgensen and Lauritzen (1998). This class provides a multivariate extension of the dispersion model density, following the spirit of the multivariate normal density. We consider the saddlepoint approximation for small dispersion matrices, which, in turn, implies that the multivariate proper dispersion model is approximately multivariate normal for small dispersion matrices. We want to mimic the basic technique of testing in multivariate normal, Hotelling's T-squared. Our version of the T-squared test applies asymptotically, for either small dispersion or large samples. We also consider estimating the normalizing constant of the bivariate gamma by Monte Carlo simulation and we investigate the marginal density by using numerical integration. We also investigate the distribution of the T-squared statistic by Monte Carlo simulation. |
|
