01 Opening Handout

Statistics 343

Applied Regression Methods, Autumn 1995

Instructor:	Ronald Thisted
Office:	Eckhart 126
Phone:	702-8332 (voice mail)
email:	r-thisted@uchicago.edu

Prerequisites

Mathematical analysis (advanced calculus)
Introduction to statistical theory (Stat 244-245)
Matrix linear algebra (Math 250)

Required Textbooks

Venables, WN and Ripley, BD (1994). Modern Applied Statistics with S-Plus. Springer-Verlag: New York. [Ordered at Seminary Coop Bookstore]

Weisberg, Sanford (1985). Applied Linear Regression, Second edition. New York: Wiley. Abbreviation: ALR. [Ordered at Seminary Coop Bookstore]

Additional Resources

Becker, Richard A., Chambers, John M., and Wilks, Allan R. (1988). The New S Language. Pacific Grove: Wadsworth & Brooks/Cole. Abbreviation: NSL.

Chambers, John M., and Hastie, Trevor J., eds. (1992). Statistical Models in S. Pacific Grove: Wadsworth & Brooks/Cole. Abbreviation: SMS.

McCullagh, Peter, and Nelder, John (1989). Generalized Linear Models, Second Edition. London: Chapman & Hall.

Mosteller, Frederick and Tukey, John W. (1977). Data Analysis and Regression: A Second Course in Statistics. Reading: Addison-Wesley.

Rao, C. R. (1973). Linear Statistical Inference and its Applications, Second Edition. New York: Wiley.

Thisted, Ronald A. (1988). Elements of Statistical Computing: Numerical Computation. New York: Chapman & Hall.

Computational Information

1. We shall be using S-Plus for most purposes in this course. This program is available on the Statistics Department computers, as well as on the SUN Cluster. If you are not a member of the Statistics Department, you should use the SUN Cluster for access to S-Plus and to the data sets used in class. Important: If you are not using the Statistics Department computers, you are responsible for obtaining an account and for learning the computing environment on the SUN Cluster, or another computer system.

2. All of the data sets from Weisberg's book, Applied Linear Regression, are available in the directory /ga/thisted/343 on galton. The files names are of the form ALRnnn, where nnn denotes the three-digit page number on which the data set appears. Note the UPPER CASE letters in the file names. These data are also available on the Sun Cluster in the directory /nfs/quads/q2/rats/343.

3. If you will be working on the Sun Cluster instead of galton, read the document entitled, "Using S+ on the Sun Cluster."

Example. Here are some computations in S to help you get started. The data are from exercise 1.2 in ALR. The statement "S UCINIT" should be done once and only once in each directory in which you plan to use S. It sets up a file called .Data, and turns off an invisible file .Audit that would otherwise grow without bound.