CPH 636-001 (Data Mining in Public Health)

 

            Syllabus (DOC), posted 09 January 2009

            Entrance Survey (DOC), posted 09 January 2009

 

            Lecture 1 (PS PDF), posted 10 January 2009

            Lecture 2 (PS PDF), posted 03 February 2009

            Lecture 3 (PS PDF), posted 03 February 2009

                        data for Lecture 3 examples (SAS), posted 03 February 2009

                        example of UNIVAR macro output (RTF), posted 03 February 2009

                        example of FREQ macro output (RTF), posted 03 February 2009

            Lecture 4 (PS PDF), posted 06 February 2009

                        data for Lecture 4 example (XLS), posted 06 February 2009

                        training subset (SAS), posted 06 February 2009

                        validation subset (SAS), posted 06 February 2009

                        test subset (SAS), posted 06 February 2009

                        example of REGDIAG macro output (RTF), posted 06 February 2009

            Lecture 5 (PS PDF), posted 13 February 2009

            Lecture 6 (PS PDF), posted 18 February 2009

                        data for Lecture 6 examples (SAS), posted 18 February 2009

                        training subset (SAS), posted 18 February 2009

                        validation subset (SAS), posted 18 February 2009

                        test subset (SAS), posted 18 February 2009

                        example of LOGISTIC macro output with main results and suggestions for variable selection (DOC), posted 18 February 2009

                        example of LOGISTIC macro output with diagnostics for quadratic and interaction terms (DOC), posted 18 February 2009                   

            Lecture 7 (PS PDF), posted 06 March 2009

                        training subset for Lecture 7 example (SAS), posted 06 March 2009

                        test subset for Lecture 7 example (SAS), posted 06 March 2009

                        example of DISCRIM macro output providing suggestions for variable selection (DOC), posted 06 March 2009    

                        example of DISCRIM macro output providing results for variables of our choosing (DOC), posted 06 March 2009

            Lecture 8 (PS PDF), posted 15 March 2009

                        FEV example: regression tree output (HTML) and visualization (MDI), posted 15 March 2009

                        SA example: classification tree output (HTML) and visualization (MDI), posted 15 March 2009

            Lecture 9 (PS PDF), posted 27 March 2009

                        Regression problem example output (HTML), posted 25 March 2009

                        Classification problem example output (HTML), posted 25 March 2009

            Lecture 10 (PS PDF), posted 01 April 2009

                        Regression problem example output (HTML), posted 25 March 2009

                        Classification problem example output (HTML), posted 25 March 2009

            Lecture 11 (PS PDF), posted 08 April 2009

                        Data for principal components analysis example (SAS), posted 08 April 2009

                        Output from exploratory data analysis (DOC), posted 08 April 2009

                        Output from principal components analysis (DOC), posted 08 April 2009

                        Data for factor analysis example, will not be posted (not for public consumption)

                        Output from exploratory data analysis (DOC), posted 08 April 2009

                        Output from factor analysis (DOC), posted 08 April 2009

                        Instructions for FACTOR macro (TXT), posted 08 April 2009

            Lecture 12 (PS PDF), posted 22 April 2009

                        Output from exploratory data analysis (DOC), posted 22 April 2009

                        Output from cluster analysis (DOC), posted 22 April 2009

                        Instructions for DISJCLUS macro (TXT), posted 22 April 2009

 

            Written Assignment 1 (PS PDF), posted 10 January 2009

                        data file (XLS) and information file (XLS), posted 10 January 2009

                        instructions for EXCELSAS and RANSPLIT macros (TXT), posted 10 January 2009

                        solutions (PS PDF), posted 03 February 2009

            Written Assignment 2 (PS PDF), posted 03 February 2009

                        full data set (SAS), posted 03 February 2009

                        training subset (SAS), posted 03 February 2009

                        validation subset (SAS), posted 03 February 2009

                        test subset (SAS), posted 03 February 2009

                        instructions for UNIVAR and FREQ macros (TXT), posted 03 February 2009

                        instructions for REGDIAG macro (TXT), posted 03 February 2009

                        solutions (PS PDF), posted 17 February 2009

            Written Assignment 3 (PS PDF), posted 13 February 2009

                        SAS code to replace RSCORE macro (TXT), posted 13 February 2009

                        instructions for LOGISTIC macro (TXT), posted 13 February 2009

                        solutions (PS PDF), posted 10 March 2009

            Written Assignment 4 (PS PDF), posted 06 March 2009

                        instructions for DISCRIM macro (TXT), posted 06 March 2009

                        instructions for making trees in Enterprise Miner (TXT), posted 06 March 2009

                        solutions (PS PDF), posted 04 April 2009

            Written Assignment 5 (PS PDF), posted 25 March 2009

                        instructions for making neural networks in Enterprise Miner (TXT), posted 25 March 2009

                        instructions for performing nearest neighbors analyses in Enterprise Miner (TXT), posted 25 March 2009

                        solutions (PS PDF), posted 18 April 2009

 

            Instructions for Final Project (DOC), posted 15 March 2009

            Oral Presentation Score Sheet (DOC), posted 15 March 2009