Leuven STATistics STATe of the Art Training Initiative

DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 1
Leuven STATistics
STATe of the Art
Training Initiative
2014-2015
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 2
Course timetable 2014-2015
DATE
TITLE
PRESENTERS
LEVEL AND
LANGUAGE
September 2014
29, 30 September,
1, 2 October 2014
Flames: Essential Tools for R
Goele Bossaert
Basic (English)
5
October 2014
6, 7, 8 October 2014
Fundamentele statistische
methoden
Marlies Lacante
Basis (Nederlands)
6
14 October 2014
Fundamentele statistische
methoden, toegepast met
SAS Eguide
Martine Beullens
Basis (Nederlands)
8
14 October 2014
Fundamentele statistische
Marlies Lacante
methoden, toegepast met SPSS
Basis (Nederlands)
8
14 October 2014
Fundamental Statistical Resarch Anna Ivanova
Methods, applications with R
Basic (English)
9
13, 14 and 30, 31
October 2013
Optimization and Numerical
Methods in Statistics
Francis Tuerlinckx
Geert Molenberghs
Katrijn Van Deun
Tom Wilderjans
Advanced (English)
7
22, 23, 24 October 2014 Models for Longitudinal
17, 18, 19 November 2014 and Incomplete data
Geert Molenberghs,
Geert Verbeke
Advanced (English)
10
3, 4 November 2014
Advanced programming in R
Jan Wijfels
Intermediate (English)
12
3, 4 November 2014
Regression and Analysis
of Variance
Anna Ivanova
Marlies Lacante
Basic (English)
13
14 November 2014
Regressie- en variantieanalyse, An Carbonez
toegepast met SPSS
Basis (Nederlands)
14
14 November 2014
Regressie- en variantieanalyse, Martine Beullens
toegepast met SAS Eguide
Basis (Nederlands)
14
13, 18 November 2014
Regression and Analysis of
Variance, applications with R
Anna Ivanova
Basic (English)
15
24, 25 November 2014
Uitbreiding bij Regressieen variantieanalyse
An Carbonez
Marlies Lacante
Verdiepend (Nederlands) 16
27, 28 November 2014
Statistical Machine
Learning with R
Jan Wijfels
Advanced (English)
17
November 2014
MORE
ON PAGE
December 2014
10 December 2014
Niet-parametrische statistiek
Marlies Lacante
Basis (Nederlands)
18
February 2015
10, 17, 24 February
3, 10, 17, 31 March
and 28 April 2015
Experimental Design
Peter Goos
Intermediate (English)
19
11, 25 February,
4, 11, 18 March 2015
Chemometrics
Wouter Saeys
Advanced (English)
20
10, 11, 12 March 2015
Cluster analysis, principal
component analysis and
exploratory factor analysis
with SAS, SPSS and R
Anne Marie De Meyer
An Carbonez
Martine Buellens
Intermediate (English)
21
March 2015
April 2015
May 2015
17, 18, 19 March 2015
Fundamental statistical methods Marlies Lacante
Basic (English)
22
19, 20 March 2015
Introduction to the analysis
of contingency tables
An Carbonez
Basic (English)
23
10, 31 March, 28 April
and 5, 12 May 2015
Concepts of multilevel,
longitudinal and mixed models
Geert Verbeke
Advanced (English)
24
23, 24 March 2015
Introduction to correspondence Anne-Marie De Meyer
analysis and multiple
correspondence analysis with
SAS, SPSS and R
Intermediate (English)
25
23 April 2015
Weblecture on Sampling Theory Geert Molenberghs
Intermediate (English)
26
28, 29 April 2015
Logistic Regression Models
with SAS
Anne Marie De Meyer
Intermediate (English)
27
28, 30 April 2015
Logistic Regression Models
with SPSS
Anne Marie De Meyer
Intermediate (English)
27
data to be announced
in 2015
Nonparametic Smoothing
Techniques and Applications
to be announced
Advanced (English)
28
6, 7 May 2015
Poisson regression with SAS
Anne-Marie De Meyer
Intermediate (English)
29
6, 8 May 2015
Poisson regression with SPSS
Anne-Marie De Meyer
Intermediate (English)
29
6, 7, 8 May 2015
Inleiding tot enquêtering
Marlies Lacante
Kristel Hoydonckx
Basis (Nederlands)
30
8, 9, 11, 12 June 2015
Flames: Essential Tools for R
Goele Bossart
Basic (English)
5
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 1
Preface
I take pleasure and pride in welcoming you to the Leuven STATistics STATe of the Art Training Initiative, a scientific
and educational project of the Leuven Statistics Research Centre (LStat), offering a range of short courses.
Statistics in Leuven is varied and broad based. Statisticians are active throughout the university, in mathematics, computer
science, economy, psychology, education, bio-engineering, engineering, biology, chemistry, medicine, pharmacy, iphysical
education, psychology, social science, linguistics, etc. Many colleagues combine an excellent international scientific
reputation with highly effective teaching skills. At the same time, statistical consulting for internal and external clients
is a wholesome component of LStat’s mission.
It is therefore not surprising that the short course programme has been highly successful and in great demand.
Celebrating this success, we are shifting into higher gear and the time-honoured programme of short courses is
gradually being expanded with further highly relevant topics, many located at the heart of our faculty’s expertise.
Due to increasing demand, some courses are offered more than once per academic year.
A selected set of courses is offered in an open educational concept, in the sense that, for example, also contingents
of students of our highly successful MSc in Statistics partake in them. This ensures stimulating interaction.
Courses take place in one of the university’s campuses, dotted around the beautiful college town of Leuven.
Should your company or institute be looking for a tailor-made training initiative, perhaps on-site, then we will be
delighted to explore options and work towards an individualized proposal.
Professor Marlies Lacante
2013-2015 chair of Lstat
Leuven Statistics Research Centre
Celestijnenlaan 200 B bus 5307
3001 HEVERLEE
+ 32 16 32 22 14
[email protected]
www.lstat.kuleuven.be
1
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 2
Presenters
Martine Beullens
graduated in Mathematics at KU Leuven. From 1990
onwards she has been working at KU Leuven and the
Federal Police on projects commissioned by the Belgian
government on the development and statistical exploitation
of federal databanks containing judicial or police information.
Currently she is working at KU Leuven at the Teaching
and Learning Vision and Quality department in the Data
management section.
Goele Bossaert
is FLAMES coordinator of the KU Leuven. She obtained
her Phd degree in Educational Sciences in 2012 and a
Master degree in Statistics the following year. She organizes,
together with the other FLAMES coordinators at the four
other Flemish universities, the inter-universitary and local
FLAMES events @ KU Leuven.
An Carbonez
is professor aan het Leuven Statistics Research Centre
(LStat) van de KU Leuven. Ze behaalde haar doctoraat
wiskunde aan de KU Leuven in 1992. Ze is coördinator
van het MSc in Statistics programma van de KU Leuven.
Ze is ook betrokken bij statistische consulting projecten
en het geven van statistische opleidingen binnen bedrijven.
Anne-Marie De Meyer
is Professor at the KU Leuven in the Faculty of Science,
Department of Mathematics. She received her PHD in
Mathematics (Applied Probability) in 1979 and is currently
teaching statistics in the MSc in Statistics and in the
Bachelor of Criminology. Since more than 25 years she
has been involved in the short course program for a
variety of courses in applied statistics and is also active in
the statistical consulting of LStat.
Peter Goos
is a full professor at the Department of Biosystems of the
Faculty of Bioscience Engineering of the University of Leuven
and the Department of Environment, Technology and
Technology Management of the Faculty of Applied Economics
of the University of Antwerp. He has been a guest professor
at the Econometric Institute of the Erasmus School of
Economics (Erasmus University of Rotterdam), the Faculty
of Business and Economics of the University of Leuven,
the Antwerp Management School and the International
School of Management in Saint-Petersburg. Peter Goos
has received the Shewell Award and the Lloyd S. Nelson
Award of the American Society for Quality, the Ziegel Award
and the Statistics in Chemistry Award of the American
Statistical Association, and the Young Statistician Award
of the European Network for Business and Industrial
Statistics. In 2013, Peter Goos was ranked 7th in the top
40 of economists in the Netherlands.
Kristel Hoydonckx
is werkzaam aan de afdeling “Faciliteiten voor onderzoek”
van de KU Leuven en staat daar ondermeer in voor de
enquêteservice.
2
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 3
Anna Ivanova
at the Harvard School of Public Health (Boston, MA). He is
founding director of the Center for Statistics at Hasselt
University and currently the director of the Interuniversity
Institute for Biostatistics and statistical Bioinformatics,
I-BioStat, a joint initiative of the Hasselt and Leuven
universities.
is a research assistant at L-BioStat of the KU Leuven.
She obtained her Master degree in Statistics from the
KU Leuven in 2004. She carries out statistical consulting
and participates in statistical consulting projects.
Wouter Saeys
Marlies Lacante
is sedert 1974 verbonden aan de onderzoekseenheid
Psychologie van de KU Leuven. Gedurende meer dan
20 jaar was zij betrokken bij het statistiekonderwijs in
de opleiding Psychologie. Momenteel doceert zij binnen
de academische Lerarenopleiding, binnen het Leuven
Statistics Research Centre (Lstat) en binnen de MSc in de
Psychologie. Ze is ook actief in het onderwijsonderzoek,
met focus op survey onderzoek en met speciale aandacht
voor de onderzoeksmethodologie.
Geert Molenberghs
is Professor of Biostatistics at the Universiteit Hasselt and
KU Leuven in Belgium. He received the B.S. degree in
mathematics (1988) and a Ph.D. in biostatistics (1993)
from the Universiteit Antwerpen. Dr Molenberghs published
methodological work on surrogate markers in clinical trials,
categorical data, longitudinal data analysis, and on the
analysis of non-response in clinical and epidemiological
studies. He served as Joint Editor for Applied Statistics
(2001-2004), Co-editor for Biometrics (2007–2009) and
as President of the International Biometric Society (20042005). He currently is Co-editor for Biostatistics (2010–
2013). He was elected Fellow of the American Statistical
Association and received the Guy Medal in Bronze from
the Royal Statistical Society. He has held visiting positions
is assistant Professor at the Department of Biosystems at
the KU Leuven in Belgium. He received his Master degree
in Bioscience Engineering (2002) and a PhD in Bioscience
Engineering (2006) from the KU Leuven. He was a
postdoctoral researcher at the School for Chemical
Engineering and Advanced Materials of the University of
Newcastle upon Tyne (UK) and at the Norwegian Food
Research Institute – Matforsk (Ås, Norway). In 2013,
he received the Young Statistician Award of the European
Network for Business and Industrial Statistics. In general
Wouter’s research deals with light transport modeling and
optical characterization of biological materials, multivariate
data analysis and chemometrics, process monitoring and
control. He is author of 80+ research articles (ISI).
Francis Tuerlinckx
is Professor of Psychology at the KU Leuven in Belgium.
He received the Master degree in psychology (1996) and
a Ph.D. in psychology (2000) from the KU Leuven. He was
a postdoc at the Department of Statistics of Columbia
University (New York). In general, Francis Tuerlinckx’
research deals with the mathematical modeling of various
aspects of human behavior. More specifically, he works
on item response theory, reaction time modeling, and
dynamical systems data analysis for human emotions.
3
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 4
Presenters
Katrijn Van Deun
is assistant professor in Methodology and Statistics at
Tilburg University and a research fellow of the KU Leuven.
She obtained a Master in psychology, a Master’s degree
in statistics and a PhD in psychology. Her main area of
expertise is scaling, clustering and component analysis
techniques, which she applies in the fields of psychology,
chemometrics and bioinformatics. She has various publications in both methodological and substantive journals
in psychometrics, chemometrics and bioinformatics.
Katrijn is secretary of the Dutch/Flemish Classification society.
Jan Wijffels
is the founder of www.bnosac.be - a consultancy company
specialised in statistical analysis and data mining. He holds
a Master in Commercial Engineering, a MSc in Statistics
and a Master in Artificial Intelligence and has been using
R for 10 years, developing and deploying R-based
solutions for clients in the private sector. He has developed
and co-developed the R packages ffbase, ETLUtils, RMOA
and RMyrrix.
Tom Wilderjans
Geert Verbeke
is Professor in Biostatistics at KU Leuven and Universiteit
Hasselt. He received the B.S. degree in mathematics (1989)
from the KU Leuven, the M.S. in biostatistics (1992) from
Universiteit Hasselt, and earned a Ph.D. in biostatistics
(1995) from the KU Leuven. Geert Verbeke has published
extensively on longitudinal data analyses. He has held
visiting positions at the Gerontology Research Center and
the Johns Hopkins University (Baltimore, MD). Geert Verbeke
is Past President of the Belgian Region of the International
Biometric Society, International Program Chair for the
International Biometric Conference in Montreal (2006),
Board Member of the American Statistical Association.
He is past Joint Editor of the Journal of the Royal Statistical
Society, Series A (2005–2008) and currently editor of
Biometrics (2010– 2013). He is the director of the Leuven
Center for Biostatistics and statistical Bioinformatics
(L-BioStat), and vice-director of the Interuniversity
Institute for Biostatistics and statistical Bioinformatics
(I-BioStat), a joint initiative of the Hasselt and Leuven
universities in Belgium.
4
is a post-doctoral researcher at the Fund for Scientific
Research (FWO-Flanders). He obtained a Master’s degree
(2005) and a PhD (2009) in Mathematical Psychology
from the KU Leuven. Tom’s research deals with multivariate data analysis (component analysis, clustering, and
combinations thereof) and model selection.
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 5
Flames: Essential tools for R
Course outline
This course gives an introduction to the use of the statistical
software language R. R is a language for data analysis and
graphics. This introduction course to R is aimed at beginners.
The course covers data handling, graphics, mathematical
functions and some statistical techniques. R is for free and for
more information you can visit the site http://cran.r-project.org/
Target audience
Everybody who is interested in using the R programming
language. You will learn how to write and manage your
R scripts.
Prerequisites
There are no prerequisites.
Presenter
Goele Bossaert
Tourse Material
A .pdf file with the course material will be made available.
Dates
29, 30 September and 1, 2 October 2014 from 9.00 hr
to 12.00 hr
or
8, 9, 11, 12 June 2015 from 9.00 hr to 12.00 hr
Language
English
FLAMES
Flanders Training Network for Methodology and Statistics
(FLAMES) is an inter-university training network rooted in
the five Flemish universities: Free University of Brussels,
Ghent University, Hasselt University, University of Antwerp,
and KU Leuven.
This network aims to support doctoral students and young
empirical researchers in their pursuit of best-in-class training
in methodology and statistics. FLAMES seeks to optimize,
intensify, and extend the methodological and statistical
training currently offered by Flemish universities and bring
existing training to new audiences. It also develops new
training modules for complex and advanced statistical
methods taught by experts, a series of seminars on qualitative
methods, organizes a yearly summer school, and delivers
specialized workshops.
The FLAMES initiative originated in 2013 with the financial
support of the Pact 2020 ('Vlaanderen in Actie', translated:
Flanders in action) and the Flemish Minister of Innovation
Ingrid Lieten.
Flames Price
•
•
•
•
PhDs and postdocs of a Flemish University: free
Other academics: € 120
Non profit/social sector € 200
Private sector € 400
5
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 6
Fundamentele statistische
methoden
Beschrijving
Deze basiscursus statistiek richt zich op het kiezen van
geschikte statistische methoden en het trekken van de
correcte conclusies uit de verkregen resultaten. Wiskundige
grondslagen van de gebruikte methoden komen in deze
cursus slechts beknopt ter sprake. De nadruk ligt op toepassing in de praktijk. Men krijgt inzicht in het adequaat
gebruik van basis-statistieken: centrummaten, spreidingsmaten, tabellen, box-plots, enz. Daarnaast worden
betrouwbaarheidsintervallen opgesteld en krijgt men de
grondslagen van toetsen van hypothesen.
Inhoud van de cursus:
• Beschrijvende grootheden: grafische en numerische
samenvatting van de data
• Verdelingen: Binomiale, Poisson, Normale, T-verdeling
• Steekproefverdeling van het gemiddelde
• Betrouwbaarheidsintervallen
• Hypothese testen omtrent een gemiddelde (één en
twee steekproeven)
• Gepaarde t-test
• Schatten en testen van proporties
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
6, 7, 8 oktober 2014 telkens van 9 u. tot 12 u.
Doelgroep
Iedereen die een opfrissing van fundamentele statistische
technieken wenst.
Taal
Nederlands
Voorkennis
Er wordt geen voorkennis ondersteld.
Lesgever
Prof. Marlies Lacante
6
Prijs
• Personeel en studenten KU Leuven en Associatie
KU Leuven: zie: https://icts.kuleuven.be/cursus/
• PhD studenten, niet KU Leuven € 120
• Non profit/sociale sector € 187,50
• Private sector € 450
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 7
Optimization & Numerical
Methods in Statistics
Course outline
Course Materials
Numerical problems are frequently encountered by statisticians. Prominently, the estimation of the parameters of a
statistical model requires the solution of an optimization
problem. In a few simple cases, closed-form solutions exist
but for many probability models the optimal parameter
estimates have to be determined by means of an iterative
algorithm. The goal of this course is threefold. First, we want
to offer the readers an overview of some frequently used
optimization algorithms in (applied) statistics. Second, we want
to provide a framework for understanding the connections
among several optimization algorithms as well as between
optimization and aspects of statistical inference. Third,
although very common, optimization is not the only numerical
problem and therefore some important related topics such
as numerical differentiation and integration will be covered.
A .pdf file with the course material will be made available.
Target audience
The intended target audience includes PhD students and
researchers in a variety of fields, including biostatistics,
psychometrics, educational measurement, public health,
sociology. We aim at readers who apply and possibly develop
statistical models and who wish to learn more about the
basic concepts of numerical techniques, with an emphasis
on optimization problems, and their use in statistics.
Background reading:
• Everitt, B.S. (1987). Introduction to Optimization Methods
and Their Application in Statistics. London: Chapman
& Hall.
• Lange, K. (1999). Numerical Analysis for Statisticians.
New York: Springer.
• Lange, K. (2004). Optimization. New York: Springer.
Dates
13 - 14 and 30 - 31 October 2014: 9.00 hr - 12.30 hr;
13.30 hr - 17.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 320
• Non profit/social sector € 500
• Private sector € 1200
Prerequisites
Participants should have a basic knowledge of the principles
of statistical inference. This includes some familiarity with
the concept of a likelihood function and likelihood-based
inference for linear, binomial, multinomial, and logistic
regression models. Readers should also have a basic
understanding of matrix algebra. A working knowledge of
the basic elements of univariate calculus is also a prerequisite, including (the concepts of continuity of a function,
derivative and integration).
Presenters
Francis Tuerlinckx, Geert Molenberghs, Katrijn Van Deun,
Tom Wilderjans
7
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 8
Fundamentele statistische methoden, t
CURSUS 1: FUNDAMENTELE STATISTISCHE METHODEN, TOEGEPAST MET SPSS
Beschrijving
Lesgever
Dit is een inleidende cursus tot het gebruik van SPSS.
Aan de hand van cases wordt geïllustreerd hoe men met
SPSS tot exploratie van gegevens komt. Hierbij wordt de
nodige aandacht besteed aan het interpreteren van de
verkregen output. Hypothesetesten voor onafhankelijke en
gepaarde groepen worden uitgevoerd en besproken. Er is
tijd om zelf te werken met deze software.
Prof. Marlies Lacante
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
Doelgroep
14 oktober 2014, 9 u. - 12 u. en 13 u. - 16 u.
Iedereen die gegevens wenst te exploreren met SPSS.
Taal
Voorkennis
Nederlands
De technieken die aangeleerd werden bij Fundamentele
Statistische Methoden.
CURSUS 2: FUNDAMENTELE STATISTISCHE METHODEN, TOEGEPAST
MET SAS EGUIDE
Beschrijving
Lesgever
Dit is een inleidende cursus tot het gebruik van SAS
Enterprise Guide. Aan de hand van cases wordt geïllustreerd
hoe men met de SAS Eguide tot exploratie van gegevens
komt. Hierbij wordt de nodige aandacht besteed aan het
interpreteren van de verkregen output. Hypothesetesten
voor onafhankelijke en gepaarde groepen worden uitgevoerd
en besproken. Er is tijd om zelf te werken met deze software.
Martine Beullens
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
Doelgroep
Iedereen die gegevens wenst te exploreren met SAS
Eguide.
14 oktober 2014, 9 u. - 12 u. en 13 u. - 16 u.
Taal
Nederlands
Voorkennis
De technieken die aangeleerd werden bij Fundamentele
Statistische Methoden.
8
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 9
n, toegepast met SAS Eguide, SPSS en R
COURSE 3: FUNDAMENTAL STATISTICAL METHODS, APPLICATIONS WITH R
Course outline
Course Materials
By using cases, one explores data by using R. Attention
is paid to the interpretation of the output. Topics as exploring
data, construction of confidence intervals and hypothesis
testing is covered. This is a hands-on session.
A .pdf file with the course material will be made available.
Target audience
Date
14 October 2014, 9.00 hr - 12.00 hr
and 13.00 hr - 16.00 hr.
Everybody who wants to explore data by using R
Language
Prerequisites
English
Fundamental Statistical Methods (distributions, confidence
intervals, hypothesis testing) and Introduction to R.
Presenter
Anna Ivanova
PRICE COURSE 1, 2 or 3:
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 80
• Non profit/social sector € 125
• Private sector € 300
9
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 10
Models for Longitudinal and Incomplete D
CONCEPTS, MODELS AND HANDS-ON APPLICATION WITH THE OPTION
TO ANALYSE ONE’S OWN DATA
10
Course outline
Target audience
We first present linear mixed models for continuous
hierarchical data. The focus lies on the modeler’s
perspective and on applications. Emphasis will be on
model formulation, parameter estimation, and hypothesis
testing, as well as on the distinction between the randomeffects (hierarchical) model and the implied marginal
model. Apart from classical model building strategies,
many of which have been implemented in standard
statistical software, a number of flexible extensions and
additional tools for model diagnosis will be indicated.
Second, models for non-Gaussian data will be discussed,
with a strong emphasis on generalized estimating
equations (GEE) and the generalized linear mixed model
(GLMM). To usefully introduce this theme, a brief review of
the classical generalized linear modeling framework will be
presented. Similarities and differences with the continuous
case will be discussed. The differences between marginal
models, such as GEE, and random-effects models, such
as the GLMM, will be explained in detail. Third, it is
oftentimes necessary to consider fully non-linear models
for longitudinal data. We will discuss such situations, and
place some emphasis on the non-linear mixed-effects
model. Fourth, non-linear mixed models will be discussed.
Applications in the PK/PD world will be brought to the
front. Fifth, when analyzing hierarchical and longitudinal
data, one is often confronted with missing observations,
i.e., scheduled measurements have not been made, due
to a variety of (known or unknown) reasons. It will be
shown that, if no appropriate measures are taken, missing
data can cause seriously jeopardize results, and interpretation difficulties are bound to occur. Methods to
properly analyze incomplete data, under flexible
assumptions, are presented. Key concepts of sensitivity
analysis are introduced. All developments will be illustrated
with worked examples using the SAS System. However,
the course is conceived such that it will be of benefit to
both SAS users and users of other platforms.
The targeted audience includes methodological and applied
statisticians and researchers in industry, public health organizations, contract research organizations, and academia.
Important: The course will also serve for the MSc in Statistics
students.
Prerequisites
Throughout the course, it will be assumed that the
participants are familiar with basic statistical modeling
concepts, including linear models (regression and analysis
of variance), as well as generalized linear models (logistic
and Poisson regression) and basic knowledge of mixed
and multilevel models. Moreover, pre-requisite knowledge
should also include general estimation and testing theory
(maximum likelihood, likelihood ratio). When registering for
this course, you have to mention the topics you have
followed before and/or indicate where you became
acquainted with the requested material.
Presenters
Geert Verbeke and Geert Molenberghs
Course Materials
A .pdf file with the course material will be made available.
Background reading:
• Verbeke, G. and Molenberghs, G. (2000) Linear Mixed
Models for Longitudinal Data. New York: Springer.
• Molenberghs, G. and Kenward, M.G. (2007) Missing
Data in Clinical Studies. Chichester: John Wiley & Sons.
• Molenberghs, G. and Verbeke, G. (2005) Models for
Repeated Discrete Data. New York: Springer.
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 11
e Data
Dates
October 22, 2014: 9.00 hr - 12.00 hr
October 23, 2014: 9.00 hr - 12.30 hr; 13.30 hr - 17.00 hr
October 24, 2014: 9.00 hr - 12.30 hr; 13.30 hr - 17.00 hr
November 17, 2014: 9.00 hr - 12.30 hr; 13.30 hr - 17.00 hr
November 18, 2014: 9.00 hr - 12.30 hr; 13.30 hr - 17.00 hr
November 19, 2014: 9.00 hr - 12.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 400
• Non profit/social sector € 625
• Private sector € 1500
11
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 12
Advanced R programming topics
Course outline
R is the lingua franca of statistical research and data
analysis. But in order to get you up and running with R,
and to get over the steep learning curve, you need to
know how to use it efficiently.
This course is a hands-on course covering the basic toolkit
you need to have in order to use R efficiently for data
analysis tasks.
It is an intermediate course aimed at users who have the
knowledge from the course ‘Essential tools for R’ and who
want to go further to improve and speed up their data
analysis tasks.
The following topics will be covered in detail
• The apply family of functions and basic parallel
programming for these, vectorisation, regular expressions,
string manipulation functions and commonly used
functions from the base package. Useful other packages
for data manipulation.
• Making a basic reproducible report using Sweave and
knitr including tables, graphs and literate programming
• If you want to build your own R package to distribute
your work, you need to understand S3 and S4 methods,
you need the basics of how generics work as well as R
environments, what are namespaces and how are they
useful. This will be covered to help you start up and
build an R package.
• Basic tips on how to organise and develop R code and
test it.
R users interested in getting the fundamentals you need
to know before you can create your own R package.
Business users who want to learn how to get the maximum
out of R by speeding up their code, learn vectorisation,
execute the basics of parallel programming and want to
learn how to build methods and code which is reproducible
in production environments.
Prerequisites
Initial experience in R ranging from a few weeks to several
years.
Course materials
A .pdf file with the course material will be made available.
Presenter
Jan Wijffels
Date
3 and 4 November 2014, 9.00 hr - 12.00 hr
and 13.00 hr - 16.00 hr
Language
English
Target audience
People who have had their initial use of R and want to go
one step further.
This covers people using R for a few months already to
several years. And more specifically users who want to
extend their data manipulation techniques to speed up
their day-to-day data analysis tasks.
Researchers from the university interested in making
reproducible research reports or users who want to use R
as a report generating tool.
12
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 160
• Non profit/social sector € 250
• Private sector € 600
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 13
Regression and Analysis of variance
Course outline
Prerequisites
Linear statistical models are widely used today in many
applications. Successfully applying these techniques require
a good understanding of the underlying theory and the
practical problems that you may encounter in real-life
situations.
Participants are familiar with basic statistical modeling
concepts (see topics described in Fundamental Statistical
Methods).
Course materials
DAY 1: REGRESSION ANALYSIS
• Correlation
• Simple linear regresssion
Ordinary least squares: estimating parameters, confidence
intervals and tests, diagnostics, prediction.
• Multiple regression
Ordinary least squares: estimating parameters, confidence
intervals and tests, diagnostics, prediction.
Variable selection techniques.
DAY 2: ANALYSIS OF VARIANCE
• One-way Anova
Comparing means
The Anova model: estimating parameters, hypothesis
tests, Anova tabel, F test
Multiple comparisons, Contrasts
• Two-way Anova
The two-way Anova Model
Main effects, interaction effects
Multiple comparisons
Target audience
This course is important for persons involved with
modeling data.
A .pdf file with the course material will be made available.
Presenter
Anna Ivanova, Marlies Lacante
Date
November 3, 4 2014: 9.00 hr - 12.00 hr; 13.00 hr 16.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 160
• Non profit/social sector € 250
• Private sector € 600
13
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 14
Regressie- en variantieanalyse, toeg e
CURSUS 1: REGRESSIE- EN VARIANTIEANALYSE, TOEGEPAST MET SPSS
Beschrijving
Lesgever
De technieken die aangeleerd werden bij Regressie- en
variantieanalyse, worden hier toegepast met SPSS. Aan
de hand van cases wordt geïllustreerd hoe men met SPSS
tot het modelleren van gegevens komt. Hierbij wordt de
nodige aandacht besteed aan het interpreteren van de
verkregen output. Er is voldoende tijd om zelf te werken
met deze software.
An Carbonez
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
Doelgroep
14 november 2014 van 9 u. - 12 u. en 13 u. - 16 u.
Iedereen die gegevens wenst te modelleren via SPSS.
Taal
Voorkennis
Nederlands
We veronderstellen een basiskennis van SPSS. Cursisten
dienen eveneens vertrouwd te zijn met de methodiek
aangebracht in Regressie- en variantieanalyse.
CURSUS 2: REGRESSIE- EN VARIANTIEANALYSE, TOEGEPAST MET
SAS EGUIDE
Beschrijving
Lesgever
De technieken die aangeleerd werden bij Regressie- en
variantieanalyse, worden hier toegepast met SAS Eguide.
Aan de hand van cases wordt geïllustreerd hoe men met
de SAS Eguide tot het modelleren van gegevens komt.
Hierbij wordt de nodige aandacht besteed aan het interpreteren van de verkregen output. Er is voldoende tijd om
zelf te werken met deze software.
Martine Beullens
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
Doelgroep
14 november 2014, 9 u. - 12 u. en 13 u. - 16 u.
Iedereen die gegevens wenst te modelleren via SAS Eguide.
Taal
Voorkennis
We veronderstellen een basiskennis van SAS Eguide.
Cursisten dienen eveneens vertrouwd te zijn met de
methodiek aangebracht in Regressie en variantie analyse.
14
Nederlands
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 15
g epast met SPSS, SAS Eguide en R
COURSE 3: REGRESSION AND ANALYSIS OF VARIANCE: APPLICATIONS WITH R
Course outline
Presenter
The linear models, provided by the course ‘Regression and
Analysis of Variance’, are applied on examples. In this course,
the R package is used. By means of cases, we illustrate
how to model your data in R and how to interpret the
corresponding output. There is a hands-on session to train
you with the functionality of R.
Anna Ivanova
Target audience
Everybody who wants to model data with R.
Course Materials
A .pdf file with the course material will be made available.
Date
13 November 2014, 9.00 hr - 12.00 hr
and 13.00 hr - 16.00 hr.
and 18 November from 9 hr - 12 hr.
Prerequisites
Everybody should be familiar with the techniques covered
in ‘Regression and Analysis of Variance’ and have a basic
knowledge of working with R.
Language
PRICE FOR COURSE 1 AND 2
PRICE FOR COURSE 3
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 80
• Non profit/social sector € 125
• Private sector € 300
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 120
• Non profit/social sector € 187,50
• Private sector € 450
English
15
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 16
Uitbreiding bij Regressieen variantieanalyse
Beschrijving
Voorkennis
De resultaten van een lineaire regressieanalyse zijn sterk
beïnvloedbaar door speciale datapunten. Het detecteren
van uitschieters en invloedrijke waarnemingen wordt in
deze cursus bestudeerd. Daarnaast wordt geïllustreerd
hoe men via robuuste regressie dit probleem kan
opvangen. De praktijk leert ook dat de resultaten van een
lineaire regressie ook sterk beïnvloed worden door
associaties tussen verklarende variabelen. Dit probleem
van multicollineariteit wordt besproken en geïllustreerd.
Verder is er een uitbreiding van variantieanalyse naar
specifieke deelhypothesen en covariantieanalyse. Er wordt
telkens geïllustreerd hoe de analyses met SAS Eguide en
SPSS kunnen uitgevoerd worden.
Cursisten dienen vertrouwd te zijn met de methodiek
aangebracht in ‘Regressie- en variantieanalyse’.
Lesgever
Marlies Lacante, An Carbonez
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
Inhoud van de cursus:
Dag 1: Uitbreiding van regressie
• Speciale datapunten: detectie van uitschieters en invloedrijke waarnemingen
• Inleiding tot robuuste regressie
• Multicollineariteit
24 november 2014 van 9 u. -12 u. en 13 u. -16 u. en
25 november 2014 van 9 u. -12 u.
Taal
Nederlands
Dag 2: (halve dag)
• Covariantie analyse
Prijs
Doelgroep
Deze cursus is bedoeld voor personen die regelmatig
lineaire regressieanalyse wensen te gebruiken.
16
• Personeel en studenten KU Leuven en Associatie
KU Leuven: zie https://icts.kuleuven.be/cursus/
• PhD studenten, niet KU Leuven € 120
• Non profit/sociale sector € 187,50
• Private sector € 450
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 17
Statistical Machine Learning with R
Course outline
Prerequisites
This course is a hands-on course covering the use
of statistical machine learning methods available in R.
The following basic learning methods will be covered and
used on common datasets.
• classification trees (rpart)
• feed-forward neural networks and multinomial regression
• random forests
• boosting for classification and regression
• bagging for classification and regression
• penalized regression modelling (lasso/ridge regularized
generalized linear models)
• model based recursive partitioning (trees with statistical
models at the nodes)
• training and evaluation will be done through the use of
the caret and ROCR packages
Initial experience in R ranging from a few weeks to several
years (at least understanding of the course 'Essential Tools
for R' is needed). Some practical experience in regression
modelling.
Course materials
A .pdf file with the course material will be made available.
Presenter
Jan Wijffels
Date
The course will cover the techniques from a high-level
viewpoint, useful for day-to-day R users.
Target audience
The course is for R users in industry/academics who are
interested in building predictive models in R which have
some experience with regressions but have less knowledge
of machine learning and techniques of artificial intelligence.
Also persons interested in the statistical learning techniques
itself will find this course usefull.
Or people with a data science background with less
knowledge of R and which are interested in machine
learning in general.
27 and 28 November 2014, 9.00 hr - 12.00 hr
and 13.00 hr - 16.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 160
• Non profit/social sector € 250
• Private sector € 600
17
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 18
Niet-parametrische statistiek
Beschrijving
Voorkennis
Deze cursus behandelt een aantal statistische technieken
- analoog aan parametrische statistiek (bv. t-test, variantieanalyse) - waarbij de klassieke onderstellingen uit de
parametrische statistiek niet hoeven gemaakt te worden
(distributievrije technieken), technieken gebaseerd op
‘ordeningen’ of ‘rankings’, alsook technieken specifiek
geschikt voor nominale gegevens.
Cursisten dienen vertrouwd te zijn met de methodiek
aangebracht in ‘Fundamentele Statistische technieken’ en
variantie analyse.
Inhoud van de cursus:
• Chi- kwadraat goodness of fit testen
• Testen mbt verschil tussen twee onafhankelijke
steekproeven
• Testen mbt verschil tussen twee afhankelijke
steekproeven
• Testen mbt verschil tussen meerdere onafhankelijke
steekproeven
• Testen mbt verschil tussen meerdere afhankelijke
steekproeven
• Kengetallen mbt de samenhang tussen variabelen
Lesgevers
Marlies Lacante
Cursusmateriaal
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
10 december 2014 van 9 u. tot 12 u.
Taal
Nederlands
Doelgroep
Gebruikers van basis statistische technieken (t-test - variantieanalyse)
Prijs
• Personeel en studenten KU Leuven en Associatie
KU Leuven:zie https://icts.kuleuven.be/cursus/
• PhD studenten, niet KU Leuven € 40
• Non profit/sociale sector € 62,50
• Private sector € 150
18
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 19
Experimental Design
Course outline
Prerequisites
This course discusses the design of factorial experiments.
Initially, the focus is on completely randomized experimental
designs. Next, the focus shifts to experimental designs
involving a restricted randomization. First, the concept of
blocking is discussed. Next, split-plot and strip-plot designs
are studied.
Prerequisites for the course are knowledge of basic statistics,
regression analysis (least squares, multicollinearity), and matrix
algebra (matrix products, inverse matrices, determinants).
The emphasis in the course is on the optimal design of
experiments. In optimal design of experiments, the experimental design is tailored to the problem at hand (unlike
classical experimental design, where standard designs
from catalogs are chosen).
The course builds on concepts from regression and
analysis of variance, such as fixed and random effects,
power calculations, variance inflation factors, multicollinearity,
confidence intervals, prediction and lack-of-fit tests.
Every topic in the course is introduced and illustrated by
means of a case study from industry. The case studies are
realistic in the sense that they involve quantitative and
qualitative experimental factors, experimenters have to
deal with limited budgets and difficulties to randomize,
and forbidden combinations of factor levels. In each of
the case studies, the goal is to enhance to performance
of a process or a product.
The statistical software package used is JMP.
Presenter
Peter Goos
Course materials
Textbook: Optimal Design of Experiments: A Case Study
Approach (Peter Goos & Bradley Jones)
Dates
February 10, 2015: 9.00 hr - 11.30 hr
February 17, 2015: 9.00 hr - 11.30 hr
February 24, 2015: 9.00 hr - 11.30 hr
March 3, 2015: 9.00 hr - 11.30 hr
March 10, 2015: 9.00 hr - 11.30 hr
March 17, 2015: 9.00 hr - 11.30 hr
March 31, 2015: 9.00 hr - 11.30 hr
April 28, 2015: 9.00 hr - 11.30 hr
Language
English
Target audience
The target audience for the course is master students or
Ph.D. students in statistics, engineers and engineering
students planning to perform experiments. The course is
ideal for Six Sigma black belts, quality managers and
people working in environments where collecting data is
expensive.
Price
• Staff and students KU Leuven and Association
KU Leuven: go to https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 320
• Non profit/social sector € 500
• Private sector € 1200
19
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 20
Chemometrics
Course outline
Prerequisites
The goal of the course is to teach students how to perform
multivariate sensor calibration. Students will become
familiar with the use of statistical concepts in chemometric
applications. Most attention will be given to the ideas
underlying the different methods and the application of
these methods to realistic examples. Theoretical considerations and equations will be limited to what is needed
to have sufficient insight to properly use the methods.
Most examples will be related to spectroscopy and
analytical chemistry, but the scope is broader. By using a
combination of lectures, computer sessions and take
home assignments the students will really learn how to
apply the chemometric methods. The following aspects of
chemometrics will be handled in this course:
• Classical modelling concepts for quantitative calibration:
Classical Least Squares (CLS), Inverse Least Squares
(ILS), Multivariate Linear Regression (MLR), Principle
Component Regression (PCR) and Partial Least Squares
(PLS).
• Necessary steps for the creation and successful deployment of calibrations; selection of calibration standards
and assessment of the reliability of the models: (Test set
validation vs. Cross-validation, model statistics). Special
attention will be given to the methods for the selection
of the number of principle components or latent variables
in the projection methods.
• Methods for data pre-processing with special attention
for the phenomena of light scattering and instrument
drift and the methods to deal with these phenomena:
derivatives, standard normal variate (SNV), multiplicative
signal correction (MSC) and extended multiplicative signal
correction (EMSC).
• Variable selection in a chemometric context and some
commonly used methods for this.
• Qualitative analysis in a chemometric context: discrimination and classification
• New trends in chemometrics such as functional data
analysis and augmented classical least squares (ACLS).
Knowledge of basic concepts of statistics and linear
algebra is required. Some notions of analytical chemistry,
sensor technology and multivariate statistics are a plus.
Target audience
The intended target audience includes PhD students and
researchers in a variety of fields, including statistics,
chemistry, biosciences and engineering. We aim at readers
who wish to learn more about multivariate calibration of
sensor systems and the use of statistical concepts in
chemometric applications.
20
Presenter
Wouter Saeys
Course Materials
Slides from the lectures
Papers discussed in the lectures
Software manual
Datasets for the take home assignments
Additional material (suggested)
• A user-friendly guide to Multivariate Calibration and
Classification by Naes, Isaksson, Fearn and Davies,
NIR Publications 2004
• Multivariate Calibration by Martens and Naes, 1989
Dates
February 11, 2015: 9.00 hr -12.00 hr
February 25, 2015: 9.00 hr - 12.00 hr
March 4, 2015: 9.00 hr - 12.00 hr
March 11, 2015: 9.00 hr - 12.00 hr
March 18, 2015: 9.00 hr - 12.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 200
• Non profit/social sector € 312,50
• Private sector € 750
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 21
Cluster analysis, principal component
analysis and exploratory factor
analysis with SAS, SPSS and R
Course outline
Target audience
Multivariate data consist of observations on two or more
variables for each individual or unit.
Data analysts and scientists involved in analysing multivariate data.
The variables will be generally correlated, and a variety of
techniques are available to analyse these data.
Prerequisites
The objective of cluster analysis is to form groups of
observations such that each group is as homogeneous as
possible with respect to certain characteristics. The groups
are as different as possible.
A practical knowledge of basic statistics will be assumed,
such as standard deviations and correlations.
Presenters
Principal component analysis is one of the popular tools
to summarize quantitative multivariate data.
Anne-Marie De Meyer, An Carbonez and Martine Beullens
During this course, PCA and exploratory factor analysis,
will be introduced and the relation between them examined.
Course Materials
The emphasis of the course will be on the interpretation
of the example data and on the results through the Biplot.
Mathematical details are kept to a minimum.
A.pdf file with course material will be made available
Dates
10, 11 and 12 March 2015 from 9.00 hr - 12.00 hr
For the exercises, participants can choose to use the
statistics package SAS (through Enterprise Guide), SPSS
or R.
Course content:
• Hierarchical cluster analysis
• Nonhierarchical cluster analysis
• Linear combination of variables
• Eigenvalues and eigenvectors
• PCA scores and Factor scores
• What is a loading or the factor pattern?
• Screenplot
• How many components or factors to retain?
• Communalities
• The biplot
• The Varimax rotation
Language
English
Price
• Staff and students KU Leuven: go to
https://icts.kuleuven.be/cursus/
• Staff and students Association KU Leuven
and PhD students, non-KU Leuven € 120
• Non profit/social sector € 187,50
• Private sector € 450
21
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 22
Fundamental Statistical Methods
Course outline
Presenters
This basic course in statistics emphasizes on selecting the
appropriate statistical method and drawing the right
conclusions from the obtained results. Mathematical details
will be kept to a minimum. The emphasis will be on understanding the concepts and on practical applications.
The adequate use of basis statistical summaries (measures
of central tendency, measures of dispersion, box-plots, ...)
will be illustrated. The foundations of confidence intervals
and of testing hypotheses will be dealt with.
Marlies Lacante
Course Materials
A .pdf file with the course material will be made available.
Dates
17, 18, 19 March 2015 from 9.00 hr - 12.00 hr
Course content:
• Descriptive statistics: graphical and numerical summaries
of the data
• Distributions: Binomial, Poisson, Exponential, Normal
and t-distribution
• Distribution of the sample mean
• Confidence intervals
• Hypothesis tests for a population mean (one and two
samples)
• Paired t-test
• Estimating and testing for proportions
Target audience
Anyone who wishes to understand basic statistical
techniques more thoroughly.
Prerequisites
There are no prerequisites
22
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 120
• Non profit/social sector € 187,50
• Private sector € 450
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 23
Introduction to the analysis
of contingency tables
Course outline
Course Materials
In this course, chi-square tests and association measures
are used to identify if there is significant association in
contingency tables and to determine how strong this
association is. By use of examples, it is illustrated that
exact tests are necessary in certain situations. There is
enough time to practice with SAS Eguide, SPSS and R.
A .pdf file with the course material will be made available.
Course content:
• Construction of a contingency table
• Tests for independence: chi square tests
• Association measures
• Analysis of a 2x2 table: relative risk, odds ratio
• Exact test
Dates
19 March 2015 13.00 - 16.00 hr
20 March 2015 9.00 - 12.00 hr and 13.00 - 16.00 hr.
Language
English
Price
Target audience
Everyone who wants to analyse contingency tables.
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 120
• Non profit/social sector € 187,50
• Private sector € 450
Prerequisites
Participants are familiar with basic statistical concepts
(which are e.g. introduced in the course ‘Fundamental
Statistical Methods’).
Presenter
An Carbonez
23
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 24
Concepts of Multilevel,
Longitudinal and Mixed models
Course outline
Course Materials
Starting from ANOVA models with random factor levels,
the concepts of mixed models are introduced and the basics
about inference in random-effects models will be explained.
Afterwards, the mixed ANOVA model is extended to general
linear mixed models for continuous data. Finally, extensions
to models for binary or count data will be briefly discussed.
Omitting all theoretical details, sufficient background will
be given such that practising statisticians can apply mixed
models in a variety of contexts, know how to use up-todate software, and are able to correctly interpret generated
outputs. Many applications, taken from various disciplines,
will be discussed.
A .pdf file with the copies of the transparencies used in
the course will be made available.
Target audience
The targeted audience includes methodological and
applied statisticians and researchers in industry, public
health organizations, contract research organizations and
academia. Important: this course will also serve the MSc
in Statistics students.
Prerequisites
The student knows the basics of statistical inference, and
statistical modeling (regression, Anova and general(ized)
linear models).
Presenter
Geert Verbeke
24
Dates
March 10, 2015 13.00 hr - 16.00 hr
March 31, 2015 13.00 hr - 16.00 hr
April 28, 2015 13.00 hr - 16.00 hr
May 5, 2015 13.00 hr - 16.00 hr
May 12, 2015 13.00 hr - 16.00 hr
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 200
• Non profit/social sector € 312,50
• Private sector € 750
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 25
Introduction to correspondence
analysis and multiple correspondence
analysis with SAS, SPSS and R
Course outline
Prerequisites
Correspondence analysis (CA) , is an exploratory technique
to simultaneously score the categories and the column
categories in a bivariate contingency table in a lower
dimensional space The objective is also to clarify the
relationship between the row and the column variable.
It is well suited for large contingency tables. It can also be
used for continuous variables as Age, which can be grouped
in different age categories.
A working knowledge of basic statistics (e.g. Pearson
Chi-square Statistic) in contingency tables will be assumed.
In three-way and high dimensional contingency tables, an
introduction to multiple correspondence analysis (MCA) is
presented, also called Principal components for categorical
data. MCA is a method to visualize the joint properties of
more than 2 categorical variables The individual observations
and the categories of the variables can be displayed in the
same plot.
In this course, the focus will be on the data examples,
the interpretation of the results and the Biplot. SAS and/or
SPSS are used for the examples and exercises.
Course content:
• Introduction and short historical overview
• CA
- Revisit Pearson Chi-square statistic
- Inertia and Eigenvalues
- CA row and column coordinates
- CA plots and association between row and
columns
- Quality of the visual presentation
- Illustration in SAS and in SPSS
• MCA
- Super Indicator matrix
- The Burt matrix
- Joint presentation of categories
- Illustration in SAS and in SPSS
Presenter
Anne-Marie De Meyer
Course Materials
A.pdf file with the course material will be made available
Dates
23 and 24 March 2015 from 9.00 hr - 12.00 hr
Language
English
Price
• Staff and students KU Leuven: go to
https://icts.kuleuven.be/cursus/
• Staff and students Association KU Leuven
and PhD students, non-KU Leuven € 80
• Non profit/social sector € 125
• Private sector € 300
Target audience
Data analysts and scientists involved in analysing multivariate categorical data.
25
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 26
Web lecture on Sampling Theory
Course outline
Course Materials
Different methods for selecting a (survey) sample from an
existing population will be considered. Problems arising in
the sampling designs will be discussed. The focus will be
on the concepts rather than on the formulas. Nevertheless,
attention will be paid at the estimation of the population
parameters of interests.
A.pdf file with the course material will be made available
Target audience
Face to face question and answer session 1:
May 5 2015: 18.00-20.00 hr
Everyone with an interest in sampling theory, from an applied
and/or methodological point of view.
Prerequisites
Participants should have an intimate knowledge of basic
concepts of descriptive and inductive statistics.
26
Dates
Face to face lecture: April 23 2015: 16.30 hr - 20.00 hr
The other lectures will be accessible via the web.
Face to face question and answer session 2:
May 12 2015: 18.00-20.00 hr
Language
English
Presenter
Price
Geert Molenberghs
• Staff and students KU Leuven: go to
https://icts.kuleuven.be/cursus/
• Staff and students Association KU Leuven
and PhD students, non-KU Leuven € 200
• Non profit/social sector € 312.5
• Private sector € 750
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 27
Logistic Regression Models
with SAS and SPSS
Description
Presenter
The focus is on the statistical model with a categorical
outcome or response variable.
Anne-Marie De Meyer
A categorical response variable can be a binary variable,
an ordinal variable or a nominal variable and each type
requires a different model to describe its relationship with
the predictor variables.
Course Materials
We will define, interpret and illustrate the models for each
type of outcome and place the models in the framework
of the Generalized Linear Model.
SAS (through SAS Enterprise Guide) or SPSS are used in
the applications.
Outline
• Introduction
• Binary Logistic Regression
• Multinomial Logistic Regression for nominal outcome
variables
• Proportional Odds Model - Ordinal Logistic Regression
• Logistic regression in the framework of the Generalized
Linear Model
Target audience
Data analysts in all disciplines
A.pdf file with the course material will be made available.
Dates
Logistic regression models with SAS: 28 and 29 April
2015 from 9.00 hr - 12.00 hr
Logistic regression models with SPSS: 28 and 30 April
2015 from 9.00 hr - 12.00 hr
Language
English
Price
• Staff and students KU Leuven: go to
https://icts.kuleuven.be/cursus/
• Staff and students Association KU Leuven
and PhD students, non-KU Leuven € 80
• Non profit/social sector € 125
• Private sector € 300
Prerequisites
‘Fundamental Statistical Methods’ and ‘Introduction to the
analysis of contingency tables’.
Knowledge of the standard regression model
27
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 28
Nonparametric smoothing
techniques and applications
Course outline
Course Materials
Nonparametric smoothing techniques are an important
class of tools for identifying the true signal hidden in noisy
data. These tools are widely used in statistical analysis in
a variety of application areas. This course will provide the
students with a thorough overview of the most important
smoothing techniques (such as kernel smoothing, local
polynomial fitting, spline smoothing, wavelet decomposition,
regularization techniques, …). The course will address
theoretical and computational aspects. We will discuss
how to use these techniques in different settings (e.g. in a
univariate or multivariate regression setting, in case of
incomplete data, …). The course includes illustrations with
data examples and the use of the R software.
A .pdf file with the course material will be made available.
Target audience
PhD students or practitioners/researchers with a good
background knowledge of statistics and statistical inference.
Prerequisites
Participants should have a good background in statistics,
in particular in statistical inference.
Presenter
to be announced
28
Background reading:
• Fan, J. and Gijbels, I. (1996). Local Polynomial Modeling
and Its Applications. Chapman and Hall, New York.
• Hastie, T., Tibshirani, R. and Friedman, J. (2001).
The Elements of Statistical Learning. Springer, New York.
• Simonoff, J.S. (1996). Smoothing Methods in Statistics.
Springer, New York.
Dates
5 days in Spring 2015
Language
English
Price
• Staff and students KU Leuven and Association
KU Leuven: go to: https://icts.kuleuven.be/cursus/
• PhD students, non KU Leuven € 400
• Non profit/social sector € 625
• Private sector € 1500
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 29
Poisson regression with SAS
or SPSS
Description
Presenter
A Poisson regression model fits a count or the number of
occurrences of an event or the rate of occurrence of an
event as a function of some predictor variables. For example:
the number of occurrences or the rate of a certain disease.
The Poisson model is a special case of the Generalized
Linear model. SAS (through SAS Enterprise Guide) or SPSS
are used in the applications.
Anne-Marie De Meyer
Outline
• Introduction to Poisson regression
• The Poisson model in the framework of the Generalized
Linear model
• Correction for overdispersion
• The negative binomial model
• Poisson regression models for rates
Course Materials
A.pdf file with the course material will be made available.
Dates
Poisson regression with SAS: 6 and 7 May 2015 from
9.00 hr - 12.00 hr
Poisson regression with SPSS: 6 and 8 May 2015 from
9.00 hr - 12.00 hr
Language
Target audience
English
Data analysts in all disciplines
Price
Prerequisites
‘Fundamental Statistical Methods’ and ‘Introduction to the
analysis of contingency tables’.
Knowledge of the standard regression model
• Staff and students KU Leuven: go to
https://icts.kuleuven.be/cursus/
• Staff and students Association KU Leuven
and PhD students, non-KU Leuven € 80
• Non profit/social sector € 125
• Private sector € 300
29
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 30
Inleiding tot enquêtering
Beschrijving
Doelgroep
Via survey onderzoek wil men informatie verzamelen
omtrent mensen, ideeën, opinies, houdingen, plannen,
gezondheid, sociale - educatieve - of familiale achtergrond. Zulk soort onderzoek gebeurt vaak bij sociologische
vraagstellingen, in psychologie, bij marktonderzoek, enz…
Informatie over dergelijke onderwerpen kan men moeilijk
op ‘experimentele wijze’ verzamelen. Daarom moet men
de personen in kwestie bevragen. Dit kan via een interview,
een vragenlijst, een telefonische enquête, enz. ... Dit soort
bevragingen kent een eigen methodologie en eigen onderzoeksregels die moeten gerespecteerd worden. In deze
cursus wordt achtereenvolgens ingegaan op de verschillende
stappen in dit onderzoeksproces.
Gebruikers van vragenlijstonderzoek
Tevens zal een half dagdeel besteed worden aan de enquêteservice aan de KU Leuven, die gebaseerd is op de opensource software “Limesurvey”. Deze software laat gebruikers
toe om snel zeer krachtige online enquêtes te ontwikkelen.
Cursusmateriaal
Inhoud van de cursus:
• analyse van de onderzoeksvraag: wat wil men te
weten komen?
• verzamelen van de gevraagde informatie
• welke regels moet men in acht nemen bij het formuleren
van de vragen? (invloed van de vraagstelling op het
antwoord, betrouwbaarheid en validiteit)
• methoden van steekproeftrekkingen
• verwerken van de gegevens
• rapportering
• hoe werkt de enquêteservice van de KU Leuven
• algemene instellingen voor de enquête
• beschikbare vraagtypes en hun mogelijkheden
• werken met tokens
• uitnodigen van de respondenten en opvolgen van de
responses
• exporteren van de resultaten naar statistische pakketten
30
Voorkennis
Cursisten dienen vertrouwd te zijn met de methodiek aangebracht in ‘Fundamentele Statistische technieken’ en de
cursus ‘Regressie- en variantie analyse’.
Lesgevers
Marlies Lacante en Kristel Hoydonckx
Cursusmateriaal wordt als .pdf file ter beschikking gesteld.
Datum
6 en 7 mei 2015 telkens van 9 u. - 12 u.,
8 mei 2015 van 9 u. -12 u. en 13 u. - 16 u.
Taal
Nederlands
Prijs
• Personeel en studenten KU Leuven en Associatie
KU Leuven: zie https://icts.kuleuven.be/cursus/
• PhD studenten, niet KU Leuven € 160
• Non profit/sociale sector € 250
• Private sector € 600
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 31
Statistical consulting Service
Consulting was our historical embryo and remains a core business.
The statistical consulting service center acts as the main pivot to determine the ideal combination between the customer
and the most appropriate university entity.
We recommend that you contact us in an early stage of your project and write a short description of your problem and
send it to [email protected].
The LStat Statistical consulting Service covers
• Statistical support for researchers within the university. The LStat provides statistical help for university research
groups and for the central administration of the university. We help you with advice and we offer support with the
design of your study and with the statistical analysis of your data whether elementary or sophisticated. The first
hour of first-line consulting is provided free of charge.
• Statistical service and execution of projects for government and industry, in service or in partnership.
The LStat uses the administrative help of Leuven Research and Development (LRD) in the negotiation of the contracts
with industries and the private sector.
We have experience with major financial companies, international institutions, manufacturers, medical organizations,
marketing companies, FMCG as well as small and medium-sized enterprises.
Our solutions range from basic regression, multivariate techniques, analysis of variance, mixed models, data mining,
process control, to risk theory, categorical data analysis, longitudinal data analysis and tailor-made simulations and
calculations.
If you have any questions, do not hesitate to contact us at: [email protected] and take a look at our website:
http://lstat.kuleuven.be/consulting/
31
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 32
Practical Matters
• REGISTRATION COSTS
The indicated prices correspond to training for 1 person.
There are several fee categories:
- Students and staff from KU Leuven and Association KU Leuven
- PhD students from other universities € 80 / full day
- non-profit sector, social sector € 125 / full day
- private sector € 300 / full day
Prices include all course material. If the course takes a whole day and the course takes place at the Arenberg
campus in Heverlee a sandwich lunch is included as well.
Payments have to be settled before the start of the course.
• CONFIRMATION
You will receive a confirmation upon receipt of your application form. This confirmation gives information
on how to make the payment and on the course venue. Please contact us in case you do not
receive a confirmation letter.
• DISCOUNT
When you subscribe for several courses, you can get a discount of 10% if the total number of full training
days equals or exceeds 5 days per person and a discount of 20% is attributed if you follow courses for
at least 10 full days.
• CANCELLATION
- If you are unable to attend a course for which you have registered, you can let a colleague replace you.
- Full cancellation for a specific course always has to be done in writing. Administrative costs for cancellation
are set at € 20 when the cancellation is carried out more than 2 weeks before the course takes place.
After that, the full course fee will be charged.
INFORMATION
For other questions on registration and extra information contact:
tel. + 32 16 32 22 14
[email protected]
www.lstat.kuleuven.be
32
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 33
F a c u lty o f S c ie nce
Registration form Short courses in Statistics 2014-2015
Post or e mail this form to: LStat, Celestijnenlaan 200B, 3001 HEVERLEE, Belgium or [email protected]
or use the registration form at www.lstat.kuleuven.be
Staff and students of (Association) KU Leuven should register online: https://icts.kuleuven.be/cursus/
Applicants details:
Mr. / Mrs. / Ms.
Family name ____________________________________________ First name ______________________________
Company/Institute __________________________________________________
Street ________________________________________________________________________________ Number ______________________
P.O. Box ____________ Postcode ______________ City ____________________________________ Country ______________________
E mail address ____________________________________________________
Fee category:
n PhD students, non- KU Leuven
n Non profit/social sector
n Private sector
Price per full day: € 80
Price per full day: € 125
Price per full day: € 300
Indicate the courses that you wish to attend:
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
n
Flames: Essential Tools for R
Fundamentele statistische methoden
Fundamentele statistische methoden, toegepast met SAS Eguide
Fundamentele statistische methoden, toegepast met SPSS
Fundamental Statistical Methods, applications with R
Optimization and Numerical Methods in Statistics
Models for Longitudinal and Incomplete data
Advanced programming in R
Regression and Analysis of Variance
Regressie- en variantieanalyse toegepast met SPSS
Regressie- en variantieanalyse toegepast met SAS Eguide
Regression and Analysis of Variance, applications with R
Uitbreiding bij Regressie- en variantieanalyse
Statistical Machine Learning with R
Niet-parametrische statistiek
Experimental Design
n Chemometrics
n Fundamental statistical methods
n Cluster analysis, principal component analysis and exploratory factor
analysis with SAS, SPSS and R
n Introduction to the analysis of contingency tables.
n Concepts of multilevel, longitudinal and mixed models
n Introduction to correspondence analysis and multiple correspondence
analysis with SAS, SPSS and R
n Weblecture on Sampling Theory
n Logistic Regression Models with SAS
n Logistic Regression Models with SPSS
n Nonparametic Smoothing Techniques and Applications
n Poisson regression with SAS
n Poisson regression with SPSS
n Inleiding tot enquêtering
n Flames: Essential Tools for R
29, 30 September, 1, 2, October 2014
6, 7, 8 October 2014
14 October 2014
14 October 2014
14 October 2014
13,14 and 30,31 October 2014
22, 23, 24 October 2014, 17, 18, 19 November 2014
3, 4 November 2014
3, 4 November 2014
14 November 2014
14 November 2014
13, 18 November 2014
24, 25 November 2014
27, 28 November 2014
10 December 2014
10, 17, 24 February and 3, 10, 17, 31 March
and 28 April
11, 25 February, 4, 11, 18 March 2015
17, 18, 19 March 2015
10, 11, 12 March 2015
19, 20 March 2015
10, 31 March, 28 April, and 5, 12 May 2015
23, 24 March 2015
23 April 2015
28, 29 April 2015
28, 30 April 2015
data to be announced in 2015
6, 7 May 2015
6, 8 May 2015
6, 7, 8 May 2015
8, 9, 11, 12 June 2015
DOC_BRO_LSTAT_2014-2015_DOC_BRO_LSTAT_2014-2015 28/08/14 08:45 Pagina 34
v.u.: Prof. M. Lacante, Leuven Statistics Research Centre, Celestijnenlaan 200 B, 3001 HEVERLEE, België
LEUVEN STATISTICS
RESEARCH CENTRE
Celestijnenlaan 200 B
3001 HEVERLEE, België
tel. + 32 16 32 22 14
[email protected]
www.lstat.kuleuven.be