2.
Session: Data Visualization
World Development Indicators at a glance
- Advanced grouping techniques - Basic visualization tools for exploring one single variable - Basic visualization tools for exploring relationships between variables
3.
Session: Data correlations and hypothesis testing
Lending Club performance
- Correlation and AUC - Statistical estimation and error - Introduction to confidence intervals - Introduction to hypothesis testing
|
Methodology
The course format and methodological approach are based on a combination of explanations and practical parts. During the sessions participants will be provided with the material needed to follow this course. The material includes both the theoretical content of the different subjects to be discussed and the data needed to practice the concepts learned. Participants will be with provided with real data sets for practices and will work in groups to solve different challenges by applying quantitative methods.
The course is divided in three sessions, each one including practice cases coming from real business situations. Students may use their laptops/tablets on the lectures/practice sessions ONLY for the course activities. Use emailing, facebooking, tweeting, chatting, skyping, internet surfing, etc. should NOT be done during classes.
Assessment criteria
100% Class attendance R test
Bibliography
- Bishop, C. M. (2006). Pattern recognition. Machine Learning, 128.
- Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., & Byers, A. H. (2011). Big data: The next frontier for innovation, competition, and productivity.
- Siegel, E. (2013). Predictive analytics: The power to predict who will click, buy, lie, or die. John Wiley & Sons.
- Google's R Style Guide: https://google.github.io/styleguide/Rguide.html
- The tidyverse style guide: https://style.tidyverse.org/
- Radziwill, N. M. (2017). End-to-end solved problems with R: A catalog of 26 examples using statistical inference. Lapis Lucera.
- Radziwill, N. M. (2019). Statistics (the Easier Way) with R, 3rd Ed: an informal text on statistics and data science. Lapis Lucera.
- David M. Diez, Mine Çetinkaya-Rundel, Christopher D. Barr (2019). OpenIntro Statistics, 4th Ed.
Timetable and sections
Group
|
Teacher
|
Department
|
|
Ruben Coca Marin
|
Operaciones, Innovación y Data Sciences
|
Timetable
From 2022/1/10 to 2022/1/12: From Monday to Wednesday from 14:00 to 15:30. From Monday to Wednesday from 16:00 to 17:30.
|