This course bridges the gap between traditional econometrics and modern data science, offering both a theoretical understanding of machine learning and the practical skills to apply it. We examine how techniques like supervised and unsupervised learning, natural language processing, and the emerging field of Causal ML allow economists to tackle large, complex datasets. We also explore the transformative role of AI and Large Language Models in social science research.
Students should have completed a course in econometrics or statistics covering multivariate regression and hypothesis testing. No prior programming experience is required, but willingness to invest time and effort in learning the basics of Python in and outside of class is expected.
Weekly Questions (40%): Conceptual and practical questions assigned each week to consolidate learning.
Presentations (30%): Students will present a mini data project application of the methods (2 per week during Weeks 5, 6, 7, and 8).
Report (30%): A written report due alongside the presentation, containing all reproducible Python code used for the application.