Please note: This course will be taught online only. In person study is not available for this course.
Matt W. Loftis is an Associate Professor of Political Science at Aarhus University in Aarhus, Denmark. His research focuses on political control of bureaucracy, political agenda setting, and digital and statistical methods.
Course Content: This course will cover automated data collection from a variety of internet-accessible data sources including a wide range of web sites and Application Programming Interfaces. Students will learn to write software to automate the data-collection process. Data collected automatically over the Internet is generally unstructured, so students will further learn to write software to clean, structure, and visualize their data in preparation for analysis.
Course objectives: Participants will gain a broad toolkit aimed at (arguably) the most time-consuming part of the research process: data collection, cleaning, and structuring. The course will lead to students being prepared to execute reliable and efficient large-scale data-collection projects. Students will also be prepared to think through the implications of their data-collection decisions on their entire research design. Researchers who plan to work with a wide variety of observational data touching both the public and private spheres will find these tools applicable in their future work. These tools can support a variety of research designs, from large-scale quantitative studies to descriptive qualitative projects focusing on few cases.
Course prerequisites: Participants will find it very useful to have experience using the R statistical software. Some familiarity with Internet technologies like HTML and HTTP are also helpful but not required.
Background knowledge required
R = moderate