POL42050

Academic Year 2025/2026

Quantitative Text Analysis (POL42050)

Subject:: Politics
College:: Social Sciences & Law
School:: Politics & Int Relations
Level:: 4 (Masters)
Credits:: 10
Module Coordinator:: Assoc Professor Stefan Muller
Trimester:: Spring
Mode of Delivery:: On Campus
Internship Module:: No
How will I be graded?: Letter grades

Curricular information is subject to change.

Computational text analysis has become increasingly popular in political science in recent years. With the vast availability of text data on the web, political scientists increasingly view quantitative text analysis (or “text as data”) as a valuable approach for studying various forms of social and political behaviour.

This module introduces political science students to the quantitative analysis of textual data. It covers the theoretical foundations, practical applications, and technical implementations of these methods using the R statistical programming language. The module also explores advanced techniques, including word embeddings, speech transcription, and machine translation. Additionally, students will engage with the Hugging Face Python infrastructure, a cutting-edge resource for implementing transformer models and other state-of-the-art natural language processing methods.

Each session integrates lectures with practical, hands-on exercises to apply these methods to political texts. These exercises address practical challenges at each stage of the research process. Most of the methods follow a three-step framework: first, identifying texts and units of analysis; second, extracting measurable features from these texts and converting them into a quantitative feature matrix; and third, analysing this matrix using statistical techniques such as dictionary construction and application, scaling models, and topic models. Students will learn to apply these steps to various types of texts.

Building on this foundational framework, students will also gain hands-on experience with advanced techniques such as word embeddings, transformer models, and generative AI. These approaches will provide insights into the latest developments in text analysis and their applications to political science research.

About this Module

Learning Outcomes:

Upon successful completion of the course, students will be able to:

1. Understand fundamental issues in (quantitative) text analysis such as inter-coder agreement, reliability, validation, accuracy, and precision.

2. Convert texts into quantitative matrices of features, and then analyse those features using statistical methods, topic models, and scaling approaches.

3. Use human coding of texts to train supervised classifiers and fine-tune transformer models.

4. Apply these methods to their own text corpus to address a substantive research question.

5. Critically evaluate (social science) research that uses automated text analysis methods.

Indicative Module Content:

Statistical software and programming using R and RMarkdown; assumptions and workflow of quantitative text analysis approaches; tokenisation and document-feature matrix; dictionaries and sentiment analysis; describing and comparing texts; human coding and document classification; supervised and unsupervised scaling; multilingual text analysis; topic models; speech recognition; word embeddings

Student Effort Hours:

Student Effort Type	Hours
Autonomous Student Learning	226
Lectures	24
Total	250

Approaches to Teaching and Learning:

active/task-based learning; peer and group work; lectures; lab/studio work; enquiry & problem-based learning; case-based learning

Requirements, Exclusions and Recommendations

Learning Requirements:

NOTE: Prior familiarity with the statistical programming language R (or Python) is a prerequisite for this course due to its direct relevance to the content and assignments. Below are some reasons why prior experience with R or Python is crucial for students to follow the course and apply the methods effectively:

– Implementation of Text Analysis Methods: Text analysis is a central component of the course, and R is widely used for implementing text analysis techniques. R provides a comprehensive set of libraries and packages specifically designed for text processing, natural language processing (NLP), and sentiment analysis. Students with prior experience in R will be able to navigate and utilize these tools more efficiently, enabling them to implement text analysis methods covered in the course effectively.

– Course Content Alignment: The course content, lectures, and materials are designed with a focus on R-based implementation. The examples, code snippets, and demonstrations provided throughout the course will be predominantly in R. Some of the advanced methods are implemented in Python, but a good understanding of R will make it much easier to write and run code in Python. Without prior familiarity, students may struggle to comprehend and replicate these examples, hindering their understanding of the core concepts and methodologies.

– Homework Assignments and Research Papers: The assignments and research papers in this course will require students to apply the text analysis methods discussed in class to real-world data. Students without prior experience with R may find it challenging to write R code to preprocess large text corpora, visualise results, and interpret the findings. Their lack of proficiency in R could impede their ability to complete assignments accurately and efficiently.

Module Requisites and Incompatibles

Not applicable to this module.

Assessment Strategy

Description	Timing	Component Scale	Must Pass Component	% of Final Grade	In Module Component Repeat Offered
Assignment(Including Essay): Homework 1	Week 5	Other	No	20	No
Assignment(Including Essay): Homework 2	Week 10	Other	No	20	No
Assignment(Including Essay): Research Paper	Week 14	Standard conversion grade scale 40%	No	60	No

Carry forward of passed components

Resit In	Terminal Exam
Summer	No

Please see Student Jargon Buster for more information about remediation types and timing.

Feedback Strategy/Strategies

• Feedback individually to students, post-assessment

How will my Feedback be Delivered?

Feedback will be provided to students within 20 working days of the deadline for the assignment in accordance with university policy.

Explore UCD

About UCD

Students

Research & Innovation

Colleges

Engage

Key Services

POL42050

Quantitative Text Analysis (POL42050)

About this Module

Learning Outcomes:

Indicative Module Content:

Student Effort Hours:

Approaches to Teaching and Learning:

Requirements, Exclusions and Recommendations

Module Requisites and Incompatibles

Assessment Strategy

Carry forward of passed components

Feedback Strategy/Strategies

How will my Feedback be Delivered?

Explore UCD

About UCD

Students

Research & Innovation

Colleges

Engage

Key Services

POL42050

Quantitative Text Analysis (POL42050)

About this Module

What will I learn?

Learning Outcomes:

Indicative Module Content:

How will I learn?

Student Effort Hours:

Approaches to Teaching and Learning:

Am I eligible to take this module?

Requirements, Exclusions and Recommendations

Module Requisites and Incompatibles

How will I be assessed?

Assessment Strategy

Carry forward of passed components

What happens if I fail?

Assessment feedback

Feedback Strategy/Strategies

How will my Feedback be Delivered?

Ask a Question: