TDT13 - Advanced Text Analytics and Language Understanding - Kunnskapsbasen
TDT13 - Advanced Text Analytics and Language Understanding
Norsk: Avansert tekstanalyse og språkforståelse, 2025
Given by Björn Gambäck.
Update 250805: The course will start on Thursday 29.8 at 13:15-15 (week 35).
Update 250821: New room (B22) due to over-registration to the course. Room B22 is on floor 1 of the Berg building (Map).
Update 250821: Due to room restrictions, students registering for the course after today will probably not be able to attend the lectures.
The course consists of a set of regular lectures and student presentations.
Note that the course content in general is tailored towards the needs of the students writing a Master's Thesis in Language Technology and that the course primarily is open for those students. It should still be possible to accommodate other interested students, but due to the renovation work on the IT-building there are restrictions on available room sizes, so contact the lecturer before registering.
[Not in 2025: Preference will be given to students have taken the course TDT4310 (Intelligent Text Analytics and Language Understanding/Intelligent tekstanalyse og språkforståelse), or something similar (e.g., during an exchange visit abroad). However, an introduction to/overview of language technology will be included at the beginning of the course.]
This year, we will in particular discuss:
- word embeddings and word-space modelling,
- transfer learning, transformers and self-attention,
- computational linguistic creativity,
- semantic representations and processing,
- and in general classification algorithms for language processing, applied to issues such as:
- sentiment analysis
- author profiling
- hate speech
- native language identification
- figurative language
Course Material
The course material (slides, articles, etc.) will be published in the course Teams group.
Examination
The grading will be based on the oral student presentations and a written report on the same subject, with presentation/report themes selected by the students together with the lecturer. The project can be carried out individually or in groups of two students working together.
Course Schedule
The course will start on Thursday 29.8 at 13:15-15 (week 35) in room B22, which is on floor 1 of the Berg building (Map).
[NB: New room due to over-registration to the course.]
Preliminary schedule (with all meetings in room B22):
- Lecture 1 (Thursday 28.8, 13:15-15): Introduction (to the course, to language, and to Language Technology)
- Lecture 2 (Thursday 11.9, 13:15-15): Machine Learning and Deep Learning for Natural Language Processing
- Lecture 3 (Thursday 2.10, 13:15-15; might change): Linguistic Meaning, Semantics and Sentiment Analysis
- Lecture 4 (Thursday 30.10, 13:15-15; might change): Digital Forensics, Computational Linguistic Creativity and Evaluation
- Student project presentations (Tuesday 25.11, 13:15-16 and/or Wednesday 26.11 09:15-12; might change)
The student presentations (examination) could tentatively be scheduled at other times during weeks 48 or 49.
The lectures and presentations will be onsite, but will possibly be available to follow also online (though only in case students registered for the course have valid reasons not to attend a specific lecture in person; the student presentations will be in-person only).
Visiting hour
by appointment
For more information about the course, please contact Prof. Björn Gambäck.