{"id":5,"date":"2024-04-04T01:32:13","date_gmt":"2024-04-04T01:32:13","guid":{"rendered":"https:\/\/sisu.ut.ee\/digitalmethods\/workshops\/"},"modified":"2024-04-04T01:32:22","modified_gmt":"2024-04-04T01:32:22","slug":"workshops","status":"publish","type":"page","link":"https:\/\/sisu.ut.ee\/digitalmethods\/workshops\/","title":{"rendered":"Workshops"},"content":{"rendered":"<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207c9e-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207c9e-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207c9e-collapse\"><strong>Peeter Tinits \u2013\u00a0<\/strong><b id=\"docs-internal-guid-d2c56d2b-7fff-52d6-5451-9b9de6903da9\">Introduction to R and Tidyverse<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207c9e-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207c9e-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer: Peeter Tinits (peeter.tinits@ut.ee), University of Tartu, Tallinn University\n<\/p>\n<p>\n\tDate: 24.08.\n<\/p>\n<p>\n\tRoom: \u00dclikooli 18-139\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n<\/p><p dir=\"ltr\">\n\tR is a scripting language often used for data processing in humanities and social sciences. It provides the means to produce analyses as a reproducible workflow that is transparent to readers and easy to update. We will start with the very basics of R and RStudio, and quickly work our way through to simple data processing via tidyverse packages. Tidyverse is a set of packages that aims to make R easy to use especially for beginners. We will learn 1) basic R syntax, 2) reading data into R, 3) selecting data points and features, 4) making quick summaries of data, 5) creating variables, 6) transforming data and data frames, 7) joining datasets together.\n<\/p>\n<p dir=\"ltr\">\n\tThis is a very practical introduction to R. We will focus more on how to do these things in R, and less on the research questions that drive these needs. All the data processing is done in tidyverse, so if you know R but not tidyverse, it may be interesting for you too.\n<\/p>\n<p dir=\"ltr\">\n\tWe will rely on personal laptops in this tutorial, you will need to install R (<a href=\"https:\/\/www.r-project.org\">https:\/\/www.r-project.org<\/a>) and RStudio\u00a0 (<a href=\"https:\/\/www.rstudio.com\">https:\/\/www.rstudio.com<\/a>) a few days beforehand. Short instructions will be shared.\n<\/p>\n<p dir=\"ltr\">\n\t<strong>If you have no previous experience in R, this tutorial is a requirement for attending other workshops using R in this summer school.<\/strong>\n<\/p>\n<p>\u00a0<\/p>\n<p dir=\"ltr\">\n\t<b id=\"docs-internal-guid-dd6c7c3a-7fff-30af-9234-7694b476a655\">References:<\/b>\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 Grolemund, Garrett, and Wickham, Hadley (2017) R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. O\u2019Reilly Media.\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<strong>Peeter Tinits<\/strong> is a digital humanities specialist in the University of Tartu, and teaches various digital humanities courses. His own research has been on spelling standardization of Estonian, the rise of environmentalism in the 20th century, and structural changes in film production crews. He is a firm believer that anyone can learn to code, and the humanities have a lot to gain from adopting reproducible research practices.\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207cb6-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207cb6-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207cb6-collapse\"><b id=\"docs-internal-guid-726387ab-7fff-1ff1-6c6c-912c8b758b71\">Kristiina Vaik \u2013<\/b><strong>\u00a0Introduction to natural language processing using Pandas and spaCy<\/strong><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207cb6-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207cb6-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer: Kristiina Vaik (kristiina.vaik@ut.ee),\u00a0University of Tartu\n<\/p>\n<p>\n\tDate: 24.08.\n<\/p>\n<p>\n\tRoom:\u00a0Jakobi 2-438\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n<p>\n\tThis workshop aims to introduce an alternative programming language used in natural language processing \u2013 Python. Python has a simple syntax and transparent semantics and is widely used for analyzing, understanding, and deriving information from structured and unstructured data. This course will start with a basic introduction to Python, we will quickly go through topics such as syntax, variables, data structures, conditionals, loops, and IO. This will be continued with an introduction to Pandas, a powerful Python data analysis toolkit that is used for data exploration and manipulation. Finally, we will move to get to know spaCy which is a free open-source library with a lot of built-in capabilities for text processing. We will use spaCy for data (pre)processing, e.g noise removal, tokenization, lemmatization, etc. Additionally, we shall see how to apply pre-built models for different downstream tasks, e.g morphological and syntactical parsing, named entity recognition, etc.\n<\/p>\n<p>\n\tStudents will be provided with Jupyter notebooks containing the code used in this tutorial. Knowledge of Python is not mandatory but highly recommended. <strong>I also recommend using your own laptop<\/strong>, instructions on what packages to download will be shared beforehand.\n<\/p>\n<p>\n\t<strong>\u00a0<\/strong>\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<strong>Kristiina Vaik<\/strong> is a Ph.D. student at the University of Tartu. She has worked as a programmer in the Natural Language Processing Research Group at the University of Tartu and as a data analyst at TEXTA.<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207cbc-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207cbc-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207cbc-collapse\"><b id=\"docs-internal-guid-e250c603-7fff-b90c-60d1-1e6c69ee8953\">Martin M\u00f6lder \u2013 <\/b><b id=\"docs-internal-guid-15f1077a-7fff-4b87-ee9c-a144f7523108\">Web-scraping with R<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207cbc-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207cbc-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer:\u00a0Martin M\u00f6lder (martin.molder@ut.ee), University of Tartu\n<\/p>\n<p>\n\tDate: 25.08.\n<\/p>\n<p>\n\tRoom: \u00dclikooli 18-139\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n<p>\n\tThe Internet is full of data \u2013 both textual and numerical \u2013 that we could use and analyse and it is up to us to collect it, systematise and clean it and finally to analyse it. This workshop focusses mostly on the first step in this process \u2013 how to automatically collect information form the internet using R. This process is in general called <strong>web-scraping<\/strong>. R connects to the Internet and you can download web-pages into R. By familiarising yourself with the structure and functioning of a web-site you can write R code that will systematically go though the content of a web page and download the information \u2013 blog-posts, comments, articles, etc \u2013 that you need. There are packages in R that can make this process rather smooth and streamlined, but some knowledge of what to look for and where on a web page is still necessary. In this workshop we will go through some simpler and more complicated examples of how to construct and automate this process of information gathering in R.\n<\/p>\n<p>\n\t<strong>\u00a0<\/strong>\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<b id=\"docs-internal-guid-ab3b2c6f-7fff-7a5b-384f-15790db5cdf5\">Martin M\u00f6lder <\/b>is a researcher at the Johan Skytte Institute of Political Studies at the University of Tartu. He teaches about quantitative methods and party politics. Much of his current (and future) research interests and activities revolve around quantitative text analysis.<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207cc4-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207cc4-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207cc4-collapse\"><b>Iza Romanowska \u2013\u00a0<\/b><strong>Practical introduction to agent-based modelling\u00a0<\/strong><b id=\"docs-internal-guid-bdb457bd-7fff-2c3a-bbca-318ad832508d\">(participating remotely)<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207cc4-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207cc4-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer: Iza Romanowska (participating remotely)\n<\/p>\n<p>\n\tCo-lecturer: Andres Kimber\n<\/p>\n<p>\n\tDate: 25.08.\n<\/p>\n<p>\n\tRoom: \u00dclikooli 18-128\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n<p dir=\"ltr\">\n\tThe goal of the workshop is to provide a quick and easy introduction to the methodology of agent-based modelling and the software most commonly used in social simulation: NetLogo. Agent-based modelling is the easiest, most user-friendly and fun simulation technique enabling even non-coders to develop an artificial world and test their ideas on it. It is widely used across social and natural sciences to investigate complex phenomena, such as ecological systems, social dynamics, epidemiology or urban development.The workshop will focus on explaining the process of developing a simulation and will provide a practical hands-on introduction to NetLogo. It will consists of a practical session demonstrating the basics of modelling through an example of a simulation of migration. An extensive list of relevant resources will enable those interested to develop their ideas further.\n<\/p>\n<p dir=\"ltr\">\n\tNetLogo was chosen thanks to its versatility as an open-source platform for building agent-based models, which combines user-friendly interface, simple coding language and a vast library of model examples, making it an ideal starting point for entry-level agent-based modellers, as well as a useful prototyping tool for more experienced programmers. It was developed with schoolchildren in mind but is widely used in social sciences and ecology. No previous experience in coding or simulation is required to join the workshop, but please install NetLogo in advance (<a data-saferedirecturl=\"https:\/\/www.google.com\/url?q=https:\/\/ccl.northwestern.edu\/netlogo\/download.shtml&amp;source=gmail&amp;ust=1596956662948000&amp;usg=AFQjCNH7ihsfT4AVM2i77Jy8W0I8Sv3-Mw\" href=\"https:\/\/ccl.northwestern.edu\/netlogo\/download.shtml\" target=\"_blank\" rel=\"noopener\">https:\/\/ccl.northwestern.edu\/netlogo\/download.shtml<\/a>).\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<strong>Iza Romanowska\u00a0<\/strong>is\u00a0a complexity scientists working on the interface between social sciences and computer science. She originally trained and worked as an archaeologist before switching to computer-based research. Currently, she is\u00a0working as a senior researcher and the head of the Social Simulation and Digital Humanities Research Group at the Barcelona Supercomputing Center leading a team of engineers and computer scientists who develop solutions for agent-based simulation (ABM) using High Performance Computing (for example, our supercomputer MareNostrum). We create models of mobility in ancient cities, look for patterns in demographic data, and create platforms for real-time pedestrian flow modelling. Dr Romanowska is a vocal advocate for a wider use of simulation in archaeological research, training next generations of ABM modellers through courses, workshops and published tutorials (e.g.,\u00a0<b><a href=\"http:\/\/tinyurl.com\/y7hhqc4d\" target=\"_blank\" rel=\"noopener\">tinyurl.com\/y7hhqc4d<\/a><\/b>). She is\u00a0also a co-author of an upcoming textbook on archaeological ABM.\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207ccb-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207ccb-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207ccb-collapse\"><b>Cornelius Puschmann \u2013\u00a0Sentiment analysis with R (participating remotely)<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207ccb-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207ccb-heading\">\n            <div class=\"accordion-body\">\n\n\n<p>\n\tLecturer: Cornelius Puschmann (puschmann@uni-bremen.de), University of Bremen (participating remotely)\n<\/p>\n<p>\n\tCo-lecturer: Sander Salvet (sander.salvet@ut.ee), University of Tartu\n<\/p>\n<p>\n\tDate: 25.08\n<\/p>\n<p>\n\tRoom: Jakobi 2-438\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n<p>\n\tAudience:<br>MA\/PhD students and faculty in all fields interested in quantitative social media research, especially doctoral students in media &amp; communication research and related fields.<\/p>\n<p>Learning outcomes:<br>Participants will learn how to obtain and analyze large-scale social media data sets to answer questions relevant to the textual expression of sentiment\/emotions. In order to achieve this goal, they will be introduced to the use of R for content analysis with quanteda and additional software packages. They will also learn the fundamentals of interacting with social media platform APIs, as well as managing data and visualizing results.<\/p>\n<p>Prerequisites:<br>The course will assume familiarity with R (<a data-saferedirecturl=\"https:\/\/www.google.com\/url?q=http:\/\/r-project.org&amp;source=gmail&amp;ust=1595578036181000&amp;usg=AFQjCNH-9qQtkxfhYQAu-uBwTejHHFPTzA\" href=\"http:\/\/r-project.org\/\" target=\"_blank\" rel=\"noopener\">r-project.org<\/a>) and RStudio, especially R Notebooks. Participants should be able to read datasets in R, work with vectors and data frames, and run basic statistical analyses, such as linear regression.<\/p>\n<p>Content:<br>This class focuses on how the types of questions that are relevant to communication and media studies as well as political science, sociology and \u00a0other fields interested in leveraging digital data from social media platforms in combination with innovative computational methods for content analysis (\u201cbig data\u201d research). The platforms used as examples include Twitter and Facebook and the techniques covered will include sentiment analysis through the use of dictionaries and third-party APIs.<\/p>\n<p>Study materials and literature:<br>The course will use the open-source software R and the development environment RStudio, which greatly facilitates coding with R. Both R and RStudio are freely available and each participant have access to a laptop computer on which the current version of R and RStudio are preinstalled, and on which they have the necessary permissions to install packages.<\/p>\n<p>Modes of study:<br>The course will follow a hands-on approach, with short theoretical sessions followed by coding challenges where participants will need to apply new methods.\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\tCornelius Puschmann is\u00a0a professor of media and communication at ZeMKI, University of Bremen\u00a0and an affiliate researcher at the Leibniz Institute for Media Research,\u00a0as well as the author of a popular German-language introduction to content analysis with R.\u00a0His\u00a0interests include digital media usage, online aggression, the role of algorithms for the selection of media content, and automated content analysis.\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207cd1-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207cd1-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207cd1-collapse\"><b>Andres Karjus \u2013\u00a0<\/b><strong>Visualizing your data using R<\/strong><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207cd1-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207cd1-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer: Andres Karjus (andres.karjus@hotmail.com), University of Edinburgh\n<\/p>\n<p>\n\tDate: 26.08.\n<\/p>\n<p>\n\tRoom: \u00dclikooli 18-139\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n<p>\n\tIn this workshop, we\u2019ll be focusing on visualizing different kinds of data using R, an excellent programming language for doing anything related to stats and data science. We will mostly be using ggplot2 and its addons, starting out with basic examples like scatterplots and time series, and how to balance legibility and the amount of information on a plot. We will also look into a few other packages for creating networks and maps, as well as interactive plots and animations that can be published on the web and included in slide presentations. Some time will also be dedicated for discussing the ethics of data visualization, or how to make sure you are not misleading your audience (and how to spot bad faith graphs in the wild).\n<\/p>\n<p>\n\tPre-workshop installation instructions and materials:\u00a0<br><a data-saferedirecturl=\"https:\/\/www.google.com\/url?q=https:\/\/andreskarjus.github.io\/artofthefigure\/tartu2020\/&amp;source=gmail&amp;ust=1598296569592000&amp;usg=AFQjCNF8SpH35RSADIdCTWhCwZ4Jy2ecPQ\" href=\"https:\/\/andreskarjus.github.io\/artofthefigure\/tartu2020\/\" target=\"_blank\" rel=\"noopener\">https:\/\/andreskarjus.github.io\/artofthefigure\/tartu2020\/<\/a>\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<strong>Andres Karjus <\/strong>is a PhD student at the Centre for Language Evolution at the University of Edinburgh, and a tutor at the School of Philosophy, Psychology &amp; Language Sciences. He uses R daily in his research and has been teaching occasional R workshops since 2015. He holds degrees in linguistics (BA, MA) and computer science (MSc). Personal website:\u00a0<a data-saferedirecturl=\"https:\/\/www.google.com\/url?q=http:\/\/andreskarjus.github.io&amp;source=gmail&amp;ust=1594467719854000&amp;usg=AFQjCNEPytar2E08iEIm9SeocMIUvXpl2g\" href=\"http:\/\/andreskarjus.github.io\/\" target=\"_blank\" rel=\"noopener\">andreskarjus.github.io<\/a>\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207cd9-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207cd9-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207cd9-collapse\"><b id=\"docs-internal-guid-94ce4304-7fff-de98-812c-fcde58e3e399\">Simon Hengchen \u2013\u00a0<\/b><strong>Introduction to diachronic word embeddings with Python\u00a0<\/strong><b id=\"docs-internal-guid-54437566-7fff-8413-37ac-5b245453f450\"><strong>(<\/strong>participating remotely)<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207cd9-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207cd9-heading\">\n            <div class=\"accordion-body\">\n\n<p>\n\tLecturer:\u00a0Simon Hengchen (simon.hengchen@gu.se), University of Gothenburg\u00a0(participating remotely)\n<\/p>\n<p>\n\tCo-lecturers: Peeter Tinits, Artjoms \u0160ela\n<\/p>\n<p>\n\tDate: 26.08.\n<\/p>\n<p>\n\tRoom: Jakobi 2-438\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n\tThe increasing availability of textual data gives new opportunities for humanities and social sciences that we are only beginning to explore. The nature of the data can vary quite a bit ranging from old digitized newspapers to Twitter or forum posts that are born and live digitally. Provided that we can access the data, they allow quite diverse questions to be answered. Concurrently, these past years have seen the rise of computational methods to detect, track, qualify, and quantify how a word\u2019s sense \u2013 or senses \u2013 change over time.\n<\/p>\n<p dir=\"ltr\">\n\tIn this tutorial, we will learn how to get and prepare textual content to build word embedding models with Python. Word embeddings are a rough approximation of the distributional hypothesis (Harris 1954), which states that words occuring in the same context tend to have the same meaning. Using such models means that we represent words as vectors (a one-row table, filled with numbers) in multi-dimensional space \u2014 which in turn allows us to go beyond the simple string comparison: we now have an easy access to a word\u2019s sense(s), among other things.\u00a0\n<\/p>\n<p dir=\"ltr\">\n\tTo reuse a famous example: vector_king \u2013 vector_man + vector_woman \u2243 vector_queen.\n<\/p>\n<p dir=\"ltr\">\n\tWhen trained over diachronic data, these models allow for the detection and quantification of a word\u2019s sense.\n<\/p>\n<p dir=\"ltr\">\n\tIn this tutorial, students will learn about:\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 diachronic textual data and where to find them\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 training different embedding models, as well as how to use them\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 different ways of tackling time\n<\/p>\n<p dir=\"ltr\">\n\tStudents will be provided with Jupyter notebooks containing the bulk of the code used in this tutorial \u2014 as such, they do not need to be proficient in Python, although this is recommended. Students should at least have a read and understood the material in Sinclair and Rockwell (2016).\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p dir=\"ltr\">\n\tRequirements:\u00a0\u00a0\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 The workshop will take place in a computer class where the software is preinstalled\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 If you use your own computer, you must be able to run Jupyter notebooks, as well as install python packages. We will be using python 3. Instructions to do so are available in Sinclair and Rockwell (2015)\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 If you want to have a go with your own data, please email Simon by August 10 for a go-ahead.\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 Please read the readings\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p dir=\"ltr\">\n\tReadings:\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 Tahmasebi, Nina and Hengchen, Simon, 2019. The Strengths and Pitfalls of Large-Scale Text Mining for Literary Studies. Samlaren: tidskrift f\u00f6r svensk litteraturvetenskaplig forskning, 140, pp.198-227. <a href=\"http:\/\/uu.diva-portal.org\/smash\/get\/diva2:1415010\/FULLTEXT01.pdf\">http:\/\/uu.diva-portal.org\/smash\/get\/diva2:1415010\/FULLTEXT01.pdf<\/a>\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p dir=\"ltr\">\n\tReferences:\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 Sinclair, St\u00e9fan and Rockwell, Geoffrey, 2015. The Art of Literary Text Analysis:\u00a0<a href=\"https:\/\/github.com\/sgsinclair\/alta\/blob\/915579fc1c6926b8fcb2a38f95349a2d6cba00b5\/ipynb\/GettingSetup.ipynb\">https:\/\/github.com\/sgsinclair\/alta\/blob\/915579fc1c6926b8fcb2a38f95349a2d6cba00b5\/ipynb\/GettingSetup.ipynb<\/a>\n<\/p>\n<p dir=\"ltr\">\n\t\u2013 Harris, Zellig S., 1954. Distributional structure. Word, 10(2-3), pp.146-162.\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n\t<b id=\"docs-internal-guid-c383cea7-7fff-d971-5ec0-55131c8d55b7\">Simon Hengchen <\/b>holds degrees in language (MA) and information science (MSc, PhD). In his short career, he has been involved in and employed by digital humanities groups, most recently the Computational History group (COMHIS) in Helsinki.\n<\/p>\n<p dir=\"ltr\">\n\tHe is currently working at the Swedish Language Bank (Spr\u00e5kbanken Text) at the University of Gothenburg, where he focuses on his main research interest, computational lexical semantic change, within the Language Change project.\n<\/p>\n<p>\n\tMore information about his NLP and DH work, as well as current projects, can be found at\u00a0<a href=\"https:\/\/hengchen.net\">https:\/\/hengchen.net<\/a>.\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207ce0-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207ce0-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207ce0-collapse\"><b id=\"docs-internal-guid-42bc388d-7fff-5429-c50d-bce4ca203337\">Artjoms \u0160ela \u2013\u00a0<\/b><b id=\"docs-internal-guid-78db4bea-7fff-87f3-6a32-7e47d5aa3d31\">Introduction to stylometry and multivariate text analysis in R<\/b><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207ce0-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207ce0-heading\">\n            <div class=\"accordion-body\">\n\n\n<p>\n\tLecturer: Artjoms \u0160ela (artjoms.sela@ut.ee), University of Tartu,\u00a0Institute of Polish Language\n<\/p>\n<p>\n\tDate: 27.08.\n<\/p>\n<p>\n\tRoom: \u00dclikooli 18-140\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n\tStylometry \u2013 a discipline that measures variation of features within a text or a set of texts \u2013 appeared much earlier than computers, but the age of computations allowed to see a style as a clearly distributed phenomena: hundreds of textual features taken simultaneously seemed to describe individuality much better than handful of hand-picked examples. The usual and well-documented application of stylometric techniques was always an authorship attribution and forensics. In this workshop we will use the general principles behind the multivariate analysis of style and authorial identity to follow the workflow of almost any textual analysis: extracting features, dealing with texts as vectors of these features, surfing the multidimensional space of these vectors.\n<\/p>\n<p dir=\"ltr\">\n\tThe workshops starts with introducing the \u201cstylo\u201d package for R (Eder, Rybicki , Kestemont 2016), which is simple to use yet powerful enough to be customizable and open to the research needs. After covering the basics we will move to build our own simple stylometric tool using \u201ctidyverse\u201d and \u201ctidytext\u201d packages that will allow us to demystify the process. Finally we will discuss how to use stylometry beyond authorhship attribution and will run a small experiment on supervised classification of text genres. Participants are encouraged to bring in their datasets, text collections and research questions!\n<\/p>\n<p dir=\"ltr\">\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n<p>\n\t<strong>Artjoms \u0160ela<\/strong> is a research fellow at University of Tartu and currently is doing postdoctoral research at Methodology department of Institute of Polish Language (Krakow). In 2018 he received his PhD in Russian literature at University of Tartu. He teaches courses focusing on digital humanities, computational methods and literature.<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n\n<p>\n\t<\/p><div class=\"accordion mb-3\">\n        <div class=\"accordion-item accordion-item--white\">\n        <h2 class=\"accordion-header\" id=\"accordion-6a511a5207ce5-heading\">\n            <button class=\"accordion-button collapsed\" type=\"button\" data-bs-toggle=\"collapse\" data-bs-target=\"#accordion-6a511a5207ce5-collapse\" aria-expanded=\"true\" aria-controls=\"accordion-6a511a5207ce5-collapse\"><b>\u00dclo Maiv\u00e4li, Taavi P<\/b><strong>\u00e4ll \u2013\u00a0Introduction to Bayesian inference in RStan &amp; brms<\/strong><\/button>\n        <\/h2>\n        <div id=\"accordion-6a511a5207ce5-collapse\" class=\"accordion-collapse collapse\" aria-labelledby=\"accordion-6a511a5207ce5-heading\">\n            <div class=\"accordion-body\">\n\n\n<p>\n\tLecturer: \u00dclo Maiv\u00e4li (ulo.maivali@ut.ee), University of Tartu\n<\/p>\n<p>\n\tLecturer: Taavi P\u00e4ll (taavi.pall@ut.ee), University of Tartu\n<\/p>\n<p>\n\tDate: 27.08.\n<\/p>\n<p>\n\tRoom:\u00a0Jakobi 2-438\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>Description<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n\tIn this workshop, we introduce statistical applications of the probability theory that are based on Bayes theorem. We will learn to work with posterior samples on the simple example of bootstrap, after which we will apply the Bayes theorem on binomial models. Then we will enter the world of Monte Carlo Markov Chains simulation (MCMC) and briefly foray into the Stan programming language to fit some flexible models (binomial and others). However, we will mainly use the \u2018brms\u2019 package in R, which allows for specification of a large array of regression models in the common R modelling language. Finally, we will stick our snouts into the truffles of multilevel shrinkage models.\n<\/p>\n<p>\n\t\u00a0\n<\/p>\n<p>\n\t<strong>About the instructor<\/strong>\n<\/p>\n\n<p dir=\"ltr\">\n\t<b id=\"docs-internal-guid-94fc1e7f-7fff-997a-2d1e-72803a0a232f\">\u00dclo Maiv\u00e4li <\/b>works in the Institute of Technology, University of Tartu. He is interested in molecular biology of protein synthesis, biomedical data analysis, metascience, and small dogs; albeit not necessarily in that order.<br><strong>Taavi P\u00e4ll<\/strong> works in Department of Microbiology, Faculty of Medicine of the University of Tartu. He studies human virome, particularly in immunodeficiency disorders, using metagenomics. Taavi uses R, Python and Snakemake workflow management system.\n<\/p>\n<p>\n\t<\/p><\/div>\n        <\/div>\n        <\/div>\n    <\/div>\n","protected":false},"excerpt":{"rendered":"<p>Peeter Tinits \u2013\u00a0Introduction to R and Tidyverse Lecturer: Peeter Tinits (peeter.tinits@ut.ee), University of Tartu, Tallinn University Date: 24.08. Room: \u00dclikooli 18-139 \u00a0 Description R is a scripting language often used for data processing in humanities and social sciences. It provides &#8230;<\/p>\n","protected":false},"author":153,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"class_list":["post-5","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/pages\/5","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/users\/153"}],"replies":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/comments?post=5"}],"version-history":[{"count":1,"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/pages\/5\/revisions"}],"predecessor-version":[{"id":63,"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/pages\/5\/revisions\/63"}],"wp:attachment":[{"href":"https:\/\/sisu.ut.ee\/digitalmethods\/wp-json\/wp\/v2\/media?parent=5"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}