Linguistic Data: Quantitative Analysis and Visualisation for theoretical linguists — различия между версиями

Материалы по математике, 2018-19 учебный год
Перейти к: навигация, поиск
(Materials)
(Materials)
Строка 60: Строка 60:
 
|-
 
|-
 
|06.04 || Multiple linear regression || [http://rpubs.com/AllaT/lingdat-multreg mult-regression] [http://math-info.hse.ru/f/2018-19/ling-data/english.csv english.csv]
 
|06.04 || Multiple linear regression || [http://rpubs.com/AllaT/lingdat-multreg mult-regression] [http://math-info.hse.ru/f/2018-19/ling-data/english.csv english.csv]
 +
|| [https://cran.r-project.org/web/packages/jtools/vignettes/summ.html more] on visualising coefficients, [https://www.princeton.edu/~otorres/Regression101R.pdf more] tests
 +
|-
 +
|13.04 || Logistic regression || [https://raw.githubusercontent.com/LingData2019/LingData/master/seminars/2019-04-06/Lab10-practice.Rmd Lab10]
 
|| [https://cran.r-project.org/web/packages/jtools/vignettes/summ.html more] on visualising coefficients, [https://www.princeton.edu/~otorres/Regression101R.pdf more] tests
 
|| [https://cran.r-project.org/web/packages/jtools/vignettes/summ.html more] on visualising coefficients, [https://www.princeton.edu/~otorres/Regression101R.pdf more] tests
 
|}
 
|}

Версия 12:17, 13 апреля 2019

Course info

Dear students,

Here will be published the materials of the course "Linguistic Data: Quantitative Analysis and Visualisation", taught at the Master programme "Linguistic Theory and Language Description" in 2018-2019 academic year.

  • Instructors: Olga Lyashevskaya, George Moroz, Alla Tambovtseva and Ilya Schurov.
  • Modules: 3-4

Software

During this course we will use R as a programming language and RStudio as a GUI.

How to install R and RStudio?

1. Download R (you can choose another mirror here if you wish) and install it on your computer. Make sure you did it before installing RStudio.

2. Download RStudio (you need RStudio Desktop Open Source License) and install it on your computer. It is recommended to create a shortcut for RStudio during installation.

It is possible avoid installing anything on your PC, using online version of RStudio.

How to use RStudio?

Read the instruction here.

For successful submission of assignments you should be able to create and save R code files (.R) and RMarkdown files (.Rmd).

Materials

Date Topic of the lecture Seminar Optional
12.01 Something about data: population vs sample, descriptive statistics problems1 r-basics RMarkdown: official page, cheatsheet
19.01 Population and samples. Working with data in R problems2 samples artists.txt

r-vectors r-dataframes orientation.csv

more on basic graphs in R
26.01 Statistical hypotheses testing binom-test poetry.csv
02.02 Student's t-test. Central limit theorem: recall t-test icelandic.csv asp-paper (Coretta, 2017)
09.02 Confidence Intervals conf_ints poetry.csv icelandic.csv an interactive visualization of CI by K.Magnusson

more on overlapping CI's (by A.Knezevic)

16.02 Data manipulation with tidyverse. Visualisation with ggplot2 class materials
02.03 Chi-squared and Fisher's exact tests chisq-test elision.csv socling.csv
16.03 Correlation coefficients and a simple linear regression corr-regressioneducation.csv chekhov.csv guess correlation game
23.03 Multiple comparisons. ANOVA [anova-mlcomp] icelandic.csv spurious correlations
06.04 Multiple linear regression mult-regression english.csv more on visualising coefficients, more tests
13.04 Logistic regression Lab10 more on visualising coefficients, more tests

R seminars in pdf

12 January: r-basics, 19 January: r-vectors, r-dataframes, r-samples, 26 January: binom-test, 2 February: t-test, 9 February: conf-ints, 02 March: chisq-test, 16 March:

R seminars in .R and .Rmd

12 January: r-basics.R, r-basics.Rmd, 19 January: r-vectors.R, r-vectors.Rmd r-dataframes.R, r-dataframes.Rmd, r-samples.Rmd, 26 January: binom-test.Rmd, 2 February: t-test.R, t-test.Rmd, 9 February: conf-ints.Rmd, conf-ints.R, 2 March: chisq-test.Rmd, chisq-test.R

Homeworks

Final project