SUBTLEX-PT-BR is a 61 million word corpus of conversational Brazilian Portuguese, compiled using subtitle texts.
Availability: The unigram version of the corpus is available at [https://osf.io/vb5yp/]
Citation:
Tang, K. (2012) A 61 Million Word Corpus of Brazilian Portuguese Film Subtitles as a Resource for Linguistic Research. UCL Working Papers in Linguistics 24. [pdf] [bib]
SUBTLEX-KR is a 90 million word corpus of conversational Korean, compiled using subtitle texts.
Availability: (Forthcoming), please email me if you would like to be notified when it becomes available: kevin.tang@hhu.de
Citation:
Tang, K., de Chene, B. (2014, forthcoming) A New Corpus of Colloquial Korean and its Applications. The 14th Laboratory Phonology Conference (LabPhon 14), Tachikawa, Tokyo, Japan. July 2014. [poster]
A selection of Praat scripts written for specific purposes. Available at https://osf.io/8qaxp/ Please see the wiki section for details.
Please email kevin.tang@hhu.de if you have any questions or suggestions.