Beáta Megyesi

Professor in Computational Linguistics (leave of absence) at Department of Linguistics and Philology

Telephone:: +46 18 471 78 60
E-mail:: Beata.Megyesi@lingfil.uu.se
Visiting address:: Engelska parken
Thunbergsvägen 3H
Postal address:: Box 635
751 26 UPPSALA
Leave of absence:: 2023-08-01 - 2024-07-31

Download contact information for Beáta Megyesi at Department of Linguistics and Philology

CV:: Download CV
ORCID:: 0000-0002-4838-6518

Short presentation

I am a professor of computational linguistics and currently on leave from Uppsala University for a professorship at Stockholm University.

My main research area is natural language processing and digital philology. I conduct research on historical cryptology to develop methods to automatically crack historical ciphers. I also develop tools for the analysis of historical and modern texts in various genres to enable large, quantitative studies for humanities and social sciences.

Keywords

digital humanities
historical cryptology
natural language processing

Biography

Education

Professor of Computational Linguistics, Department of Linguistics and Philology, Uppsala University, 2021
Associate Professor in Computational Linguistics, Department of Linguistics and Philology, Uppsala University, 2013
PhD in Speech Communication, Department of Speech, Music and Hearing, KTH, 2002
B.A. in Computational Linguistics, Department of Linguistics, Stockholm University, 2000

Appointments

Present:

Vice chair and member of the Linguistics review panel at the Swedish Research Council, 2021-2023
Member of the nominating committee of the Northern European Association for Language Technology – NEALT, 2022-2025
Vice-chair and member of the board of the Center for Digital Humanities, Uppsala University, 2021-2023

Past:

President of the Northern European Association for Language Technology –
NEALT, 2020-2021
Head of Department of Linguistics and Philology, 2009-2018
Director of the English Park Campus, Uppsala University, 2017-2018
Vice-president of the Northern European Association for Language Technology –
NEALT (2018-2019)
Member of the board at the Dept. of Linguistics and Philology, 2007–2009, 2010-2012, 2012-2015, 2016-2018
Member of the board of the faculty of languages, Uppsala University, 2008-2011, 2011-2014, 2019-2020
Director of studies at the Department of linguistics and philology, 2007-2009
Program coordinator for the Language Technology Program, Uppsala University, 2004-2007
Member of the board at the Department of Speech, Music and Hearing, 2003-2004

Teaching

Basic level courses

Languages, computers, and text processing (in Swedish)
Advisor for Language Technology Project, 7.5 ECTS
BA thesis supervision

Advanced level courses

Research and Development, 15 ECTS
Digital Philology, 5/7.5 ECTS
Thesis work in language technology, 30 ECTS
Advisor for Language Technology Project, 7.5 ECTS
Master thesis supervision

PhD education

I was co-supervisor: Eva Petterson and Mojgan Seraji

Other things I like: my twins, traveling, Amnesty International, some workout like skiing, piloxing and pump, books, cello, chocolate, margaritas and cosmos, ladies of jazz, Bridges of Madison county, and of course my dearest best friends: girls, you know who you are!, and my (often empty) not-to-do list...

Things I don't like: greed, injustice, and ruling techniques

Research

Research interests

Historical Cryptology
Digital Philology focusing on the automatic analysis of historical texts and student writings
PoS tagging, morphological analysis, chunking, shallow parsing for different types of languages
Parallel corpora and treebanks
Text categorization

Projects

DECRYPT: Decryption of historical manuscripts (PI, Vetenskapsrådet: 2018-2024).
DECODE: Automatic decoding of historical manuscripts (PI, Vetenskapsrådet: 2015-2017)
SweLL - L2 infrastructure: Research Infrastructure for Swedish as a second language (RJ, 2017-2019)
SWE-CLARIN - SWEGRAM: Automatic annotation and analysis of Swedish texts (Swedish Research Council, 2014-2018, 2019-2022)
Multilingual parallel corpora
Swedish treebank
G rammar extraction
Basic Language Resource Kit for Swedish

Publications

Selection of publications

Recent publications

All publications

Articles

Books

Chapters

Conferences

Reports

Other

A Turkish-Swedish parallel corpus (2006)