I am Researcher at the Department of Knowledge Technologies at the Jožef Stefan Institute, and the Laboratory for Cognitive Modeling at the Faculty for Information and Communication Science, University of Ljubljana.
Publications
Google Scholar
Curriculum Vitae
Download my CV
Active projects
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages (EU CEF, 2021-2023)
IMSyPP: Innovative Monitoring Systems and Prevention Policies of Online Hate Speech (EU REC, 2020-2022)
LiLaH: The linguistic landscape of hate speech on social media (Flemish-Slovene bilateral, 2019-2023)
News
- A BERTić model fine-tuned on the NER task has been added to HuggingFace
- The SotA transformer model for Bosnian, Croatian, Montenegrin and Serbian - BERTić - has been released via HuggingFace
- I am co-organizing the VarDial2021 evaluation campaign with the task of Social Media Geolocation, which includes geo-locating tweets written in Croatian, Bosnian, Montenegrin or Serbian
- The SotA NLP technologies for South Slavic languages are available now as a Python package
- I am co-organizing the WMT2020 shared task on similar language translation, including Slovene, Croatian and Serbian for the first time to WMT
- I am co-organizing the VarDial2020 evaluation campaign with the task of Social Media Geolocation, which includes geo-locating tweets written in Croatian, Bosnian, Montenegrin or Serbian
- I helped setting up the CLASSLA knowledge centre for language technologies for South Slavic languages, part of CLARIN ERIC