This image shows Filip Miletić

Filip Miletić

Dr.

Institute for Natural Language Processing
Foundations of Computational Linguistics

Contact

Pfaffenwaldring 5 b
70569 Stuttgart
Room: 01.012

Subject

I am a postdoctoral researcher working on the SemChangeMWE project, headed by Prof. Dr. Sabine Schulte im Walde. I am broadly interested in socially grounded modeling of complex linguistic behaviors from naturally occurring data, with a focus on lexical semantics. Specific areas of interest include:

  • vector space models of lexical semantics
  • computational and variationist sociolinguistics
  • multiword expressions
  • varieties of English and language contact
  • resource creation and evaluation

Abdul Khaliq, M., Chang, P., Ma, M., Pflugfelder, B., Miletić, F. (2024). RAGAR, your falsehood radar: RAG-augmented reasoning for political fact-checking using multimodal large language models. To appear in Proceedings of the Seventh Fact Extraction and VERification Workshop (FEVER). [preprint]

Rassem, M., Tsigkouli, M., Jenkins, C., Miletić, F., Schulte im Walde, S. (2024). Visualising changes in semantic neighbourhoods of English noun compounds over time. To appear in Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities (NLP4DH).

Knupleš, U., Faleńska, A., Miletić, F. (2024). Gender identity in pretrained language models: An inclusive approach to data collection and probing. To appear in Findings of the Association for Computational Linguistics: EMNLP 2024.

Miletić, F., Przewozny-Desriaux, A., Tanguy, L. (2024). Modeling fine-grained sociolinguistic variation: The promises and pitfalls of Twitter corpora and neural word embeddings. In Kaunisto, M., Schilk, M., editors, Challenges in Corpus Linguistics: Rethinking Corpus Compilation and Analysis. Amsterdam: John Benjamins. [preview]

Chifu, A.-G., Glavaš, G., Iionescu, R. T., Ljubešić, N., Miletić, A., Miletić, F., Scherrer, Y., Vulić, I. (2024). VarDial evaluation campaign 2024: Commonsense reasoning in dialects and multi-label similar language identification. In Proceedings of The Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), Mexico City, Mexico. [pdf]

Miletić, A., Miletić, F. (2024). A gold standard with silver linings: Scaling up annotation for distinguishing Bosnian, Croatian, Montenegrin and Serbian. In Proceedings of the 4th Workshop on Human Evaluation of NLP Systems, Turin, Italy. [pdf]

Mahdizadeh, S., Rassem, M., Jenkins, C., Miletić, F., Schulte im Walde, S. (2024). What can diachronic contexts and topics tell us about the present-day compositionality of English noun compounds? In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, Italy. [pdf]

Miletić, F., Schulte im Walde, S. (2024). Semantics of multiword expressions in transformer-based models: A survey. Transactions of the Association for Computational Linguistics, 12. [pdf]

Hindennach, S., Shi, L., Miletić, F., Bulling, A. (2024). Mindful explanations: Prevalence and impact of mind attribution in XAI research. In Proceedings of the ACM: Human-Computer Interaction. [pdf]

Jenkins, C., Miletić, F., Schulte im Walde, S. (2023). To split or not to split: Composing compounds in contextual vector spaces. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore. [pdf]

Miletić, F., Przewozny-Desriaux, A., Tanguy, L. (2023). Understanding computational models of semantic change: New insights from the speech community. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore. [pdf]

Maurer, M., Jenkins, C., Miletić, F., Schulte im Walde, S. (2023). Classifying noun compounds for present-day compositionality: Contributions of diachronic frequency and productivity patterns. In Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), Ingolstadt, Germany. [pdf]

Miletić, F., Schulte im Walde, S. (2023). A systematic search for compound semantics in pretrained BERT architectures. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Dubrovnik, Croatia. [pdf]

Miletić, F., Przewozny-Desriaux, A., Tanguy, L. (2021). Detecting contact-induced semantic shifts: What can embedding-based methods do in practice? In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic. [pdf]

Miletić, F., Przewozny-Desriaux, A., Tanguy, L. (2020). Collecting tweets to investigate regional variation in Canadian English. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, France. [pdf]

2022-present

Postdoctoral researcher
Institute for Natural Language Processing, University of Stuttgart

Project: Computational models of the emergence and diachronic change of multi-word expression meaning (SemChangeMWE)
PI: Prof. Dr. Sabine Schulte im Walde

2018-2022

PhD in Linguistics
CLLE, CNRS & Toulouse Jean Jaurès University (France)

Thesis: An investigation into contact-induced semantic shifts in Quebec English: Conciliating corpus-based vector models and variationist sociolinguistic inquiry
Advisors: Anne Przewozny-Desriaux & Ludovic Tanguy

2016-2018

MA in Modern Languages and Literatures for Cultural Services (English & French)
University of Genoa (Italy)

Thesis: Contact-induced lexical and morphosyntactic phenomena in Quebec English

2014-2018

Certificate in Humanities and Cultural Heritage
IANUA School for Advanced Studies, University of Genoa (Italy)

2013-2016

BA in Modern Languages and Cultures (English & French)
University of Genoa (Italy)

Thesis: Lexical and morphosyntactic features of Canadian English on Twitter

To the top of the page