Maria Antoniak

I’m an Assistant Professor in Computer Science at the University of Colorado Boulder, where I’m also affiliated with the Department of Information Science. My research focuses on natural language processing and cultural analytics.

Previously, I was a Young Investigator at the Allen Institute for AI and a postdoc at the Pioneer Centre for AI at the University of Copenhagen. I completed my PhD in Information Science at Cornell University, advised by David Mimno, and my master’s degree in Computational Linguistics at the University of Washington. I’ve also spent time at places like ETH Zurich, Microsoft Research FATE, Twitter Cortex, and Facebook Core Data Science.

Research Interests

My work is interdisciplinary, and I frequently collaborate with researchers in the humanities and healthcare. Cross-cutting themes in my work include creative uses of NLP methods and studies of online platforms.

📐 Measuring the Reliability of NLP Tools: I’ve shown that popular NLP methods ported to new domains can result in surprising instabilities and biases: for example, word vector similarities require additional stability tests when used to measure social biases. I’m generally interested in datasets, domains/genres, and probing of data-centric methods.

📚 Modeling Narratives: I’ve used NLP models to study storytelling, investigating questions like where people tell stories online and how we can extract narrative pathways and framing. With collaborators in the humanities, I’ve also studied narrative reception, examining topics like how readers of different genres write book reviews and what the “classics” mean to Goodreads reviewers.

⚕️ Person-Centered Healthcare: I’ve worked with interdisciplinary teams of clinicians and researchers at Microsoft, Facebook, the Hospital for General Surgery NY, and the Association of American Medical Colleges. My research has focused on using NLP technologies to examine people’s healthcare experiences: for example, how postpartum people share and frame their childbirth narratives and how an online community collaboratively makes sense of difficult healthcare decisions.

News

☀️ I accepted a tenure-track job offer in Computer Science at the University of Colorado Boulder!

🎓 I’m recruiting students to join CU Boulder in Fall 2026; check the FAQ to learn more!

📝 New work about research cultures and LLM adaptations accepted at ACL

Upcoming Travel & Talks

Jun 2025	Attending FAccT in Athens
May 2025	Invited talk at the Conference on "Social Science and Generative AI" at the Sciences Po médialab in Paris
May 2025	Presenting at the Workshop on Computational Models of Narrative in Geneva
Apr 2025	Leading a workshop at COMPTEXT 2025 in Vienna: slides
Mar 2025	Teaching a course at the Centre for Excellence in the Social Sciences at the University of Warsaw
Mar 2025	Attending the "Doing AI Differently" Workshop hosted by the Alan Turing Institute in London
Mar 2025	Attending the AI STORIES Workshop in Bergen
Feb 2025	Invited talk at the Institute for Analytical Sociology in Norrköping
Feb 2025	Invited to the Lorentz Center workshop on "Impressed by Reading: Measuring the Impact" in Leiden
Jan 2025	Invited talk at the Interdisciplinary Institute for Societal Computing in Saarbrücken
Dec 2024	Attending the Computational Humanities Research (CHR) conference in Aarhus
Oct 2024	Invited talk at the Institute for Natural Language Processing at the University of Stuttgart
Oct 2024	Invited talk at the Statistics Seminar at Uppsala University
Oct 2024	Attending the conference for Danish Digitization, Data Science, and AI in Nyborg
Oct 2024	Invited talk at the NLP Workshop at the Interacting Minds Center at Aarhus University
Sep 2024	Attending the Humanities and AI Virtual Institute at Oxford

Teaching

Fall 2025: NLP for Cultural Analytics (University of Colorado Boulder, Computer Science)

Winter 2023: NLP for Cultural Analytics (University of Washington, Linguistics)

I’ve led or co-led sessions at ICWSM, FAccT, Bell Labs, and the popular NLP+CSS 201 tutorial series. I’ve also taught similar public-facing courses for the Hertie School in Berlin, the Brown Institute at Columbia, and the IDEAS Summer School at Northeastern.

Outreach & Resources

I was one of the lead organizers for AI for Humanists, a series of tutorials and workshops that guide interdisciplinary researchers in using large language models.

I’m the lead builder and maintainer for some cultural analytics tools:

Little Mallet Wrapper (a Python wrapper around the topic modeling library MALLET)
Riveter (a tool to measure connotation frames via verb lexicons)

Media

Interviewed by Chris Potts (Stanford NLP) on his podcast about NLP, novels, the digital humanities, Ukraine, and more
Featured on the Diaries of Social Data Research podcast by Katherine Keith and Lucy Li about my work modeling birth stories
Interviewed by the University of Notre Dame about my career path from the humanities

Recent Service

Action Editor for ARR
Editorial Board Member for the Journal of Cultural Analytics
Advisory Board Member and Guest Editor for the Computational Humanities Research (CHR) Journal
Computational Humanities Research (CHR) 2024 Best Paper Selection Committee
Senior Area Chair for ACL 2025
Senior Area Chair for COLING 2025
Publicity Co-Chair for FAccT 2025
Ethics Co-Chair for NAACL 2024, 2025
Workshops Co-Chair for ICWSM 2024
Reviewer for FAccT 2024, COLM 2024, Workshop on Narrative Understanding (WNU) 2024, etc.

Other Things

I’m the founder and former President of Grads for Gender Inclusion in Computing at Cornell. I also maintain this this repository of resources and advice about preventing harassment in academic research.
My family is Ukrainian, and I spent a year teaching at the Ukrainian Catholic University in Lviv. Please consider taking action to support safety and freedom in Ukraine 🌻
books books books