I’m a researcher and data scientist working in natural language processing, computational social science, and cultural analytics. I’m interested in modeling how people communicate about their personal experiences through text, especially when these communications occur in online communities grounded in shared, structured experiences, like healthcare experiences or cultural objects like books or games. My research relies on unsupervised machine learning methods and re-explores the reliability and evaluation of these methods when applied to small, socially-specific datasets.

My past work has examined how postpartum people share and frame their birth experiences, how online book reviewers use and write about genres, and how the use of word embedding models require additional stability tests when used to measure biases.

I’m currently a PhD candidate in Information Science at Cornell University, where I’m advised by David Mimno. I have a master’s degree in Computational Linguistics from the University of Washington and have worked as a data scientist and research intern at places like Microsoft Research, Twitter Cortex, Facebook Core Data Science, and Pacific Northwest National Laboratory.

I’m on the job market! Please reach out if you think I’d be a good fit for your academic department or industry research group.


Dec 2021 Hosting a tutorial on BERT for the NLP+CSS 201 Tutorial Series
Oct 2021 Selected as a "Rising Star in EECS" by the MIT EECS Rising Stars workshop
Sept 2021 Our paper on storyteller personalities in Boccaccio's Decameron is accepted at the LaTeCH-CLfL Workshop at EMNLP
Sept 2021 I was featured on the Diaries of Social Data Research podcast along with my collaborator Karen Levy
Sept 2021 Speaking at Stanford NLP Seminar on "Modeling Shared Personal Experiences in Online Communities"
Fall 2021 TAing INFO 2950: Intro to Data Science with Matthew Wilkens
Aug 2021 Giving a career advice talk on "Wandering with a Purpose" to the Women in NLP group
Jun 2021 Presenting our paper on seed terms for bias measurement at ACL 2021 and at the Gender Bias in NLP Workshop
Jun 2021 Speaking on a book reviews panel and sharing a talk on "BERT for Humanists" at ACH 2021
Jun 2021 Interning at Twitter Cortex
May 2021 Our paper Bad Seeds: Evaluating Lexical Methods for Bias Measurement is accepted at ACL 2021
Dec 2020 Paper on Goodreads classics published at Post45 & Cultural Analytics
Feb 2021 Speaking at Microsoft's AI & Gaming Research Summit on self disclosure in online gaming communities
Dec 2020 Paper on genre and LibraryThing book reviews published at CSCW

Other Things