Research

Experience

NLP Researcher - UIowa Biocomputing Lab (January 2025 – Present)

I am developing NLP pipelines to uncover and validate new vocabulary for Retrograde Cricopharyngeal Dysfunction (RCPD) from Reddit. Methods include phrase mining and n-gram materialization; embedding-based seed expansion (Word2Vec + Transformer embeddings) with LLM-assisted vetting; topic modeling (BERTopic/LDA); clustering (UMAP/HDBSCAN); and MongoDB-backed ETL at scale. I track and mitigate semantic drift, organize terms into equivalence/inflection groups, and deliver explainable, clinician-facing visualizations and briefs to a multidisciplinary team in Computer Science and Otolaryngology.

Singh Lab (October 2023 - January 2025)

As an undergraduate research assistant in Dr. Rahul Singh's lab in the Computer Science department at the University of Iowa. I have worked in a multidisciplinary lab of computer scientists and biologists to collect data and present my results. My current project involves using text analysis and database minign techniques and algorithms on social media data to gain insights into opioid use trends. Previously I was involved in image segmentation for the development of deep learning models to evaluate the effectiveness of schistosomiasis treatment.

University of Iowa Attention & Perception Lab (October 2024 – Present)

As a research assistant in the Attention & Perception Lab (with Dr. Cathleen Moore and Michael Paavola), we used Unity/C# to build a continuous monitoring environment for studying lifeguard behavior. My work includes scene instrumentation, event/time logging, and integrating telemetry to support downstream analysis and training.

University of Iowa DSRI (formerly NADS) (October 2022 - May 2023)

I was an undergraduate research assistant at the University of Iowa Driving Safety Research Institute (DSRI) with Dr. Chris Schwarz. The project focused on using deep learning and computer vision techniques on a 1/10th scale car and trailer to create an educational platform for local highschools.

Presentations/Reports

UIowa Biocomputing Lab

Current progress toward expanding the RCPD lexicon on Reddit. We use Word2Vec to surface semantically similar terms (cosine-nearest neighbors) and apply LLM-assisted vetting that emulates WordNet-style relations (synonymy, hypernym/hyponym, holonym/co-hyponym) to accept or reject candidates. The pipeline also includes phrase mining / n-gram materialization, grouping inflectional variants, monitoring and mitigating semantic drift across expansion rounds, and producing clinician-facing visualizations and explainable summaries of the evolving vocabulary.

Singh Lab

University of Iowa DSRI