
Gender Gap Corpus Annotation
2025
Developed and deployed a corpus annotation pipeline using FastAPI (backend), HTML/JavaScript (frontend), and Docker, leveraging NLP techniques like dependency parsing, POS tagging, and web scraping, while ensuring annotation quality through Inter-annotator Agreement (IAA) analysis and optimizing search with Whoosh.